What are your preferred methods for integrating data from different sources?

Gernot Bernkopf
3 replies
# Direct data transfers: Do you prefer tools that move data directly from source to destination (like ETL pipelines), or do you rely more on virtualizing access to data without moving it? # Real-time vs. batch processing: Is real-time data integration critical for your applications, or are batch processes sufficient? What tools do you use to achieve this?

Replies

Simon🍋
ETL tools like Talend or custom Python scripts, depending on data complexity. SQL for joining disparate sources once loaded.
Aria Madeline Sinclair
Fivetran for pulling data from various SaaS tools into a data warehouse. dbt for transformation and modeling once the raw data is loaded. Then good old SQL for analysis across tables.
Benjamin Thomas Clark
Depends on data volume and refresh needs. Cloud ETL like Fivetran or Stitch for scheduled extracts, transformations in the warehouse. Python for advanced pipelines or real-time streaming. Looker or Tableau for visual joining once loaded.