What are your preferred methods for integrating data from different sources?
Gernot Bernkopf
4 replies
# Direct data transfers: Do you prefer tools that move data directly from source to destination (like ETL pipelines), or do you rely more on virtualizing access to data without moving it?
# Real-time vs. batch processing: Is real-time data integration critical for your applications, or are batch processes sufficient? What tools do you use to achieve this?
Replies
Simon🍋@simonas_kauzonas
Launching soon!
ETL tools like Talend or custom Python scripts, depending on data complexity. SQL for joining disparate sources once loaded.
Share
Fivetran for pulling data from various SaaS tools into a data warehouse. dbt for transformation and modeling once the raw data is loaded. Then good old SQL for analysis across tables.
Depends on data volume and refresh needs. Cloud ETL like Fivetran or Stitch for scheduled extracts, transformations in the warehouse. Python for advanced pipelines or real-time streaming. Looker or Tableau for visual joining once loaded.
Combination of tools for me - Fivetran for extracting data from SaaS apps, dbt for transforming & modeling the data, and Looker to join sources for analysis. Python when I need more customization. Curious what others are using?