Insights

Blog

Dec 23, 2024

Building a Scalable Reverse ETL Pipeline with Apache Airflow and Snowflake

Discover how to construct a cost-effective and scalable reverse ETL pipeline using Apache Airflow and Snowflake to seamlessly synchronize data from your Snowflake warehouse to HubSpot. This guide covers essential aspects such as state management, dynamic batching, and API integration to ensure efficient and reliable data synchronization. Airflow / Snowflake / ETL / Data Engineering / Python / SQL

Nov 19, 2023

Optimizing Data Quality: dbt Model Coverage Analysis

Explore the power of Python and GitHub Workflows in enhancing data quality with our comprehensive guide. Learn how to analyze dbt model coverage, ensuring robust documentation and efficient testing in your data engineering projects. dbt / Python / CICD

Sep 23, 2023

Harness the Power of dbt Core in Airflow with Astronomer Cosmos on MWAA

Learn how to set up and run Astronomer Cosmos, an open-source package that allows you to run dbt core projects as Apache Airflow DAGs and Task Groups with ease. This blog post, provides a comprehensive guide to running Cosmos on MWAA (Amazon Managed Workflows for Apache Airflow). dbt / Airflow

Aug 30, 2023

Splitting Nested JSON Into Individual Rows in Redshift

Explore the art of transforming JSON data in Amazon Redshift databases, mastering elegant techniques to split and extract data from nested structures. Uncover time-saving strategies for managing one-to-many relationships and enhancing data manipulation workflows. Data Engineering / ETL

Jun 10, 2023

The Power of Pre-Commit and SQLFluff

The Power of Pre-Commit and SQLFluff - Ensuring Consistent SQL Code Across Your Team's Projects Data Engineering / SQL / CICD