COVID Data Aggregation

  • Airflow
  • Python
  • SQL
  • Amazon S3
  • Snowflake

What is it?

A repo of Apache Airflow DAGs that fetch and aggregate daily state-related COVID data from the covidtracking.com API, push that data as a series of CSV files to Amazon S3, and then load those CSVs downstream into Snowflake tables using the S3 to Snowflake Transfer Operator.

These DAGs are used for various Airflow trainings and help explain the fundamentals of Airflow from the ground up.