Twitter data is extracted using ETL data pipelines and Airflow workflows are implemented.
-
Updated
Feb 15, 2023 - Python
Twitter data is extracted using ETL data pipelines and Airflow workflows are implemented.
Apache Airflow Cheatsheet
Automate Apache Spark in Hadoop with Airflow in Cloud
Starter package to setup Apache Airflow locally.
Process of scheduled data extraction, transform and load is done using Apache Airflow and PySpark
This repo contains the concepts of Apache Airflow and the practical implemetation I'll be doing while learning.
An example Apache Airflow DAG-definition source repository, to be used with the Airflow DAG Aggregator.
A simple dag for triggering the Cloud Data Fusion Pipeline using Apache Airflow.
Udacity project within the Data Engineer Nanodegree
Project in Course of Udacity's Data Engineering Nano-Degree
Playing around with Airflow
Setup for Apache Airflow with Docker.
A simple DataOps for wine dataset on Docker
Celery and Kubernetes operators are used in order to manage data engineering pipelines of stocks and cryptocurrencies prices
The ETL Pipeline using a way autoscaling
This repository contains the projects I completed in the Udacity Data Engineering Nanodegree.
Add a description, image, and links to the airflow topic page so that developers can more easily learn about it.
To associate your repository with the airflow topic, visit your repo's landing page and select "manage topics."