-
Notifications
You must be signed in to change notification settings - Fork 0
This project presents a robust data pipeline using Apache Airflow for orchestration, Apache Kafka for real-time data streaming, and MongoDB for data storage. It automates the process of web scraping to collect large companies' data, transforms and processes this data, and then stores it efficiently.
WALIDAADI/ETL_using_Airflow
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
About
This project presents a robust data pipeline using Apache Airflow for orchestration, Apache Kafka for real-time data streaming, and MongoDB for data storage. It automates the process of web scraping to collect large companies' data, transforms and processes this data, and then stores it efficiently.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published