5.AirflowDataPipeline

AirflowDataPipeline is a data collection project designed to automate the process of extracting data from a website and storing it in a SQL database using Airflow.In summary, AirflowDataPipeline provides a powerful and efficient way to collect and store data from a website in a SQLite database using Airflow, offering a scalable and reliable solution for data collection and management. The project is specifically designed to run on a daily basis, ensuring that the latest data is always available in the database. The data is collected from the website's API in JSON format, then transformed and loaded into the SQL database.

Installation

1. If you have Docker Desktop

download repository
open terminal in your favorit IDE or command line
in terminal:
cd docker_demo
docker build -t image_name .
docker run -p 8080:8080 image_name
in your favorite browser open http://localhost:8080

Where login : admin, password you can find in container file sestem docker desktop standalone_admin_password.txt

Run dag_1 > Trigger DAG

2. Another way

Open linux command line

mkdir workspace
apt update
apt upgrade
apt install python3-pip
apt install python3-venv
python3 -m venv /workspace/venv
source /workspace/bin/activate
pip install virtualenv
pip install pandas
pip install apache-airflow
export AIRFLOW_HOME=/workspace/airflow
airflow version
add dags in /workspace/airflow/dags
airflow standalone
in your favorite browser open http://localhost:8080

Where login : admin /workspace/airflowstandalone_admin_password.txt or in command line

Run dag_1 > Trigger DAG

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
dags		dags
README.md		README.md
dockerfile		dockerfile
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

dags

dags

README.md

README.md

dockerfile

dockerfile

requirements.txt

requirements.txt

Repository files navigation

5.AirflowDataPipeline

Installation

1. If you have Docker Desktop

2. Another way

About

Releases

Packages

Languages

OleksandrCherniavskyi/5.AirflowDataPipeline

Folders and files

Latest commit

History

Repository files navigation

5.AirflowDataPipeline

Installation

1. If you have Docker Desktop

2. Another way

About

Topics

Resources

Stars

Watchers

Forks

Languages