This is a repo with links to everything you'd ever want to learn about data engineering
-
Updated
May 26, 2024
This is a repo with links to everything you'd ever want to learn about data engineering
OpenMetadata is a unified platform for discovery, observability, and governance powered by a central metadata repository, in-depth lineage, and seamless team collaboration.
Compare tables within or across databases
This repository provides various demos/examples of using Snowpark for Python.
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
Efficient data transformation and modeling framework that is backwards compatible with dbt.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Data Engineering Pilipinas is a community for data engineers, data analysts, data scientists, developers, AI / ML engineers, and users of closed and open source data tools and methods / techniques in the Philippines. Data Engineering Pilipinas is a PyData group.
Airflow & DBT Cloud Integrated Project Presented at Lagos DBT Community Meetup & DataFestAfrica 23
Apply for a job at Olist's Data Team: https://olist.gupy.io/
Predict stock price based on financial news feeds
A Data Platform built for AWS, powered by Kubernetes.
Roadmap for Data Engineering
Recohut - Learn data engineering, data science
Forecasting Solar Power: Analysis of using a LSTM Neural Network
An open source development framework to help you build data workflows and modern data architecture on AWS.
Code and data for the Modern Polars book
Tutorial on how to setup Trino and Apache Ranger using docker
Add a description, image, and links to the dataengineering topic page so that developers can more easily learn about it.
To associate your repository with the dataengineering topic, visit your repo's landing page and select "manage topics."