Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
-
Updated
Jun 5, 2024
Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
dbt adapter for SQL Server and Azure SQL
Handbook for the data and analytics engineering professions
The Tuva Project Docs i.e. where we write and share our knowledge about healthcare data and analytics.
Airbnb Analytics Engineering and Data Warehousing Project
🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)
Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.
Neste projeto, são realizadas transformações nos dados da empresa Northwind. ✒️
🥪🏭 A simple CLI for generating synthetic Jaffle Shop data.
A curated list of awesome dbt resources
Streamlit-based analytics dashboard visualizing real-time economic indicators. This project uses cron jobs to provide real-time updates of common economic indicators
Automatically generate DBML files from Snowflake databases for quickly reverse engineer interactive ER diagrams and documentation from your Snowflake DB. Ideal for data engineers and analysts, it supports custom primary key configurations and relationship inference.
A starter dbt project and synthetic claims dataset for trying out the Tuva Project.
Data Engineering project focused on analyzing ordering, invoicing, and sales data at a hotel. It leverages a dataset sourced from Zenodo. The project architecture employs cloud-based technologies, including Google Cloud Platform, Terraform for infrastructure provisioning, Mage for workflow orchestration, Google Cloud
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
Add a description, image, and links to the analytics-engineering topic page so that developers can more easily learn about it.
To associate your repository with the analytics-engineering topic, visit your repo's landing page and select "manage topics."