delta-lake
Here are 140 public repositories matching this topic...
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
-
Updated
May 30, 2024 - Java
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
-
Updated
May 30, 2024 - Scala
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
May 30, 2024 - Java
This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you could also launch an EMR notebook via cluster template to check the outcome from the EMR Serverless application.
-
Updated
May 30, 2024 - TypeScript
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
-
Updated
May 29, 2024 - Python
Hackolade plugin for Delta Lake on Databricks
-
Updated
May 29, 2024 - JavaScript
🦖 Efficiently evolve your old fixed-length data files into more modern file formats, fully parallelized!
-
Updated
May 29, 2024 - Rust
A native Rust library for Delta Lake, with bindings into Python
-
Updated
May 29, 2024 - Rust
An open protocol for secure data sharing
-
Updated
May 29, 2024 - Scala
DataPulse is a platform for developers to build, schedule and monitor data pipelines.
-
Updated
May 29, 2024 - JavaScript
Analytical database for data-driven Web applications 🪶
-
Updated
May 29, 2024 - Rust
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
-
Updated
May 29, 2024 - Rust
Free High-Quality Financial Data in Azure
-
Updated
May 29, 2024 - Python
-
Updated
May 28, 2024 - Jupyter Notebook
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
-
Updated
May 28, 2024 - Java
Schema mappings in SQL and PySpark for ELT pipelines to normalize data to OCSF
-
Updated
May 28, 2024 - Python
Amazon SageMaker Local Mode Examples
-
Updated
May 21, 2024 - Python
A Minimalistic Rust Implementation of Delta Sharing Server.
-
Updated
May 21, 2024 - Rust
Improve this page
Add a description, image, and links to the delta-lake topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the delta-lake topic, visit your repo's landing page and select "manage topics."