Upserts, Deletes And Incremental Processing on Big Data.
-
Updated
May 28, 2024 - Java
Upserts, Deletes And Incremental Processing on Big Data.
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
World's most powerful data catalog service with providing a high-performance, geo-distributed and federated metadata lake.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
lakeFS - Data version control for your data lake | Git for data
An Git-like version control file system for data lineage & data collaboration.
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
Postgres for Search and Analytics
Open Control Plane for Tables in Data Lakehouse
VRE infrastructure running at CERN
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
汇总Apache Hudi相关资料
Blog posts for ParadeDB as .mdx, hosted on Mintlify
An IDE and translation engine for detection engineers and threat hunters. Be faster, write smarter, keep 100% privacy.
Repository for tutorials, information and notes on technology in general.
Add a description, image, and links to the datalake topic page so that developers can more easily learn about it.
To associate your repository with the datalake topic, visit your repo's landing page and select "manage topics."