Stars
🦜🔗 The platform for reliable agents.
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
DuckDB extension that adds support for SQL/PGQ and graph algorithms
A modern replacement for Redis and Memcached
New Generation Opensource Data Stack Demo
Apache DataFusion Ballista Distributed Query Engine
A composable and fully extensible C++ execution engine library for data management systems.
Fast map matching, an open source framework in C++
OSM planet dump high performance data loader. Transform OpenStreetMap World/Region PBF dump into partitioned by H3 regions PostGIS pgsnapshot (lossless) OSM schema representation and/or into ArrowI…
Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg
jscastro76 / threebox
Forked from peterqliu/threeboxA Three.js plugin for Mapbox GL JS, with support for animations and advanced 3D rendering.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
LocationTech SFCurve is a Scala library for the creation, transformation, and querying of space-filling curves
Specification for storing geospatial vector data (point, line, polygon) in Parquet
Universal solution for geospatial data tailored to data lakehouse systems for the first time in the industry
Apache Superset is a Data Visualization and Data Exploration Platform
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Geo Assist is a spatial library to manage spatial data in-memory.
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Apache Doris is an easy-to-use, high performance and unified analytics database.
Worldwide building footprints derived from satellite imagery