Stars
🐶 Kubernetes CLI To Manage Your Clusters In Style!
Empowering everyone to build reliable and efficient software.
Apache Spark - A unified analytics engine for large-scale data processing
Postgres change data capture to streams, queues, and search indexes like Kafka, SQS, Elasticsearch, HTTP endpoints, and more
⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered business intelligence in seconds.
Distributed stream processing engine in Rust
AutoMQ is a diskless Kafka® on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
A lightweight data processing framework built on DuckDB and 3FS.
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
Java bindings for H3, a hierarchical hexagonal geospatial indexing system
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Multi-container environment with Hadoop, Spark and Hive
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)