-
Cloudera Inc.
Stars
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Picocli is a modern framework for building powerful, user-friendly, GraalVM-enabled command line apps with ease. It supports colors, autocompletion, subcommands, and more. In 1 source file so apps …
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Small set of tools for JVM troublshooting, monitoring and profiling.
A Chaos Engineering Platform for Kubernetes.
Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.
FlatBuffers: Memory Efficient Serialization Library
Fault tolerance and resilience patterns for the JVM
Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.