Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View xiaoyuyao's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Cloudera Inc.

Block or report xiaoyuyao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 2,669 712 Updated Jan 16, 2026

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java 1,397 168 Updated Jan 16, 2026

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,292 980 Updated Jan 15, 2026

Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.

Java 285 79 Updated Nov 26, 2025

Picocli is a modern framework for building powerful, user-friendly, GraalVM-enabled command line apps with ease. It supports colors, autocompletion, subcommands, and more. In 1 source file so apps …

Java 5,278 446 Updated Oct 30, 2025

Apache Iceberg

Java 8,440 2,969 Updated Jan 16, 2026

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 8,441 1,593 Updated Jan 17, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,549 1,986 Updated Jan 17, 2026

Apache Hadoop Ozone

Java 1 1 Updated Jul 13, 2021

Small set of tools for JVM troublshooting, monitoring and profiling.

Java 3,338 523 Updated Jan 26, 2024

A Chaos Engineering Platform for Kubernetes.

Go 7,489 919 Updated Jan 11, 2026

Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.

Java 1,142 593 Updated Jan 16, 2026

FlatBuffers: Memory Efficient Serialization Library

C++ 25,409 3,479 Updated Dec 22, 2025

An HDFS DataNode based on the Spring Framework

Java 5 Updated Oct 4, 2022

Fault tolerance and resilience patterns for the JVM

Java 4,300 309 Updated Dec 28, 2025

Apache Hadoop

Java 15,448 9,185 Updated Jan 16, 2026

Mirror of Apache Hadoop

Java 2 Updated Oct 7, 2019

Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.

Java 320 82 Updated May 15, 2025