Highlights
Stars
Apache DataFusion Comet Spark Accelerator
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Uniffle is a high performance, general purpose Remote Shuffle Service.
FlatBuffers: Memory Efficient Serialization Library
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Firebase + Flutter sample apps with code snippets, supported by comprehensive articles for each implementation.
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
Upserts, Deletes And Incremental Processing on Big Data.
Focus on what matters instead of fighting with Git.
Advanced data structure and algorithm for system design,系统设计需要了解的算法
Custom memory allocators in C++ to improve the performance of dynamic memory allocation
Lab Materials for MIT 6.S191: Introduction to Deep Learning
A redesign of Nodejs.org built using Gatsby.js with React.js, TypeScript, and Remark.
A collection of awesome readme templates to display on your profile
📨 An open letter to GitHub from the maintainers of open source projects
🔗 Some useful websites for programmers.
Visually explore, understand, and present your data.
A High-Quality Real Time Upscaler for Anime Video
A friendly programming language from the future
Ah shhgit! Find secrets in your code. Secrets detection for your GitHub, GitLab and Bitbucket repositories.
Git Source Code Mirror - This is a publish-only repository but pull requests can be turned into patches to the mailing list via GitGitGadget (https://gitgitgadget.github.io/). Please follow Documen…
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
A list of developer portfolios for your inspiration