Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View MaoZiming's full-sized avatar
đź”­
Thinking
đź”­
Thinking

Organizations

@NetSys @Y-Hack @Yale-LILY @yale-nova @skypilot-org @berkeley-cs168 @Trinity-data-store @uccl-project

Block or report MaoZiming

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Distributed Compiler based on Triton for Parallel Systems

Python 1,186 97 Updated Oct 17, 2025
Python 27 3 Updated Oct 23, 2025

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Python 1,603 106 Updated Oct 22, 2025

Fast OS-level support for GPU checkpoint and restore

C++ 245 26 Updated Sep 28, 2025

CloudSim: A Framework For Modeling And Simulation Of Cloud Computing Infrastructures And Services

Java 944 538 Updated Sep 25, 2025

Collaborative Datacenter Simulation and Exploration for Everybody

Kotlin 97 63 Updated Oct 17, 2025

Perplexity GPU Kernels

C++ 500 65 Updated Sep 19, 2025

Borg cluster traces from Google

TeX 993 204 Updated Aug 14, 2025

Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple regression tasks.

Python 277 36 Updated Oct 22, 2025

Model Context Protocol Servers

TypeScript 71,076 8,487 Updated Oct 20, 2025
C++ 308 26 Updated Oct 1, 2025

m3fs(Make 3FS) is the toolset designed to deploy 3FS cluster.

Go 52 9 Updated Jul 18, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,337 277 Updated Oct 21, 2025

The P programming language.

C# 3,450 203 Updated Sep 30, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,400 952 Updated Oct 22, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,659 588 Updated Oct 23, 2025

RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 3,186 328 Updated Oct 20, 2025

Open-source implementation of AlphaEvolve

Python 4,238 625 Updated Oct 20, 2025

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 267 28 Updated Oct 22, 2025

Microsoft Collective Communication Library

C++ 368 32 Updated Sep 20, 2023

Analyze computation-communication overlap in V3/R1.

1,110 143 Updated Mar 21, 2025

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 170 22 Updated Mar 27, 2025

A High-Throughput Parallel Lossless Compressor for Scientific Data

C++ 72 16 Updated Jan 22, 2023

Expert Parallelism Load Balancer

Python 1,282 195 Updated Mar 24, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 1,914 205 Updated Oct 23, 2025

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,241 174 Updated Aug 19, 2025

Unified Collective Communication Library

C 278 117 Updated Oct 22, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,661 659 Updated Oct 22, 2025

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

C 1,479 489 Updated Oct 23, 2025
Next