Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zhangmenghao's full-sized avatar

Block or report zhangmenghao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
C++ 2 Updated Oct 23, 2025

Secure and fast microVMs for serverless computing.

Rust 30,960 2,128 Updated Nov 3, 2025

Venus Collective Communication Library, supported by SII and Infrawaves.

C++ 108 4 Updated Nov 3, 2025

NVIDIA Inference Xfer Library (NIXL)

C++ 702 177 Updated Nov 5, 2025

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 812 73 Updated Nov 5, 2025

Analyze computation-communication overlap in V3/R1.

1,112 143 Updated Mar 21, 2025

Aims to implement dual-port and multi-qp solutions in deepEP ibrc transport

Cuda 66 3 Updated May 9, 2025

Managed collective communication service

Rust 22 4 Updated Sep 2, 2024
Python 42 6 Updated Aug 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 2 Updated Apr 7, 2025

Kolors Team

Python 4,566 352 Updated Nov 13, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,930 286 Updated May 15, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,289 1,103 Updated Nov 5, 2025

DLRover: An Automatic Distributed Deep Learning System

Python 1,580 198 Updated Nov 5, 2025
C++ 162 30 Updated Nov 5, 2025

CUDA checkpoint and restore utility

C 380 22 Updated Sep 15, 2025

ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

C++ 457 162 Updated Oct 31, 2025

Lumina is a user-friendly tool to test the correctness and performance of hardware network stacks.

Python 26 6 Updated Jan 8, 2024

Benchmark Test Suite for RDMA Networks

C++ 57 4 Updated Apr 15, 2023

Checkpoint/Restore tool

C 3,462 676 Updated Nov 3, 2025

Initializer for KServe Cluster

Shell 1 1 Updated Jul 29, 2024

P4 codes for research projects

P4 217 58 Updated Nov 3, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,098 31,040 Updated Nov 5, 2025

Large Language Model (LLM) Systems Paper List

1,580 86 Updated Nov 4, 2025

PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.

Python 153 66 Updated Oct 15, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,041 3,175 Updated Nov 4, 2025

Zeta is a distributed platform for developing and deploying complex, elastic, and highly available multi-tenant network services.

C 20 10 Updated Mar 31, 2023

nsfc - 国家自然科学基金项目LaTeX模版(面青地)

TeX 534 141 Updated Mar 15, 2025
Next