AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result.

Rust 3,356 288 Updated Sep 23, 2025

microsoft / tokenweave

Efficient Compute-Communication Overlap for Distributed LLM Inference

Python 61 4 Updated Oct 1, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 909 44 Updated Oct 22, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,683 1,499 Updated Oct 22, 2025

infinigence / FlashOverlap

A lightweight design for computation-communication overlap.

Cuda 182 8 Updated Oct 10, 2025

NVIDIA / nccl-tests

NCCL Tests

Cuda 1,306 322 Updated Oct 25, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,201 99 Updated Oct 17, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,154 82 Updated Aug 28, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,198 2,090 Updated Oct 28, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,968 542 Updated Oct 28, 2025

juanfont / headscale

An open source, self-hosted implementation of the Tailscale control server

Go 32,005 1,706 Updated Oct 28, 2025

uccl-project / uccl

Ultra and Unified CCL

C++ 629 51 Updated Oct 28, 2025

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,186 1,049 Updated Oct 18, 2025

SWE-bench / SWE-smith

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 435 69 Updated Oct 27, 2025

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 688 171 Updated Oct 28, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,160 409 Updated Oct 28, 2025

LoongServe / LoongServe

Jupyter Notebook 124 12 Updated Nov 11, 2024

OpenHands / OpenHands

🙌 OpenHands: Code Less, Make More

Python 64,497 7,831 Updated Oct 28, 2025

zcox10 / repo-coder-v2

Python 1 1 Updated May 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuhong Zhong yuhong-zhong

Achievements

Achievements

Highlights

Block or report yuhong-zhong

Stars

SPFresh / SPFresh

marius-team / quake

mmulet / term.everything

skhynix / cMPI

yichuan-w / LEANN

e2b-dev / infra

flexflow / flexflow-train

zilliztech / claude-context

hashicorp / raft

asg017 / sqlite-vec

bytedance / trae-agent

smallcloudai / refact