AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result.

Rust 3,362 291 Updated Sep 23, 2025

microsoft / tokenweave

Efficient Compute-Communication Overlap for Distributed LLM Inference

Python 61 4 Updated Oct 31, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 910 44 Updated Oct 29, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,720 1,510 Updated Nov 4, 2025

infinigence / FlashOverlap

A lightweight design for computation-communication overlap.

Cuda 183 8 Updated Oct 10, 2025

NVIDIA / nccl-tests

NCCL Tests

Cuda 1,323 326 Updated Nov 3, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,210 104 Updated Oct 17, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,161 82 Updated Aug 28, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,333 2,110 Updated Nov 3, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,013 558 Updated Nov 4, 2025

juanfont / headscale

An open source, self-hosted implementation of the Tailscale control server

Go 32,506 1,731 Updated Nov 2, 2025

uccl-project / uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 800 73 Updated Nov 4, 2025

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 4,204 1,059 Updated Nov 4, 2025

SWE-bench / SWE-smith

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 440 72 Updated Nov 3, 2025

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 699 177 Updated Nov 4, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,208 420 Updated Nov 4, 2025

LoongServe / LoongServe

Jupyter Notebook 124 12 Updated Nov 11, 2024

OpenHands / OpenHands

🙌 OpenHands: Code Less, Make More

Python 64,680 7,863 Updated Nov 4, 2025

zcox10 / repo-coder-v2

Python 1 1 Updated May 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuhong Zhong yuhong-zhong

Achievements

Achievements

Highlights

Block or report yuhong-zhong

Stars

SPFresh / SPFresh

marius-team / quake

mmulet / term.everything

skhynix / cMPI

yichuan-w / LEANN

e2b-dev / infra

flexflow / flexflow-train

zilliztech / claude-context

hashicorp / raft

asg017 / sqlite-vec

bytedance / trae-agent

smallcloudai / refact