Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zhyncs's full-sized avatar
🎯
🎯

Organizations

@baidu @togethercomputer @flashinfer-ai

Block or report zhyncs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High Performance LLM Inference Operator Library

C++ 562 48 Updated Jan 28, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,515 240 Updated Jan 29, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 3,412 233 Updated Jan 14, 2026

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 803 59 Updated Jan 14, 2026

A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node

C 63 6 Updated Dec 19, 2025

Lightweight coding agent that runs in your terminal

Rust 58,020 7,518 Updated Jan 29, 2026

Helpful kernel tutorials and examples for tile-based GPU programming

Python 615 37 Updated Jan 29, 2026

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 944 130 Updated Jan 29, 2026

cocoon

C++ 651 58 Updated Dec 25, 2025

Nex Venus Communication Library

C++ 72 7 Updated Nov 17, 2025

Fast and Furious AMD Kernels

C++ 346 49 Updated Jan 24, 2026

Perplexity open source garden for inference technology

Rust 350 29 Updated Dec 25, 2025

Autonomous GPU Kernel Generation via Deep Agents

Python 223 28 Updated Jan 29, 2026

A Lightweight LLM Post-Training Library

Python 2,132 228 Updated Jan 29, 2026

Kimi Code CLI is your next CLI agent.

Python 4,660 450 Updated Jan 28, 2026

High-throughput tensor loading for PyTorch

Python 221 14 Updated Jan 22, 2026

Contexts Optical Compression

Python 22,280 2,043 Updated Jan 27, 2026

Ship correct and fast LLM kernels to PyTorch

Python 139 15 Updated Jan 14, 2026

The Modular Platform (includes MAX & Mojo)

Mojo 25,515 2,768 Updated Jan 28, 2026

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 732 99 Updated Jan 29, 2026

Ascend TileLang adapter

C++ 199 70 Updated Jan 29, 2026

A unified inference and post-training framework for accelerated video generation.

Python 3,015 248 Updated Jan 29, 2026

Tile primitives for speedy kernels

Cuda 3,108 232 Updated Jan 29, 2026

Building the Virtuous Cycle for AI-driven LLM Systems

Python 141 20 Updated Jan 29, 2026

a size profiler for cuda binary

Python 69 Updated Jan 15, 2026

🐹 Deep clean and optimize your Mac.

Shell 32,286 870 Updated Jan 29, 2026

Post-training with Tinker

Python 2,776 308 Updated Jan 29, 2026

Verify Precision of all Kimi K2 API Vendor

Python 501 27 Updated Jan 27, 2026
Next