Spycsh

🧨

Sihan Chen Spycsh

🧨

20 followers · 0 following

Intel
Shanghai
14:09 (UTC +08:00)

Achievements

x2 x3 x3

Achievements

x2 x3 x3

Lists (1)

Sort

🚀 My stack

1 repository

Stars

92 results for source starred repositories

Clear filter

llm-d-incubation / llm-d-modelservice

helm charts for deploying models with llm-d

Go Template 28 49 Updated Feb 23, 2026

llm-d / llm-d-benchmark

llm-d benchmark scripts and tooling

Python 47 52 Updated Feb 23, 2026

llm-d / llm-d-inference-scheduler

Inference scheduler for llm-d

Go 131 128 Updated Feb 23, 2026

llm-d / llm-d-kv-cache

Distributed KV cache scheduling & offloading libraries

Go 103 89 Updated Feb 24, 2026

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,972 825 Updated Feb 23, 2026

ai-dynamo / aiperf

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 143 38 Updated Feb 24, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,522 326 Updated Feb 24, 2026

ai-dynamo / aiconfigurator

Offline optimization of your disaggregated Dynamo graph

Python 192 64 Updated Feb 24, 2026

vllm-project / production-stack

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,180 369 Updated Feb 24, 2026

intel / torch-xpu-ops

Python 77 75 Updated Feb 24, 2026

vllm-project / vllm-xpu-kernels

The vLLM XPU kernels for Intel GPU

C++ 22 27 Updated Feb 11, 2026

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,126 63 Updated Jan 24, 2026

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,993 1,104 Updated Feb 9, 2026

HabanaAI / gaudi-pytorch-bridge

C++ 17 5 Updated Feb 3, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,125 875 Updated Feb 24, 2026

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,666 468 Updated Oct 27, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,541 1,215 Updated Feb 24, 2026

lipku / LiveTalking

Real time interactive streaming digital human

Python 7,141 1,125 Updated Feb 11, 2026

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 10,423 1,752 Updated Feb 24, 2026

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,617 655 Updated Feb 24, 2026

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 5,013 733 Updated Feb 24, 2026

deepseek-ai / DeepSeek-V3

Python 101,679 16,529 Updated Aug 28, 2025

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 887 97 Updated Aug 22, 2024

spring-projects / spring-ai

An Application Framework for AI Engineering

Java 7,946 2,301 Updated Feb 23, 2026

HabanaAI / hccl_demo

C++ 24 20 Updated Oct 9, 2025

antgroup / echomimic_v2

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,488 529 Updated Feb 23, 2026

PKU-YuanGroup / LLaVA-CoT

[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,127 81 Updated Dec 12, 2025

anliyuan / Ultralight-Digital-Human

一个超轻量级、可以在移动端实时运行的数字人模型

Python 2,421 347 Updated Sep 18, 2025

kleinlee / DH_live

每个人都能用的数字人

Python 1,854 393 Updated Nov 8, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 18,469 2,608 Updated Feb 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sihan Chen Spycsh

Achievements

Achievements

Block or report Spycsh

Lists (1)

🚀 My stack

Stars

llm-d-incubation / llm-d-modelservice

llm-d / llm-d-benchmark

llm-d / llm-d-inference-scheduler

llm-d / llm-d-kv-cache

bitsandbytes-foundation / bitsandbytes

ai-dynamo / aiperf

llm-d / llm-d

ai-dynamo / aiconfigurator

vllm-project / production-stack

intel / torch-xpu-ops

vllm-project / vllm-xpu-kernels

hemingkx / SpeculativeDecodingPapers

deepseek-ai / DeepEP

HabanaAI / gaudi-pytorch-bridge

ai-dynamo / dynamo

huggingface / nanoVLM

kvcache-ai / ktransformers

lipku / LiveTalking

pipecat-ai / pipecat

InternLM / lmdeploy

flashinfer-ai / flashinfer

deepseek-ai / DeepSeek-V3

feifeibear / LLMSpeculativeSampling

spring-projects / spring-ai

HabanaAI / hccl_demo

antgroup / echomimic_v2

PKU-YuanGroup / LLaVA-CoT

anliyuan / Ultralight-Digital-Human

kleinlee / DH_live

triton-lang / triton