Spycsh

Sihan Chen Spycsh

19 followers · 0 following

Intel
Shanghai
19:13 (UTC +08:00)

Achievements

x2 x3 x3

Achievements

x2 x3 x3

Lists (1)

Sort

🚀 My stack

1 repository

Stars

llm-d / llm-d-benchmark

llm-d benchmark scripts and tooling

Jupyter Notebook 41 43 Updated Jan 9, 2026

intel / sycl-tla

Forked from NVIDIA/cutlass

SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs

C++ 62 73 Updated Jan 7, 2026

llm-d / llm-d-inference-scheduler

Inference scheduler for llm-d

Go 117 113 Updated Jan 11, 2026

llm-d / llm-d-kv-cache

Distributed KV cache scheduling & offloading libraries

Go 94 74 Updated Jan 12, 2026

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,882 812 Updated Jan 8, 2026

ai-dynamo / aiperf

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 85 18 Updated Jan 10, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,337 289 Updated Jan 12, 2026

ai-dynamo / aiconfigurator

Offline optimization of your disaggregated Dynamo graph

Python 146 49 Updated Jan 12, 2026

vllm-project / production-stack

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,099 351 Updated Jan 7, 2026

intel / torch-xpu-ops

C++ 72 66 Updated Jan 12, 2026

vllm-project / vllm-xpu-kernels

The vLLM XPU kernels for Intel GPU

C++ 18 18 Updated Jan 9, 2026

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,072 56 Updated Dec 22, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,876 1,055 Updated Dec 29, 2025

HabanaAI / gaudi-pytorch-bridge

C++ 17 5 Updated Jan 9, 2026

HabanaAI / vllm-fork

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 85 135 Updated Jan 9, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,761 774 Updated Jan 12, 2026

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,514 440 Updated Oct 27, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,334 1,201 Updated Jan 12, 2026

lipku / LiveTalking

Real time interactive streaming digital human

Python 7,005 1,088 Updated Jan 1, 2026

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 9,783 1,611 Updated Jan 11, 2026

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,511 646 Updated Jan 12, 2026

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 4,602 640 Updated Jan 12, 2026

deepseek-ai / DeepSeek-V3

Python 101,057 16,462 Updated Aug 28, 2025

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 879 94 Updated Aug 22, 2024

spring-projects / spring-ai

An Application Framework for AI Engineering

Java 7,645 2,198 Updated Jan 12, 2026

HabanaAI / hccl_demo

C++ 24 20 Updated Oct 9, 2025

antgroup / echomimic_v2

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,447 524 Updated Aug 11, 2025

PKU-YuanGroup / LLaVA-CoT

[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,111 82 Updated Dec 12, 2025

anliyuan / Ultralight-Digital-Human

一个超轻量级、可以在移动端实时运行的数字人模型

Python 2,383 342 Updated Sep 18, 2025

kleinlee / DH_live

每个人都能用的数字人

Python 1,815 381 Updated Nov 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sihan Chen Spycsh

Achievements

Achievements

Block or report Spycsh

Lists (1)

🚀 My stack

Stars

llm-d / llm-d-benchmark

intel / sycl-tla

llm-d / llm-d-inference-scheduler

llm-d / llm-d-kv-cache

bitsandbytes-foundation / bitsandbytes

ai-dynamo / aiperf

llm-d / llm-d

ai-dynamo / aiconfigurator

vllm-project / production-stack

intel / torch-xpu-ops

vllm-project / vllm-xpu-kernels

hemingkx / SpeculativeDecodingPapers

deepseek-ai / DeepEP

HabanaAI / gaudi-pytorch-bridge

HabanaAI / vllm-fork

ai-dynamo / dynamo

huggingface / nanoVLM

kvcache-ai / ktransformers

lipku / LiveTalking

pipecat-ai / pipecat

InternLM / lmdeploy

flashinfer-ai / flashinfer

deepseek-ai / DeepSeek-V3

feifeibear / LLMSpeculativeSampling

spring-projects / spring-ai

HabanaAI / hccl_demo

antgroup / echomimic_v2

PKU-YuanGroup / LLaVA-CoT

anliyuan / Ultralight-Digital-Human

kleinlee / DH_live