Thanks to visit codestin.com
Credit goes to Github.com

gaocegege

Follow

🐮

Programming

Ce Gao gaocegege

🐮

Programming

Follow

AI Infrastructure / MLSys | Co-founder & CEO @tensorchord | Co-chair @kubeflow | SJTU

3.3k followers · 2.3k following

Sponsors

Achievements

Achievements

Highlights

Developer Program Member

Organizations

Lists (7)

Sort

Kubernetes

Life

Machine Learning

Machine Learning Infra

29 repositories

MIDI PR

Security

Storage

Starred repositories

EnterpriseDB / pgpu

PGPU extends postgres with GPU acceleration

Rust 29 2 Updated Feb 3, 2026

langchain-ai / deepagents

Deep Agents is an agent harness built on langchain and langgraph. Deep Agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped…

Python 9,278 1,469 Updated Feb 13, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 2,728 422 Updated Feb 13, 2026

apache / tvm-ffi

Open ABI and FFI for Machine Learning Systems

C++ 337 59 Updated Feb 12, 2026

sgl-project / sglang-jax

JAX backend for SGL

Python 237 68 Updated Feb 13, 2026

Michael-A-Kuykendall / shimmy

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Rust 3,668 280 Updated Jan 16, 2026

generalaction / emdash

Emdash is the Open-Source Agentic Development Environment (🧡 YC W26). Run multiple coding agents in parallel. Use any provider.

TypeScript 1,305 113 Updated Feb 13, 2026

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,177 449 Updated Feb 13, 2026

IvorySQL / IvorySQL

Open Source Oracle Compatible PostgreSQL.

C 991 167 Updated Feb 10, 2026

hao-ai-lab / LookaheadReasoning

[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning

Python 65 6 Updated Oct 31, 2025

ByteDance-Seed / seed-oss

Python 868 45 Updated Sep 15, 2025

TabbyML / pochi

TypeScript 81 42 Updated Feb 13, 2026

Eugeny / tabby

A terminal for a more modern age

TypeScript 68,877 3,866 Updated Feb 5, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 11,674 1,565 Updated Nov 3, 2025

hhy3 / awesome-vector-search

14 Updated Feb 6, 2026

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,554 71 Updated Jan 20, 2026

vllm-project / semantic-router

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 3,188 536 Updated Feb 13, 2026

supabase-community / database-build

In-browser Postgres sandbox with AI assistance (formerly postgres.new)

TypeScript 2,928 271 Updated Feb 10, 2025

warehouse-pg / warehouse-pg

An Open Source alternative to the Greenplum® Database

C 88 28 Updated Feb 9, 2026

qdrant / benchmark-cohere-wiki-50m

Scripts for benchmarking Qdrant with Cohere Wiki dataset

Python 6 1 Updated May 5, 2025

stepfun-ai / StepMesh

C++ 343 36 Updated Jan 28, 2026

envoyproxy / ai-gateway

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,380 168 Updated Feb 12, 2026

QwenLM / Qwen3-Coder

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 15,503 1,081 Updated Feb 3, 2026

boson-ai / RPBench-Auto

An automated pipeline for evaluating LLMs for role-playing.

Python 204 10 Updated Sep 14, 2024

meta-pytorch / torchtune

PyTorch native post-training library

Python 5,673 703 Updated Feb 13, 2026

pgsty / dbrank

DBMS Trending Data Analysis

3 Updated Jul 31, 2025

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

10,365 774 Updated Jan 21, 2026

vllm-project / recipes

Common recipes to run vLLM

Jupyter Notebook 413 142 Updated Feb 13, 2026

HKUDS / RAG-Anything

"RAG-Anything: All-in-One RAG Framework"

Python 13,076 1,561 Updated Jan 26, 2026

RahulSChand / gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,390 86 Updated Dec 3, 2024

Starred topics

Tensorflow