Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View gaocegege's full-sized avatar
🐮
Programming
🐮
Programming

Sponsors

@dravenk
@rezmoss

Block or report gaocegege

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

PGPU extends postgres with GPU acceleration

Rust 29 2 Updated Feb 3, 2026

Deep Agents is an agent harness built on langchain and langgraph. Deep Agents are equipped with a planning tool, a filesystem backend, and the ability to spawn subagents - making them well-equipped…

Python 9,278 1,469 Updated Feb 13, 2026

A framework for efficient model inference with omni-modality models

Python 2,728 422 Updated Feb 13, 2026

Open ABI and FFI for Machine Learning Systems

C++ 337 59 Updated Feb 12, 2026

JAX backend for SGL

Python 237 68 Updated Feb 13, 2026

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Rust 3,668 280 Updated Jan 16, 2026

Emdash is the Open-Source Agentic Development Environment (🧡 YC W26). Run multiple coding agents in parallel. Use any provider.

TypeScript 1,305 113 Updated Feb 13, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,177 449 Updated Feb 13, 2026

Open Source Oracle Compatible PostgreSQL.

C 991 167 Updated Feb 10, 2026

[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning

Python 65 6 Updated Oct 31, 2025
Python 868 45 Updated Sep 15, 2025
TypeScript 81 42 Updated Feb 13, 2026

A terminal for a more modern age

TypeScript 68,877 3,866 Updated Feb 5, 2026

Nano vLLM

Python 11,674 1,565 Updated Nov 3, 2025

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 3,188 536 Updated Feb 13, 2026

In-browser Postgres sandbox with AI assistance (formerly postgres.new)

TypeScript 2,928 271 Updated Feb 10, 2025

An Open Source alternative to the Greenplum® Database

C 88 28 Updated Feb 9, 2026

Scripts for benchmarking Qdrant with Cohere Wiki dataset

Python 6 1 Updated May 5, 2025
C++ 343 36 Updated Jan 28, 2026

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,380 168 Updated Feb 12, 2026

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 15,503 1,081 Updated Feb 3, 2026

An automated pipeline for evaluating LLMs for role-playing.

Python 204 10 Updated Sep 14, 2024

PyTorch native post-training library

Python 5,673 703 Updated Feb 13, 2026

DBMS Trending Data Analysis

3 Updated Jul 31, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

10,365 774 Updated Jan 21, 2026

Common recipes to run vLLM

Jupyter Notebook 413 142 Updated Feb 13, 2026

"RAG-Anything: All-in-One RAG Framework"

Python 13,076 1,561 Updated Jan 26, 2026

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,390 86 Updated Dec 3, 2024
Next