Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ywqzzy's full-sized avatar
🎯
Keep retring
🎯
Keep retring
  • Beijing

Block or report ywqzzy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

rocksdb in rust

Rust 12 Updated Oct 27, 2025

The best ChatGPT that $100 can buy.

Python 33,583 3,712 Updated Oct 25, 2025

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,549 131 Updated Sep 22, 2025

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 14,859 1,116 Updated Oct 27, 2025

An transformer based LLM. Written completely in Rust

Rust 2,909 244 Updated Oct 10, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,866 139 Updated Aug 26, 2025

The financial transactions database designed for mission critical safety and performance.

Zig 14,156 703 Updated Oct 28, 2025

Distributed query engine providing simple and reliable data processing for any modality and scale

Rust 4,648 324 Updated Oct 27, 2025

Static suckless single batch CUDA-only qwen3-0.6B mini inference engine

Cuda 503 41 Updated Sep 8, 2025

Materials for learning SGLang

621 50 Updated Oct 26, 2025

Java raft/config/mq/rpc engine, zero dependencies, 10X faster

Java 204 18 Updated Oct 8, 2025
C++ 309 26 Updated Oct 1, 2025

Patterns and resources of low latency programming.

727 27 Updated Jul 30, 2025

Ultra and Unified CCL

C++ 628 51 Updated Oct 27, 2025

The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing

Rust 1,629 187 Updated Oct 27, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,315 476 Updated Oct 26, 2025

Nano vLLM

Python 7,218 929 Updated Aug 31, 2025

Open-source vector similarity search for Postgres

C 18,096 931 Updated Oct 22, 2025

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 26,777 1,870 Updated Oct 27, 2025

DuckLake is an integrated data lake and catalog format

C++ 2,170 104 Updated Oct 18, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 4,869 680 Updated Oct 18, 2025

VictoriaMetrics: fast, cost-effective monitoring solution and time series database

Go 15,234 1,479 Updated Oct 27, 2025

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 519 49 Updated Sep 13, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,361 225 Updated Oct 12, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,636 317 Updated Aug 19, 2025

Large Language Model (LLM) Systems Paper List

1,567 83 Updated Oct 18, 2025

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

C++ 906 106 Updated Oct 27, 2025

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 8,158 880 Updated Oct 27, 2025

Accelerate inference without tears

Python 363 21 Updated Oct 14, 2025

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,001 2,438 Updated Oct 27, 2025
Next