Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View answerleo's full-sized avatar

Block or report answerleo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Havenask is a large-scale distributed information search system widely used within Alibaba Group

C++ 1,796 336 Updated Nov 3, 2025

A distributed, fast open-source graph database featuring horizontal scalability and high availability

C++ 11,963 1,282 Updated Oct 22, 2025

PerfKit Benchmarker (PKB) contains a set of benchmarks to measure and compare cloud offerings. The benchmarks use default settings to reflect what most users will see. PerfKit Benchmarker is licens…

Python 1,992 543 Updated Jan 16, 2026

Enjoy the magic of Diffusion models!

Python 11,478 1,095 Updated Jan 15, 2026

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,681 338 Updated Jan 16, 2026

FlashInfer: Kernel Library for LLM Serving

Python 4,687 653 Updated Jan 16, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,503 4,080 Updated Jan 17, 2026

A guidance language for controlling large language models.

Jupyter Notebook 21,184 1,138 Updated Jan 6, 2026

Sourcetrail - free and open-source interactive source explorer

C++ 16,380 1,635 Updated Dec 13, 2021

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 40,113 2,705 Updated Jan 17, 2026

A family of compressed models obtained via pruning and knowledge distillation

363 18 Updated Nov 6, 2025

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,172 74 Updated Sep 30, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,592 509 Updated Jan 17, 2026

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,237 1,289 Updated May 23, 2024

Transformer related optimization, including BERT, GPT

C++ 6,383 928 Updated Mar 27, 2024

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,651 2,017 Updated Jan 17, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,709 12,641 Updated Jan 17, 2026
C++ 520 43 Updated Jan 6, 2026

Continuous Profiling Platform. Debug performance issues down to a single line of code

Go 11,147 716 Updated Jan 17, 2026

DeepSeek Coder: Let the Code Write Itself

Python 22,651 2,706 Updated Nov 11, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,287 696 Updated Nov 24, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,885 1,899 Updated Dec 31, 2025

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,879 299 Updated Jan 16, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,690 192 Updated Jun 25, 2024

Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4

C 956 107 Updated Dec 21, 2025
Python 1,522 220 Updated Jun 26, 2025

APM, Application Performance Monitoring System

Java 24,687 6,627 Updated Jan 16, 2026

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

792 56 Updated May 8, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 63,083 7,917 Updated Oct 4, 2025

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,564 201 Updated Dec 26, 2023
Next