Stars
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
verl: Volcano Engine Reinforcement Learning for LLMs
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
Minimal yet performant LLM examples in pure JAX
SGLang is a fast serving framework for large language models and vision language models.
MCP 资源精选, MCP指南,Claude MCP,MCP Servers, MCP Clients
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
A self-learning tutorail for CUDA High Performance Programing.
Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
🧑🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
Minimalistic 4D-parallelism distributed training framework for education purpose
机器学习工程师、算法工程师、软件工程师、数据科学家-面试指南 | Interview guide for MLE, SDE, DS
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
My learning notes/codes for ML SYS.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Minimalistic large language model 3D-parallelism training
FlashInfer: Kernel Library for LLM Serving
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
💯 Curated coding interview preparation materials for busy software engineers
提供多款 Shadowrocket 规则,拥有强劲的广告过滤功能。每日 8 时重新构建规则。
Measuring Massive Multitask Language Understanding | ICLR 2021
Multi-backend recommender systems with Keras 3
适用于 Quantumult X 规则整理集合. 所有内容源自 互联网,仅作为收集和整理
converts Vertex AI API to OpenAI API format.
Efficient Triton Kernels for LLM Training
Open weights language model from Google DeepMind, based on Griffin.