Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ZhaoyueCheng's full-sized avatar

Block or report ZhaoyueCheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Nano vLLM

Python 7,222 930 Updated Aug 31, 2025

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

3,355 343 Updated Jul 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,813 2,360 Updated Oct 28, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,694 78 Updated Sep 8, 2025

Minimal yet performant LLM examples in pure JAX

Python 187 24 Updated Sep 23, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,446 3,184 Updated Oct 28, 2025

MCP 资源精选, MCP指南,Claude MCP,MCP Servers, MCP Clients

4,800 289 Updated Oct 1, 2025

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

C++ 1,037 205 Updated Sep 15, 2025

A self-learning tutorail for CUDA High Performance Programing.

JavaScript 758 75 Updated Jun 30, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,496 297 Updated Oct 27, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,593 1,628 Updated Sep 30, 2025

🧑‍🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

6,417 633 Updated Oct 25, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,866 139 Updated Aug 26, 2025

Machine Learning FAQ

392 84 Updated Jan 13, 2023

机器学习工程师、算法工程师、软件工程师、数据科学家-面试指南 | Interview guide for MLE, SDE, DS

220 18 Updated Jul 12, 2025

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 8,571 651 Updated Oct 27, 2025

My learning notes/codes for ML SYS.

Python 3,999 242 Updated Oct 6, 2025

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 34,714 3,151 Updated Oct 28, 2025

Minimalistic large language model 3D-parallelism training

Python 2,274 251 Updated Sep 3, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,969 543 Updated Oct 28, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 51,750 7,541 Updated Oct 28, 2025

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 131,427 15,946 Updated Aug 27, 2025

提供多款 Shadowrocket 规则,拥有强劲的广告过滤功能。每日 8 时重新构建规则。

17,924 1,116 Updated Oct 27, 2025

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,509 113 Updated May 28, 2023

Multi-backend recommender systems with Keras 3

Python 145 17 Updated Oct 22, 2025

适用于 Quantumult X 规则整理集合. 所有内容源自 互联网,仅作为收集和整理

JavaScript 3,655 364 Updated Oct 24, 2025

converts Vertex AI API to OpenAI API format.

TypeScript 13 4 Updated Oct 23, 2024

Fast CUDA matrix multiplication from scratch

Cuda 915 131 Updated Sep 2, 2025

Efficient Triton Kernels for LLM Training

Python 5,775 420 Updated Oct 28, 2025

Open weights language model from Google DeepMind, based on Griffin.

Python 653 33 Updated Jun 4, 2025
Next