Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zhixin612's full-sized avatar
  • Tianjin University
  • Tianjin, China
  • 16:31 (UTC +08:00)

Highlights

  • Pro

Organizations

@TJU-NSL

Block or report zhixin612

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"AI-Trader: Can AI Beat the Market?"

Python 1,308 304 Updated Oct 28, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,215 808 Updated Oct 17, 2025

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 463 42 Updated Oct 27, 2025

Research prototype of PRISM — a cost-efficient multi-LLM serving system with flexible time- and space-based GPU sharing.

Python 37 1 Updated Aug 15, 2025

Fast and memory-efficient exact attention

Python 20,204 2,090 Updated Oct 28, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

93,417 25,243 Updated Oct 19, 2025
Python 24 3 Updated Oct 28, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 2,903 213 Updated Oct 28, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 790 58 Updated Oct 20, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,271 230 Updated Oct 28, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,405 6,452 Updated Oct 28, 2025

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 608 74 Updated Oct 28, 2025

🦜🔗 Build context-aware reasoning applications

Python 118,236 19,469 Updated Oct 27, 2025

2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍

10,688 3,176 Updated Jun 20, 2025

Large Language Model (LLM) Systems Paper List

1,567 83 Updated Oct 18, 2025

A framework for generating realistic LLM serving workloads

Python 73 4 Updated Oct 9, 2025

Awesome LLMs on Device: A Comprehensive Survey

1,237 109 Updated Jan 12, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

8,398 551 Updated Sep 11, 2025

A collection of prompts, system prompts and LLM instructions

HTML 3,949 543 Updated Sep 30, 2025

关于2025年CS保研实验室/导师招生广告的汇总。欢迎想要打广告的小伙伴积极PR,资瓷一下互联网精神吼不吼啊?

177 41 Updated Sep 16, 2025

Tile primitives for speedy kernels

Cuda 2,844 190 Updated Oct 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,818 2,361 Updated Oct 28, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,681 282 Updated Oct 28, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,928 285 Updated May 15, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,526 1,846 Updated Oct 24, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,639 317 Updated Aug 19, 2025

中国大模型

6,299 536 Updated Nov 30, 2024

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,373 660 Updated Oct 28, 2025
Next