An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,662 841 Updated Dec 18, 2025

deepseek-ai / DeepSeek-V3

Python 100,861 16,429 Updated Aug 28, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,526 1,536 Updated Apr 24, 2025

deepseek-ai / DeepSeek-R1

91,612 11,771 Updated Jun 27, 2025

teshnizi / OptiMUS

Optimization Modeling Using mip Solvers and large language models

Python 231 45 Updated Nov 4, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 16,630 1,966 Updated Nov 4, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,562 7,829 Updated Dec 26, 2025

google / latexify_py

A library to generate LaTeX expression from Python code.

Python 7,589 394 Updated Feb 13, 2025

QwenLM / Qwen2.5-Math

A series of math-specific large language models of our Qwen2 series.

Python 1,055 151 Updated Jan 11, 2025

zyushun / hessian-spectrum

Code for the paper: Why Transformers Need Adam: A Hessian Perspective

Jupyter Notebook 63 8 Updated Mar 11, 2025

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,503 5,841 Updated Aug 14, 2024

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 123,789 19,236 Updated Dec 27, 2025

Cardinal-Operations / ORLM

ORLM: Training Large Language Models for Optimization Modeling

Python 224 34 Updated Sep 18, 2025

zyushun / Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 446 15 Updated May 13, 2025

LehaoLin / mozi

The simplest and most practical Node.js backend template, suitable for quickly setting up small-scale backends. | 最爽的方式起个Nodejs小后端

JavaScript 3 1 Updated Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chenyu Zhou 0xzhouchenyu

Achievements

Achievements

Highlights

Block or report 0xzhouchenyu

Stars

zai-org / Open-AutoGLM

kourgeorge / arxiv-style

LLM4OR / LLM4OR

dvlab-research / Step-DPO

KellerJordan / Muon

MoonshotAI / Moonlight

google-gemini / gemini-cli

SkyLiu0 / netlib

G-dab / SJTUAppointment

ai4co / awesome-fm4co

hiyouga / EasyR1

deepseek-ai / FlashMLA

huggingface / open-r1

huggingface / trl

volcengine / verl

OpenRLHF / OpenRLHF