Thanks to visit codestin.com
Credit goes to github.com

RyanLiu112

Follow

🎯

Focusing

Runze Liu RyanLiu112

🎯

Focusing

Follow

Master's student @ THU

44 followers · 18 following

Tsinghua University
China
09:34 (UTC +08:00)
https://ryanliu112.github.io
https://scholar.google.com/citations?user=LiIfGakAAAAJ

Achievements

Achievements

Highlights

Pro

Pinned Loading

TsinghuaC3I/Awesome-RL-for-LRMs TsinghuaC3I/Awesome-RL-for-LRMs Public

A Survey of Reinforcement Learning for Large Reasoning Models

2k 112
TsinghuaC3I/MARTI TsinghuaC3I/MARTI Public

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 336 34
compute-optimal-tts compute-optimal-tts Public

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

Python 275 23
Awesome-Process-Reward-Models Awesome-Process-Reward-Models Public

A comprehensive collection of process reward models.

117 3
GenPRM GenPRM Public

[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

Python 85 2
wizard-III/ArcherCodeR wizard-III/ArcherCodeR Public

ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement learning.

Python 43 2