Thanks to visit codestin.com
Credit goes to github.com

YuchenFan48

Follow

💭

Moving Forward

Yuchen Fan YuchenFan48

💭

Moving Forward

Follow

23 followers · 4 following

Achievements

Achievements

YuchenFan48/README.md

Hi 👋, I'm Yuchen Fan

🔭 I’m currently working on Reinforcement Learning and LLMs
📫 How to reach me [email protected]
⚡ Fun fact I am crazy about skiing.

Connect with me:

Pinned Loading

PRIME-RL/ImplicitPRM PRIME-RL/ImplicitPRM Public

Repo of paper "Free Process Rewards without Process Labels"

Python 169 11
TsinghuaC3I/Awesome-RL-for-LRMs TsinghuaC3I/Awesome-RL-for-LRMs Public

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2.3k 129
TsinghuaC3I/SSRL TsinghuaC3I/SSRL Public

SSRL: Self-Search Reinforcement Learning

Python 206 14
THUDM/slime THUDM/slime Public

slime is an LLM post-training framework for RL Scaling.

Python 4.4k 570