Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View YuchenFan48's full-sized avatar
💭
Moving Forward
💭
Moving Forward

Block or report YuchenFan48

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YuchenFan48/README.md

Hi 👋, I'm Yuchen Fan

yuchenfan48

yuchenfan48

  • 🔭 I’m currently working on Reinforcement Learning and LLMs

  • 📫 How to reach me [email protected]

  • âš¡ Fun fact I am crazy about skiing.

Connect with me:

 yuchenfan48

Pinned Loading

  1. PRIME-RL/ImplicitPRM PRIME-RL/ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    Python 169 11

  2. TsinghuaC3I/Awesome-RL-for-LRMs TsinghuaC3I/Awesome-RL-for-LRMs Public

    A Survey of Reinforcement Learning for Large Reasoning Models

    TeX 2.3k 129

  3. TsinghuaC3I/SSRL TsinghuaC3I/SSRL Public

    SSRL: Self-Search Reinforcement Learning

    Python 206 14

  4. THUDM/slime THUDM/slime Public

    slime is an LLM post-training framework for RL Scaling.

    Python 4.4k 570