Pinned Loading
-
HuatuoGPT-o1
HuatuoGPT-o1 PublicForked from FreedomIntelligence/HuatuoGPT-o1
Medical o1, Towards medical complex reasoning with LLMs
Python
-
Search-R1
Search-R1 PublicForked from PeterGriffinJin/Search-R1
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Python
-
-
-
verl
verl PublicForked from verl-project/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.