- 👋 Hi, I’m @CSfufu
- I am currently focus on MLLM reasoning and Reinforcement Learning.
-
Zhejiang University
- Shanghai China
-
04:26
(UTC +08:00)
Highlights
- Pro
Pinned Loading
-
XiaoYee/Awesome_Efficient_LRM_Reasoning
XiaoYee/Awesome_Efficient_LRM_Reasoning Public😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
-
Revisual-R1
Revisual-R1 Public🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to …
-
hiyouga/EasyR1
hiyouga/EasyR1 PublicEasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
-
shawn0728/ARES
shawn0728/ARES Public🌴 ARES is an open-source framework for adaptive multimodal reasoning, featuring a two-stage pipeline—Adaptive Cold-Start and Entropy-Shaped Policy Optimization—to balance reasoning depth and effici…
Python 9
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
If the problem persists, check the GitHub status page or contact support.