r1
Here are 55 public repositories matching this topic...
🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.
-
Updated
May 3, 2025
Explore the Multimodal “Aha Moment” on 2B Model
-
Updated
Mar 18, 2025 - Python
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
-
Updated
Apr 30, 2025 - Python
Latest Advances on Long Chain-of-Thought Reasoning
-
Updated
Apr 13, 2025
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
-
Updated
Feb 19, 2025 - Python
Model Context Protocol server for DeepSeek's advanced language models
-
Updated
Mar 27, 2025 - JavaScript
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
-
Updated
Apr 28, 2025
Doge Family of Small Language Model
-
Updated
May 3, 2025 - Python
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
-
Updated
May 6, 2025 - Python
Code for "UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning"
-
Updated
May 6, 2025 - Python
A comprehensive collection of process reward models.
-
Updated
Apr 23, 2025
Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
-
Updated
Apr 24, 2025 - Python
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
-
Updated
May 5, 2025 - Python
使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略
-
Updated
May 6, 2025 - Jupyter Notebook
Auto-generate fallback and meter display from existing group info in d&b audiotechnik's R1 and ArrayCalc software.
-
Updated
May 4, 2025 - Python
Improve this page
Add a description, image, and links to the r1 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the r1 topic, visit your repo's landing page and select "manage topics."