r1
Here are 28 public repositories matching this topic...
Explore the Multimodal “Aha Moment” on 2B Model
-
Updated
Mar 18, 2025 - Python
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
-
Updated
Oct 21, 2025 - Python
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
-
Updated
Feb 19, 2025 - Python
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
-
Updated
May 5, 2025 - Python
Doge Family of Small Language Models
-
Updated
Aug 13, 2025 - Python
[ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.
-
Updated
May 16, 2025 - Python
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
-
Updated
Oct 23, 2025 - Python
Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
-
Updated
May 26, 2025 - Python
Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
-
Updated
Jun 4, 2025 - Python
Auto-generate fallback and meter display from existing group info in d&b audiotechnik's R1 and ArrayCalc software.
-
Updated
May 4, 2025 - Python
Recreating the minimal training methods of DeepSeek-R1 for small langauge models.
-
Updated
Feb 10, 2025 - Python
Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
-
Updated
Mar 12, 2025 - Python
Improve this page
Add a description, image, and links to the r1 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the r1 topic, visit your repo's landing page and select "manage topics."