Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Cartus's full-sized avatar
🍭
Focusing
🍭
Focusing

Highlights

  • Pro

Block or report Cartus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Language Models (LLMs).

70 5 Updated Oct 23, 2025

Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"

Python 47 Updated Sep 29, 2025

[NeurIPS 2025🔥]Main source code of SRPO framework.

Python 176 18 Updated Sep 21, 2025

Code repo for FaStFact: Faster, Stronger Long-Form Factuality Evaluations in LLMs.

3 Updated Sep 20, 2025
Python 2 Updated Sep 8, 2025

Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training

Python 39 3 Updated Aug 25, 2025

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Python 9 Updated Aug 20, 2025

The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"

Python 15 Updated Oct 10, 2025

[ICML 2025🔥] ParallelComp: Parallel Long-Context Compressor for Length Extrapolation

Python 28 Updated Jun 16, 2025

An LLM framework for deep and efficient scientific peer review

Python 6 2 Updated Jun 7, 2025

Extract information from various climate scientific graphics to combat misinformation and support scientific communication

Python 6 Updated Sep 23, 2025

Latest Advances on Federated LLM Learning

72 4 Updated Jul 7, 2025

[NeurIPS'25] EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

Python 4 Updated Oct 22, 2025

Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"

Python 112 10 Updated Aug 28, 2025

Code for Research Project TLDR

Python 23 Updated Jul 28, 2025

[NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario

Python 18 Updated Oct 5, 2025

[PVLDB 2024 Best Paper Nomination] TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods

Shell 1,050 77 Updated Oct 15, 2025

[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling

Python 590 51 Updated Jun 16, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,228 57 Updated Oct 18, 2025

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Python 204 9 Updated Sep 26, 2025

A series of technical report on Slow Thinking with LLM

Python 742 41 Updated Aug 13, 2025

The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"

45 2 Updated Sep 13, 2025
Python 16 3 Updated Nov 20, 2024

Efficient triton implementation of Native Sparse Attention.

Python 239 17 Updated May 23, 2025

CiteCheck: Towards Accurate Citation Faithfulness Detection

Python 3 Updated Feb 18, 2025

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

Python 138 5 Updated Oct 23, 2025

Official Repo for Open-Reasoner-Zero

Python 2,054 119 Updated Jun 2, 2025

Latest Advances on System-2 Reasoning

Python 1,257 72 Updated Jun 8, 2025
Next