Cartus

🍭

Focusing

Zhijiang Cartus

🍭

Focusing

NLP/ML/LLM Padawan

120 followers · 36 following

University of Cambridge
Tonga
19:25 (UTC +08:00)
https://cartus.github.io/
@ZhijiangG

Achievements

Highlights

Stars

rdi-berkeley / awesome-RLVR-boundary

A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Language Models (LLMs).

70 5 Updated Oct 23, 2025

sail-sg / feedback-conditional-policy

Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"

Python 47 Updated Sep 29, 2025

SUSTechBruce / SRPO_MLLMs

[NeurIPS 2025🔥]Main source code of SRPO framework.

Python 176 18 Updated Sep 21, 2025

Yingjia-Wan / FaStfact

Code repo for FaStFact: Faster, Stronger Long-Form Factuality Evaluations in LLMs.

3 Updated Sep 20, 2025

16demi / ReasonAlign-analysis

Python 2 Updated Sep 8, 2025

MasterVito / SvS

Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training

Python 39 3 Updated Aug 25, 2025

Elfsong / Afterburner

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Python 9 Updated Aug 20, 2025

yangzhch6 / DARS

The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"

Python 15 Updated Oct 10, 2025

menik1126 / ParallelComp

[ICML 2025🔥] ParallelComp: Parallel Long-Context Compressor for Length Extrapolation

Python 28 Updated Jun 16, 2025

YuanChang98 / tree-review

An LLM framework for deep and efficient scientific peer review

Python 6 2 Updated Jun 7, 2025

Albasu120491 / ClimateViz

Extract information from various climate scientific graphics to combat misinformation and support scientific communication

Python 6 Updated Sep 23, 2025

Clin0212 / Awesome-Federated-LLM-Learning

Latest Advances on Federated LLM Learning

72 4 Updated Jul 7, 2025

EffiBench / EffiBench-X

[NeurIPS'25] EffiBench-X: A Multi-Language Benchmark for Measuring Efficiency of LLM-Generated Code

Python 4 Updated Oct 22, 2025

OS-Copilot / ScienceBoard

Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"

Python 112 10 Updated Aug 28, 2025

zzli2022 / TLDR

Code for Research Project TLDR

Python 23 Updated Jul 28, 2025

sylvain-wei / TIME

[NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario

Python 18 Updated Oct 5, 2025

decisionintelligence / TFB

[PVLDB 2024 Best Paper Nomination] TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods

Shell 1,050 77 Updated Oct 15, 2025

qixucen / atom

[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling

Python 590 51 Updated Jun 16, 2025

0russwest0 / Awesome-Agent-RL

414 14 Updated Oct 11, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,228 57 Updated Oct 18, 2025

LengSicong / MMR1

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Python 204 9 Updated Sep 26, 2025

dvlab-research / Open-Code-Zero

1 Updated Mar 10, 2025

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python 742 41 Updated Aug 13, 2025

SeekingDream / Static-to-Dynamic-LLMEval

The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"

45 2 Updated Sep 13, 2025

starrYYxuan / UniTE

Python 16 3 Updated Nov 20, 2024

XunhaoLai / native-sparse-attention-triton

Efficient triton implementation of Native Sparse Attention.

Python 239 17 Updated May 23, 2025

xzy-xzy / CiteCheck

CiteCheck: Towards Accurate Citation Faithfulness Detection

Python 3 Updated Feb 18, 2025

CJReinforce / PURE

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

Python 138 5 Updated Oct 23, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,054 119 Updated Jun 2, 2025

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 1,257 72 Updated Jun 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhijiang Cartus

Achievements

Achievements

Highlights

Block or report Cartus

Stars

rdi-berkeley / awesome-RLVR-boundary

sail-sg / feedback-conditional-policy

SUSTechBruce / SRPO_MLLMs

Yingjia-Wan / FaStfact

16demi / ReasonAlign-analysis

MasterVito / SvS

Elfsong / Afterburner

yangzhch6 / DARS

menik1126 / ParallelComp

YuanChang98 / tree-review

Albasu120491 / ClimateViz

Clin0212 / Awesome-Federated-LLM-Learning

EffiBench / EffiBench-X

OS-Copilot / ScienceBoard

zzli2022 / TLDR

sylvain-wei / TIME

decisionintelligence / TFB

qixucen / atom

0russwest0 / Awesome-Agent-RL

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

LengSicong / MMR1

dvlab-research / Open-Code-Zero

RUCAIBox / Slow_Thinking_with_LLMs

SeekingDream / Static-to-Dynamic-LLMEval

starrYYxuan / UniTE

XunhaoLai / native-sparse-attention-triton

xzy-xzy / CiteCheck

CJReinforce / PURE

Open-Reasoner-Zero / Open-Reasoner-Zero

zzli2022 / Awesome-System2-Reasoning-LLM