Thanks to visit codestin.com
Credit goes to github.com

Rainbowend

Follow

Rainbowend

Follow

1 follower · 8 following

Popular repositories Loading

MPTS MPTS Public

Forked from thu-rllab/MPTS

Model Predictive Task Sampling

Python
MoPPS MoPPS Public

Forked from thu-rllab/MoPPS

[KDD 2026] Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?

Python
PDTS PDTS Public

Forked from thu-rllab/PDTS

[ICML 2025] Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

Python