Renmin University of China
university
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Agentic Entropy-Balanced Policy Optimization
Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning
models
0
None public yet
datasets
0
None public yet