Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
-
Updated
Feb 10, 2024 - Python
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
🌟 Align diffusion processes with detailed human preferences to improve machine learning models for richer, more accurate outputs.
Add a description, image, and links to the srpo topic page so that developers can more easily learn about it.
To associate your repository with the srpo topic, visit your repo's landing page and select "manage topics."