Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Official code implementation of ICML 2025 paper: Reward-free World Models for Online Imitation Learning

Notifications You must be signed in to change notification settings

TobyLeelsz/iqmpc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Reward-free World Models for Online Imitation Learning

Official code implementation of ICML 2025 paper: Reward-free World Models for Online Imitation Learning [Paper Link]

Shangzhe Li, Zhiao Huang, Hao Su

demo_IQMPC

Introduction

IQ-MPC is a world model designed for online imitation learning. It leverages the inverse soft-Q objective to train the critic, enabling effective policy learning from limited expert demonstrations and online reward-free interactions. Built upon the architecture of TD-MPC2, IQ-MPC excels in handling complex tasks such as dexterous hand manipulation and high-dimensional locomotion.

Environment Setup and Running the Code

  1. Setup the environment using the following commands:
conda env create -f conda_env/environment.yaml
conda activate iqmpc
  1. Download the expert datasets here, which includes the expert datasets for 6 locomotion tasks and 3 dexterous hand manipulation tasks. All of the expert demonstrations are sampled from a trained single-task TD-MPC2 agent.
  2. Set the task in tdmpc2/config.json and the correct expert dataset path corresponding to the task.
  3. Run the training code:
python3 tdmpc2/train.py

Acknowledgement

This repository is created based on the original TD-MPC2 implementation repository: TD-MPC2 Official Implementation.

Citation

If you find our work helpful to your research, please consider citing our paper as follows:

@inproceedings{li2025reward,
  title={Reward-free World Models for Online Imitation Learning},
  author={Shangzhe Li and Zhiao Huang and Hao Su},
  booktitle={International Conference on Machine Learning (ICML)},
  year={2025}
}

About

Official code implementation of ICML 2025 paper: Reward-free World Models for Online Imitation Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages