Reward-free World Models for Online Imitation Learning

Official code implementation of ICML 2025 paper: Reward-free World Models for Online Imitation Learning [Paper Link]

Shangzhe Li, Zhiao Huang, Hao Su

Introduction

IQ-MPC is a world model designed for online imitation learning. It leverages the inverse soft-Q objective to train the critic, enabling effective policy learning from limited expert demonstrations and online reward-free interactions. Built upon the architecture of TD-MPC2, IQ-MPC excels in handling complex tasks such as dexterous hand manipulation and high-dimensional locomotion.

Environment Setup and Running the Code

Setup the environment using the following commands:

conda env create -f conda_env/environment.yaml
conda activate iqmpc

Download the expert datasets here, which includes the expert datasets for 6 locomotion tasks and 3 dexterous hand manipulation tasks. All of the expert demonstrations are sampled from a trained single-task TD-MPC2 agent.
Set the task in tdmpc2/config.json and the correct expert dataset path corresponding to the task.
Run the training code:

python3 tdmpc2/train.py

Acknowledgement

This repository is created based on the original TD-MPC2 implementation repository: TD-MPC2 Official Implementation.

Citation

If you find our work helpful to your research, please consider citing our paper as follows:

@inproceedings{li2025reward,
  title={Reward-free World Models for Online Imitation Learning},
  author={Shangzhe Li and Zhiao Huang and Hao Su},
  booktitle={International Conference on Machine Learning (ICML)},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
conda_env		conda_env
images		images
tdmpc2		tdmpc2
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reward-free World Models for Online Imitation Learning

Introduction

Environment Setup and Running the Code

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Languages

TobyLeelsz/iqmpc

Folders and files

Latest commit

History

Repository files navigation

Reward-free World Models for Online Imitation Learning

Introduction

Environment Setup and Running the Code

Acknowledgement

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages