(ICCV 25)MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion

Zihan Wang, Jeff Tan, Tarasha Khurana*, Neehar Peri*, Deva Ramanan

Carnegie Mellon University

* Equal Contribution

Installation

git clone --recursive https://github.com/MonoFusion/MonoFusion.git
cd MonoFusion
conda create -n monofusion python=3.10
conda activate monofusion
pip install -r requirements.txt
pip install git+https://github.com/nerfstudio-project/gsplat.git
# extra deps for preprocessing
cd preproc && ./setup_dependencies.sh && cd -

Usage

1. Prepare raw data via ExoRecon

cd preproc/ExoRecon and follow README.md there:

conda env create -f egorecon.yml
conda activate egorecon
python -m pip install -e projectaria_tools_pkg
./push_all_data.sh  # downloads + restructures Ego-Exo4D takes

Each take should end up as MonoFusion/raw_data/<SEQ_NAME>/ containing aria01.vrs, frame_aligned_videos/, trajectory/Dy_train_meta.json, and timestep.txt.

2. Get Priors (`./data/SEQ_NAME`)

cd preproc
python process_custom.py \
  --img-dirs ../raw_data/<SEQ_NAME>/images \
  --gpus 0 1

Generates depth, masks, TAPIR tracks, and DUSt3R alignment into ../data/<SEQ_NAME>/.

3. Train (`bash opt.sh`)

# edit opt.sh so SEQ_NAME matches _<SEQ_NAME> used during preprocessing
bash opt.sh <experiment_prefix>

The script appends a timestamp, calls dance_glb.py, logs to ./results_<SEQ_NAME>/<experiment_prefix>_<timestamp>/, and saves checkpoints under checkpoints/ inside that folder.
Advanced runs can invoke python dance_glb.py --seq_name <SEQ_NAME> --exp <NAME> [Tyro args] directly.

4. Visualize

bash vis.sh ./results_<SEQ_NAME>/<RUN_NAME> 7007

WORK_DIR is the exact path produced in step 4.
Pick any open TCP port; the script launches run_rendering.py for inspection.

Citation

If you find our data, code processing, or project useful, please kindly consider citing our work:

@InProceedings{Wang_2025_ICCV,
    author    = {Wang, Zihan and Tan, Jeff and Khurana, Tarasha and Peri, Neehar and Ramanan, Deva},
    title     = {MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2025},
    pages     = {8252-8263}
}

Acknowledgement

Code is built from Shape-of-Motion, thanks for wonderful codebase!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

(ICCV 25)MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion

Installation

Usage

1. Prepare raw data via ExoRecon

2. Get Priors (`./data/SEQ_NAME`)

3. Train (`bash opt.sh`)

4. Visualize

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 6

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 209 Commits
flow3d		flow3d
preproc		preproc
.DS_Store		.DS_Store
.gitmodules		.gitmodules
README.md		README.md
dance_glb.py		dance_glb.py
opt.sh		opt.sh
requirements.txt		requirements.txt
run_rendering.py		run_rendering.py
vis.sh		vis.sh

Uh oh!

Uh oh!

Z1hanW/MonoFusion

Folders and files

Latest commit

History

Repository files navigation

(ICCV 25)MonoFusion: Sparse-View 4D Reconstruction via Monocular Fusion

Installation

Usage

1. Prepare raw data via ExoRecon

2. Get Priors (./data/SEQ_NAME)

3. Train (bash opt.sh)

4. Visualize

Citation

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Uh oh!

Languages

2. Get Priors (`./data/SEQ_NAME`)

3. Train (`bash opt.sh`)

Packages