Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs

Projectpage · Paper · Video

Installation

It's recommended to use python=3.10, cuda=12.1, and torch=2.4.1 to set up the environment so that all the packages can be directly downloaded and no compilation is needed.

conda create -n mmlphuman python=3.10
conda activate mmlphuman

pip install torch==2.4.1 torchvision numpy<2.0 --index-url https://download.pytorch.org/whl/cu121 
pip install iopath ninja jaxtyping rich
pip install gsplat --index-url https://docs.gsplat.studio/whl/pt24cu121
pip install --no-index --no-cache-dir pytorch3d -f https://dl.fbaipublicfiles.com/pytorch3d/packaging/wheels/py310_cu121_pyt241/download.html
pip install imageio numba omegaconf open3d opencv-python scipy smplx scikit-image tensorboardx tensorboard trimesh websockets torchmetrics websocket-client dearpygui plyfile torch_pca

Download SMPL-X model, place the neutral npz file to ./smpl_model/smplx/SMPLX_NEUTRAL.npz.

Dataset Preparation

Download AvatarReX, ActorsHQ, or THuman4.0 datasets.
For ActorsHQ dataset, download SMPL-X registration from here, and place smpl_params.npz at the corresponding root path of each subject.
Generate LBS weight volume.
- Follow this link to compile executable file PointInterpolant.
- Change the executable file path in script/gen_weight_volume.py. Then run
```
cd script
python gen_weight_volume.py --data_dir {DATASET_DIR} --smpl_path ../smpl_model/smplx/SMPLX_NEUTRAL.npz
```
(Optional) prepare the template mesh. It is recommended for datasets with loose clothing. We use the template mesh for the following subjects
```
AvatarReX: avatarrex_zzr, avatarrex_lbn1
THuman4.0: subject02
ActorsHQ: Actor01, Actor04
```
- Templates for avatarrex_zzr, avatarrex_lbn1, Actor01, Actor04 can be downloaded from here. We include the template for subject02 in ./template. Follow this link if you want to reconstruct a template from a dataset.
- Place the template file in {DATASET_DIR}/gaussian/template.ply. If no template is provided, SMPL-X mesh will be used as the template during training.

The dataset will look like this after preparation.

AvatarReX dataset
├── 22010708 
├── 22010710 
├── calibration_full.json
├── gaussian
│   ├── lbs_weights_grid.npz
│   └── template.ply
└── smpl_params.npz

THuman4.0 dataset
├── calibration.json
├── gaussian
│   ├── lbs_weights_grid.npz
│   └── template.ply
├── images
│   ├── cam00 
│   └── cam01 
├── masks
│   ├── cam00 
│   └── cam01
└── smpl_params.npz

ActorsHQ dataset
├── calibration.csv
├── gaussian
│   ├── lbs_weights_grid.npz
│   └── template.ply
├── masks
│   ├── Cam001 
│   └── Cam002
├── rgbs
│   ├── Cam001 
│   └── Cam002 
└── smpl_params.npz

Training

python train.py --config ./config/{DATASET}.yaml --data_dir {DATASET_DIR} --out_dir {MODEL_DIR}

It will take about 17 hours on a RTX 3090. We have not yet implemented the function to resume training from a checkpoint, so please be cautious during training.

Visualization

To visualize the results during training, open the viewer, set ip, port, and connect

cd viewer
python net_viewer.py

To visualize a trained model

python visualize.py --model_dir {MODEL_DIR} --ip {IP} --port {PORT}

cd viewer
python net_viewer.py

Test and Evaluation

To render images using training poses and training cameras, modify the test part in the config, and run:

python test.py --config ./config/{DATASET}.yaml --model_dir {MODEL_DIR} --out_dir {IMAGE_OUR_DIR} --data_dir {DATASET_DIR}

To render images using novel poses and novel views, run:

python test.py --config ./config/{DATASET}.yaml --model_dir {MODEL_DIR} --out_dir {IMAGE_OUR_DIR} --cam_path {CAM_FILE} --pose_path {POSE_FILE} [--test]

CAM_FILE is a JSON file and can be exported from the viewer. POSE_FILE refers to AMASS pose file or smpl_params.npz

To test the rendering speed, run:

python test.py --config ./config/{DATASET}.yaml --model_dir {MODEL_DIR} --cam_path {CAM_FILE} --pose_path {POSE_FILE} --test_speed [--test]

Evaluation example codes are provided in script/eval.ipynb

Acknowledgement

This project uses gsplat rasterizer. We also use some data processing and evaluation code from AnimatableGaussians and Gaussian-Splatting. We greatly thank the authors for their wonderful works.

Citation

@article{zhan2025realtime,
    title={Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs},
    author={Zhan, Youyi and Shao, Tianjia and Yang, Yin and Zhou, Kun},
    journal={arXiv preprint arXiv:2504.12909},
    year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
config		config
scene		scene
script		script
template/subject02		template/subject02
utils		utils
viewer		viewer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
test.py		test.py
train.py		train.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs

Projectpage · Paper · Video

Installation

Dataset Preparation

Training

Visualization

Test and Evaluation

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Languages

License

1231234zhan/mmlphuman

Folders and files

Latest commit

History

Repository files navigation

Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs

Projectpage · Paper · Video

Installation

Dataset Preparation

Training

Visualization

Test and Evaluation

Acknowledgement

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages