egoallo

Code release for our preprint:

Brent Yi¹, Vickie Ye¹, Maya Zheng¹, Yunqi Li², Lea Müller¹, Georgios Pavlakos³, Yi Ma¹, Jitendra Malik¹, and Angjoo Kanazawa¹. Estimating Body and Hand Motion in an Ego-sensed World. arXiV, 2024.

¹UC Berkeley, ²ShanghaiTech, ³UT Austin

Updates

Oct 7, 2024: Initial release. (training code, core implementation details)
Oct 14, 2024: Added model checkpoint, dataset preprocessing, inference, and visualization scripts.
May 6, 2025: Updated scripts + instructions for dataset preprocessing, which is now self-contained in this repository.

Overview

TLDR; We use egocentric SLAM poses and images to estimate 3D human body pose, height, and hands.

egoallo_coffee.mov

This repository is structured as follows:

.
├── download_checkpoint_and_data.sh
│                            - Download model checkpoint and sample data.
├── 0_preprocess_training_data.py
│                            - Preprocessing script for training datasets.
├── 1_train_motion_prior.py
│                            - Training script for motion diffusion model.
├── 2_run_hamer_on_vrs.py
│                            - Run HaMeR on inference data (expects Aria VRS).
├── 3_aria_inference.py
│                            - Run full pipeline on inference data.
├── 4_visualize_outputs.py
│                            - Visualize outputs from inference.
├── 5_eval_body_metrics.py
│                            - Compute and print body estimation accuracy metrics.
│
├── src/egoallo/
│   ├── data/                - Dataset utilities.
│   ├── transforms/          - SO(3) / SE(3) transformation helpers.
│   └── *.py                 - All core implementation.
│
└── pyproject.toml          - Python dependencies/package metadata.

Getting started

EgoAllo requires Python 3.12 or newer.

Clone the repository.

git clone https://github.com/brentyi/egoallo.git

Install general dependencies.
```
cd egoallo
pip install -e .
```
Download+unzip model checkpoint and sample data.
```
bash download_checkpoint_and_data.sh
```
You can also download the zip files manually: here are links to the checkpoint and example trajectories.
Download the SMPL-H model file.

You can find the "Extended SMPL+H model" (16 shape parameters) from the MANO project webpage. Our scripts assumes an npz file located at ./data/smplh/neutral/model.npz, but this can be overridden at the command-line (--smplh-npz-path {your path}).
Visualize model outputs.

The example trajectories directory includes example outputs from our model. You can visualize them with:
```
python 4_visualize_outputs.py --search-root-dir ./egoallo_example_trajectories
```

Running inference

Installing inference dependencies.

Our guidance optimization uses a Levenberg-Marquardt optimizer that's implemented in JAX. If you want to run this on an NVIDIA GPU, you'll need to install JAX with CUDA support:
```
# Also see: https://jax.readthedocs.io/en/latest/installation.html
pip install "jax[cuda12]==0.6.1"
```
You'll also need jaxls:
```
pip install git+https://github.com/brentyi/jaxls.git
```
Running inference on example data.

Here's an example command for running EgoAllo on the "coffeemachine" sequence:
```
python 3_aria_inference.py --traj-root ./egoallo_example_trajectories/coffeemachine
```
You can run python 3_aria_inference.py --help to see the full list of options.
Running inference on your own data.

To run inference on your own data, you can copy the structure of the example trajectories. The key files are:
- A VRS file from Project Aria, which contains calibrations and images.
- SLAM outputs from Project Aria's MPS: closed_loop_trajectory.csv and semidense_points.csv.gz.
- (optional) HaMeR outputs, which we save to a hamer_outputs.pkl.
- (optional) Project Aria wrist and palm tracking outputs.
Running HaMeR on your own data.

To generate the hamer_outputs.pkl file, you'll need to install hamer_helper.

Then, as an example for running on our coffeemachine sequence:
```
python 2_run_hamer_on_vrs.py --traj-root ./egoallo_example_trajectories/coffeemachine
```

Preprocessing Training Data

To train the motion prior model, we use data from the AMASS dataset. Due to licensing constraints, we cannot redistribute the preprocessed data. Instead, we provide two sequential preprocessing scripts:

Download the AMASS dataset.

Download the AMASS dataset from the official website. We use the following splits:
- Training: ACCAD, BioMotionLab_NTroje, BMLhandball, BMLmovi, CMU, DanceDB, DFaust_67EKUT, Eyes_Japan_Dataset, KIT, MPI_Limits, TCD_handMocap, TotalCapture
- Validation: HumanEva, MPI_HDM05, SFU, MPI_mosh
- Testing: Transitions_mocap, SSM_synced
Run the first preprocessing script.
```
python 0a_preprocess_training_data.py --help
python 0a_preprocess_training_data.py --data-root /path/to/amass --smplh-root ./data/smplh
```
This script, adapted from HuMoR, processes raw AMASS data by:
- Converting to gender-neutral SMPL-H parameters
- Computing contact labels for feet, hands, and knees
- Filtering out problematic sequences (treadmill walking, sequences with foot skating)
- Downsampling to 30fps
Run the second preprocessing script.
```
python 0b_preprocess_training_data.py --help
python 0b_preprocess_training_data.py --data-npz-dir ./data/processed_30fps_no_skating/
```
This converts the processed NPZ files to a unified HDF5 format for more efficient training, with optimized chunk sizes for reading sequences.

Status

This repository currently contains:

egoallo package, which contains reference training and sampling implementation details.
Training script.
Model checkpoints.
Dataset preprocessing script.
Inference script.
Visualization script.
Setup instructions.

While we've put effort into cleaning up our code for release, this is research code and there's room for improvement. If you have questions or comments, please reach out!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

egoallo

Updates

Overview

Getting started

Running inference

Preprocessing Training Data

Status

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
data		data
src/egoallo		src/egoallo
.gitignore		.gitignore
0a_preprocess_training_data.py		0a_preprocess_training_data.py
0b_preprocess_training_data.py		0b_preprocess_training_data.py
1_train_motion_prior.py		1_train_motion_prior.py
2_run_hamer_on_vrs.py		2_run_hamer_on_vrs.py
3_aria_inference.py		3_aria_inference.py
4_visualize_outputs.py		4_visualize_outputs.py
5_eval_body_metrics.py		5_eval_body_metrics.py
LICENSE		LICENSE
README.md		README.md
download_checkpoint_and_data.sh		download_checkpoint_and_data.sh
pyproject.toml		pyproject.toml

License

brentyi/egoallo

Folders and files

Latest commit

History

Repository files navigation

egoallo

Updates

Overview

Getting started

Running inference

Preprocessing Training Data

Status

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages