Codestin Search App

LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning

Rui Li¹ · Biao Zhang¹ · Zhenyu Li¹ · Federico Tombari^2,3 · Peter Wonka^2,3

¹KAUST · ²Google · ³Technical University of Munich

arXiv 2025

LaRI is a single-feed-forward method that models unseen 3D geometry using layered point maps. It enables complete, efficient, and view-aligned geometric reasoning from a single image.

📋 TODO List

Inference code & Gradio demo
Evaluation data & code
Training data & code
Release the GT generation code (Estimated time: within July, 2025)

🛠️ Environment Setup

Create the conda environment and install required libraries:

conda create -n lari python=3.10 -y
conda activate lari
pip install -r requirements.txt

Install Pytorch3D following these instructions.

🚀 Quick Start

We currently provide the object-level model at our HuggingFace Model Hub. Try the examples or use your own images with the methods below:

Gradio Demo

Launch the Gradio interface locally:

python app.py

Or try it online via HuggingFace Demo.

Command Line

Run object-level modeling with:

python demo.py --image_path assets/cole_hardware.png

The input image path is specified via --image_path. Set --is_remove_background to remove the background. Layered depth maps and the 3D model will be saved in the ./results directory by default.

📊 Evaluation

Pre-trained weights and Evaluation Data

Scene Type	Pre-trained Weights	Evaluation Data
Object-level	checkpoint	Google Scanned Objects (data)
Scene-level	checkpoint	SCCREAM (data)

Download the pre-trained weights and unzip the evaluation data.

Object-level Evaluation

./scripts/eval_object.sh

Scene-level Evaluation

./scripts/eval_scene.sh

NOTE: For both object and scene evaluation, set data_path and test_list_path to the customized absolute paths, set --pretrained to your model checkpoint path, and set --output_dir to specify where to store the evaluation results.

💻 Training

💾 Dataset setup

1. Objaverse (object-level)

Download the processed Objaverse dataset, extract all files (objaverse_chunk_<ID>.tar.gz) into the target folder, for example:

mkdir ./datasets/objaverse_16k
tar -zxvf  ./objaverse_chunk_<ID>.tar.gz -C ./datasets/objaverse_16k

2. 3D-FRONT (scene-level)

Download the processed 3D-FRONT dataset, extract all files to the target folder. For example:

mkdir ./datasets/3dfront
tar -zxvf  ./front3d_chunk_<ID>.tar.gz -C ./datasets/3dfront

3. ScanNet++ (scene-level)

Download the ScanNet++ dataset, as well as the ScanNet++ toolbox.
Copy the .yml configuration files to the ScanNet++ toolbox folder, for example:

cd /path/to/lari
cp -r ./scripts/scannetpp_proc/*.yml /path/to/scannetpp/scannetpp/dslr/configs

Run the following command in the ScanNet++ toolbox folder to downscale and undistort the data.

cd /path/to/scannetpp
# downscale the images
python -m dslr.downscale dslr/configs/downscale_lari.yml
# undistort the images
python -m dslr.undistort dslr/configs/undistort_lari.yml

Download the ScanNet++ annotation from here and extract it to the data subfolder of your ScanNet++ path, for example

tar -zxvf  ./scannetpp_48k_annotation.tar.gz -C ./datasets/scannetpp_v2/data

🔥 Train the model

Download MoGe pre-trained weights. For training with object-level data (Objaverse), run

./scripts/train_object.sh

For training with scene-level data (3D-FRONT and ScanNet++), run

./scripts/train_scene.sh

For both training settings, set data_path, train_list_path and test_list_path of each dataset to your customized absolute paths, set pretrained_path to the downloaded MoGe weights path, set --output_dir and --wandb_dir to specify where to store the evaluation results.

✨ Acknowledgement

This prject is largely based on DUSt3R, with some model weights and functions from MoGe, Zero-1-to-3, and Marigold. Many thanks to these awesome projects for their contributions.

📰 Citation

Please cite our paper if you find it helpful:

@inproceedings{li2025lari,
      title={LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning}, 
      author={Li, Rui and Zhang, Biao and Li, Zhenyu and Tombari, Federico and Wonka, Peter},
      booktitle={arXiv preprint arXiv:2504.18424},
      year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
data_lists		data_lists
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
app.py		app.py
demo.py		demo.py
requirements.txt		requirements.txt
test.py		test.py
tools.py		tools.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning

📋 TODO List

🛠️ Environment Setup

🚀 Quick Start

Gradio Demo

Command Line

📊 Evaluation

Pre-trained weights and Evaluation Data

Object-level Evaluation

Scene-level Evaluation

💻 Training

💾 Dataset setup

1. Objaverse (object-level)

2. 3D-FRONT (scene-level)

3. ScanNet++ (scene-level)

🔥 Train the model

✨ Acknowledgement

📰 Citation

About

Uh oh!

Uh oh!

Languages

ruili3/lari

Folders and files

Latest commit

History

Repository files navigation

LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning

📋 TODO List

🛠️ Environment Setup

🚀 Quick Start

Gradio Demo

Command Line

📊 Evaluation

Pre-trained weights and Evaluation Data

Object-level Evaluation

Scene-level Evaluation

💻 Training

💾 Dataset setup

1. Objaverse (object-level)

2. 3D-FRONT (scene-level)

3. ScanNet++ (scene-level)

🔥 Train the model

✨ Acknowledgement

📰 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages