Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

CVPR 2025 Highlight

Project Page | Paper

This repository is official implementation for the CVPR 2025 highlight paper, Dr. Splat.

Download

git clone [email protected]:kaist-ami/Dr-Splat.git --recursive

Setup

conda create -n drsplat python=3.9

pip install torch==2.2.1 torchvision==0.17.1 torchaudio==2.2.1 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

pip install submodules/langsplat-rasterization
pip install submodules/segment-anything-langsplat
pip install submodules/simple-knn


pip install ninja git+https://github.com/NVlabs/tiny-cuda-nn/#subdirectory=bindings/torch
pip install kmeans_pytorch

pip install faiss-cpu

Download Checkpoint

Downalod SAM checkpoint from here and move into ckpts directory.

Preliminary

Prepare camera pose of the scenes (e.g., COLMAP) and trained 3DGS.

Feature (SAM Mask + CLIP embedding) Extraction

To construct feature embedded 3DGS with Dr. Splat, first need to prepare CLIP embeddings per sam masks.

mkdir "${COLMAP_PATH}/language_features"
CUDA_VISIBLE_DEVICES=${GPU_ID} python preprocessing.py \
                        --dataset_path ${CAMERA_PATH} \

echo "All commands executed."

Training

Training Dr. Splat with direct CLIP embedding registration to 3DGS

CUDA_VISIBLE_DEVICES=${GPU_ID} python train.py \
                        -s ${CAMERA_PATH} \
                        -m ${OUTPUT_PATH} \
                        --start_checkpoint ${TRAINED_3DGS_PATH}/chkpnt30000.pth \
                        --feature_level 1 \
                        --name_extra pq_openclip \
                        --use_pq \
                        --pq_index ckpts/pq_index.faiss \
                        --port 55560 \
                        --topk 45 \
                        --eval  # enable if you want to split your dataset with training and validation sets else, disable this

Feature PCA Visualization

CUDA_VISIBLE_DEVICES=${GPU_ID} python render_pca.py \
                        -s ${CAMERA_PATH} \
                        -m ${TRAINED_DRSPLAT_PATH} \
                        --pq_index ckpts/pq_index.faiss \
                        --feature_level 1 \
                        -l language_features_dim3

Activation Visualization

CUDA_VISIBLE_DEVICES=${GPU_ID} python render_activation.py \
                        -s ${CAMERA_PATH} \
                        -m ${TRAINED_DRSPLAT_PATH} \
                        --semantic_model sam \
                        --feature_level 1 \
                        --pq_index ckpts/pq_index.faiss \
                        --img_label sofa \  # text query
                        --img_save_label sofa_test \  # save directory name
                        --threshold 0.5 \  # 0.0 - full activation render, 
                        # greater than 0.0 - alpha-blended result with 3D scene 
                        -l language_features_dim3

Evaluation

TBA

Citation

@inproceedings{drsplat25,
    title={Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration},
    author={Jun-Seong, Kim and Kim GeonU and Yu-Ji, Kim and Yu-Chiang Frank Wang and Jaesung Choe and Oh, Tae-Hyun},
    booktitle=CVPR,
    year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
arguments		arguments
assets		assets
autoencoder		autoencoder
ckpts		ckpts
evaluation		evaluation
gaussian_renderer		gaussian_renderer
lpipsPyTorch		lpipsPyTorch
my_utils		my_utils
scene		scene
submodules		submodules
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.md		LICENSE.md
README.md		README.md
convert.py		convert.py
preprocessing.py		preprocessing.py
render_activation.py		render_activation.py
render_pca.py		render_pca.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

CVPR 2025 Highlight

Project Page | Paper

Download

Setup

Download Checkpoint

Preliminary

Feature (SAM Mask + CLIP embedding) Extraction

Training

Feature PCA Visualization

Activation Visualization

Evaluation

Citation

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

kaist-ami/Dr-Splat

Folders and files

Latest commit

History

Repository files navigation

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

CVPR 2025 Highlight

Project Page | Paper

Download

Setup

Download Checkpoint

Preliminary

Feature (SAM Mask + CLIP embedding) Extraction

Training

Feature PCA Visualization

Activation Visualization

Evaluation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages