Codestin Search App

Text to Point Cloud Localization with Multi-Level Negative Contrastive Learning

Dunqiang Liu*, Shujun Huang*, Wen Li, Siqi Shen, Cheng WANG†
Xiamen University, ASC Lab
^* Equal Contribution ^† Corresponding Author

In the cross-modal place recognition stage, we introduce a multi-level negative contrastive learning framework to minimize the similarity of different locations at global-level, instance-level, and relation-level, respectively. This fully leverages the descriptive power of language for spatial localization. In the fine localization stage, we use the language query and the retrieved cell to regress the corresponding position.

Installation

Create the environment using the following command.

git clone https://github.com/dqliua/MNCL.git

conda create -n mncl python=3.10
conda activate mncl

# Install the according versions of torch and torchvision
conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3 -c pytorch

pip install -r requirements.txt

Data Preparation

We use the publicly available dataset KITTI360Pose. You can download the KITTI360Pose dataset from here.

For dataset details, kindly refer to Text2Pos.

The dataset folder should display as follow:

data
└── KITTI360Pose
    └── k360_30-10_scG_pd10_pc4_spY_all
        ├── cells
        ├── direction
        ├── poses
        ├── street_centers
        └── visloc

Model Zoo

The table below lists the pretrained weights in our method. These include the default text encoder and the 3D point cloud backbone. You can download them directly from the provided links.

Component	Model	Download Link
Text Backbone	Flan-T5	Hugging Face
Object Backbone	PointNet	Google Drive

After completing the above steps, the basic directory structure should be like:

MNCL
 ├── checkpoints
      ├── coarse.pth
      ├── fine.pth
      └── pointnet_acc0.86_lr1_p256_model.pth
 ├── data
      └── KITTI360Pose
            └── k360_30-10_scG_pd10_pc4_spY_all
                ├── cells
                ├── direction
                ├── poses
                ├── street_centers
                └── visloc
 ├── dataloading
      └── .....
 ├── datapreparation
      └── .....
 ├── evalution
      └── .....
 ├── models
      └── .....
 ├── t5-large
      └── .....
 ├── training
      └── .....

Train

After configuring the dependencies and preparing the dataset, use the following commands to train the coarse retrieval and fine localization, respectively.

Coarse Retrieval

python -m training.coarse  \
  --batch_size 64  \
  --coarse_embed_dim 256  \
  --shuffle  \
  --base_path ./data/KITTI360Pose/k360_30-10_scG_pd10_pc4_spY_all/ \
  --use_features "class"  "color"  "position"  "num" \
  --no_pc_augment \
  --fixed_embedding \
  --epochs 32 \
  --learning_rate 0.0001 \
  --lr_scheduler step \
  --lr_step 5 \
  --lr_gamma 0.5 \
  --temperature 0.05 \
  --ranking_loss CCL \
  --num_of_hidden_layer 3 \
  --alpha 2 \
  --hungging_model t5-large \
  --folder_name PATH_TO_COARSE

Fine Localization

python -m training.fine 
  --batch_size 32 \ 
  --fine_embed_dim 128 \ 
  --shuffle \
  --base_path ./data/KITTI360Pose/k360_30-10_scG_pd10_pc4_spY_all/ \
  --use_features "class"  "color"  "position"  "num" \
  --no_pc_augment \
  --fixed_embedding \
  --epochs 32 \
  --learning_rate 0.0003 \
  --fixed_embedding \
  --hungging_model t5-large \
  --regressor_cell all \
  --pmc_prob 0.5 \
  --folder_name PATH_TO_FINE \

Eval

Evaluation coarse retrieval only on val set

python -m evaluation.coarse
	--base_path ./data/KITTI360Pose/k360_30-10_scG_pd10_pc4_spY_all/ \
    --use_features "class"  "color"  "position"  "num" \
    --no_pc_augment \
    --hungging_model t5-large \
    --fixed_embedding \
    --path_coarse ./checkpoints/{PATH_TO_COARSE}/{COARSE_MODEL_NAME} \

Evaluation whole pipeline on val set

python -m evaluation.pipeline --base_path ./data/k360_30-10_scG_pd10_pc4_spY_all/ \
    --use_features "class"  "color"  "position"  "num" \
    --no_pc_augment \
    --no_pc_augment_fine \
    --hungging_model t5-large \
    --fixed_embedding \
    --path_coarse ./checkpoints/{PATH_TO_COARSE}/{COARSE_MODEL_NAME} \
    --path_fine ./checkpoints/{PATH_TO_FINE}/{FINE_MODEL_NAME} \

Test

Test coarse retrieval only on test set

python -m evaluation.coarse 
	--base_path ./data/KITTI360Pose/k360_30-10_scG_pd10_pc4_spY_all/ \
    --use_features "class"  "color"  "position"  "num" \
    --use_test_set \
    --no_pc_augment \
    --hungging_model t5-large \
    --fixed_embedding \
    --path_coarse ./checkpoints/{PATH_TO_COARSE}/{COARSE_MODEL_NAME} \

Test whole pipeline on test set

python -m evaluation.pipeline --base_path ./data/k360_30-10_scG_pd10_pc4_spY_all/ \
    --use_features "class"  "color"  "position"  "num" \
    --use_test_set \
    --no_pc_augment \
    --no_pc_augment_fine \
    --hungging_model t5-large \
    --fixed_embedding \
    --path_coarse ./checkpoints/{PATH_TO_COARSE}/{COARSE_MODEL_NAME} \
    --path_fine ./checkpoints/{PATH_TO_FINE}/{FINE_MODEL_NAME}

Citation

If you find this work helpful, please kindly consider citing our paper:

@inproceedings{liu2025text,
  title={Text to point cloud localization with multi-level negative contrastive learning},
  author={Liu, Dunqiang and Huang, Shujun and Li, Wen and Shen, Siqi and Wang, Cheng},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={39},
  number={5},
  pages={5397--5405},
  year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text to Point Cloud Localization with Multi-Level Negative Contrastive Learning

Table of Contents

Installation

Data Preparation

Model Zoo

Train

Eval

Test

Citation

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
checkpoints		checkpoints
dataloading		dataloading
datapreparation		datapreparation
evaluation		evaluation
imgs		imgs
models		models
training		training
.DS_Store		.DS_Store
README.md		README.md
requirements.txt		requirements.txt

dqliua/MNCL

Folders and files

Latest commit

History

Repository files navigation

Text to Point Cloud Localization with Multi-Level Negative Contrastive Learning

Table of Contents

Installation

Data Preparation

Model Zoo

Train

Eval

Test

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages