M3TR: A Generalist Model for Real-World HD Map Completion

Accepted for publication at IEEE Robotics and Automation Letters (RA-L)

Fabian Immel^{1 📧} , Richard Fehler¹ , Frank Bieder¹ , Jan-Hendrik Pauls² , Christoph Stiller²

¹ FZI Research Center for Information Technology ² Institute for Measurement and Control Systems, Karlsruhe Institute of Technology

(^📧) corresponding author

Project Page 🌐

ArXiv Preprint

Official implementation of M3TR: A Generalist Model for Real-World HD Map Completion

Introduction

Autonomous vehicles rely on HD maps for their operation, but offline HD maps eventually become outdated. For this reason, online HD map construction methods use live sensor data to infer map information instead. Research on real map changes shows that oftentimes entire parts of an HD map remain unchanged and can be used as a prior. We therefore introduce M3TR (Multi-Masking Map Transformer), a generalist approach for HD map completion both with and without offline HD map priors. As a necessary foundation, we address shortcomings in ground truth labels for Argoverse 2 and nuScenes and propose the first comprehensive benchmark for HD map completion. Unlike existing models that specialize in a single kind of map change, which is unrealistic for deployment, our Generalist model handles all kinds of changes, matching the effectiveness of Expert models. With our map masking as augmentation regime, we can even achieve a +1.4 mAP improvement without a prior. Finally, by fully utilizing prior HD map elements and optimizing query designs, M3TR outperforms existing methods by +4.3 mAP while being the first real-world deployable model for offline HD map priors.

Results on Argoverse 2 Geo Split (see paper for full evaluation)

Qualitative Results on Argoverse 2 Geo Split

Model Checkpoints

You can find the trained checkpoints of the Generalist model on Argoverse 2 and nuScenes here

Basic Usage

The environment can be found in the Dockerfile in the folder, simply build the corresponding docker image, which will contain dependencies and a copy of this codebase.

The code follows the structure of the MapTRv2 codebase, so the basic workflow is similar. After downloading the Argoverse 2 or nuScenes dataset, you need to create the labels for training:

python tools/m3tr/custom_av2_map_converter.py --data-root /datasets/public/argoverse20/sensor --out-root ./gen_labels/argoverse2_no_map_prior --masked-elements boundary centerline divider_dashed divider_solid ped_crossing

(nuScenes command similar)

The main difference in usage here compared to MapTR is the --masked-elements flag with which you specify the elements that are not included in the map prior. The above command would therefore generate labels without any prior, equivalent to the ones for the pure online HD map construction task.

--masked-elements accepts either a list of element types as above or special flags. Those are:

ego_lane: masks out all labels associated with the ego lane
ego_road: masks out all labels associated with the ego road
random: randomly selects a masking type for a sample
random_whole_dataset: duplicates each sample for each available masking type (e.g. 8 masking types = 8x the dataset annotations stacked, once for each masking type)

The model configs for different Expert models are the same, you control what type Expert is trained by the generated labels. Thus you need to change the directory in the ann_root field in the config depending on your generated labels and their output path.

When you pass random_whole_dataset as a flag for --masked-elements, that generates the labels for the Generalist model.

Note: The labels generated in this way follow the default split from MapTRv2. To mirror the evaluation in the paper and use the geographic split, follow the instructions in geographical-splits for MapTRv2.

To train a model, use the dist_train.sh script:

./tools/dist_train.sh ./projects/configs/m3tr/m3tr_av2_3d_r50_54ep_generalist.py 4

This command would train the generalist model for Argoverse 2 on 4 GPUs.

Acknowledgements

We're grateful for the open-source codebase of MapTRv2, which formed the basis for our project:

MapTRv2

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
docker_res		docker_res
figs		figs
ll2_wheels		ll2_wheels
mmdetection3d		mmdetection3d
projects		projects
tools		tools
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
requirement.txt		requirement.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

M3TR: A Generalist Model for Real-World HD Map Completion

Project Page 🌐

ArXiv Preprint

Introduction

Results on Argoverse 2 Geo Split (see paper for full evaluation)

Qualitative Results on Argoverse 2 Geo Split

Model Checkpoints

Basic Usage

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Uh oh!

License

Uh oh!

immel-f/m3tr

Folders and files

Latest commit

History

Repository files navigation

M3TR: A Generalist Model for Real-World HD Map Completion

Project Page 🌐

ArXiv Preprint

Introduction

Results on Argoverse 2 Geo Split (see paper for full evaluation)

Qualitative Results on Argoverse 2 Geo Split

Model Checkpoints

Basic Usage

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages