Super Fast and Accurate 3D Object Detection based on 3D LiDAR Point Clouds (SFA3D)

Features

Super fast and accurate 3D object detection based on LiDAR
Fast training, fast inference
An Anchor-free approach
No Non-Max-Suppression
Support distributed data parallel training
Release pre-trained models

Highlights

The technical details are described here
The great introduction and explanation from Computer Vision and Perception for Self-Driving Cars Course Youtube link
SFA3D is used for the second course in the Udacity Self-Driving Car Engineer Nanodegree Program: Sensor Fusion and Tracking GitHub link

Update 2020.09.06: Add ROS source code. The great work has been done by @AhmedARadwan. The implementation is here

Demonstration (on a single GTX 1080Ti)

Youtube link

2. Getting Started

2.1. Requirement

The instructions for setting up a virtual environment is here.

git clone https://github.com/maudzung/SFA3D.git SFA3D
cd SFA3D/
pip install -r requirements.txt

2.2. Data Preparation

Download the 3D KITTI detection dataset from here.

The downloaded data includes:

Velodyne point clouds (29 GB)
Training labels of object data set (5 MB)
Camera calibration matrices of object data set (16 MB)
Left color images of object data set (12 GB) (For visualization purpose only)

Please make sure that you construct the source code & dataset directories structure as below.

2.3. How to run

2.3.1. Visualize the dataset

To visualize 3D point clouds with 3D boxes, let's execute:

cd sfa/data_process/
python kitti_dataset.py

2.3.2. Inference

The pre-trained model was pushed to this repo.

python test.py --gpu_idx 0 --peak_thresh 0.2

2.3.3. Making demonstration

python demo_2_sides.py --gpu_idx 0 --peak_thresh 0.2

The data for the demonstration will be automatically downloaded by executing the above command.

2.3.4. Training

2.3.4.1. Single machine, single gpu

python train.py --gpu_idx 0

2.3.4.2. Distributed Data Parallel Training

Single machine (node), multiple GPUs

python train.py --multiprocessing-distributed --world-size 1 --rank 0 --batch_size 64 --num_workers 8

Two machines (two nodes), multiple GPUs

First machine

python train.py --dist-url 'tcp://IP_OF_NODE1:FREEPORT' --multiprocessing-distributed --world-size 2 --rank 0 --batch_size 64 --num_workers 8

Second machine

python train.py --dist-url 'tcp://IP_OF_NODE2:FREEPORT' --multiprocessing-distributed --world-size 2 --rank 1 --batch_size 64 --num_workers 8

Tensorboard

To track the training progress, go to the logs/ folder and

cd logs/<saved_fn>/tensorboard/
tensorboard --logdir=./

Then go to http://localhost:6006/

2.4. Benchmarking, Statistics, and Live Video

To benchmark inference speed and collect statistics (e.g., FPS, system stats) on Jetson devices, use the provided scripts:

# For detailed stats and benchmarking
python sfa/demo_2_sides_stats.py --jclock
python sfa/demo_front_stats.py --jclock

# For live video rendering
python sfa/demo_2_sides_live.py

# Check jtop stats API
python sfa/stats.py

Notes:

The --jclock flag enables Jetson maximum performance clocks for consistent benchmarking.
Stats scripts print detailed FPS and runtime statistics, and can output all available jtop system stats (CPU, GPU, RAM, temperature, etc.) for full hardware monitoring.
Sample stats outputs (e.g., results/fpn_resnet_18/2sides_jclock.csv) are available in the results/ directory for reference.

PyTorch for Jetson

Tested Jetson Environment:

PyTorch v2.1.0
JetPack 6.0 DP (L4T R36.2.0)
Python 3.10
torch-2.1.0-cp310-cp310-linux_aarch64.whl (USE_DISTRIBUTED=on)

For best performance on Jetson devices, install the official NVIDIA PyTorch wheel:

Download PyTorch for Jetson (Python 3.8, CUDA 10.2)

Install with:

pip install https://nvidia.box.com/shared/static/0h6tk4msrl9xz3evft9t0mpwwwkw7a32.whl

Contact

If you think this work is useful, please give me a star!
If you find any errors or have any suggestions, please contact me (Email: [email protected]).
Thank you!

Citation

@misc{Super-Fast-Accurate-3D-Object-Detection-PyTorch,
  author =       {Nguyen Mau Dung},
  title =        {{Super-Fast-Accurate-3D-Object-Detection-PyTorch}},
  howpublished = {\url{https://github.com/maudzung/Super-Fast-Accurate-3D-Object-Detection}},
  year =         {2020}
}

References

[1] CenterNet: Objects as Points paper, PyTorch Implementation
[2] RTM3D: PyTorch Implementation
[3] Libra_R-CNN: PyTorch Implementation

The YOLO-based models with the same BEV maps input:
[4] Complex-YOLO: v4, v3, v2

3D LiDAR Point pre-processing:
[5] VoxelNet: PyTorch Implementation

Folder structure

${ROOT}
└── checkpoints/
    ├── fpn_resnet_18/    
        ├── fpn_resnet_18_epoch_300.pth
└── dataset/    
    └── kitti/
        ├──ImageSets/
        │   ├── test.txt
        │   ├── train.txt
        │   └── val.txt
        ├── training/
        │   ├── image_2/ (left color camera)
        │   ├── calib/
        │   ├── label_2/
        │   └── velodyne/
        └── testing/  
        │   ├── image_2/ (left color camera)
        │   ├── calib/
        │   └── velodyne/
        └── classes_names.txt
└── sfa/
    ├── config/
    │   ├── train_config.py
    │   └── kitti_config.py
    ├── data_process/
    │   ├── kitti_dataloader.py
    │   ├── kitti_dataset.py
    │   └── kitti_data_utils.py
    ├── models/
    │   ├── fpn_resnet.py
    │   ├── resnet.py
    │   └── model_utils.py
    └── utils/
    │   ├── demo_utils.py
    │   ├── evaluation_utils.py
    │   ├── logger.py
    │   ├── misc.py
    │   ├── torch_utils.py
    │   ├── train_utils.py
    │   └── visualization_utils.py
    ├── demo_2_sides.py
    ├── demo_front.py
    ├── test.py
    └── train.py
├── README.md 
└── requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
checkpoints/fpn_resnet_18		checkpoints/fpn_resnet_18
dataset/kitti		dataset/kitti
results/fpn_resnet_18		results/fpn_resnet_18
sfa		sfa
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Technical_details.md		Technical_details.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Super Fast and Accurate 3D Object Detection based on 3D LiDAR Point Clouds (SFA3D)

Features

Highlights

Demonstration (on a single GTX 1080Ti)

2. Getting Started

2.1. Requirement

2.2. Data Preparation

2.3. How to run

2.3.1. Visualize the dataset

2.3.2. Inference

2.3.3. Making demonstration

2.3.4. Training

2.3.4.1. Single machine, single gpu

2.3.4.2. Distributed Data Parallel Training

Tensorboard

2.4. Benchmarking, Statistics, and Live Video

Notes:

PyTorch for Jetson

Contact

Citation

References

Folder structure

About

Uh oh!

Releases

Packages

Languages

License

laurent-19/SFA3D

Folders and files

Latest commit

History

Repository files navigation

Super Fast and Accurate 3D Object Detection based on 3D LiDAR Point Clouds (SFA3D)

Features

Highlights

Demonstration (on a single GTX 1080Ti)

2. Getting Started

2.1. Requirement

2.2. Data Preparation

2.3. How to run

2.3.1. Visualize the dataset

2.3.2. Inference

2.3.3. Making demonstration

2.3.4. Training

2.3.4.1. Single machine, single gpu

2.3.4.2. Distributed Data Parallel Training

Tensorboard

2.4. Benchmarking, Statistics, and Live Video

Notes:

PyTorch for Jetson

Contact

Citation

References

Folder structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages