StereoSA

Mahmoud Tahmasebi* ([email protected]), Saif Huq, Kevin Meehan, Marion McAfee

Paper

Performance on KITTI raw dataset

Note: for inference on KITTI raw refer to this repo https://github.com/M2219/ESMStereo

Performace with confidence branch

The confidence branch generates a map that detects and suppresses unreliable disparity estimates, incurring only a slight reduction in processing speed.

SOTA results on KITTI dataset.

Method	KITTI 2012 (3-noc)	KITTI 2012 (3-all)	KITTI 2015 (D1-bg)	KITTI 2015 (D1-fg)	KITTI 2015 (D1-all)	Runtime (ms)
CFNet	1.23 %	1.58 %	1.54 %	3.56 %	1.94 %	180
IGEV-Stereo	1.55 %	1.93 %	1.79 %	3.82 %	2.13 %	180
ACVNet	1.62 %	2.03 %	1.81 %	4.09 %	2.19 %	200
LEAStereo	1.62 %	2.03 %	1.81 %	4.09 %	2.19 %	300
EdgeStereo-V2	1.41 %	1.89 %	1.74 %	3.20 %	1.98 %	320
CREStereo	1.45 %	1.85 %	1.70 %	3.53 %	2.01 %	410
SegStereo	4.11 %	4.65 %	2.21 %	6.16 %	4.43 %	600
SSPCVNet	1.91 %	2.42 %	1.99 %	5.39 %	2.55 %	900
CSPN	1.64 %	2.11 %	1.91 %	4.47 %	2.33 %	1000
StereoSA	1.30 %	1.67 %	1.60 %	3.33 %	1.89 %	67 (RTX 4070S)

Results on SceneFlow dataset.

Method (Real-Time)	EPE [px]	Runtime (ms)	GPU
DCVSMNet	0.60	67	RTX 3080
ACVNet	0.48	200	RTX 3090
Selective-IGEV	0.44	240	RTX 3090
IGEV++	0.43	280	RTX 3080
DLNR	0.47	297	A100
MoCha-Stereo	0.41	340	2 x A6000
DiffuVolume	0.46	360	RTX 3090
IGEV-Stereo	0.47	370	RTX 3090
StereoSA	0.41	64	RTX 4070S

Integration into visual-inertial odometry

This video was recorded during a straight-line navigation test on the Hunter V2 Robot. For more details, please visit the project repository: https://github.com/M2219/ACNMR

You may also be interested in the following related repository, which is a fork of OpenVINS: https://github.com/M2219/open_vins This fork modifies the original code to accept disparity maps for finding keypoint correspondences in images and includes configuration files for the OAK-D Pro camera.

ATE

How to use

Environment

NVIDIA RTX 4070S
Python 3.10
Pytorch 2.0.0

Install

Create a virtual environment and activate it.

conda create -n StereoSA python=3.10
conda activate StereoSA

Dependencies

conda install pytorch torchvision torchaudio cudatoolkit=11.8 -c pytorch -c nvidia
pip install opencv-python
pip install scikit-image
pip install tensorboard
pip install matplotlib 
pip install tqdm
pip install timm=1.0.11

Data Preparation

Train

Use the following command to train StereoSA on SceneFlow. First training,

python train_sceneflow.py --logdir ./checkpoints/sceneflow/first/

Second training,

python train_sceneflow.py --logdir ./checkpoints/sceneflow/second/ --loadckpt ./checkpoints/sceneflow/first/checkpoint_000059.ckpt

Use the following command to finetune StereoSA on KITTI using the pretrained model on SceneFlow,

python train_kitti.py --logdir ./checkpoints/kitti/ --loadckpt ./checkpoints/sceneflow/second/checkpoint_000059.ckpt

Evaluation on SceneFlow and KITTI

Pretrained Model

StereoSA

Generate disparity images of KITTI test set,

python save_disp.py

Citation

If you find this project helpful in your research, welcome to cite the paper.

Acknowledgements

Thanks to open source works: OpenVINS.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
datasets		datasets
filenames		filenames
imgs		imgs
kitti_publisher		kitti_publisher
kitti_publisher_conf		kitti_publisher_conf
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
onnx_transformed.py		onnx_transformed.py
onnx_transformed_conf.py		onnx_transformed_conf.py
save_disp.py		save_disp.py
save_vid.py		save_vid.py
test_eth3d.py		test_eth3d.py
test_kitti.py		test_kitti.py
test_mid.py		test_mid.py
train_kitti.py		train_kitti.py
train_sceneflow.py		train_sceneflow.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StereoSA

Paper

Performance on KITTI raw dataset

Performace with confidence branch

SOTA results on KITTI dataset.

Results on SceneFlow dataset.

Integration into visual-inertial odometry

ATE

How to use

Environment

Install

Create a virtual environment and activate it.

Dependencies

Data Preparation

Train

Evaluation on SceneFlow and KITTI

Pretrained Model

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

M2219/StereoSA

Folders and files

Latest commit

History

Repository files navigation

StereoSA

Paper

Performance on KITTI raw dataset

Performace with confidence branch

SOTA results on KITTI dataset.

Results on SceneFlow dataset.

Integration into visual-inertial odometry

ATE

How to use

Environment

Install

Create a virtual environment and activate it.

Dependencies

Data Preparation

Train

Evaluation on SceneFlow and KITTI

Pretrained Model

Citation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages