0% found this document useful (0 votes)

18 views6 pages

Engproc 33 00022

This paper provides an overview of recent algorithms for object detection and tracking, categorizing detection methods into one-stage and two-stage algorithms, and tracking methods into correlation-filter-based and deep-learning-based approaches. It highlights key algorithms such as R-CNN, Fast R-CNN, Faster R-CNN, YOLO, KCF, and SORT, discussing their functionalities and performance. The authors conclude by emphasizing the need for advancements in processing speed and accuracy for applications like self-driving cars.

Uploaded by

gopikajitesh.lbs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views6 pages

Engproc 33 00022

Uploaded by

gopikajitesh.lbs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Proceeding Paper

An Overview of Object Detection and Tracking Algorithms †

Kehao Du * and Alexander Bobkov

Department of Informatics and Control Systems, Bauman Moscow State Technical University,
105005 Moscow, Russia; [email protected]
* Correspondence: [email protected]; Tel.: +7-925-793-0588
† Presented at the 15th International Conference “Intelligent Systems” (INTELS’22), Moscow, Russia,
14–16 December 2022.

Abstract: With the development of information technology, the vision-based detection and tracking
of moving objects is gradually penetrating into all aspects of people’s lives, and its importance is
becoming more prominent, attracting more and more scientists and research institutions at home
and abroad to participate in research in this field. With in-depth research into vision-based object
detection and tracking, various superior algorithms have appeared in recent years. In this article, we
attempt to compare some of the classic algorithms in this area of detection and tracking that have
appeared recently. This article examines and summarizes two areas: the detection and the tracking of
moving objects. First, we divide object detection into one-stage algorithms and two-stage algorithms
depending on whether a region proposal should be generated, and we accordingly outline some
commonly used object detection algorithms. Second, we separate object tracking into the KCF and
SORT algorithms according to the differences in the underlying algorithms.

Keywords: region proposal; R-CNN; Fast R-CNN; Faster R-CNN; YOLO; MOSSE; KCF; SORT

1. Object Detection Algorithms

Object detection algorithms can be divided into one-stage algorithms and two-stage
algorithms. Two-stage object detection algorithms appeared earlier, so we start with them.

1.1. Two-Stage Object Detection Algorithms

Citation: Du, K.; Bobkov, A. An

The task of object detection in an image usually involves drawing bounding boxes
Overview of Object Detection and
and labels for each object, i.e., classification and localization for each object, not merely the
Tracking Algorithms. Eng. Proc. 2023, main object.
33, 22. https://doi.org/10.3390/ If we use a sliding window for image classification and object localization, we would
engproc2023033022 need to apply convolutional neural networks to many different objects in the image. Since
convolutional neural networks will recognize every object in the image as either an object
Academic Editors: Askhat Diveev,
or the background, we would need to use convolutional neural networks in a large number
Ivan Zelinka, Arutun Avetisyan and
of places and scales, but this is computationally intensive.
Alexander Ilin
To solve this problem, neural network researchers propose using the concept of regions
Published: 13 June 2023 so that we can identify “blobs” of areas in an image that contain objects, which can be
performed much faster. The first model is regions with CNN features (R-CNN) [1], and the
principles of its algorithm are shown in Figure 1.
In R-CNN, the input image is first scanned for possible features using a selective
Copyright: © 2023 by the authors.
search algorithm, resulting in roughly 2000 region proposals; then, we run a convolutional
Licensee MDPI, Basel, Switzerland.
neural network on these region proposals; finally, the output of each convolutional neural
This article is an open access article
network is fed into a support vector machine (SVM), which uses linear regression to narrow
distributed under the terms and
conditions of the Creative Commons
the feature’s bounding box.
Attribution (CC BY) license (https://
Essentially, we transform object detection into an image classification task. However,
creativecommons.org/licenses/by/ there are also several problems: the learning rate is low, it requires a large amount of disk
4.0/). space, and the inference speed is also low.

Eng. Proc. 2023, 33, 22. https://doi.org/10.3390/engproc2023033022 https://www.mdpi.com/journal/engproc

Eng. Proc. 2023, 33, 22 2 of 6

Figure 1. R-CNN architecture.

The first updated version of R-CNN is Fast R-CNN [2], which significantly improves
the average precision through two improvements, which are shown in Figure 2.

Figure 2. Fast R-CNN architecture.

Feature extraction is performed before creating a proposal of regions, so the convo-

lutional neural network can only be run once for the entire image. We extend the neural
network used for prediction by replacing the SVM with a SoftMax layer, instead of creating
a new model.
Fast R-CNN is much faster than R-CNN because it only trains one CNN per im-
age. However, the selective search algorithm still requires a long time to generate region
proposals.
Faster R-CNN [3] is a typical case of object detection based on deep learning. The
algorithm replaces the slow selective search algorithm with a fast neural network: by
inserting a region proposal network (RPN), it predicts proposals based on features. The
RPN decides “where” to look, and this reduces the amount of computation for the entire
inference process. Once we have region proposals, we feed them directly into Fast R-CNN.
Moreover, we add a pooling layer, several fully connected layers, a SoftMax classification
layer, and a bounding box regressor, which are shown in Figure 3. In conclusion, Faster
R-CNN is faster and more accurate.
Eng. Proc. 2023, 33, 22 3 of 6

Figure 3. Faster R-CNN architecture.

1.2. One-Stage Object Detection Algorithms

Although Faster R-CNN has advanced greatly in terms of computational speed, it
cannot meet the real-time detection requirements, so some researchers have proposed a
regression-based method to directly regress the position and category of the object from the
image. Two classic methods are YOLO [4] and SSD [5]. This article only discusses YOLO
and its one-stage algorithm.
Different from the two-stage (two stages) detection algorithms represented by the R-
CNN series, YOLO discards the region proposals and directly completes feature extraction,
bounding box regression, and classification in the same non-branched convolutional net-
work, which makes the network structure simpler and the detection speed almost 10 times
higher than that of Faster R-CNN. This allows deep learning object detection algorithms to
meet the needs of real-time detection tasks with improved processing power.
The specific steps of the algorithm are shown in Figure 4. We divide the input image
into an S × S grid; each grid is responsible for determining what the objects in the grid
are. We output the parameter information of the rectangles around the objects and the
confidence score. The confidence score here refers to which features are contained in the
grid and the predictive accuracy of each feature.

Figure 4. You Only Look Once (YOLO) model.

Eng. Proc. 2023, 33, 22 4 of 6

Since YOLO only looks at the image once, we do not use sliding windows. The feature
map extracted after image convolution is divided into S × S blocks and then each block
is classified with a superior classification model and each mesh is processed using non-
maximum suppression to remove overlapping blocks; finally, we obtain our result. It can
be seen that the whole process is very simple: there is no need for region proposals to find
the object, and direct regression completes the determination of the position and category.

2. Object Tracking Algorithms

The next category concerns object tracking algorithms. It is generally considered that
visual object tracking falls into two categories: generative model methods and discrimina-
tive model methods. Currently, the most popular is the discriminative method, also called
tracking by detection. All the algorithms presented in this article are related to tracking by
detection.
Apart from classical tracking algorithms, object tracking algorithms are divided into
two categories: correlation-filter-based tracking (MOSSE) [6] and deep-learning-based
tracking algorithms (SORT) [7].
The notion of correlation first appeared in the field of signal processing to measure the
relationship between signals, including autocorrelation and cross-correlation. Researchers
have introduced cross-correlation into the field of computer vision to analyze the relation-
ships between factors; it is used to measure the degree of local matching of different images
in a certain area. Correlation filtering is a method of searching for objects in an image frame
based on the principle of cross-correlation.
Before the advent of correlation filtering and deep learning, the traditional visual
tracking algorithm was slow in research and had poor tracking performance. MOSSE
contributes to the development of the correlation filter tracking algorithm and also reflects
the potential of correlation filter technology in the tracking domain for the first time. The
frame rate of MOSSE is shown in Table 1.

Table 1. Compares the frame rates of MOSSE and other algorithm.

Algorithm Frame Rate CPU

Frag Track realtime Unknown
GBDL realtime 3.4 Ghz Pent. 4
IVTL 7.5 fps 2.8 Ghz CPU
MILTrack 25 fps Core 2 Quad
MOSSE Filters 669 fps 2.4 Ghz Core 2 Duo

Based on the principle of correlation filtering, aiming to measure the similarity of local
pixels, MOSSE develops a filter that can realize the sum of the least squares of errors. After
tracking the initialization of the first frame of a video sequence, MOSSE can generate a
relatively stable filter to use a matching pattern to measure the similarity in subsequent
video images. The tracking speed can reach 669 fps when limited; with the advantage
of a high tracking speed, the method has good reliability. MOSSE uses the Fast Fourier
Transform (FFT) for image and filter processing, converting the convolution of images and
filters in the frequency domain into a mathematical product operation. After filtering for
image detection, the point with the highest cross-correlation value, i.e., the center point of
the object, will be found.
Considering that the correlation filter algorithm using raw pixels for feature extraction
is too simple, the accuracy of the algorithm is not high. An improved CS correlation filter
algorithm is proposed, also based on correlation filtering, which additionally provides
higher accuracy while maintaining the speed advantage of correlation filtering. KCF [8]
is an improvement of CSK [9], and CSK is also an improvement and addition based
on MOSSE.
However, in terms of tracking, in order to reduce the computational complexity of
the model, KCF proposes a cyclic matrix to reduce the computational complexity of the
Eng. Proc. 2023, 33, 22 5 of 6

cyclic shifts, and the Fourier transform is used to switch the calculation process between
the frequency domain and the spatial domain, so that matrix multiplication with higher
computational complexity becomes computation point multiplication with less compu-
tational complexity. During the KCF tracking process, the detection model also learns
to measure the similarity between the predicted object and the tracking object. For each
object predicted by the tracking model, the detection model first measures the similarity
between the predicted object and the tracking object, and then decides whether to update
the tracking track.
Another algorithm, SORT, differs from the correlation filter algorithm in that it does
not use any signs of the tracked object during tracking, but only uses the position and
size of the detection frames to estimate the movement and match the data. It does not
implement any re-detection algorithm, focusing instead on frame-to-frame comparisons.
SORT mainly consists of three parts: object detection, a Kalman filter, and the Hungar-
ian algorithm. Object detection mainly uses algorithms such as YOLO and R-CNN. Kalman
filtering can identify the “optimal” estimate using the value predicted by the mathematical
model and the measured value of the observation to find the “optimum” estimate (optimal
here refers to the smallest standard error).
The Hungarian algorithm is a data association algorithm; in essence, the tracking
algorithm should solve the data association problem. Suppose that there are two sets S
and T, and set S has m elements and set T has n elements. The Hungarian algorithm seeks
to match elements in S with elements in T in pairs (although they may not be the same).
Combined with tracking scenario S and T, we have frame t and frame t − 1. The task of the
Hungarian algorithm is to match frame t with frame t − 1.
SORT, as reflected in its name, is simple and can correspond to real time. On the
one hand, its principle is straightforward and easy to implement; on the other hand, it
is extremely fast due to its simplicity, as shown in Figure 5. At the same time, thanks
to the introduction of object detection technology, the tracking accuracy is also signifi-
cantly improved, which means that it can achieve a good compromise between accuracy
and speed.

Figure 5. Benchmark performance of SORT.

3. Conclusions
In this article, modern algorithms for object detection and tracking are considered.
Object detection is presented in terms of one-stage algorithms and two-stage algorithms.
Object tracking via the SORT and KCF algorithms is introduced. Algorithms for object detec-
tion and tracking based on different concepts are compared, the unique and advantageous
qualities of each concept are identified, and a promising direction for the development of
Eng. Proc. 2023, 33, 22 6 of 6

future detection and tracking algorithms is explored. For example, self-driving cars always
require improvements in processing speed and accuracy.

Author Contributions: Conceptualization, A.B.; methodology, A.B.; software, K.D.; validation, K.D.;
writing—original draft preparation, K.D.; writing—review and editing, K.D. All authors have read
and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.

References
1. Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In
Proceedings of the 2014 Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28 June 2014; pp. 580–587.
2. Girshick, R. Fast R-CNN. In Proceedings of the 2015 International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13
December 2015; pp. 1440–1448.
3. Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. In
Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 7–12 December 2015; Volume 28.
4. Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the
2016 Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788.
5. Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. SSD: Single shot multibox detector. In Proceedings of
the 2016 European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands, 11–14 October 2016; pp. 21–37.
6. Bolme, D.S.; Beveridge, J.R.; Draper, B.A.; Lui, Y.M. Visual object tracking using adaptive correlation filters. In Proceedings of the
2010 Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA, 13–18 June 2010; pp. 2544–2550.
7. Bewley, A.; Ge, Z.; Ott, L.; Ramos, F.; Upcroft, B. Simple online and realtime tracking. In Proceedings of the 2016 International
Conference on Image Processing (ICIP), Phoenix, AZ, USA, 25–28 September 2016; pp. 3464–3468.
8. Henriques, J.F.; Caseiro, R.; Martins, P.; Batista, J. High-speed tracking with kernelized correlation filters. IEEE Trans. Pattern Anal.
Mach. Intell. 2014, 37, 583–596. [CrossRef] [PubMed]
9. Henriques, J.F.; Caseiro, R.; Martins, P.; Batista, J. Exploiting the circulant structure of tracking-by-detection with kernels. In
Proceedings of the 2012 European Conference on Computer Vision (ECCV), Florence, Italy, 7–13 October 2012; pp. 702–715.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.

Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Al3502 - DLV Unit 2
No ratings yet
Al3502 - DLV Unit 2
18 pages
Comparative Analysis of Deep Learning Image Detect
No ratings yet
Comparative Analysis of Deep Learning Image Detect
22 pages
1 s2.0 S2773186325001100 Main
No ratings yet
1 s2.0 S2773186325001100 Main
21 pages
Oracle Genetative AI Mock Test - Set - 05
No ratings yet
Oracle Genetative AI Mock Test - Set - 05
5 pages
Mini
No ratings yet
Mini
8 pages
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
No ratings yet
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
11 pages
Oracle Generative AI (1Z0-1127-25) Mock Test - Set - 7
No ratings yet
Oracle Generative AI (1Z0-1127-25) Mock Test - Set - 7
5 pages
123-Article Text-411-1-10-20240701
No ratings yet
123-Article Text-411-1-10-20240701
11 pages
Irjet V10i1063
No ratings yet
Irjet V10i1063
6 pages
Important Questions in Machine Learning
No ratings yet
Important Questions in Machine Learning
4 pages
Deep Learning Object Detection
No ratings yet
Deep Learning Object Detection
7 pages
Sansuchak (Review 2 Report (3) .
No ratings yet
Sansuchak (Review 2 Report (3) .
16 pages
The Ultimate Guide To Object Detection
No ratings yet
The Ultimate Guide To Object Detection
16 pages
A Comprehensive Survey of The R-CNN Family For Object Detection
No ratings yet
A Comprehensive Survey of The R-CNN Family For Object Detection
6 pages
Object Detection Presentation
No ratings yet
Object Detection Presentation
12 pages
Sensors 25 00214
No ratings yet
Sensors 25 00214
32 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
Making Machines See (Unit-3)
No ratings yet
Making Machines See (Unit-3)
8 pages
Image and Video Analytics Unit 3
No ratings yet
Image and Video Analytics Unit 3
18 pages
Real-Time Object Detection App
No ratings yet
Real-Time Object Detection App
6 pages
Viva Questions Plant Disease Detection
No ratings yet
Viva Questions Plant Disease Detection
3 pages
AI Engineer Interview Questions With Answers
No ratings yet
AI Engineer Interview Questions With Answers
3 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
Camera Ready Paper-Anushree
No ratings yet
Camera Ready Paper-Anushree
12 pages
ICS Lab 1
No ratings yet
ICS Lab 1
5 pages
Object Detection Using ELAN
No ratings yet
Object Detection Using ELAN
6 pages
Aws RP
No ratings yet
Aws RP
11 pages
Understanding Large Language Models: Learning Their Underlying Concepts and Technologies 1 / Converted Edition Thimira Amaratunga Download
100% (1)
Understanding Large Language Models: Learning Their Underlying Concepts and Technologies 1 / Converted Edition Thimira Amaratunga Download
85 pages
Model Tuning
No ratings yet
Model Tuning
1 page
Sapkota Et Al., 2025
No ratings yet
Sapkota Et Al., 2025
28 pages
Poster-Presentation-707-Hybrid Deep Learning and Machine Learning Approach For Early Guava Disease Detection & Classification
No ratings yet
Poster-Presentation-707-Hybrid Deep Learning and Machine Learning Approach For Early Guava Disease Detection & Classification
2 pages
MCPC2003
No ratings yet
MCPC2003
1 page
IEEE 2022 DQL Intersection
No ratings yet
IEEE 2022 DQL Intersection
5 pages
Realtime Object Detection Using SSD
No ratings yet
Realtime Object Detection Using SSD
8 pages
Object Detection Techniques A Review
No ratings yet
Object Detection Techniques A Review
9 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
MCQs ByFile
No ratings yet
MCQs ByFile
10 pages
Comparative Study of Deep Learning Optimizers: SGD, Momentum SGD, RMSProp, AMSGrad, Adam, Yogi, and Lion
No ratings yet
Comparative Study of Deep Learning Optimizers: SGD, Momentum SGD, RMSProp, AMSGrad, Adam, Yogi, and Lion
9 pages
390 Submission
No ratings yet
390 Submission
5 pages
YOLO Advances To Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series
No ratings yet
YOLO Advances To Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series
83 pages
Fin Irjmets1684232858
No ratings yet
Fin Irjmets1684232858
9 pages
Object Detection Report
No ratings yet
Object Detection Report
27 pages
ML - Logistic Regression
No ratings yet
ML - Logistic Regression
14 pages
Machine Learning-Based Detection of SQL Injection and Data Exfiltration Through Behavioral Profiling of Relational Query Patterns
No ratings yet
Machine Learning-Based Detection of SQL Injection and Data Exfiltration Through Behavioral Profiling of Relational Query Patterns
15 pages
Last Lab Report
No ratings yet
Last Lab Report
6 pages
Cviii 2024 Ws
No ratings yet
Cviii 2024 Ws
45 pages
Complete AIML Engineer Roadmap
No ratings yet
Complete AIML Engineer Roadmap
13 pages
Introduction To DL
No ratings yet
Introduction To DL
18 pages
AI Term Paper Final
No ratings yet
AI Term Paper Final
29 pages
Object Detection Using You Only Look Once (YOLO) Algorithm in Convolution Neural Network (CNN)
No ratings yet
Object Detection Using You Only Look Once (YOLO) Algorithm in Convolution Neural Network (CNN)
5 pages
Object Detection
No ratings yet
Object Detection
13 pages
Deep Learning in Image Detection
No ratings yet
Deep Learning in Image Detection
16 pages
An Introduction To Machine Learning 3rd Edition Miroslav Kubat Instant Download
100% (1)
An Introduction To Machine Learning 3rd Edition Miroslav Kubat Instant Download
161 pages
CS414-Lesson 07. Recurrent Neural Network
No ratings yet
CS414-Lesson 07. Recurrent Neural Network
23 pages
John 2020 Comparative
No ratings yet
John 2020 Comparative
7 pages
5 Major Computervision Technique
No ratings yet
5 Major Computervision Technique
10 pages
Object Detection for the Visually Impaired
No ratings yet
Object Detection for the Visually Impaired
4 pages
Object Detectionwith Convolutional Neural Networks
No ratings yet
Object Detectionwith Convolutional Neural Networks
12 pages
Deep Learning for Daily Object Detection
No ratings yet
Deep Learning for Daily Object Detection
6 pages
CNN Mcqs
No ratings yet
CNN Mcqs
26 pages
Yolo Vs RCNN
No ratings yet
Yolo Vs RCNN
5 pages
5 Semi Marked PDF
No ratings yet
5 Semi Marked PDF
7 pages
Final Report - Removed
No ratings yet
Final Report - Removed
43 pages
I Jeter 039112021
No ratings yet
I Jeter 039112021
8 pages
Vijay Report
No ratings yet
Vijay Report
14 pages
Image+Text Hybrid Model Paper
No ratings yet
Image+Text Hybrid Model Paper
27 pages
AReviewon YOLOv 8 Andits Advancementsv 2
No ratings yet
AReviewon YOLOv 8 Andits Advancementsv 2
20 pages
Comparative Analysis of Deep Learning Image Detection Algorithms
No ratings yet
Comparative Analysis of Deep Learning Image Detection Algorithms
27 pages
Learning Deep Learning Theory and Practice of Neural Networks Computer Vision NLP and Transformers Using TensorFlow 1st Edition Ekman Magnus Instant Download
100% (1)
Learning Deep Learning Theory and Practice of Neural Networks Computer Vision NLP and Transformers Using TensorFlow 1st Edition Ekman Magnus Instant Download
82 pages
Chapter 4 - Introduction To Pattern Recognition &
No ratings yet
Chapter 4 - Introduction To Pattern Recognition &
71 pages
Real-Time CNN Visual Recognition
No ratings yet
Real-Time CNN Visual Recognition
13 pages
Ijlbps 6620dd20c5747
No ratings yet
Ijlbps 6620dd20c5747
8 pages
CS769 2025 Lecture 3-Annotated
No ratings yet
CS769 2025 Lecture 3-Annotated
67 pages
Object Detection Security System Report
No ratings yet
Object Detection Security System Report
13 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
YOLO Object Detection: Challenges & Successors
No ratings yet
YOLO Object Detection: Challenges & Successors
33 pages
Report 34
No ratings yet
Report 34
22 pages
Autonomous Vehicle Object Detection
No ratings yet
Autonomous Vehicle Object Detection
20 pages
Literature Survey For Robotics
No ratings yet
Literature Survey For Robotics
6 pages
Research Article: An Evaluation of Deep Learning Methods For Small Object Detection
No ratings yet
Research Article: An Evaluation of Deep Learning Methods For Small Object Detection
18 pages
BTP Report Faster R CNN Compressed
No ratings yet
BTP Report Faster R CNN Compressed
32 pages
YOLO Based Object Detection Models: A Review and Its Applications
No ratings yet
YOLO Based Object Detection Models: A Review and Its Applications
40 pages
Report 34
No ratings yet
Report 34
26 pages
(Ebook PDF) Data Mining For Business Analytics: Concepts, Techniques, and Applications in R Online Version
No ratings yet
(Ebook PDF) Data Mining For Business Analytics: Concepts, Techniques, and Applications in R Online Version
123 pages
Deep Learning With Python Francois Chollet Download
100% (4)
Deep Learning With Python Francois Chollet Download
56 pages

Engproc 33 00022

Uploaded by

Engproc 33 00022

Uploaded by

Proceeding Paper

An Overview of Object Detection and Tracking Algorithms †

1. Object Detection Algorithms

1.1. Two-Stage Object Detection Algorithms

Citation: Du, K.; Bobkov, A. An

Eng. Proc. 2023, 33, 22. https://doi.org/10.3390/engproc2023033022 https://www.mdpi.com/journal/engproc

Figure 1. R-CNN architecture.

Figure 2. Fast R-CNN architecture.

Feature extraction is performed before creating a proposal of regions, so the convo-

Figure 3. Faster R-CNN architecture.

1.2. One-Stage Object Detection Algorithms

Figure 4. You Only Look Once (YOLO) model.

2. Object Tracking Algorithms

Table 1. Compares the frame rates of MOSSE and other algorithm.

Algorithm Frame Rate CPU

Figure 5. Benchmark performance of SORT.

You might also like