0% found this document useful (0 votes)

6 views21 pages

Object Detection

Uploaded by

jay4pelican

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views21 pages

Object Detection

Uploaded by

jay4pelican

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Object Detection : Only

Convolution Based Models

Copyright 2019 RESTRICTED CIRCULATION

Object Localisation & Detection ( single object)

Source:https://towardsdatascience.com/evolution-of-object-detection-and-localization-algorithms-e241021d8bad

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Multiple Objects with Sliding Window

• Sliding window using simple CNN for object detection that we built earlier
• Strides can vary
• Window size can vary
• Computation cost is huge ( slow models )

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Issues

• Multiple aspect ratios

• Multiple bounding boxes for same object
• Object overlapping is not handled properly
• Overlapping bounding boxes go through repeated
convolutions instead of sharing features

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Localisation and detection as single convolution

• Usual CNN layers

• Image is divided into a grid • Output is 3X3X8 tensor

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Evaluate=>IOU: intersection Over Union

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Non Max Suppression
• Multiple instances of same object must be brought down to
one
• Discard all bounding boxes with Pc < 0.6
• Pick the one with highest Pc, discard all boxes which have
IOU > 0.5 with that box
• Do this until you have either all high Pc box or discarded
them
• For multiple classes , NMS needs to be done separately for
each class

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Anchor Boxes

• One grid area might need to output multiple bounding

boxes for multiple classes
• We can simply output multiple instances for each grid
• Number of such outputs are called number of anchor
boxes

Copyright 2019 RESTRICTED CIRCULATION ‹#›

YOLO

• CNN with tensor output is used to build the model ( input

needs to be prepared according to the grid size )
• Output is : nXnXAX(1+4+C)
• n= grid size , A = number of anchor boxes , 1 = probability
for background vs object , 4 = for bounding box coordinates,
C = number of classes being considered
• Use NMS for better bounding boxes while predictions
( separately for each class )

Copyright 2019 RESTRICTED CIRCULATION ‹#›

SSD: Single Shot Detection

• Issue with YOLO: can not detect at different scales very well
• SSD has convolutions of multiple scales on top features
created by VGG16
• Prediction is facilitated at different convolution output.
• Early layers output help predict objects at finer scale due to
their receptive field being limited to smaller areas in the
image
• As we move forward , layers receptive fields grow larger and
they favour predicting larger objects
• Unlike YOLO, SSD does not split the image into grids of
arbitrary size but predicts offset of predefined anchor boxes
(this is called “default boxes” in the paper) for every location
of the feature map.

Copyright 2019 RESTRICTED CIRCULATION ‹#›

SSD Architechture

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Object Detection : Region
Proposal Based Models

Copyright 2019 RESTRICTED CIRCULATION

What is Region Proposal

• Region Proposal is a process of identifying parts of images

[ rectangles ] which have high chances of having an object
instead of background
• Selective Search is a common approach for coming up with
region proposals
• Its pretty fast with high recall [ many of the region proposals
might not have any object, but all the objects will be
contained in proposed regions ]
• Its not part of the network being built
• For deep dive in selective search :
• https://www.learnopencv.com/selective-search-for-object-detection-cpp-python/

Copyright 2019 RESTRICTED CIRCULATION ‹#›

R-CNN

Issues with R-CNN

• Very slow training due to large number of convent usage

across region proposal
• Prediction is also very slow , 47 seconds/image

Fast R-CNN

• Instead of using multiple instances of

convnet feature extraction , Region
proposals are projected on the convnet
feature map
• Linear part from fully connected layer is
used for bounding box regression
• Actually loss is a composite one ,
containing both classification and
regression losses . We can use weighted
some [ wt as a hyper parameter ] instead
of simple sum .
• Removal of multiple application of
convnet gives huge reduction in
prediction time as well as training time

R-CNN Vs Fast R-CNN

Notice that during prediction, most of the time in Fast R-CNN is being
taken by external Region Proposal Process. Faster R-CNN, makes the
Region Proposals also part of Network and further speed up things

Faster R-CNN

Pixel Level Masks : Mask R-CNN

Semantic Segmentation

• Pixel Level classification

• Doesn’t Differentiate between two objects of same
class if they are adjacent [ no mask boundaries ]

Mask R-CNN

• Upper Branch is essentially

doing what Faster R-CNN does
• Lower branch is for semantic
segmentation for each bounding
box for each class . This
combination eventually gives
instance segmentation

R3 - To Build A Fire
100% (1)
R3 - To Build A Fire
20 pages
Topic 7 - Challenge Risk and Safety
No ratings yet
Topic 7 - Challenge Risk and Safety
83 pages
Finite Element Method For Electromagnetics
No ratings yet
Finite Element Method For Electromagnetics
360 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
Report 34
No ratings yet
Report 34
22 pages
Object Detection Using You Only Look Once (YOLO) Algorithm in Convolution Neural Network (CNN)
No ratings yet
Object Detection Using You Only Look Once (YOLO) Algorithm in Convolution Neural Network (CNN)
5 pages
Object Detection
No ratings yet
Object Detection
57 pages
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
60 pages
Yolo Vs RCNN
No ratings yet
Yolo Vs RCNN
5 pages
Autonomous Vehicle Object Detection
No ratings yet
Autonomous Vehicle Object Detection
20 pages
Du 2018 J. Phys. Conf. Ser. 1004 012029
No ratings yet
Du 2018 J. Phys. Conf. Ser. 1004 012029
9 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
Presentation (Theoretical Evaluation)
No ratings yet
Presentation (Theoretical Evaluation)
107 pages
Advanced Object Detection Guide
No ratings yet
Advanced Object Detection Guide
90 pages
CSE4261 Lecture-12
No ratings yet
CSE4261 Lecture-12
24 pages
Generalized R-CNN for Researchers
No ratings yet
Generalized R-CNN for Researchers
127 pages
Convnets 4
No ratings yet
Convnets 4
22 pages
Real-Time Object Detection App
No ratings yet
Real-Time Object Detection App
6 pages
MV cs4243 2024 Amir 6 p2
No ratings yet
MV cs4243 2024 Amir 6 p2
95 pages
RCNN
No ratings yet
RCNN
25 pages
Object Detection Using CNN-RCNN.-1
No ratings yet
Object Detection Using CNN-RCNN.-1
14 pages
L10 Lecture Detection - Segmentation v2.5
No ratings yet
L10 Lecture Detection - Segmentation v2.5
35 pages
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
No ratings yet
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
11 pages
Advanced Topics in CNN and RNN
No ratings yet
Advanced Topics in CNN and RNN
72 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
Object Detection & Segmentation Guide
No ratings yet
Object Detection & Segmentation Guide
38 pages
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
No ratings yet
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
6 pages
L7 Detection
No ratings yet
L7 Detection
54 pages
Final Report - Removed
No ratings yet
Final Report - Removed
43 pages
Region-Based Object Detection and Classification Using Faster R-CNN
No ratings yet
Region-Based Object Detection and Classification Using Faster R-CNN
6 pages
Object Detection
No ratings yet
Object Detection
76 pages
Object Detection Security System Report
No ratings yet
Object Detection Security System Report
13 pages
R-CNN vs Fast R-CNN Analysis
No ratings yet
R-CNN vs Fast R-CNN Analysis
4 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
机器学习读书会嘉宾分享计算机视觉目标检测
No ratings yet
机器学习读书会嘉宾分享计算机视觉目标检测
52 pages
Image and Video Analytics Unit 3
No ratings yet
Image and Video Analytics Unit 3
18 pages
Fast Methods For Deep Learning Based Object Detection
No ratings yet
Fast Methods For Deep Learning Based Object Detection
43 pages
Yolo
No ratings yet
Yolo
24 pages
Lenc 15 RCNN
No ratings yet
Lenc 15 RCNN
12 pages
Od Segment
No ratings yet
Od Segment
53 pages
Deep Learning Algorithms For Object Detection
No ratings yet
Deep Learning Algorithms For Object Detection
43 pages
CVR FDP
No ratings yet
CVR FDP
37 pages
A Comprehensive Survey of The R-CNN Family For Object Detection
No ratings yet
A Comprehensive Survey of The R-CNN Family For Object Detection
6 pages
M10 - Introduction To TensorFlow, Deep Learning and Application
No ratings yet
M10 - Introduction To TensorFlow, Deep Learning and Application
25 pages
Unit 3
No ratings yet
Unit 3
45 pages
BTP Report Faster R CNN Compressed
No ratings yet
BTP Report Faster R CNN Compressed
32 pages
Deep Learning for Daily Object Detection
No ratings yet
Deep Learning for Daily Object Detection
6 pages
Object Detect
No ratings yet
Object Detect
12 pages
Week 5 - Fast RCNN
No ratings yet
Week 5 - Fast RCNN
17 pages
10 1109@access 2019 2932731
No ratings yet
10 1109@access 2019 2932731
9 pages
Literature Survey For Robotics
No ratings yet
Literature Survey For Robotics
6 pages
Object Detection Using Adaptive Mask RCNN
No ratings yet
Object Detection Using Adaptive Mask RCNN
12 pages
10 R CNN
No ratings yet
10 R CNN
28 pages
The Ultimate Guide To Object Detection
No ratings yet
The Ultimate Guide To Object Detection
16 pages
RCNN: Pros, Cons, and Applications
No ratings yet
RCNN: Pros, Cons, and Applications
6 pages
A Review of Object Detection Based On Convolutional Neural Network
No ratings yet
A Review of Object Detection Based On Convolutional Neural Network
6 pages
An Improved Rotation Invariant CNN-based Detector With Rotatable Bounding Boxes For Aerial Image Detection
No ratings yet
An Improved Rotation Invariant CNN-based Detector With Rotatable Bounding Boxes For Aerial Image Detection
5 pages
A Brief Review and Challenges of Object 2020
No ratings yet
A Brief Review and Challenges of Object 2020
17 pages
YOLOv2: Real-Time Object Detection
No ratings yet
YOLOv2: Real-Time Object Detection
5 pages
HR Software Market - Research Report 1
No ratings yet
HR Software Market - Research Report 1
99 pages
3 Months Bank Statement
No ratings yet
3 Months Bank Statement
3 pages
Mentor TYUs Week-8 - Prompt Engineering at Scale - MLS - 05 - 20 - 24
No ratings yet
Mentor TYUs Week-8 - Prompt Engineering at Scale - MLS - 05 - 20 - 24
14 pages
Assignment 1 2024
No ratings yet
Assignment 1 2024
3 pages
Rubrics
No ratings yet
Rubrics
2 pages
Pandas
No ratings yet
Pandas
11 pages
DM GTU Study Material E-Notes Unit-4 29012022085557AM
No ratings yet
DM GTU Study Material E-Notes Unit-4 29012022085557AM
12 pages
Greek Architecture
No ratings yet
Greek Architecture
13 pages
ANZ J. Surg. 2008 78 (Suppl. 1) A68-A80
No ratings yet
ANZ J. Surg. 2008 78 (Suppl. 1) A68-A80
13 pages
Physical Properties of Metals
No ratings yet
Physical Properties of Metals
4 pages
Project Topics On Law of Evidence
No ratings yet
Project Topics On Law of Evidence
5 pages
Sunny Days For Silicon
No ratings yet
Sunny Days For Silicon
5 pages
Preboard Exam in Ee 2
No ratings yet
Preboard Exam in Ee 2
14 pages
Education, Arts, and Sciences
No ratings yet
Education, Arts, and Sciences
1 page
Secure Stock 2081-0709
No ratings yet
Secure Stock 2081-0709
3 pages
PCC-2000 Reference Manual V1.42
No ratings yet
PCC-2000 Reference Manual V1.42
26 pages
Flipkart Sample Opposition
100% (1)
Flipkart Sample Opposition
76 pages
Technical Vocational Education: Quarter 1-Week4-Module 4
No ratings yet
Technical Vocational Education: Quarter 1-Week4-Module 4
20 pages
Lesson 5 Freedom of The Human Person
No ratings yet
Lesson 5 Freedom of The Human Person
16 pages
Runge-Kutta Method: Consider First Single First-Order Equation: Classic High-Order Scheme Error (4th Order)
No ratings yet
Runge-Kutta Method: Consider First Single First-Order Equation: Classic High-Order Scheme Error (4th Order)
17 pages
HW 683608 1answe
No ratings yet
HW 683608 1answe
4 pages
Student Animal Research Booklets
100% (1)
Student Animal Research Booklets
45 pages
Steel Squares: Specifications
No ratings yet
Steel Squares: Specifications
1 page
Selling Task % Weight of Task in Sales Process % Advertising Contribution To Task Advertising's Contribution To Sales Estimated Estimated Projected
100% (1)
Selling Task % Weight of Task in Sales Process % Advertising Contribution To Task Advertising's Contribution To Sales Estimated Estimated Projected
2 pages
Reto 4
No ratings yet
Reto 4
5 pages
Lecture O03: ENGR90024 Computational Fluid Dynamics
No ratings yet
Lecture O03: ENGR90024 Computational Fluid Dynamics
43 pages
Chest Freezer: User Manual
No ratings yet
Chest Freezer: User Manual
31 pages
Construction Professionals' Epoxy Guide
No ratings yet
Construction Professionals' Epoxy Guide
3 pages
WiFi, Working, Elements of WiFi
100% (2)
WiFi, Working, Elements of WiFi
67 pages
STCMB 1
No ratings yet
STCMB 1
59 pages
Traction Alternator Type Ta10106cy
No ratings yet
Traction Alternator Type Ta10106cy
64 pages
Scientific Notation Unit Test
100% (1)
Scientific Notation Unit Test
3 pages
Secure HTTP: A Historical Overview
No ratings yet
Secure HTTP: A Historical Overview
1 page

Object Detection

Uploaded by

Object Detection

Uploaded by

Object Detection : Only

Convolution Based Models

Copyright 2019 RESTRICTED CIRCULATION

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION ‹#›

• Multiple aspect ratios

Copyright 2019 RESTRICTED CIRCULATION ‹#›

• Usual CNN layers

• Image is divided into a grid • Output is 3X3X8 tensor

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION ‹#›

• One grid area might need to output multiple bounding

Copyright 2019 RESTRICTED CIRCULATION ‹#›

• CNN with tensor output is used to build the model ( input

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION

• Region Proposal is a process of identifying parts of images

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION ‹#›

• Very slow training due to large number of convent usage

Copyright 2019 RESTRICTED CIRCULATION ‹#›

• Instead of using multiple instances of

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION ‹#›

Copyright 2019 RESTRICTED CIRCULATION

• Pixel Level classification

Copyright 2019 RESTRICTED CIRCULATION ‹#›

• Upper Branch is essentially

Copyright 2019 RESTRICTED CIRCULATION ‹#›

You might also like