0% found this document useful (0 votes)

24 views35 pages

Deep Learning For Object Detection - 131124

The document discusses deep learning techniques for object detection, highlighting its applications in areas such as self-driving cars, robotics, and facial recognition. It details various models like YOLO, Faster R-CNN, and SSD, emphasizing the advantages of YOLO for real-time detection. Additionally, it covers dataset creation, model training, and performance evaluation metrics essential for developing effective object detection systems.

Uploaded by

Sanynita Kiskindy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views35 pages

Deep Learning For Object Detection - 131124

Uploaded by

Sanynita Kiskindy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

DEEP LEARNING FOR

OBJECT DETECTION

• Djoko Purwanto
• Artificial Intelligence and Health Technology Research Center
• Institut Teknologi Sepuluh Nopember (ITS)
OBJECT DETECTION
Introduction

Object detection is a computer vision task that involves identifying and locating objects within
images. Deep learning has significantly advanced this field, leveraging neural networks to improve
accuracy and efficiency.

dog
cat
Model

parameter

2
Object Detection Applications
 Optical character recognition: OCR is the recognition of hand-written, printed, or typed characters
from an image. These techniques are used for scanning printed books to a digital document. Other
applications are data entry, traffic sign recognition, etc.
 Self-driving cars: These cars can drive by itself. One of the major capabilities of self-driving cars is
detecting pedestrians, cars, trucks, traffic signs, etc. These detections are essential for the proper
working of self-driving cars.
 Verification using face and IRIS code: Face and IRIS verification and authentication are used in iPhone
and Android phones. It does the device authorization if the exact face or IRIS match detected.
 Robotics: There are a lot of applications in robotics using object detection. One of the common
applications is bin picking and sorting of objects. Using object detection techniques, the robot can able
to understand the location of objects. Using that information, the robot can able to pick the object and
able to sort it.
 Object tracking and counting: Using object detection techniques, you can track an object and can be
used as an object counter. For example, how many cars have crossed in a junction, how people entered
a shopping mall etc.
 Other applications
3
Object Detection Models

Model Name Note

YOLO (You Only Look Once) Processes images in real-time by predicting bounding
boxes and class probabilities simultaneously
Faster R-CNN Combines region proposal networks with CNNs to
enhance detection speed and accuracy
SSD (Single Shot MultiBox Balances speed and accuracy by detecting objects at
Detector) multiple scales in a single pass.

4
You Only Look Once (YOLO)
YOLO is one of the ‘Deep learning-based approach‘ of object detection. The object detection algorithms
using deep learning can be classified into two groups :

1. Classification based algorithms: There are mainly two stages in classification based algorithms. In
the first stage, it will select a bunch of Region of Interest (ROI) in the image where the chances of
objects are high. In the second stage, it will apply a Convolution Neural Network to these regions to
detect the presence of an object. One of the problems with this method is, we have to execute the
detector in each of the ROI, and that makes is slow and computationally expensive. One example of
this type of algorithm is R-CNN.
2. Regression-based algorithms: In this algorithm, there is no selection of interesting ROI in the image,
instead of that, it will predict the classes and bounding boxes for the entire image at once. This makes
detection faster than classification algorithms. One of the famous regression-based algorithms is
YOLO (“You Only Look Once“). The YOLO detector is very fast so it is used in self-driving cars and
other applications where real-time object detection is required.

5
The YOLO detector can predict the class of object, its bounding box, and the probability of the
object’s class in the bounding box.

Each bounding box is having the following parameters:

 center position of the bounding box in the image ( 𝑏𝑏𝑥𝑥 , 𝑏𝑏𝑦𝑦 )
 width of the box( 𝑏𝑏𝑤𝑤 )
 height of the box ( 𝑏𝑏ℎ )
 class of object ( 𝑐𝑐 )
 probability of the object’s class (𝑝𝑝𝑐𝑐 )
6
YOLO in Darknet Framework

Darknet is an open-source neural

network framework primarily
written in C and CUDA. It is
designed for high performance and
is particularly well-known for its
implementation of the YOLO (You
Only Look Once) object detection
system.

7
 YOLOv4: Known for its balance of speed and accuracy.
 YOLOv5: Developed separately but widely used for its ease of use.
 YOLOv7: Further optimizations for speed and accuracy.

8
YOLO in PyTorch Framework

Ultralytics is a company focused on advancing artificial intelligence, particularly in computer vision.

They are best known for their work on the YOLO (You Only Look Once) series of models, which are
widely used for real-time object detection and image segmentation.
9
Ultralytics provides a comprehensive ecosystem for working with object detection models, from
dataset preparation to training and deployment. This makes it a powerful tool for both researchers
and developers looking to implement AI solutions in various domains.

10
The latest YOLO model from Ultralytics is YOLOv11, which was released in late 2024

11
Object Detection Illustration

12
Mask detection
Mask detection technology enhances public health and
safety by monitoring compliance in crowded places like
airports and malls.

Person recognition
Person recognition accurately identifies individuals,
enhancing security and efficiency. In smart office
environments, this technology can be leveraged for staff
authentication, ensuring secure access while also enabling
personalized settings that cater to individual preferences.

13
Ship detection
Ship detection using maritime drones is increasingly
utilized for effective surveillance and monitoring of
waterways. Drones equipped with advanced imaging
technology can identify and track vessels in real-time,
enhancing maritime safety and security.

Ping-pong ball detection

Ping-pong ball detection is used in sports training and
robotics to track the ball's position and trajectory in real-time.
This technology enhances training by providing analytics on
ball movement, helping athletes improve their techniques.

14
DATASET
Dataset on the Internet The dataset is utilized in the learning process of models for specific
applications. Some datasets are freely available on the internet

roboflow

15
Kaggle

16
Custom Dataset Builder for Object Detection
Custom datasets for object detection can be created using the Image Labeler tool

labelImg

17
Creating a dataset can be efficiently accomplished using various freely available or commercially
accessible web-based software tools.
roboflow

18
Dataset Criteria for Object Detection

 Number of Images:
Aim for 1,000 to 5,000 images as a minimum; more complex tasks may need 10,000 to 100,000.
 Number of Instances:
 Each image should have 5-10 instances of the target objects.
 Ensure a balanced representation of different classes.
 Annotation Quality:
Use accurate bounding boxes and correct class labels for each object.
 Data Augmentation:
Apply techniques like rotation and flipping to increase dataset size and diversity.
 Dataset Split:
Divide into training (70-80%), validation (10-15%), and testing (10-15%) sets.

19
TRAIN THE MODEL
Train using
roboflow

20
Train using Google Colab

21
Train in PC using Darknet Framework

To run the Darknet framework on your PC, start by installing the necessary prerequisites: Visual
Studio for Windows or build-essential and git for Linux. Next, clone the Darknet repository from
GitHub and modify the Makefile to enable GPU and OpenCV support if desired. Build the
project using make on Linux or Visual Studio on Windows. Prepare datasets and the YOLO model,
placing them in the Darknet directory. Finally, execute the appropriate command in the terminal
or command prompt to train your model or detect objects in an image

22
Train in PC using
PyTorch
Framework

23
Model Performance Evaluation
 Intersection over Union (IoU): measures the overlap between predicted and ground
truth bounding boxes.
 Precision: measures the accuracy of the positive predictions made by the model.
 Recall: assesses the model's ability to identify all relevant instances.
 Mean Average Precision (mAP): measures how well a model can detect and locate
objects in images. It is calculated by averaging the Average Precision (AP) scores for
different object classes and various Intersection over Union (IoU) thresholds.

24
OBJECT DETECTION PROGRAMMING
Framework
 PyTorch: An open-source deep learning library that
provides the foundation for building and training
neural networks. It is known for its flexibility and
dynamic computation capabilities.
 Ultralytics: A company that specializes in computer
vision, particularly through the development of the
YOLO (You Only Look Once) models. They focus on
creating user-friendly tools and frameworks for object
detection.
 YOLO: A series of real-time object detection models
that can identify multiple objects in images quickly
and accurately. The latest version, YOLOv11, is
implemented in PyTorch, benefiting from its efficient
training and inference capabilities.

25
Inference using YOLOv11 Pre-trained Model
 The YOLOv11 pre-trained model is downloaded from https://github.com/ultralytics/ultralytics
 The YOLOv11 model is trained on the MS COCO (Common Objects in Context) dataset

26
Train YOLOv11 Model using Transfer Learning
Directory structure YAML file

27
Python program for training the model

Class distribution

28
Training results obtained using a GPU accelerator

29
Training results obtained using CPU

30
Model performance evaluation

31
32
Labeling of validation data Prediction results for the validation data

33
Prediction result for the test image

34
THANK YOU

FG100 Tech Manual v2
80% (10)
FG100 Tech Manual v2
94 pages
Finish Presentation
No ratings yet
Finish Presentation
56 pages
YOLO v11: A Guide for Developers
No ratings yet
YOLO v11: A Guide for Developers
73 pages
Object Detection Presentation
100% (3)
Object Detection Presentation
28 pages
Object Detection
No ratings yet
Object Detection
11 pages
YOLO Advances To Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series
No ratings yet
YOLO Advances To Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series
83 pages
Research On An Improved Fish Recognition Algorithm Based On YOLOX
No ratings yet
Research On An Improved Fish Recognition Algorithm Based On YOLOX
10 pages
Yolov 8
No ratings yet
Yolov 8
12 pages
Team 10
No ratings yet
Team 10
20 pages
Object Detection Using TensorFlow
No ratings yet
Object Detection Using TensorFlow
21 pages
Sapkota Et Al., 2025
No ratings yet
Sapkota Et Al., 2025
28 pages
Paper 45
No ratings yet
Paper 45
7 pages
CCTV
No ratings yet
CCTV
23 pages
Ijramt V3 I5 11
No ratings yet
Ijramt V3 I5 11
3 pages
Yolo Vs RCNN
No ratings yet
Yolo Vs RCNN
5 pages
Detection and Content Retrieval of Object in An Image Using YOLO
No ratings yet
Detection and Content Retrieval of Object in An Image Using YOLO
8 pages
19bce0014 VL2021220702099 Pe003
No ratings yet
19bce0014 VL2021220702099 Pe003
17 pages
Deep Learning in Image Detection
No ratings yet
Deep Learning in Image Detection
16 pages
Real-Time Object Detection App
No ratings yet
Real-Time Object Detection App
6 pages
Final Project Paper Akash
No ratings yet
Final Project Paper Akash
5 pages
Project
100% (1)
Project
30 pages
Final Synopsis1
No ratings yet
Final Synopsis1
10 pages
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
No ratings yet
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
5 pages
Incremental Training for Unseen Object Classification
No ratings yet
Incremental Training for Unseen Object Classification
19 pages
Mini
No ratings yet
Mini
8 pages
Fyp Zainab 1
No ratings yet
Fyp Zainab 1
16 pages
YOLO: For Computer Vision Experts
No ratings yet
YOLO: For Computer Vision Experts
3 pages
RT6 Map Update Guide
No ratings yet
RT6 Map Update Guide
1 page
Object Detection
No ratings yet
Object Detection
13 pages
1 s2.0 S1877050924033301 Main
No ratings yet
1 s2.0 S1877050924033301 Main
7 pages
Sodium Coolant Handbook
No ratings yet
Sodium Coolant Handbook
288 pages
Parental Personality and Parenting Style
No ratings yet
Parental Personality and Parenting Style
13 pages
YOLO Based Object Detection Models: A Review and Its Applications
No ratings yet
YOLO Based Object Detection Models: A Review and Its Applications
40 pages
YOLO Based Object Detection Models: A Review and Its Applications
No ratings yet
YOLO Based Object Detection Models: A Review and Its Applications
40 pages
YOLOv2 MATLAB Underwater Detection
No ratings yet
YOLOv2 MATLAB Underwater Detection
8 pages
Design of A Real-Time Object Detection Prototype S
No ratings yet
Design of A Real-Time Object Detection Prototype S
6 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
Seminar 201202175023
No ratings yet
Seminar 201202175023
16 pages
Yolo Algorithm
No ratings yet
Yolo Algorithm
37 pages
Object Detection Document
No ratings yet
Object Detection Document
4 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
YOLO: Real-Time Object Detection
No ratings yet
YOLO: Real-Time Object Detection
10 pages
YOLO Algorithm for Object Detection
No ratings yet
YOLO Algorithm for Object Detection
9 pages
YOLO: Fast Object Detection for Engineers
No ratings yet
YOLO: Fast Object Detection for Engineers
6 pages
You Only Look Once - Object Detection Models A Review
No ratings yet
You Only Look Once - Object Detection Models A Review
8 pages
Sample Test Hkimo Grade 3 (Vòng Sơ Lo I) : Part I: Logical Thinking
100% (1)
Sample Test Hkimo Grade 3 (Vòng Sơ Lo I) : Part I: Logical Thinking
7 pages
CV Lab 9
No ratings yet
CV Lab 9
4 pages
YOLO Based Detection and Classification of Objects in Video Records
No ratings yet
YOLO Based Detection and Classification of Objects in Video Records
5 pages
YOLOv1 v8综述
No ratings yet
YOLOv1 v8综述
36 pages
Presentation1 FINAL 1
No ratings yet
Presentation1 FINAL 1
11 pages
YOLO: Real-Time Object Detection
No ratings yet
YOLO: Real-Time Object Detection
10 pages
YOLO V3 ML Project
No ratings yet
YOLO V3 ML Project
15 pages
Object Detection Using Image Processing
No ratings yet
Object Detection Using Image Processing
17 pages
ONGC Spce Tube Product
No ratings yet
ONGC Spce Tube Product
2 pages
Lesson Plan in Mathematics 4: School: Teacher: Date: I. Objectives
No ratings yet
Lesson Plan in Mathematics 4: School: Teacher: Date: I. Objectives
6 pages
YOLO-Based Object Detection with Voice and Cartoon Effects
No ratings yet
YOLO-Based Object Detection with Voice and Cartoon Effects
6 pages
YOLOv2: Real-Time Object Detection
No ratings yet
YOLOv2: Real-Time Object Detection
5 pages
Yolo
No ratings yet
Yolo
10 pages
Yolo Paper
No ratings yet
Yolo Paper
10 pages
Syllabus Apni Kaksha
No ratings yet
Syllabus Apni Kaksha
1 page
Yolopdf
No ratings yet
Yolopdf
10 pages
Real-Time CNN Visual Recognition
No ratings yet
Real-Time CNN Visual Recognition
13 pages
M10 - Introduction To TensorFlow, Deep Learning and Application
No ratings yet
M10 - Introduction To TensorFlow, Deep Learning and Application
25 pages
Handbook of Shanti Swarup Bhatnagar Prize Winners (1958 - 1998)
No ratings yet
Handbook of Shanti Swarup Bhatnagar Prize Winners (1958 - 1998)
118 pages
Synopsis - Internship - Group-53
No ratings yet
Synopsis - Internship - Group-53
8 pages
Unit-1 - Introduction To Nodejs
No ratings yet
Unit-1 - Introduction To Nodejs
92 pages
Bunn Programing Manual
No ratings yet
Bunn Programing Manual
18 pages
Modified Compressed Air Engine Two Stroke Engine Working On The Design of A Four Stroke Petrol Engine
No ratings yet
Modified Compressed Air Engine Two Stroke Engine Working On The Design of A Four Stroke Petrol Engine
3 pages
Multicast Sockets Overview and Practical Java Example 1
No ratings yet
Multicast Sockets Overview and Practical Java Example 1
10 pages
Engineering Design for Rebar Installation
No ratings yet
Engineering Design for Rebar Installation
1 page
NGR Installation Manual PDF
No ratings yet
NGR Installation Manual PDF
15 pages
B10 AutoCAD 201222
No ratings yet
B10 AutoCAD 201222
2 pages
UV Lab Report - BE
No ratings yet
UV Lab Report - BE
15 pages
Wave Properties of Light
No ratings yet
Wave Properties of Light
36 pages
Fex Guide
No ratings yet
Fex Guide
60 pages
Shader Tweaks for Gamers
No ratings yet
Shader Tweaks for Gamers
44 pages
Network Optimization Checklist
No ratings yet
Network Optimization Checklist
6 pages
Tema 4 Synopsys Primer Ejemplo
No ratings yet
Tema 4 Synopsys Primer Ejemplo
21 pages
Power Systems Engineers Guide
No ratings yet
Power Systems Engineers Guide
7 pages
Chapter 3 Methods of Lead Optimization
No ratings yet
Chapter 3 Methods of Lead Optimization
23 pages
UART Interface Design & UVM Verification
No ratings yet
UART Interface Design & UVM Verification
4 pages
Numerical Analysis: Lecture-5
No ratings yet
Numerical Analysis: Lecture-5
8 pages
A Novel Online Machine Learning Approach For..
No ratings yet
A Novel Online Machine Learning Approach For..
7 pages
Grade 5 Maths Exam
No ratings yet
Grade 5 Maths Exam
3 pages
Biology Basics for Students
No ratings yet
Biology Basics for Students
5 pages
Tree
No ratings yet
Tree
7 pages