0% found this document useful (0 votes)

15 views38 pages

Deep Segmentation

The document discusses deep segmentation networks, focusing on DeepLab and U-Net architectures for semantic image segmentation. It explains the principles of DeepLab, including atrous convolution and fully connected conditional random fields (CRFs) to improve segmentation accuracy. U-Net is also introduced, highlighting its contraction and expansion phases for effective segmentation learning and detail recovery.

Uploaded by

Sabbir Hossain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views38 pages

Deep Segmentation

Uploaded by

Sabbir Hossain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 38

Deep Segmentation

Deep Segmentation Networks

1. DeepLab v1, v2, v3

2. U-Nets
Introduction of DeepLab
What is semantic image segmentation?

 Partitioning an image into regions of meaningful

objects.
 Assign an object category label.

3
Introduction of
DCNNDeepLab
and image segmentation

Select maximal score

DCNN
class

Class prediction scores for

each pixel

 What happens in each standard DCNN layer?

 Striding
 Pooling
4
Introduction
DCNN and image segmentation
Pooling advantages:
 Invariance to small translations of the input.
 Helps avoid overfitting.
 Computational efficiency.

Striding advantages:
 Fewer applications of the filter.
 Smaller output size. 5
Introduction
DCNN and image segmentation

What are the disadvantages for semantic

segmentation?
xDown-sampling causes loss of information.
xThe input invariance harms the pixel-perfect
accuracy.

DeepLab address those issues by:

Atrous convolution (‘Holes’ algorithm).
6
CRFs (Conditional Random Fields).
Up-Sampling
Addressing the reduced resolution
problem
Possible solution:

‘deconvolutional’ layers (backwards convolution).

x Additional memory and computational time.
x Learning additional parameters.

Atrous (‘Holes’) convolution

7
Deeplabv2
Atrous (‘Holes’)
Algorithm
 Remove the down-sampling from the last pooling layers.
 Up-sample the original filter by a factor of the strides:
Atrous convolution for 1-D signal:

Introduce zeros between

filter values

 Note: standard convolution is a special case for rate r=1.

Chen, Liang-Chieh, et al. " 12
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs." arXiv preprint
Atrous (‘Holes’)
Algorithm
Standard convolution

Atrous
convolution

Chen, Liang-Chieh, et al. " 13

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs." arXiv preprint
Atrous (‘Holes’)
Algorithm
Filters field-of-view
 Small field-of-view → accurate localization
 Large field-of-view → context assimilation
 ‘Holes’: Introduce zeros between filter values.
 Effective filter size increases (enlarge the field-of-view of filter):

 However, we take into account only the non-zero filter values:

 Number of filter parameters is the same.
 Number of operations per position is the same.
Chen, Liang-Chieh, et al. " 14
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs." arXiv preprint
Atrous (‘Holes’)
Algorithm Original
filter
Standard convolution

Padded
filter
Atrous
convolution

Chen, Liang-Chieh, et al. " 15

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs." arXiv preprint
Boundary recovery
 DCNN trade-off:
Classification accuracy ↔ Localization accuracy

 DCNN score maps successfully predict classification and rough

position.
x Less effective for exact outline.

Chen, Liang-Chieh, et al. " 16

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs." arXiv preprint
Boundary recovery

 Possible solution: super-pixel representation.

 Suggested Solution: fully connected CRFs.

L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “

Semantic image segmentation with deep convolutional nets and fully connected CRFs,” in ICLR, 2015. 17
https://www.researchgate.net/figure/225069465_fig1_Fig-1-Images-segmented-using-SLIC-into-superpixels-of-size-64-256-and-1024-pixels
Conditional Random
Problem statement
Fields
 X - Random field of input observations (images) of size N.
 L  l1 ,..., lM  - Set of labels.
 Y - Random field of pixel labels.
 X j - color vector of pixel j.
 Y j - label assigned to pixel j.

CRFs are usually used to model connections between different images.

Here we use them to model connection between image pixels!

P. Krahenbuhl and V. Koltun, “Efficient inference in fully connected CRFs with Gaussian edge potentials,” in NIPS, 2011. 23
Probabilistic Graphical
 Graphical ModelModels
Factorization - a distribution over many variables
represented as a product of local functions, each
depends on a smaller subset of variables.
1
p x , y   Z   a  x ,y 

aF 
N a  N ( a ) 

24
C. Sutton and A. McCallum, “An introduction to Conditional Random Fields”, Foundations and Trends in Machine Learning, vol. 4, No. 4 (2011) 267–373
Probabilistic Graphical
Models
 Undirected vs. Directed
G(V, F, E)
Undirected Directed

4
p  y1 , y2 , y3     y1 , y2    y2 , y3    y1 , y3 
1 1 1
 
p y x  p  y   p  xk y 
k 1

25
C. Sutton and A. McCallum, “An introduction to Conditional Random Fields”, Foundations and Trends in Machine Learning, vol. 4, No. 4 (2011) 267–373
Conditional Random
Fully connected CRFs
Definition: Fields
A
1
P Y X     a Ya | X 
Z X  a 1
 Z(X) - is an input-dependent normalization factor.

Factorization (energy function):

N
E y | X   i  yi | X    i , j  yi , y j | X 
i 1 i j

y - is the label assignment for pixels.

P. Krahenbuhl and V. Koltun, “Efficient inference in fully connected CRFs with Gaussian edge potentials,” in NIPS, 2011. 27
C. Sutton and A. McCallum, “An introduction to Conditional Random Fields”, Foundations and Trends in Machine Learning, vol. 4, No. 4 (2011) 267–373
Conditional Random
Potential functions in our case
Fields
i  i 
y | X  log p y | X   i 
 - is the label assignment probability for pixel i computed by
DCNN.  
  s  s 2 x  x
2
  s  s 2 
 i , j  yi , y j | X  1 yi y j  1 exp     2 exp   i 
i j i j j

  2 a 2
2 b 
2
 2   
2

                     

 ‘bilateral ’ kernel smoothness kernel 

 - position of pixel i.
 - intensity (color) vector of pixel i.
 - learned parameters (weights).

Chen, Liang-Chieh, et al. -"DeepLab:
hyperSemantic
parameters (what
Image Segmentation withis considered
Deep “near”
Convolutional Nets, / “similar”).
Atrous Convolution,
28
and Fully Connected
CRFs." arXiv preprint arXiv:1606.00915 (2016).
Conditional Random
Potential functions in our case
Fields
 
  s  s 2 2
xi  x j   s  s  2

 i , j  yi , y j | X  1 yi y j  1 exp     2 exp   i 
i j j

  2 a 2
2 b 
2
 2   
2

                      
 ‘bilateral ’ kernel smoothness kernel 
Pixels “nearness” Pixels color similarity

 Bilateral kernel – nearby pixels with similar color are likely to be in the
same class.
 - what is considered “near” / “similar”).

29
Chen, Liang-Chieh, et al. "DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected
CRFs." arXiv preprint arXiv:1606.00915 (2016).
Conditional Random
Potential functions in our case
Fields
 
  s  s 2 2
xi  x j   s  s  2

 i , j  yi , y j | X  1 yi y j  1 exp     2 exp   i 
i j j

  2 a 2
2 b 
2
 2   
2

                      
 ‘bilateral ’ kernel smoothness kernel 

 – uniform penalty for nearby pixels with different labels.

x Insensitive to compatibility between labels!

30
P. Krahenbuhl and V. Koltun, “Efficient inference in fully connected CRFs with Gaussian edge potentials,” in NIPS, 2011.
Boundary recovery

Score map

Belief map

31
L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “Semantic image segmentation with deep convolutional nets and fully
connected CRFs,” in ICLR, 2015.
DeepLab
 Group:
 CCVL (Center for Cognition, Vision, and Learning).

 Basis networks (pre-trained for ImageNet):

 VGG-16 (Oxford Visual Geometry Group, ILSVRC 2014
1st).

 ResNet-101 (Microsoft Research Asia, ILSVRC 2015 1 ).

 Code: https://bitbucket.org/deeplab/deeplab-public/
32
U-Net
What does a U-Net do?

Learns Segmentation

Input Image Output Segmentation

Map
U-Net Architecture

“Contraction”
Phase
- Increases field of
view
- Lose Spatial
Information

Ronneberger et al. (2015) U-net

Architecture
U-Net Architecture

“Expansion”
- PhaseHigh Resolution
Create
Mapping

Ronneberger et al. (2015) U-net

Architecture
U-Net Architecture
Concatenate with high-
resolution feature maps from
the Contraction Phase

Ronneberger et al. (2015) U-net

Architecture
U-Net Summary
• Contraction Phase
– Reduce spatial dimension, but increases the “what.”
• Expansion Phase
– Recovers object details and the dimensions, which is the “where.”
• Concatenating feature maps from the Contraction phase helps the
Expansion phase with recovering the “where” information.
Author Results

Ronneberger et al. (2015) ISBI cell

tracking challenge

Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
Data Science - CS109: Joe Blitzstein, Verena Kaynig-Fittkau, Hanspeter Pfister
No ratings yet
Data Science - CS109: Joe Blitzstein, Verena Kaynig-Fittkau, Hanspeter Pfister
47 pages
SVM Notes
No ratings yet
SVM Notes
40 pages
Java 8 Concurrency Programming Guide
0% (1)
Java 8 Concurrency Programming Guide
5 pages
Deeplab: Semantic Image Segmentation With Deep Convolutional Nets, Atrous Convolution, and Fully Connected Crfs
No ratings yet
Deeplab: Semantic Image Segmentation With Deep Convolutional Nets, Atrous Convolution, and Fully Connected Crfs
14 pages
Feature Selection in Python ML
No ratings yet
Feature Selection in Python ML
7 pages
Deep Learning Interview Questions - Deep Learning Questions
No ratings yet
Deep Learning Interview Questions - Deep Learning Questions
21 pages
Deconvolution Network ICCV 2015 Paper PDF
No ratings yet
Deconvolution Network ICCV 2015 Paper PDF
9 pages
Deep RL Overview for AI Researchers
No ratings yet
Deep RL Overview for AI Researchers
150 pages
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
No ratings yet
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
10 pages
Ai Mini Project
No ratings yet
Ai Mini Project
21 pages
CNN-Based Semantic Image Segmentation
No ratings yet
CNN-Based Semantic Image Segmentation
10 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
78 pages
Crfasrnn Presentation PDF
No ratings yet
Crfasrnn Presentation PDF
48 pages
Artificial Intelligence and Autonomous Vehicles
No ratings yet
Artificial Intelligence and Autonomous Vehicles
6 pages
Deep Learning in Semantic Segmentation
No ratings yet
Deep Learning in Semantic Segmentation
28 pages
Sensors: Depth Estimation and Semantic Segmentation From A Single RGB Image Using A Hybrid Convolutional Neural Network
No ratings yet
Sensors: Depth Estimation and Semantic Segmentation From A Single RGB Image Using A Hybrid Convolutional Neural Network
20 pages
The One Hundred Layers Tiramisu: Fully Convolutional Densenets For Semantic Segmentation
No ratings yet
The One Hundred Layers Tiramisu: Fully Convolutional Densenets For Semantic Segmentation
9 pages
Discriminative Random Fields: Google Research, 1440 Broadway, New York, NY 10018, USA
No ratings yet
Discriminative Random Fields: Google Research, 1440 Broadway, New York, NY 10018, USA
36 pages
Sensors: Semantic Segmentation With Transfer Learning For Off-Road Autonomous Driving
No ratings yet
Sensors: Semantic Segmentation With Transfer Learning For Off-Road Autonomous Driving
21 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
A Comparative Study of Real-Time Semantic Segmentation For Autonomous Driving
No ratings yet
A Comparative Study of Real-Time Semantic Segmentation For Autonomous Driving
11 pages
Unsupervised Image Segmentation Model
No ratings yet
Unsupervised Image Segmentation Model
13 pages
MRF For Vision and Image Processing
No ratings yet
MRF For Vision and Image Processing
472 pages
Convolutional CRFs for Fast Semantic Segmentation
No ratings yet
Convolutional CRFs for Fast Semantic Segmentation
12 pages
Ind4 0
No ratings yet
Ind4 0
75 pages
Anam, Al-Jumaily - 2016 - Adaptive Myoelectric Pattern Recognition For Arm Movement in Different Positions Using Advanced Online Sequent
No ratings yet
Anam, Al-Jumaily - 2016 - Adaptive Myoelectric Pattern Recognition For Arm Movement in Different Positions Using Advanced Online Sequent
4 pages
Ch3 CNN
No ratings yet
Ch3 CNN
64 pages
Multiscale ConvNet for Scene Labeling
No ratings yet
Multiscale ConvNet for Scene Labeling
15 pages
Java Lab Manual
No ratings yet
Java Lab Manual
26 pages
Semantic Segmentation
No ratings yet
Semantic Segmentation
22 pages
DFANet: Fast Semantic Segmentation
No ratings yet
DFANet: Fast Semantic Segmentation
10 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
50 pages
2017DataMiningTools PDF
No ratings yet
2017DataMiningTools PDF
4 pages
Decision Tree Fields
No ratings yet
Decision Tree Fields
8 pages
Thesis Z Ai
No ratings yet
Thesis Z Ai
46 pages
Lecture 5 - CNNs For Detection and Segmentation
No ratings yet
Lecture 5 - CNNs For Detection and Segmentation
62 pages
Semantic Image Segmentation Via Deep Parsing Network
No ratings yet
Semantic Image Segmentation Via Deep Parsing Network
11 pages
Chapter 1 Introduction To Machine Learning
100% (1)
Chapter 1 Introduction To Machine Learning
19 pages
9 CNN
No ratings yet
9 CNN
156 pages
Harley MSC Thesis Menos Especializadpo
No ratings yet
Harley MSC Thesis Menos Especializadpo
71 pages
Satyaveer Singh
No ratings yet
Satyaveer Singh
2 pages
Week 1 HW
No ratings yet
Week 1 HW
3 pages
1 CASENet: Deep Category-Aware Semantic Edge Detection
No ratings yet
1 CASENet: Deep Category-Aware Semantic Edge Detection
16 pages
Segmentation-Aware Convolutional Networks Using Local Attention Masks
No ratings yet
Segmentation-Aware Convolutional Networks Using Local Attention Masks
11 pages
Dlcv2017d3l1segmentation 170623173102
No ratings yet
Dlcv2017d3l1segmentation 170623173102
36 pages
Zhao Et Al 2023
No ratings yet
Zhao Et Al 2023
9 pages
14 Segmentation
No ratings yet
14 Segmentation
22 pages
A Review On Multiscale-Deep-Learning Applications
No ratings yet
A Review On Multiscale-Deep-Learning Applications
28 pages
2015 - DeepLab v1 - Semantic Image Segmentation With Deep Convolutional Nets and Fully Connected Crfs
No ratings yet
2015 - DeepLab v1 - Semantic Image Segmentation With Deep Convolutional Nets and Fully Connected Crfs
14 pages
MLT Machine Learning
No ratings yet
MLT Machine Learning
6 pages
2018 - Understanding Convolution For Semantic Segmentation
No ratings yet
2018 - Understanding Convolution For Semantic Segmentation
10 pages
METHODOLOGY
No ratings yet
METHODOLOGY
5 pages
Abstract 3. Related Work Sandstorm U-Net Deeplabv3 4. Methodology
No ratings yet
Abstract 3. Related Work Sandstorm U-Net Deeplabv3 4. Methodology
18 pages
Semantic Segmentation for CS Students
No ratings yet
Semantic Segmentation for CS Students
151 pages
REF-6-DeepLab Semantic Image Segmentation With Deep Convolutional Nets Atrous Convolution and Fully Connected CRFs
No ratings yet
REF-6-DeepLab Semantic Image Segmentation With Deep Convolutional Nets Atrous Convolution and Fully Connected CRFs
15 pages
Large Kernel Matters
No ratings yet
Large Kernel Matters
11 pages
L D C E S S: Earning Ense Onvolutional Mbeddings FOR Emantic Egmentation
No ratings yet
L D C E S S: Earning Ense Onvolutional Mbeddings FOR Emantic Egmentation
10 pages
CS60010 - CNN 4
No ratings yet
CS60010 - CNN 4
32 pages
ES (AI Module) Student Workbook - Year 1 (English)
100% (1)
ES (AI Module) Student Workbook - Year 1 (English)
46 pages
【全局卷积GAP】2017 - Large - Kernel - Matters - Improve - Semantic - Segmentation - by - Global - Convolutional - Network
No ratings yet
【全局卷积GAP】2017 - Large - Kernel - Matters - Improve - Semantic - Segmentation - by - Global - Convolutional - Network
9 pages
02 Semantic Segmentation 2024
No ratings yet
02 Semantic Segmentation 2024
53 pages
Data Science Book1
No ratings yet
Data Science Book1
9 pages
2018 - SeGAN - Adversarial Network With Multi-Scale L 1 Loss For Medical
No ratings yet
2018 - SeGAN - Adversarial Network With Multi-Scale L 1 Loss For Medical
10 pages
Refine Net
No ratings yet
Refine Net
11 pages
DL Unit 5
No ratings yet
DL Unit 5
63 pages
Object Detection and Segmentation - Part 2
No ratings yet
Object Detection and Segmentation - Part 2
36 pages
Carafe
No ratings yet
Carafe
15 pages
FRM Course Syllabus IPDownload
No ratings yet
FRM Course Syllabus IPDownload
2 pages
AI & ML Concepts for Students
No ratings yet
AI & ML Concepts for Students
7 pages
Lecture 4
No ratings yet
Lecture 4
46 pages
3 - Enhancing Graph Neural Network-Based Fraud Detectors Against Camouflaged Fraudsters
No ratings yet
3 - Enhancing Graph Neural Network-Based Fraud Detectors Against Camouflaged Fraudsters
10 pages
Overview of Semantic Segmentation
No ratings yet
Overview of Semantic Segmentation
20 pages
DSA2324 Lecture 01 Introduction To Data Science
No ratings yet
DSA2324 Lecture 01 Introduction To Data Science
96 pages
Semantic Segmentation by Using Down-Sampling and S
No ratings yet
Semantic Segmentation by Using Down-Sampling and S
14 pages
Segmentation by Gan
No ratings yet
Segmentation by Gan
18 pages
Subrata Mondal: Experience
No ratings yet
Subrata Mondal: Experience
2 pages
2025 KINGS RCA Admission Guideline
No ratings yet
2025 KINGS RCA Admission Guideline
16 pages
BASeg - Boundary Aware Semantic Segmentation For Autonomous
No ratings yet
BASeg - Boundary Aware Semantic Segmentation For Autonomous
11 pages
Image Segmentation Basics
No ratings yet
Image Segmentation Basics
11 pages
A Beginner's Guide To Deep Learning Based Semantic Segmentation Using Keras - Divam Gupta
No ratings yet
A Beginner's Guide To Deep Learning Based Semantic Segmentation Using Keras - Divam Gupta
14 pages
4 Months Nasscom - SuprMentr Internship 2025
No ratings yet
4 Months Nasscom - SuprMentr Internship 2025
8 pages
Module V-Deep Learning
No ratings yet
Module V-Deep Learning
19 pages
AI 'Driven Robotics For Autonomous Vehicle
No ratings yet
AI 'Driven Robotics For Autonomous Vehicle
29 pages
Lecture 5 Segmentation
No ratings yet
Lecture 5 Segmentation
140 pages
A Review Deep Learning Techiques For Speech Processing2023
No ratings yet
A Review Deep Learning Techiques For Speech Processing2023
75 pages
Machine Learning Record VR19
No ratings yet
Machine Learning Record VR19
46 pages
Deep Learning - 11 - 12
No ratings yet
Deep Learning - 11 - 12
48 pages

Deep Segmentation

Uploaded by

Deep Segmentation

Uploaded by

Deep Segmentation

Deep Segmentation Networks

1. DeepLab v1, v2, v3

 Partitioning an image into regions of meaningful

Select maximal score

Class prediction scores for

 What happens in each standard DCNN layer?

What are the disadvantages for semantic

DeepLab address those issues by:

‘deconvolutional’ layers (backwards convolution).

Atrous (‘Holes’) convolution

Introduce zeros between

 Note: standard convolution is a special case for rate r=1.

Chen, Liang-Chieh, et al. " 13

 However, we take into account only the non-zero filter values:

Chen, Liang-Chieh, et al. " 15

 DCNN score maps successfully predict classification and rough

Chen, Liang-Chieh, et al. " 16

 Possible solution: super-pixel representation.

 Suggested Solution: fully connected CRFs.

L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, “

CRFs are usually used to model connections between different images.

Factorization (energy function):

y - is the label assignment for pixels.

 – uniform penalty for nearby pixels with different labels.

 Basis networks (pre-trained for ImageNet):

 ResNet-101 (Microsoft Research Asia, ILSVRC 2015 1 ).

Input Image Output Segmentation

Ronneberger et al. (2015) U-net

Ronneberger et al. (2015) U-net

Ronneberger et al. (2015) U-net

Ronneberger et al. (2015) ISBI cell

You might also like