0% found this document useful (0 votes)

191 views19 pages

Structured Outputs - Data Types

The document discusses structured outputs in convolutional neural networks (CNNs), highlighting their ability to produce high-dimensional tensors for tasks like pixel-level classification and image segmentation. It explains how CNNs can handle varying spatial extents and different data types, including 1-D, 2-D, and 3-D representations. The document emphasizes the advantages of using CNNs for complex data relationships and processing capabilities over traditional neural networks.

Uploaded by

devanand272003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

191 views19 pages

Structured Outputs - Data Types

Uploaded by

devanand272003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Structured outputs,

Data types

Mr. Sivadasan E T
Associate Professor
Vidya Academy of Science and Technology, Thrissur
Structured outputs
• A "structured object" in the context of
convolutional neural networks (CNNs) refers to
outputs that go beyond simple classification or
regression values.

• These outputs have complex, meaningful

relationships between their components and
typically represent high-dimensional data with
intricate patterns or structures.
Structured outputs

Convolutional networks can be used to output a high-

dimensional, structured object, rather than just
predicting a class label for a classification task or a
real value for a regression task.
High-Dimensional Tensor Output:

CNNs often emit a tensor as output.

A tensor can be seen as a multi-dimensional grid of

numbers representing probabilities, pixel intensities, or
other information.
Structured outputs
Example - Pixel-Level Classification:
Suppose a CNN produces a tensor S where:

Si,j,k represents the probability that pixel (j, k) belongs

to class i (like "car" or "person").

This enables pixel-wise classification rather than

predicting just a single class for the entire image.
Structured outputs
Image Segmentation:

By assigning a class to each pixel, CNNs can create

precise masks that outline individual objects in an
image.

Use Case: Identifying and isolating cars, roads, and

pedestrians in autonomous driving images.
Structured outputs

• Once a prediction for each pixel is made,

various methods can be used to further process
these predictions in order to obtain a
segmentation of the image into regions.
Structured outputs

• The general idea is to assume that large groups

of contiguous pixels tend to be associated with
the same label.

• Graphical models can describe the probabilistic

relationships between neighboring pixels.
Data Types

The data used with a convolutional network usually

consists of several channels.

Each channel being the observation of a different

quantity at some point in space or time.
Data Types
• One advantage to convolutional networks is that
they can also process inputs with varying spatial
extents.
• These kinds of input simply cannot be represented
by traditional, matrix multiplication-based neural
networks.
• This provides a compelling reason to use
convolutional networks even when computational
cost and overfitting are not significant issues.
Data Types

• For example, consider a collection of images,

where each image has a different width and
height.

• It is unclear how to model such inputs with a

weight matrix of fixed size.
Data Types

• Convolution is straightforward to apply; the

kernel is simply applied a different number of
times depending on the size of the input, and the
output of the convolution operation scales
accordingly.
Data Types

1-D Single Channel

• Audio waveform: The axis we convolve over

corresponds to time.

• We discretize time and measure the amplitude

of the waveform once per time step.
Data Types
1-D Multi-Channel

• This involves animating 3D characters by

changing their joint angles over time.
• Each frame records the angles of different
joints, describing the character's pose.
• In convolutional models, each data channel
represents the angle of one joint around a
specific axis.
Data Types
2-D Single Channel:

• Audio data that has been preprocessed with a

Fourier transform:

• We can transform the audio waveform into a 2D

tensor with different rows corresponding to different
frequencies and different columns corresponding to
different points in time.
Data Types
2-D Multi-Channel:

Color image data:

• One channel contains the red pixels, one the green
pixels, and one the blue pixels.

• The convolution kernel moves over both the

horizontal and vertical axes of the image, conferring
translation equivariance in both directions.
Data Types

3-D Single Channel:

Volumetric data: A common source of this kind of data

is medical imaging technology, such as CT scans.
Data Types

3-D Multi-Channel:

Color video data: One axis corresponds

to time, one to the height of the video frame, and one
to the width of the video frame.
Thank You!

CNN, RNN
No ratings yet
CNN, RNN
60 pages
Unit 4
No ratings yet
Unit 4
51 pages
Deep Learning - 12 - 1
No ratings yet
Deep Learning - 12 - 1
58 pages
Machine Learning For Neuroscience: Convolutional Neural Networks
No ratings yet
Machine Learning For Neuroscience: Convolutional Neural Networks
50 pages
PDA - Unit 4
No ratings yet
PDA - Unit 4
62 pages
Srinivas Institute of Technology (Deep Learning)
No ratings yet
Srinivas Institute of Technology (Deep Learning)
12 pages
Operation Strategy
100% (1)
Operation Strategy
22 pages
Chap 9-2 - Convolutional Neural Network - Heechul Lim
No ratings yet
Chap 9-2 - Convolutional Neural Network - Heechul Lim
58 pages
L4 - Deep Learning
No ratings yet
L4 - Deep Learning
50 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
CNN Final
No ratings yet
CNN Final
17 pages
Unit 2 Part 03
No ratings yet
Unit 2 Part 03
49 pages
21CSE424T - Deep Learning For Data Analytics - Unit I - 06082025
No ratings yet
21CSE424T - Deep Learning For Data Analytics - Unit I - 06082025
125 pages
Region Religion and Politics 100 Years of Shiromani Alcali Dal Amarjit S Narang Download
No ratings yet
Region Religion and Politics 100 Years of Shiromani Alcali Dal Amarjit S Narang Download
64 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Convolution and Pooling As An Infinitely Strong Prior
100% (1)
Convolution and Pooling As An Infinitely Strong Prior
11 pages
IB Chemistry Stoichiometry & Periodicity
No ratings yet
IB Chemistry Stoichiometry & Periodicity
309 pages
The Ultimate Guide To Reading The Water
No ratings yet
The Ultimate Guide To Reading The Water
39 pages
Introduction To Deep Learning - Deep Feed Forward Network
No ratings yet
Introduction To Deep Learning - Deep Feed Forward Network
24 pages
Purbasari and Purbararang Script
No ratings yet
Purbasari and Purbararang Script
22 pages
A General Theory of Domination and Justice 1st Edition Lovett Instant Download
No ratings yet
A General Theory of Domination and Justice 1st Edition Lovett Instant Download
145 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Convolution Neural Network (CNN) : A - Suraj Kumar
No ratings yet
Convolution Neural Network (CNN) : A - Suraj Kumar
22 pages
Basic Intro CNN
No ratings yet
Basic Intro CNN
14 pages
Unit 4 Deep Learning
No ratings yet
Unit 4 Deep Learning
27 pages
Introduction To Neural Networks - Single Layer Perceptrons - Modified
No ratings yet
Introduction To Neural Networks - Single Layer Perceptrons - Modified
26 pages
Awrrpt 1 66643 66644
No ratings yet
Awrrpt 1 66643 66644
228 pages
Unit 3
No ratings yet
Unit 3
105 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Computer Vision
No ratings yet
Computer Vision
20 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
Deep Learning Module-04
No ratings yet
Deep Learning Module-04
17 pages
Unit - 5
No ratings yet
Unit - 5
47 pages
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
No ratings yet
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
123 pages
Recurrent Neural Networks RNN
No ratings yet
Recurrent Neural Networks RNN
19 pages
Computer Vision 2
No ratings yet
Computer Vision 2
62 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
PNAL9 CNNs
No ratings yet
PNAL9 CNNs
61 pages
Georges Renault Cvis II
No ratings yet
Georges Renault Cvis II
76 pages
Mergeddv
No ratings yet
Mergeddv
2 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
36 pages
Activation Functions - Sigmoid - Tanh - ReLU - Softmax - Risk Minimization - Loss Function
No ratings yet
Activation Functions - Sigmoid - Tanh - ReLU - Softmax - Risk Minimization - Loss Function
17 pages
Dbms Theory
No ratings yet
Dbms Theory
20 pages
Studies in The Psychology of Sex, Volume 3 Analysis of The Sexual Impulse Love and Pain The Sexual Impulse in Women by Ellis, Havelock, 1859-1939
100% (3)
Studies in The Psychology of Sex, Volume 3 Analysis of The Sexual Impulse Love and Pain The Sexual Impulse in Women by Ellis, Havelock, 1859-1939
242 pages
CNN 1
No ratings yet
CNN 1
9 pages
Deep Learning Module-04 Search Creators
No ratings yet
Deep Learning Module-04 Search Creators
17 pages
Explain The Convolution Operation in The Context of Image Processing. How Does It Differ From Standard Matrix Multiplication?
No ratings yet
Explain The Convolution Operation in The Context of Image Processing. How Does It Differ From Standard Matrix Multiplication?
5 pages
Encoder-Decoder Sequence To Sequence Architechure
No ratings yet
Encoder-Decoder Sequence To Sequence Architechure
16 pages
Convolutional Networks Guide
No ratings yet
Convolutional Networks Guide
15 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
CNNs Explained for Students
No ratings yet
CNNs Explained for Students
11 pages
Software Requirements Specification (SRS)
No ratings yet
Software Requirements Specification (SRS)
5 pages
Evolution of Handwriting Systems
100% (2)
Evolution of Handwriting Systems
38 pages
Speech Recognition
No ratings yet
Speech Recognition
7 pages
Fundamentals-of-ML-Study-Guide - M3
No ratings yet
Fundamentals-of-ML-Study-Guide - M3
20 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
CH1O3 Questions PDF
No ratings yet
CH1O3 Questions PDF
52 pages
Canon Irc2380i Irc3080 Irc3080i Irc3580 Irc3580i Brochure
No ratings yet
Canon Irc2380i Irc3080 Irc3080i Irc3580 Irc3580i Brochure
8 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
Principles of Assessment: Prepared By: Julie G. de Guzman Eps - I Science
No ratings yet
Principles of Assessment: Prepared By: Julie G. de Guzman Eps - I Science
25 pages
Secure Stock 2081-0709
No ratings yet
Secure Stock 2081-0709
3 pages
alloy20DataSheet PDF
No ratings yet
alloy20DataSheet PDF
2 pages
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
Convolution Nueral Networks
No ratings yet
Convolution Nueral Networks
32 pages
Assignment/ Tugasan HBEC4403 Social and Emotional Development of Young Children/ September 2023 Semester
No ratings yet
Assignment/ Tugasan HBEC4403 Social and Emotional Development of Young Children/ September 2023 Semester
12 pages
Thuyết Trình Anh Văn Sáng Thứ 5
No ratings yet
Thuyết Trình Anh Văn Sáng Thứ 5
7 pages
Perl Arrays and Lists Guide
No ratings yet
Perl Arrays and Lists Guide
5 pages
Unit III
No ratings yet
Unit III
89 pages
DDP Sohana - 2021 - Notification
No ratings yet
DDP Sohana - 2021 - Notification
17 pages
CNN Basics and Architecture Guide
No ratings yet
CNN Basics and Architecture Guide
65 pages
Vivekananda Universe
No ratings yet
Vivekananda Universe
4 pages
CNN Basics for AI Enthusiasts
No ratings yet
CNN Basics for AI Enthusiasts
6 pages
Intro to Convolutional Neural Networks
No ratings yet
Intro to Convolutional Neural Networks
80 pages
AdaGrad - RMSProp - Adam
No ratings yet
AdaGrad - RMSProp - Adam
9 pages
Multiplication&division PDF
No ratings yet
Multiplication&division PDF
2 pages
Android-Controlled Pesticide Spraying Robot
No ratings yet
Android-Controlled Pesticide Spraying Robot
6 pages
Understanding Kohlberg's Moral Stages
No ratings yet
Understanding Kohlberg's Moral Stages
43 pages
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
No ratings yet
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
31 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
NN 06
No ratings yet
NN 06
18 pages
CNNs for Machine Learning Experts
No ratings yet
CNNs for Machine Learning Experts
6 pages
Unit - 2
No ratings yet
Unit - 2
31 pages
Industrial Two Roll Mill Quotation
No ratings yet
Industrial Two Roll Mill Quotation
3 pages
CNNs Explained for B.Tech Students
No ratings yet
CNNs Explained for B.Tech Students
29 pages
Deep Learning 4/7: Convolutional Neural Networks: C. de Castro, IEIIT-CNR, Cristina - Decastro@ieiit - Cnr.it
0% (1)
Deep Learning 4/7: Convolutional Neural Networks: C. de Castro, IEIIT-CNR, Cristina - Decastro@ieiit - Cnr.it
49 pages
160719a0cd3011 - 29094359708
No ratings yet
160719a0cd3011 - 29094359708
2 pages
DL Unit Iii
No ratings yet
DL Unit Iii
13 pages
Unit 2
No ratings yet
Unit 2
20 pages
CNNs for AI and Machine Learning
No ratings yet
CNNs for AI and Machine Learning
16 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
PFC 4197
No ratings yet
PFC 4197
114 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
Revelations of Chance Synchronicity As Spiritual Experience No-Wait Download
100% (8)
Revelations of Chance Synchronicity As Spiritual Experience No-Wait Download
14 pages
Introduction To Data Science and Python For Data
No ratings yet
Introduction To Data Science and Python For Data
12 pages

Structured Outputs - Data Types

Uploaded by

Structured Outputs - Data Types

Uploaded by

Structured outputs,

• These outputs have complex, meaningful

Convolutional networks can be used to output a high-

CNNs often emit a tensor as output.

A tensor can be seen as a multi-dimensional grid of

Si,j,k represents the probability that pixel (j, k) belongs

This enables pixel-wise classification rather than

By assigning a class to each pixel, CNNs can create

Use Case: Identifying and isolating cars, roads, and

• Once a prediction for each pixel is made,

• The general idea is to assume that large groups

• Graphical models can describe the probabilistic

The data used with a convolutional network usually

Each channel being the observation of a different

• For example, consider a collection of images,

• It is unclear how to model such inputs with a

• Convolution is straightforward to apply; the

1-D Single Channel

• Audio waveform: The axis we convolve over

• We discretize time and measure the amplitude

• This involves animating 3D characters by

• Audio data that has been preprocessed with a

• We can transform the audio waveform into a 2D

Color image data:

• The convolution kernel moves over both the

3-D Single Channel:

Volumetric data: A common source of this kind of data

Color video data: One axis corresponds

You might also like