DL - Intro

Uploaded by

silversreyleigh27

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views35 pages

DL - Intro

Uploaded by

silversreyleigh27

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Deep Learning

Introduction
• Deep learning is a subset of machine learning.
Difference between DL and ML
• https://www.youtube.com/watch?v=6M5VXKLf4D4
Applications of DL
• Virtual Assistants: Siri, Alexa, Cortana etc.
• Recommendation Engines
• News aggregator and Fake news detector
• Music Composition
• Image Captioning
• Self Driving Cars
• Language Translators
Types of DL Models
• Supervised Models
• CNN (Convolutional Neural Network)
• Image Classification

• RNN (Recurrent Neural Network)

• To Predict Sequences
• LSTM one of the popular techniques.
Disadvantages of Feed Forward NN.
• Cannot Handle Sequential Data.
• Considers only the current input
• Cannot memorize the previous inputs.

• Solution : RNN.
RNN

• Designed to work for Time series data or the data that involves
sequences.
• Sequence: One data point is dependent upon the previous data point.
• Concept of Memory involved.
Applications of RNN
• Image
Captioning

• Text Prediction
• Machine Translation

• Time Series Prediction

RNN
Types of RNN
• One to One RNN/ • One to Many RNN
Vanilla • Image Captioning
• Many to One
• Many to Many
• E.g.: Sentiment • E.g.: Machine
Analysis Translation
Problems with RNN
• Vanishing Gradient
• When the derivative function is sigmoid.
• Derivative of sigmoid function value lies between 0-0.25.
• The gradient becomes smaller and smaller
• Over a period of time the old and new weights will become same.

• Exploding Gradient
• When gradient becomes very high and is not able to converge.
Way to deal with Gradient Problem
• LSTM (LongShort- Term Memory)
• LSTM capable of learning long term
dependencies by remembering
information for long periods of time.
LSTM Basics
• NLP task for auto completion
• Today, due to health issue, I ….

• Last week, due to health issue, I…..

• I need medication
• I had taken medication

• Need to remember long history…

• But, RNN has short term memory
Another Example
• Maya loves eating samosa everyday. Her favorite cuisine is ……….
Memory Cell
Building Long Term Memory

Only Keywords are stored.

Long Term Memory
• Maya loves samosa, her favorite cuisine is Indian. Rahul loves pizza
and pasta, his favorite cuisine is …….
• Forget some things and remember something.
• This is forget gate.
• Input Gate
• Add memory for Pasta
Output Gate
LSTM Architecture
• Consist of 3 parts
Called Gates
1: Forget Gate
2. Input Gate
3. Output Gate
3 Step Process of LSTM
• Decide how much past data should be
remembered (FORGET GATE)
• Let output of h(t-1) be “ Rita is good in Maths.
Jimmy on the other hand is good at Biology”
• Output at x(t) be “ Jimmy called me yesterday. He
loves to play football. He is selected as the captain
of his team”
• The forget gate realizes there might be a change in
context after encountering the first full stop.
• It compares with the current input sentence at
x(t). The next sentence talks about Jimmy, so the
information on Rita is deleted.
• The position of the subject is vacated and assigned
to Jimmy.
Step 2
• What information we need to store in the
cell?
• First, Sigmoid Layer (Input Gate Layer)
decides which values will get updated.
• Next tanh creates a vector of new
candidate values that could be added to
the state.
• Next step we combine these two and
create an update to the state.
• With the current input at x(t), the input gate analyzes
the important information in Jimmy called me yesterday. He
loves to play football. He is selected as the captain of his team”
• The information “ He loves of play football. He is
selected as the captain of his team” is remembered

• “He called me yesterday” is less important; hence it's

forgotten. This process of adding some new information
can be done via the input gate.
Step 3
• Decide What Part of the Current Cell State Makes It to
the Output
• First, we run a sigmoid layer, which decides what parts
of the cell state make it to the output.
• Then, we put the cell state through tanh to push the
values to be between -1 and 1 and multiply it by the
output of the sigmoid gate.
• E.g: “Jimmy played well against the opponent. Brave
_____ was awarded player of the match”
• For empty place many choices, many options.
• Brave Adjective, choice of noun.
• The best choice is John.

• https://colah.github.io/posts/2015-08-Understanding-
LSTMs/
Deep Learning Frameworks
• TensorFlow
• Keras
• PyTorch
• Theano
• Caffe
• Microsoft CNTK
TensorFlow
• Google’s Brain team developed Deep Learning Framework
TensorFlow.
• Supports Python and R
• Uses dataflow graphs to process data.
• Easy to build robust models
• TensorBoard available for Data Visualization
Keras
• Francois Chollet developed Keras and it is now one of the fastest
growing Deep Learning framework packages.
• Supports high level NN API, written in Python
• Can run on top of TensorFlow, Theano and CNTK.
• User friendly, as API is simple.
• Extensible as new modules are simple to add
• TensorFlow has adopted Keras as its official high level API
• Used by companies like Netflix, Uber etc.
Keras

C# Practical File
No ratings yet
C# Practical File
15 pages
LSTM Material 1
No ratings yet
LSTM Material 1
3 pages
DLT Unit-4
No ratings yet
DLT Unit-4
18 pages
Unit 4 - Machine Learning
No ratings yet
Unit 4 - Machine Learning
16 pages
Deep Learning L3
No ratings yet
Deep Learning L3
37 pages
RNNs & LSTMs for Tech Enthusiasts
No ratings yet
RNNs & LSTMs for Tech Enthusiasts
9 pages
9 Deep Leaning RNN
No ratings yet
9 Deep Leaning RNN
64 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Aids Ii
No ratings yet
Aids Ii
42 pages
ML Unit 4
No ratings yet
ML Unit 4
47 pages
LSTM&RNN
No ratings yet
LSTM&RNN
10 pages
LSTM Deep Learning
No ratings yet
LSTM Deep Learning
11 pages
Unit 4 - MachineLearning
No ratings yet
Unit 4 - MachineLearning
16 pages
Module 4
No ratings yet
Module 4
14 pages
LSTM
No ratings yet
LSTM
14 pages
LSTM & Gru
No ratings yet
LSTM & Gru
17 pages
Introduction To Long Short Term Memory LSTM
No ratings yet
Introduction To Long Short Term Memory LSTM
6 pages
RNN
No ratings yet
RNN
28 pages
Unit 2 DL
No ratings yet
Unit 2 DL
43 pages
AAM Unit 6 Notes
No ratings yet
AAM Unit 6 Notes
20 pages
Unit 3
No ratings yet
Unit 3
8 pages
NLP - L8 LSTM
No ratings yet
NLP - L8 LSTM
7 pages
NN Text Generation Zaid Bouslikhin
No ratings yet
NN Text Generation Zaid Bouslikhin
14 pages
LSTM PPT
No ratings yet
LSTM PPT
22 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
105 pages
LSTM
No ratings yet
LSTM
22 pages
Deep Learning
No ratings yet
Deep Learning
49 pages
Unit 2 DL
No ratings yet
Unit 2 DL
44 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
Unit 4
No ratings yet
Unit 4
27 pages
Dr. Ahmad Al-Mahasneh
No ratings yet
Dr. Ahmad Al-Mahasneh
32 pages
RNN and LSTM - Explanation by Example
No ratings yet
RNN and LSTM - Explanation by Example
56 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
Neural Networks
No ratings yet
Neural Networks
22 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
Understanding LSTM Networks
No ratings yet
Understanding LSTM Networks
7 pages
6 - RNN LSTM & Gru
No ratings yet
6 - RNN LSTM & Gru
14 pages
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
LSTM Neural Networks Explained
No ratings yet
LSTM Neural Networks Explained
4 pages
Week 6
No ratings yet
Week 6
60 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Lecture 11
No ratings yet
Lecture 11
57 pages
RNN & LSTM: Vamsi Krishna B 1 9 M E 0 2 3
No ratings yet
RNN & LSTM: Vamsi Krishna B 1 9 M E 0 2 3
14 pages
RNN 2
No ratings yet
RNN 2
144 pages
RNNs and LSTMs
No ratings yet
RNNs and LSTMs
41 pages
RNN & LSTM Notes
No ratings yet
RNN & LSTM Notes
8 pages
19 Deep Learning
100% (1)
19 Deep Learning
49 pages
Sequence Modeling
100% (1)
Sequence Modeling
131 pages
Recurrent Neural Network: What Does RNN Stand For?
No ratings yet
Recurrent Neural Network: What Does RNN Stand For?
7 pages
MVDAFT Final
No ratings yet
MVDAFT Final
30 pages
ML (Cs-601) Unit 4 Complete
No ratings yet
ML (Cs-601) Unit 4 Complete
45 pages
Unit 4
No ratings yet
Unit 4
86 pages
Recurrent Neural Networks
100% (1)
Recurrent Neural Networks
14 pages
RNN LSTM
No ratings yet
RNN LSTM
37 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Gen AI - 15-3-25
No ratings yet
Gen AI - 15-3-25
24 pages
Machine Learning: RNN, LSTM, GRU, Translation
No ratings yet
Machine Learning: RNN, LSTM, GRU, Translation
12 pages
CS5560 Lect12-RNN - LSTM
No ratings yet
CS5560 Lect12-RNN - LSTM
30 pages
LSTM
No ratings yet
LSTM
3 pages
GFT Agile Runbook: A Guick Guide For You To Start Running Your Project Using An Agile Approach
No ratings yet
GFT Agile Runbook: A Guick Guide For You To Start Running Your Project Using An Agile Approach
32 pages
Python for Analytics Course Guide
No ratings yet
Python for Analytics Course Guide
13 pages
Cyber Security MCQ
100% (6)
Cyber Security MCQ
2 pages
Adobe Flex Ria
No ratings yet
Adobe Flex Ria
36 pages
Ai Scripting Docsforadobe Dev en Latest
No ratings yet
Ai Scripting Docsforadobe Dev en Latest
884 pages
SGFL Job Opportunities 2020
No ratings yet
SGFL Job Opportunities 2020
7 pages
Explain How Multiplexing Is So Cost-Effective. How Is Interference Avoided by Using Frequency Division Multiplexing?
No ratings yet
Explain How Multiplexing Is So Cost-Effective. How Is Interference Avoided by Using Frequency Division Multiplexing?
3 pages
Proposed List of Moocs (Nptel Jan-April-2023 Time Line) For B.E. Degree Honors
No ratings yet
Proposed List of Moocs (Nptel Jan-April-2023 Time Line) For B.E. Degree Honors
1 page
Chapter Two HTML: Internet Programming Compiled By:tadesse K
No ratings yet
Chapter Two HTML: Internet Programming Compiled By:tadesse K
162 pages
Arch Fundamental
No ratings yet
Arch Fundamental
43 pages
Chapter 6: CPU Scheduling: Cycle CPU Burst I/O Burst
No ratings yet
Chapter 6: CPU Scheduling: Cycle CPU Burst I/O Burst
17 pages
OPTIMAX PowerFit Virtual Power Plants Unit Commitment Pooling
No ratings yet
OPTIMAX PowerFit Virtual Power Plants Unit Commitment Pooling
4 pages
Approaches To Software Engineering - Isaac Computer Science
No ratings yet
Approaches To Software Engineering - Isaac Computer Science
9 pages
TOS Q1 TLEICTCSS 9 - Jonalyn Ambrona
No ratings yet
TOS Q1 TLEICTCSS 9 - Jonalyn Ambrona
2 pages
Installing and Configuring FreeNAS
No ratings yet
Installing and Configuring FreeNAS
29 pages
Expected Coded Direction With Distance Questions For Sbi Clerk Mains Exam
75% (4)
Expected Coded Direction With Distance Questions For Sbi Clerk Mains Exam
15 pages
Toronto Restaurant Location Guide
No ratings yet
Toronto Restaurant Location Guide
9 pages
Sample PDF
No ratings yet
Sample PDF
11 pages
Syllabus CS
No ratings yet
Syllabus CS
9 pages
Session Level Yapp Handout PDF
No ratings yet
Session Level Yapp Handout PDF
27 pages
CppUnit Guide for C++ Beginners
No ratings yet
CppUnit Guide for C++ Beginners
11 pages
Win7AIO x64 Aug2013
No ratings yet
Win7AIO x64 Aug2013
2 pages
SECCD Col11
No ratings yet
SECCD Col11
96 pages
Apex Trigger Scenarios For Practice: by Dhananjay Aher
0% (2)
Apex Trigger Scenarios For Practice: by Dhananjay Aher
9 pages
From Sqli To Shell II
No ratings yet
From Sqli To Shell II
37 pages
Downgrade Kitkat
No ratings yet
Downgrade Kitkat
7 pages
Syn-2151 10/100/1000baset Ethernet Media Converter
No ratings yet
Syn-2151 10/100/1000baset Ethernet Media Converter
2 pages
Abstract Hadoop
No ratings yet
Abstract Hadoop
1 page

DL - Intro

Uploaded by

DL - Intro

Uploaded by

Deep Learning

• RNN (Recurrent Neural Network)

• Time Series Prediction

• Last week, due to health issue, I…..

• Need to remember long history…

Only Keywords are stored.

• “He called me yesterday” is less important; hence it's

You might also like