0% found this document useful (0 votes)

62 views31 pages

08 NLP With Deep Learning

This document provides a 6 step process for using deep learning to generate new text based on a corpus of existing text data: 1) Read in text data and understand the characters 2) Process the text by vectorizing it and creating an encoding dictionary 3) Create batches of text sequences for the neural network 4) Build the neural network model with embedding, GRU, and dense layers 5) Train the model on the batches 6) Generate new text by loading the trained model and running a text generation loop

Uploaded by

조동올

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views31 pages

08 NLP With Deep Learning

Uploaded by

조동올

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

NLP with Deep

Learning
Deep Learning

● Let’s explore how to work with text data in

conjunction with deep learning!
● This is a natural extension of the time series
and recurrent neural network topics we
just discussed.
Deep Learning

● We will create a neural network that will

generate new text based on a corpus of
text data.
● Check out “The Unreasonable Effectiveness
of RNNs” by Andrej Karpathy
● So how will this work?
Deep Learning

● Given an input string sequence, predict the

sequence shifted forward 1 character.
○ [“h” , ”e” , ”l” , ”l” ]
○ [“e” , ”l” , ”l” , ”o”]
Deep Learning

● The character based RNN will actually learn

the structure of the text.
● In our example we will use the works of
William Shakespeare.
● We will see the network clearly learn play
writing structure and spacing, just from a
character level!
Deep Learning
Deep Learning

● Step 1: Read in Text Data

○ We can use basic built in python
commands to read in a corpus of text
as string data.
○ Note, you should have a large data set
for this, at least 1 million characters for
realistic results.
Deep Learning

● Step 2: Text Processing and Vectorization

○ The neural network can’t take in raw
strings, so we will encode them each
to an integer.
■ A:1
■ B:2
■ C:3
■ ? : 55
Deep Learning

● Step 3: Creating Batches

○ We’ll use Tensorflow’s dataset object
to easily create batches of text
sequences.
■ [“h”, “e” , “l” , “l” , “o” , “ “, “m”]
■ [,“e” , “l” , “l” , “o” , “ “, “m”, “y”]
Deep Learning

● Step 3: Creating Batches

○ We’ll want to use sequence lengths
that are long enough to capture
structure and previous words.
○ But not so long that the sequence is
just historical noise.
Deep Learning

● Step 4: Creating the Model

○ We’ll use 3 layers
■ Embedding
■ GRU
■ Dense
Deep Learning

● Step 4: Creating the Model

○ Embedding Layer turns positive
integers (indexes) into dense vectors
of fixed size. eg. [[4], [20]] -> [[0.25,
0.1,0.3], [0.6, -0.2,0.9]]
○ Its up to the user to choose the
number of embedding dimensions.
Deep Learning

● Step 4: Creating the Model

○ GRU
■ Gated Recurrent Unit is a special
type of recurrent neuron unit.
■ The GRU is like a long short-term
memory (LSTM) with forget gate
but has fewer parameters than
LSTM, as it lacks an output gate.
Gated Recurrent Unit (GRU)
ht

ht-1 Ct
X
+
X rt 1- zt X

σ σ tanh

xt
Deep Learning

● Step 4: Creating the Model

○ Dense Layer
■ One neuron per character.
■ Character labels will be one hot
encoded so the final dense layer
produces a probability per
character.
Deep Learning

● Step 4: Creating the Model

○ Dense Layer
■ Probability per character means we
can play around with
“temperature”:
● Choosing less probable
characters more/less often
Deep Learning

● Step 5: Training the Model

○ We’ll set up our batches and make
sure to one-hot encode our character
labels.
Deep Learning

● Step 6: Generating new text

○ We’ll save our models weights and
show you how to reload a model’s
weights with a different batch size in
order to pass in single examples.
Let’s get started!
Text Generation
With Python and Keras
Part One
Deep Learning

● Part 1: The Data

○ Import main libraries
○ Importing Text
○ Understanding Characters
Text Generation
With Python and Keras
Part Two
Deep Learning

● Part 2: Text Processing

○ Vectorize the text
○ Create encoding dictionary
Text Generation
With Python and Keras
Part Three
Deep Learning

● Step 3: Creating Batches

○ Understand text sequences
○ Use Tensorflow datasets to generate
batches
○ Shuffle batches
Text Generation
With Python and Keras
Part Four
Deep Learning

● Step 4: Creating the Model

○ Set up loss function
○ Create Model
■ Embedding
■ GRU
■ Dense
Text Generation
With Python and Keras
Part Five
Deep Learning

● Step 5: Training the Model

○ We’ll quickly show an example of how to
train the model.
○ We’ll also show you how to load our
provided saved model file.
Text Generation
With Python and Keras
Part Six
Deep Learning

● Step 6: Generating Text

○ We’ll load our model
○ Adjust batch size to 1
○ Run a loop that generates new text

Chapter2 Limitations of RNN
No ratings yet
Chapter2 Limitations of RNN
29 pages
Unit 5 DL
No ratings yet
Unit 5 DL
26 pages
AIML LAB Week9 2
No ratings yet
AIML LAB Week9 2
3 pages
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
No ratings yet
Report On Text Classification Using CNN, RNN & HAN - Jatana - Medium
15 pages
RNN-1
No ratings yet
RNN-1
50 pages
Cs224n Text Generation
No ratings yet
Cs224n Text Generation
73 pages
Steps
No ratings yet
Steps
3 pages
22a Neural
No ratings yet
22a Neural
46 pages
Natural Language Processing With RNNs .Ipynb - Colaboratory
No ratings yet
Natural Language Processing With RNNs .Ipynb - Colaboratory
15 pages
A Quick Recap: Artificial Intelligence LAB
No ratings yet
A Quick Recap: Artificial Intelligence LAB
29 pages
Automated Image Captioning With Convnets and Recurrent Nets: Andrej Karpathy, Fei-Fei Li
No ratings yet
Automated Image Captioning With Convnets and Recurrent Nets: Andrej Karpathy, Fei-Fei Li
105 pages
Chapter 2. Transformers: A Note For Early Release Readers
No ratings yet
Chapter 2. Transformers: A Note For Early Release Readers
85 pages
Text Generation
No ratings yet
Text Generation
4 pages
3 Sequence and Language Modeling
No ratings yet
3 Sequence and Language Modeling
56 pages
04 - RNNs
No ratings yet
04 - RNNs
37 pages
Text Generation With LSTM Recurrent Neural Networks in Python With Keras
No ratings yet
Text Generation With LSTM Recurrent Neural Networks in Python With Keras
23 pages
Day 4
No ratings yet
Day 4
22 pages
Experiment 3.3
No ratings yet
Experiment 3.3
3 pages
Problem 1 Proposal
No ratings yet
Problem 1 Proposal
24 pages
NLP Basics
No ratings yet
NLP Basics
119 pages
Lecture8 421
No ratings yet
Lecture8 421
85 pages
FDP Deep Learning Architectures and Applications
No ratings yet
FDP Deep Learning Architectures and Applications
51 pages
NN Text Generation Zaid Bouslikhin
No ratings yet
NN Text Generation Zaid Bouslikhin
14 pages
Exercise 4 Changes
No ratings yet
Exercise 4 Changes
4 pages
Intro to RNNs: A Beginner's Guide
No ratings yet
Intro to RNNs: A Beginner's Guide
8 pages
Text Generation: Shusen Wang
No ratings yet
Text Generation: Shusen Wang
49 pages
Machine Translation Using Natural Language Process
No ratings yet
Machine Translation Using Natural Language Process
6 pages
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
No ratings yet
Complete NLP Guide - From Fundamentals To Deep Learning With TensorFlow
13 pages
Unit 5.
No ratings yet
Unit 5.
17 pages
Class44-46 Introduction To Enncoder-Decoder Model Attention-03-09May2023
No ratings yet
Class44-46 Introduction To Enncoder-Decoder Model Attention-03-09May2023
35 pages
NLP Unit-3A Notes
No ratings yet
NLP Unit-3A Notes
28 pages
Deep Learning Recurrent Neural Networks - Introduction
No ratings yet
Deep Learning Recurrent Neural Networks - Introduction
106 pages
NN UNIT 5 Notes
No ratings yet
NN UNIT 5 Notes
23 pages
6b. Recurrent Neural Networks
No ratings yet
6b. Recurrent Neural Networks
38 pages
Neural Text Generation: A Practical Guide: Ziang Xie Zxie@cs - Stanford.edu
No ratings yet
Neural Text Generation: A Practical Guide: Ziang Xie Zxie@cs - Stanford.edu
21 pages
Wa0010.
No ratings yet
Wa0010.
7 pages
RNN Overview: Types, Applications, and Code
No ratings yet
RNN Overview: Types, Applications, and Code
8 pages
Custom GPT-2 Text Generation Guide
No ratings yet
Custom GPT-2 Text Generation Guide
3 pages
ANN Text and Sequence Processing
No ratings yet
ANN Text and Sequence Processing
33 pages
The Unreasonable Effectiveness of Recurrent Neural Networks
No ratings yet
The Unreasonable Effectiveness of Recurrent Neural Networks
1 page
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-02-28 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000689 2025-02-28 Reference-Material-I
39 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
Sentiment Analysis With An Recurrent Neural Networks
No ratings yet
Sentiment Analysis With An Recurrent Neural Networks
12 pages
14 LookingForward
No ratings yet
14 LookingForward
48 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
Machine Learning
No ratings yet
Machine Learning
17 pages
AN2DL 05 2324 Seq2SeqAndWordEmbedding
No ratings yet
AN2DL 05 2324 Seq2SeqAndWordEmbedding
42 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
Session 8
No ratings yet
Session 8
24 pages
APznzaYD23xZzgrNn UY T9fGgJbB0 Kfhgt21x0vaHH4qfIvCmiqGVPY37T19O
No ratings yet
APznzaYD23xZzgrNn UY T9fGgJbB0 Kfhgt21x0vaHH4qfIvCmiqGVPY37T19O
10 pages
UNIT5
No ratings yet
UNIT5
81 pages
AAM Unit 6 Notes
No ratings yet
AAM Unit 6 Notes
20 pages
Assignment4 - Deeplearning
No ratings yet
Assignment4 - Deeplearning
10 pages
Differ - Blog-Heres How You Can Build and Train GPT-2 From Scratch Using PyTorch
No ratings yet
Differ - Blog-Heres How You Can Build and Train GPT-2 From Scratch Using PyTorch
13 pages
Transformer
No ratings yet
Transformer
39 pages
Sequence Models For NLP
No ratings yet
Sequence Models For NLP
195 pages
RES-TLL008F21-6036 Lab9
No ratings yet
RES-TLL008F21-6036 Lab9
10 pages
01 - Xcelium, Nlint, Nwave, Verdi
No ratings yet
01 - Xcelium, Nlint, Nwave, Verdi
90 pages
Logic Cheatsheet
No ratings yet
Logic Cheatsheet
7 pages
TCL Uplevel
No ratings yet
TCL Uplevel
1 page
Charge Sharing Problem
No ratings yet
Charge Sharing Problem
2 pages
Charge Sharing Prob
No ratings yet
Charge Sharing Prob
5 pages
Clock Period Prob
No ratings yet
Clock Period Prob
3 pages
EECS141 Homework: Logic & Timing
No ratings yet
EECS141 Homework: Logic & Timing
6 pages
Lecture12 memoryII
No ratings yet
Lecture12 memoryII
27 pages
Lecture11 MemoryI Rev
No ratings yet
Lecture11 MemoryI Rev
50 pages
Lecture6 Combinationallogic
No ratings yet
Lecture6 Combinationallogic
31 pages
Lecture5 Inverterchain
No ratings yet
Lecture5 Inverterchain
42 pages
5th Sem Syllabus Autonomy
No ratings yet
5th Sem Syllabus Autonomy
28 pages
Data Science Task List Pfsinterns
No ratings yet
Data Science Task List Pfsinterns
14 pages
Module 1 Introduction To AI in Project Management
No ratings yet
Module 1 Introduction To AI in Project Management
16 pages
UNIT V Reinforcement Learning
No ratings yet
UNIT V Reinforcement Learning
8 pages
Major Project
No ratings yet
Major Project
10 pages
3 - Enhancing Graph Neural Network-Based Fraud Detectors Against Camouflaged Fraudsters
No ratings yet
3 - Enhancing Graph Neural Network-Based Fraud Detectors Against Camouflaged Fraudsters
10 pages
Artificial Intelligence Training: Introduction To The Course
No ratings yet
Artificial Intelligence Training: Introduction To The Course
2 pages
2020 Student Handbook: June 21st - August 1st
No ratings yet
2020 Student Handbook: June 21st - August 1st
14 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Drones Wind
No ratings yet
Drones Wind
26 pages
AI Training for Tech Enthusiasts
No ratings yet
AI Training for Tech Enthusiasts
8 pages
Perceptron in AI & Machine Learning
No ratings yet
Perceptron in AI & Machine Learning
37 pages
Fabric Get Started
No ratings yet
Fabric Get Started
99 pages
Implementation of Discrete Hidden Markov Model For Sequence Classification in C++ Using Eigen
No ratings yet
Implementation of Discrete Hidden Markov Model For Sequence Classification in C++ Using Eigen
8 pages
Introduction To ML
100% (1)
Introduction To ML
39 pages
Deep Learning (MODULE-5)
100% (1)
Deep Learning (MODULE-5)
71 pages
RNN LSTM
No ratings yet
RNN LSTM
16 pages
Machine Learning Model For Heart Disease Detection A Comparative Analysis of SVM Vs KNN
No ratings yet
Machine Learning Model For Heart Disease Detection A Comparative Analysis of SVM Vs KNN
5 pages
Network Security Threats and Mitigation Strategies in Mobile Networks A Machine Learning Perspective
No ratings yet
Network Security Threats and Mitigation Strategies in Mobile Networks A Machine Learning Perspective
5 pages
AI For Leaders
No ratings yet
AI For Leaders
12 pages
Crop, Fertilizer, & Irrigation Recommendation Using Machine Learning Techniques
No ratings yet
Crop, Fertilizer, & Irrigation Recommendation Using Machine Learning Techniques
9 pages
Machine Learning Q&A
No ratings yet
Machine Learning Q&A
5 pages
Towards An Efficient Model For Network Intrusion Detection System (IDS) : Systematic Literature Review
No ratings yet
Towards An Efficient Model For Network Intrusion Detection System (IDS) : Systematic Literature Review
30 pages
CNNs in Radiology: An Overview
No ratings yet
CNNs in Radiology: An Overview
20 pages
Machine Learning Engineer
No ratings yet
Machine Learning Engineer
2 pages
Intro to AI for Beginners
No ratings yet
Intro to AI for Beginners
9 pages
K - NN Classification
No ratings yet
K - NN Classification
4 pages
8 Machine Learning in Trading
No ratings yet
8 Machine Learning in Trading
17 pages
Unit 3-Fuzzy Clustering
No ratings yet
Unit 3-Fuzzy Clustering
34 pages
Real-Time Artificial Intelligence Based Health Mon
No ratings yet
Real-Time Artificial Intelligence Based Health Mon
21 pages

08 NLP With Deep Learning

Uploaded by

08 NLP With Deep Learning

Uploaded by

NLP with Deep

● Let’s explore how to work with text data in

● We will create a neural network that will

● Given an input string sequence, predict the

● The character based RNN will actually learn

● Step 1: Read in Text Data

● Step 2: Text Processing and Vectorization

● Step 3: Creating Batches

● Step 3: Creating Batches

● Step 4: Creating the Model

● Step 4: Creating the Model

● Step 4: Creating the Model

● Step 4: Creating the Model

● Step 4: Creating the Model

● Step 5: Training the Model

● Step 6: Generating new text

● Part 1: The Data

● Part 2: Text Processing

● Step 3: Creating Batches

● Step 4: Creating the Model

● Step 5: Training the Model

● Step 6: Generating Text

You might also like