0% found this document useful (0 votes)

9 views9 pages

NLP Session 2

Uploaded by

2023ac05191

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views9 pages

NLP Session 2

Uploaded by

2023ac05191

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Word Embeddings Topics

NLP Session 2-Vector semantics and Embedding by Dr. Chetana Gavankar [Autosaved].pdf
PDF

List all topics covered in this pdf

The topics covered in the document are as follows:

1. Vector Semantics and Word Embedding:

Lexical semantics
Vector semantics
Word vectors
TF-IDF
Word2Vec (Skip-gram, CBOW)
GloVe
Visualizing embeddings
2. Lexical Semantics:
Morphology
Language modeling
Distributional hypothesis
Synonymy, similarity, and relatedness
Antonymy and connotation
3. Word Similarity and Relatedness:
Semantic fields
Applications in question answering, plagiarism detection, etc.
4. Computational Models of Word Meaning:
Defining words by usage
Word embeddings as vectors
Importance of context and distribution
5. Types of Word Embeddings:
Frequency-based (Count vectors, TF-IDF, Co-occurrence)
Prediction-based (Word2Vec, GloVe)
6. Techniques for Embedding:
Word2Vec (Skip-gram, CBOW)
Training embeddings (positive and negative examples)
Stochastic Gradient Descent (SGD)
7. Embedding Properties:
Sparse vs. dense vectors
Contextual embeddings (e.g., ELMo, BERT)
Historical and cultural bias in embeddings

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 1/9
8. Applications of Word Embeddings:
Information retrieval
Analogical reasoning
Sentiment analysis
9. Advanced Topics:
Diachronic word embeddings
Bias quantification in embeddings
Optimization of computational efficiency in embeddings
10. Comparative Methods:
Skip-gram vs. CBOW
Use cases and advantages of each method
11. Visualization and Interpretation:
Visualizing word embeddings
Analogical relations using embeddings
12. GloVe Specifics:
Word-word co-occurrence matrix
Ratios of conditional probabilities
Intuition and advantages of GloVe over Word2Vec
13. References and Resources:
Links to related literature, tutorials, and videos for further understanding.

If you need detailed insights or explanations for any specific topic, feel free to ask!

Explain each topic in detail with a numerical example where ever is required. Also show flow
diagram or chart where ever necessary

This is a comprehensive task, and I will break down each topic from the document in detail, with
numerical examples, flow diagrams, and charts where necessary. Here's the plan:

1. Topic Explanation: Each topic will be explained conceptually.

2. Numerical Examples: Relevant topics will include numerical examples for better understanding.
3. Flow Diagrams or Charts: Topics involving processes or workflows will be complemented with
flow diagrams or charts.

Let’s start with Vector Semantics and Word Embedding:

1. Vector Semantics and Word Embedding

Concept:

Vector Semantics: Words are represented as points in a multi-dimensional space, where the
distances between points indicate semantic similarity. For example, words like "king" and "queen"
are closer in this space than "king" and "cat."

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 2/9
Word Embedding: A technique to map words into dense vectors of real numbers which capture
semantic meanings. Word embeddings are used in NLP to analyze and predict text-based tasks.

Numerical Example:

Suppose we have the following sentences:

1. "I love apples."

2. "Apples are delicious."

The co-occurrence matrix for the words could look like this:

Word I love apples are delicious

I 0 1 1 0 0

love 1 0 1 0 0
apples 1 1 0 1 1
are 0 0 1 0 1

delicious 0 0 1 1 0

Each row or column represents a vector for that word. For example:

Vector for "apples": [1, 1, 0, 1, 1]

2. Lexical Semantics

Concept:

The study of word meanings and their relationships, such as synonymy (similar meanings),
antonymy (opposite meanings), and polysemy (multiple meanings).
Distributional Hypothesis: Words that appear in similar contexts have similar meanings.

Numerical Example:

Consider the sentences:

1. "The cat sat on the mat."

2. "The dog lay on the carpet."

The co-occurrence matrix for "cat" and "dog" might show similar contexts like "sat" and "lay":

Word cat dog sat lay mat carpet

cat 0 0 1 0 1 0
dog 0 0 0 1 0 1

Here, "cat" and "dog" are related because they co-occur with similar words ("mat" and "carpet").

3. TF-IDF

Concept:

Term Frequency (TF): Frequency of a word in a document.

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 3/9
Inverse Document Frequency (IDF): A measure of how unique a word is across all documents.

Formula:

TF-IDF(t, d) = TF(t, d) × IDF(t)

Where:

count(t, d) N
TF(t, d) = , IDF(t) = log
total words in d DF(t)

N : Total number of documents

DF (t): Number of documents containing t

Numerical Example:

Suppose we have two documents:

1. Doc1: "Apples are sweet."

2. Doc2: "Apples are tasty."

TF of "apples" in Doc1: TF= 13 = 0.33

IDF of "apples": IDF = log 22 = 0 (since it appears in both documents)

TF-IDF of "apples": TF − IDF = 0.33 × 0 = 0

4. Word2Vec

Concept:

Two key architectures:

Skip-gram: Predict surrounding words (context) given a target word.
CBOW: Predict the target word given surrounding words.

Flow Diagram:

mathematica

Input: One-hot encoding of "apple"

↓
Hidden Layer: Word embedding vector
↓
Output Layer: Softmax probabilities for context words

Numerical Example:

For the sentence: "I love apples," the Skip-gram model might create training pairs:

("I", "love"), ("love", "apples")

The model learns embeddings to maximize the similarity between pairs.

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 4/9
5. GloVe

Concept:

Combines global co-occurrence statistics and local context for word embedding.

Example:

Using the co-occurrence matrix:

Word apple orange fruit

apple 10 3 20
orange 3 8 15
fruit 20 15 50

The ratio of co-occurrence probabilities is used to compute embeddings:

P (i∣j)
Ratio =
P (k∣j)

6. Word Similarity

Concept:

Dot Product: Measures similarity between word vectors.

Cosine Similarity: Normalized dot product to account for vector length.

Formula:

A⋅B
Cosine Similarity =

∣∣A∣∣ ⋅ ∣∣B ∣∣

Numerical Example:

Given two vectors:

A = [1, 2, 3]
B = [4, 5, 6]

Dot Product = 1 ⋅ 4 + 2 ⋅ 5 + 3 ⋅ 6 = 32
32
Cosine Similarity = = 0.9746
12 + 22 + 32 ⋅ 42 + 52 + 62

7. Applications

Question Answering: Using embeddings to retrieve relevant answers.

Plagiarism Detection: Semantic similarity measures to compare texts.

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 5/9
This detailed explanation has been started for the first few topics. I will continue to elaborate on
remaining topics in subsequent parts. Let me know if you'd like me to proceed with the next topics.

please proceed

Continuing with the explanation of topics:

8. Skip-gram and CBOW in Word2Vec

Concept:

Skip-gram: Predicts context words for a given center word.

CBOW (Continuous Bag of Words): Predicts the center word using surrounding context words.

Differences:

Feature Skip-gram CBOW

Prediction Target Context words Center word

Training Data Works well with small data Requires large data

Focus Rare words Frequent words

Training Speed Slower Faster

Numerical Example:

Sentence: "I enjoy learning NLP."

Skip-gram:
Context window = 2
Training pairs: ("I", "enjoy"), ("I", "learning"), ("enjoy", "I"), ("enjoy", "learning"), etc.
CBOW:
Input: ["I", "learning"]
Output: "enjoy"

Flow Diagram:

Skip-gram:

vbnet

Input: "NLP"
↓
Hidden Layer: Embedding vector for "NLP"
↓
Output: Predicts words like "learning" and "enjoy"

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 6/9
CBOW:

vbnet

Input: Words ["I", "learning"]

↓
Hidden Layer: Aggregated embeddings
↓
Output: Predicts "enjoy"

9. GloVe

Concept:

Focuses on the global word co-occurrence matrix and uses ratios of conditional probabilities for
embedding computation.

Numerical Example:

Suppose the co-occurrence matrix X is:

Word apple fruit sweet

apple 50 30 20

fruit 30 40 10

sweet 20 10 25

Compute P (word∣context):

30 20
P (apple∣fruit) = = 0.75, P (apple∣sweet) = = 0.8
40 25

Ratio:

P (apple∣fruit) 0.75
Ratio = = = 0.9375
P (apple∣sweet) 0.8

This ratio is used to learn embeddings.

10. Word Embedding Visualization

Concept:

Techniques like t-SNE or PCA reduce high-dimensional embeddings to 2D or 3D for visualization.

Example:

Words such as "king," "queen," "man," "woman" may cluster based on gender and royalty.

Visualization Chart:

Using t-SNE, a 2D plot might show:

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 7/9
css

woman--------queen
| |
| |
man---------king

11. Semantic Relations in Word Embeddings

Concept:

Analogies: king − man + woman = queen

Relations like synonyms, antonyms, or analogical reasoning.

Numerical Example:

Given vectors:

king = [0.8, 0.6], man = [0.7, 0.4], woman = [0.6, 0.8]

Compute:

queen = king − man + woman

queen = [0.8, 0.6] − [0.7, 0.4] + [0.6, 0.8] = [0.7, 1.0]

12. Bias in Word Embeddings

Concept:

Word embeddings can reflect societal biases present in training data.

Example: "man is to computer programmer as woman is to homemaker."

Visualization:

Biased embeddings:

css

man---programmer
|
|
woman---homemaker

13. TF-IDF vs Word2Vec

Aspect TF-IDF Word2Vec

Vector Type Sparse Dense

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 8/9
Aspect TF-IDF Word2Vec

Basis Term frequency Learned embeddings

Context Awareness No Yes

Use Case Document retrieval Semantic understanding

14. Applications of Word Embeddings

1. Question Answering:
Use embeddings to retrieve the most relevant answers.
2. Plagiarism Detection:
Compare text similarity using cosine similarity.
3. Sentiment Analysis:
Use embeddings to identify sentiments in a document.

15. Skip-gram Training Workflow

Flow Diagram:

1. Input Sentence: "I like apples."

2. Training Window: Context size = 2
3. Training Pairs:
Positive: ("I", "like"), ("like", "I"), etc.
Negative: ("I", "bananas"), etc.

Steps:

Compute dot product for each word pair.

Apply sigmoid to get probabilities.
Adjust embeddings via SGD.

16. Final Summary

Key Concepts:

1. Vector Semantics: Words are vectors in space.

2. Word Embeddings: Represent words in dense vectors.
3. Word2Vec and GloVe: Different techniques for embedding learning.
4. Applications: From question answering to bias analysis.

If you need further expansion on specific sections or additional diagrams, let me know!
Is this conversation helpful so far?

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 9/9

Lecture 6 - Word2Vec and Text Classification
No ratings yet
Lecture 6 - Word2Vec and Text Classification
66 pages
GenAI Workflow Automation NPTEL Zoom Course
No ratings yet
GenAI Workflow Automation NPTEL Zoom Course
88 pages
Module 3 - NLP
No ratings yet
Module 3 - NLP
34 pages
Lesson 2 Feature Engineering On Text Data
No ratings yet
Lesson 2 Feature Engineering On Text Data
89 pages
CCS369 - TSS-Unit 2
No ratings yet
CCS369 - TSS-Unit 2
56 pages
ProgramsGenAI BAIL657C
No ratings yet
ProgramsGenAI BAIL657C
18 pages
Module03 Embeddings
No ratings yet
Module03 Embeddings
102 pages
Unit 2 Updated New
No ratings yet
Unit 2 Updated New
77 pages
Lecture#14
No ratings yet
Lecture#14
38 pages
21 Word2Vec 24 09 2024
No ratings yet
21 Word2Vec 24 09 2024
63 pages
Lab 5
No ratings yet
Lab 5
27 pages
Vector Semantics and Embeddings
No ratings yet
Vector Semantics and Embeddings
29 pages
Vector Semantics and Embedding (Part 2)
No ratings yet
Vector Semantics and Embedding (Part 2)
47 pages
Natural Language Processing: Lecture # 7
No ratings yet
Natural Language Processing: Lecture # 7
36 pages
Ch4 Word Embeddings
No ratings yet
Ch4 Word Embeddings
21 pages
NLP Word Embeddings Explained
No ratings yet
NLP Word Embeddings Explained
55 pages
Week 2 and 3
No ratings yet
Week 2 and 3
76 pages
DM Chapter 9 - Word Embedding
No ratings yet
DM Chapter 9 - Word Embedding
7 pages
08 Embedding Et RNN v2.11
No ratings yet
08 Embedding Et RNN v2.11
69 pages
Wordembed
No ratings yet
Wordembed
31 pages
Unit 3 NLP
No ratings yet
Unit 3 NLP
8 pages
Word Embadding
No ratings yet
Word Embadding
24 pages
Word 2 Vec
No ratings yet
Word 2 Vec
6 pages
Text Vectorization
No ratings yet
Text Vectorization
18 pages
ML For NLP-LO4
No ratings yet
ML For NLP-LO4
42 pages
NLP Text Representation Guide
No ratings yet
NLP Text Representation Guide
131 pages
08 Word Embeddings (2021)
No ratings yet
08 Word Embeddings (2021)
58 pages
Tut4 - WordEmb NLP
No ratings yet
Tut4 - WordEmb NLP
30 pages
Week 5
No ratings yet
Week 5
26 pages
Lect 04
No ratings yet
Lect 04
44 pages
4 Word Representation
No ratings yet
4 Word Representation
41 pages
Wordembed v2.0
No ratings yet
Wordembed v2.0
46 pages
Generative AI 2
No ratings yet
Generative AI 2
24 pages
Word Embeddings Classification
No ratings yet
Word Embeddings Classification
52 pages
Generative AI
No ratings yet
Generative AI
16 pages
Neural Models For NLP
No ratings yet
Neural Models For NLP
67 pages
Gen AI 1
No ratings yet
Gen AI 1
4 pages
Chapter II
No ratings yet
Chapter II
26 pages
Unit 2
No ratings yet
Unit 2
15 pages
Economics Market Exchange
No ratings yet
Economics Market Exchange
14 pages
Word Embeddings Notes Cleaned
No ratings yet
Word Embeddings Notes Cleaned
4 pages
NLP Prez Word - Sentence Embedding - MAQUET - MARTIN - LEEFEBURE - MOGAVERO
No ratings yet
NLP Prez Word - Sentence Embedding - MAQUET - MARTIN - LEEFEBURE - MOGAVERO
18 pages
08-DL-Deep Learning For Text Data (Transfer Learning in NLP)
No ratings yet
08-DL-Deep Learning For Text Data (Transfer Learning in NLP)
53 pages
Fake News Detection Project
No ratings yet
Fake News Detection Project
7 pages
11.chapter8 WordEmbedding
No ratings yet
11.chapter8 WordEmbedding
17 pages
DLNLP CH-3 N
No ratings yet
DLNLP CH-3 N
11 pages
Lebijp 59 SZ 31 Py
No ratings yet
Lebijp 59 SZ 31 Py
69 pages
Word2Vec for NLP Enthusiasts
100% (1)
Word2Vec for NLP Enthusiasts
12 pages
Sheet 3
No ratings yet
Sheet 3
5 pages
NLP An Intuitive Understanding of Word Embeddings From Count Vectors To Word2Vec
No ratings yet
NLP An Intuitive Understanding of Word Embeddings From Count Vectors To Word2Vec
18 pages
Word Vectors for NLP Students
No ratings yet
Word Vectors for NLP Students
34 pages
NLP Using Deep Learning Handson
No ratings yet
NLP Using Deep Learning Handson
7 pages
Unit 5 Part 2
No ratings yet
Unit 5 Part 2
21 pages
Movie Recommendation System Using TF-IDF Vectorization and Cosine Similarity
No ratings yet
Movie Recommendation System Using TF-IDF Vectorization and Cosine Similarity
9 pages
12 Subrata DL
No ratings yet
12 Subrata DL
25 pages
Word2Vec for NLP Enthusiasts
No ratings yet
Word2Vec for NLP Enthusiasts
13 pages
Word Embeddings & Word2Vec Guide
No ratings yet
Word Embeddings & Word2Vec Guide
9 pages
Lecture Word Embeddings WordTo Vec IR
No ratings yet
Lecture Word Embeddings WordTo Vec IR
60 pages
Part 3
No ratings yet
Part 3
5 pages
NLP Notes
No ratings yet
NLP Notes
11 pages
Empowering Youth: Skill India Impact
No ratings yet
Empowering Youth: Skill India Impact
6 pages
Constructing and Evaluating Word Embeddings
No ratings yet
Constructing and Evaluating Word Embeddings
33 pages
Orange3 Text PDF
No ratings yet
Orange3 Text PDF
53 pages
HR Attrition Prediction via Machine Learning
No ratings yet
HR Attrition Prediction via Machine Learning
14 pages
CS671A/CS671: Introduction To Natural Language Processing Mid-Semester Exam
No ratings yet
CS671A/CS671: Introduction To Natural Language Processing Mid-Semester Exam
7 pages
All Paths Lead To Philosophy: Dmitriy Brezhnev, Stephen Trushiem, Vikas Yendluri
No ratings yet
All Paths Lead To Philosophy: Dmitriy Brezhnev, Stephen Trushiem, Vikas Yendluri
11 pages
NLP Project
No ratings yet
NLP Project
12 pages
Big Data Analytics Handbook 2020
No ratings yet
Big Data Analytics Handbook 2020
103 pages
AFrameworkford Differentiated Marketingof Tourism Destination Images Basedon Visual Contentof Photos
No ratings yet
AFrameworkford Differentiated Marketingof Tourism Destination Images Basedon Visual Contentof Photos
36 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
6 pages
Kim and Kim WOM14-00848-V2
No ratings yet
Kim and Kim WOM14-00848-V2
14 pages
Final Year Report Submitted
No ratings yet
Final Year Report Submitted
61 pages
Deep Learning-Based Feature Extraction Technique For Single Document Summarization Using Hybrid Optimization Technique
No ratings yet
Deep Learning-Based Feature Extraction Technique For Single Document Summarization Using Hybrid Optimization Technique
15 pages
NLP Question Bank Answers (Raghav) - This Is Better
No ratings yet
NLP Question Bank Answers (Raghav) - This Is Better
25 pages
Pazzani - Content-Based Recommender Systems
No ratings yet
Pazzani - Content-Based Recommender Systems
17 pages
Ir Mod4 Notes
No ratings yet
Ir Mod4 Notes
19 pages
NLP Practise Question
No ratings yet
NLP Practise Question
5 pages
Text Mining Techniques Guide
No ratings yet
Text Mining Techniques Guide
45 pages
Week 8-Module 7 NLP
No ratings yet
Week 8-Module 7 NLP
52 pages
Predictive News Filtering
No ratings yet
Predictive News Filtering
32 pages
EagleBot - A Chatbot Based Multi-Tier Question Answering System Fo
No ratings yet
EagleBot - A Chatbot Based Multi-Tier Question Answering System Fo
54 pages
Drug Recommendation System
No ratings yet
Drug Recommendation System
7 pages
Application of Computational Linguistics
No ratings yet
Application of Computational Linguistics
19 pages
Deep Learning-Based Depression Detection From Social Media
No ratings yet
Deep Learning-Based Depression Detection From Social Media
20 pages
Text Analysis with TF-IDF and NLTK
No ratings yet
Text Analysis with TF-IDF and NLTK
10 pages
Building Ontologies For Different Natural Languages
No ratings yet
Building Ontologies For Different Natural Languages
23 pages
Legal Factor Identification via VSMs
No ratings yet
Legal Factor Identification via VSMs
10 pages
Thesis - Dinesh Mavaluru
No ratings yet
Thesis - Dinesh Mavaluru
142 pages

NLP Session 2

Uploaded by

NLP Session 2

Uploaded by

Word Embeddings Topics

List all topics covered in this pdf

The topics covered in the document are as follows:

1. Vector Semantics and Word Embedding:

1. Topic Explanation: Each topic will be explained conceptually.

Let’s start with Vector Semantics and Word Embedding:

1. Vector Semantics and Word Embedding

Suppose we have the following sentences:

1. "I love apples."

Word I love apples are delicious

Vector for "apples": [1, 1, 0, 1, 1]

Consider the sentences:

1. "The cat sat on the mat."

Word cat dog sat lay mat carpet

Term Frequency (TF): Frequency of a word in a document.

TF-IDF(t, d) = TF(t, d) × IDF(t)

N : Total number of documents

Suppose we have two documents:

1. Doc1: "Apples are sweet."

TF of "apples" in Doc1: TF= 13 = 0.33 ​

IDF of "apples": IDF = log 22 = 0 (since it appears in both documents)

TF-IDF of "apples": TF − IDF = 0.33 × 0 = 0

Two key architectures:

Input: One-hot encoding of "apple"

("I", "love"), ("love", "apples")

The model learns embeddings to maximize the similarity between pairs.

Using the co-occurrence matrix:

Word apple orange fruit

The ratio of co-occurrence probabilities is used to compute embeddings:

Dot Product: Measures similarity between word vectors.

Given two vectors:

Question Answering: Using embeddings to retrieve relevant answers.

Continuing with the explanation of topics:

8. Skip-gram and CBOW in Word2Vec

Skip-gram: Predicts context words for a given center word.

Feature Skip-gram CBOW

Focus Rare words Frequent words

Sentence: "I enjoy learning NLP."

Input: Words ["I", "learning"]

Suppose the co-occurrence matrix X is:

Word apple fruit sweet

This ratio is used to learn embeddings.

10. Word Embedding Visualization

Techniques like t-SNE or PCA reduce high-dimensional embeddings to 2D or 3D for visualization.

Using t-SNE, a 2D plot might show:

11. Semantic Relations in Word Embeddings

Analogies: king − man + woman = queen

king = [0.8, 0.6], man = [0.7, 0.4], woman = [0.6, 0.8]

queen = king − man + woman

queen = [0.8, 0.6] − [0.7, 0.4] + [0.6, 0.8] = [0.7, 1.0]

12. Bias in Word Embeddings

Word embeddings can reflect societal biases present in training data.

13. TF-IDF vs Word2Vec

Aspect TF-IDF Word2Vec

Basis Term frequency Learned embeddings

Use Case Document retrieval Semantic understanding

14. Applications of Word Embeddings

15. Skip-gram Training Workflow

1. Input Sentence: "I like apples."

Compute dot product for each word pair.

16. Final Summary

1. Vector Semantics: Words are vectors in space.

You might also like

TF of "apples" in Doc1: TF= 13 = 0.33