0% found this document useful (0 votes)

47 views3 pages

Unit 5 Language Modeling Notes

Language modeling in NLP involves probabilistic models that predict the likelihood of word sequences, with applications in predictive text, speech recognition, and chatbots. Key concepts include n-gram models, evaluation metrics like coverage rate and perplexity, and techniques for parameter estimation and adaptation to new domains. The document also discusses various types of language models and challenges in multilingual and crosslingual contexts.

Uploaded by

meghanayakkala597

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views3 pages

Unit 5 Language Modeling Notes

Uploaded by

meghanayakkala597

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

UNIT 5: LANGUAGE MODELING

1. Introduction to Language Modeling

A Language Model (LM) in NLP is a probabilistic statistical model that estimates the likelihood of a

sequence of words. It predicts the next word in a sentence using the context provided by previous

words.

Applications:

- Predictive text input

- Speech recognition

- Spelling correction

- Machine translation

- Chatbots

Example: "I love reading history..." -> next word: "books"

2. N-Gram Models

N-gram = sequence of N words.

- Unigram: "I", "love", "reading"

- Bigram: "I love", "love reading"

- Trigram: "I love reading"

Formula using Chain Rule:

P(W) = P(w1) * P(w2|w1) * P(w3|w1, w2) ...

Approximation: P(w_n | ...) P(w_n | w_{n-k}...)

3. Language Model Evaluation

i. Coverage Rate: % of known n-grams in test data.

ii. Perplexity: Measures model's prediction power.

Perplexity = 2^H(p) or PP(W) = (1/P(w1...wt))^(1/t)

4. Parameter Estimation

i. MLE: P(wi|wi-1,wi-2) = count(wi-2,wi-1,wi) / count(wi-2,wi-1)

ii. Smoothing: Assigns small probabilities to unseen n-grams.

Backoff: Uses lower-order n-grams when data is sparse.

5. Language Model Adaptation

Used when applying models to new domains.

Techniques:

- Interpolation: Mix in-domain and general models

- Topic-based adaptation: Cluster documents into topics

6. Types of Language Models

i. Class-Based: Group words (e.g., cities, animals)

ii. Variable-Length: Handle varying input/output sizes

iii. Discriminative: Focus on classification tasks

iv. Topic-Based (LDA): Discover hidden topics in docs

v. Neural Network Models: Use deep learning (Word2Vec, BERT)

7. Language-Specific Modeling Problems

i. Morphologically Rich: Use morphemes instead of full words

ii. No Word Segmentation: Needed in Chinese, Japanese

iii. Spoken vs Written: Require manual transcription

8. Multilingual and Crosslingual Modeling

i. Multilingual: Handle multiple languages & code-switching

Example: "I need to tell her que no voy a poder ir."

ii. Crosslingual: Use one language's data for another

(Translate or share models like LSA)

Conclusion:

Language modeling is essential in NLP for understanding and generating human language. It

ranges from simple n-grams to advanced neural models.

CB3591 - Engineering Ssecure Software Systems - Notes
No ratings yet
CB3591 - Engineering Ssecure Software Systems - Notes
50 pages
NLP - AI2214601 Unit 1to Unit 5 Notes
No ratings yet
NLP - AI2214601 Unit 1to Unit 5 Notes
98 pages
NLP Notes For Students
100% (2)
NLP Notes For Students
18 pages
Language Modeling
No ratings yet
Language Modeling
3 pages
Language Modeling in NLP
No ratings yet
Language Modeling in NLP
15 pages
21ML1601 NLP QB
No ratings yet
21ML1601 NLP QB
34 pages
Unit 5
No ratings yet
Unit 5
20 pages
NLP Unit-5.2 Notes
No ratings yet
NLP Unit-5.2 Notes
72 pages
NLP Unit-4
No ratings yet
NLP Unit-4
62 pages
Unit 1
No ratings yet
Unit 1
99 pages
NLP-Ch-2 Introduction To Language Models
No ratings yet
NLP-Ch-2 Introduction To Language Models
82 pages
Language Models and Application of Natural Language Processing
No ratings yet
Language Models and Application of Natural Language Processing
70 pages
NLP Model
No ratings yet
NLP Model
6 pages
NLP Unit 5
No ratings yet
NLP Unit 5
3 pages
NLP Unit5 15marks Jntuh
No ratings yet
NLP Unit5 15marks Jntuh
4 pages
NLP - N-Gram Language Model
No ratings yet
NLP - N-Gram Language Model
22 pages
Unit-5 Notes NLP
No ratings yet
Unit-5 Notes NLP
28 pages
NLP Internal
No ratings yet
NLP Internal
15 pages
NLP Unit 4 Q & A
No ratings yet
NLP Unit 4 Q & A
17 pages
Ngrams
No ratings yet
Ngrams
22 pages
NLP Sem Unit 5
No ratings yet
NLP Sem Unit 5
9 pages
Technical NLP U3-6
No ratings yet
Technical NLP U3-6
83 pages
2023 07 28 Evolution of Language Models
No ratings yet
2023 07 28 Evolution of Language Models
73 pages
Intro To Language Models - Soumyasis Mishra - 191001021003 - BCS4C
No ratings yet
Intro To Language Models - Soumyasis Mishra - 191001021003 - BCS4C
10 pages
Lecture 6 To 8 N-Gram
No ratings yet
Lecture 6 To 8 N-Gram
19 pages
Natural Language Processing 5
No ratings yet
Natural Language Processing 5
24 pages
Language Models in Natural Language Processing
No ratings yet
Language Models in Natural Language Processing
4 pages
CH 6
No ratings yet
CH 6
30 pages
L5 Cse256 Fa24 LM
No ratings yet
L5 Cse256 Fa24 LM
65 pages
NLP 1.2
No ratings yet
NLP 1.2
22 pages
Introduction To Language Models
No ratings yet
Introduction To Language Models
24 pages
Bcse306l Ai Module-7 Smsatapathy
No ratings yet
Bcse306l Ai Module-7 Smsatapathy
51 pages
AI Quiz ch3
No ratings yet
AI Quiz ch3
29 pages
DR Pushpak's Talk IIT Bombay, Ex IIT Patna
No ratings yet
DR Pushpak's Talk IIT Bombay, Ex IIT Patna
136 pages
Module-1 ch-2
No ratings yet
Module-1 ch-2
31 pages
Evolution of Large Language Models
No ratings yet
Evolution of Large Language Models
32 pages
NLP Notes Unit 1to5 Final
No ratings yet
NLP Notes Unit 1to5 Final
75 pages
Deep Learning (MODULE-4) - RNN - NLP
No ratings yet
Deep Learning (MODULE-4) - RNN - NLP
52 pages
Cs224n 2025 Lecture05 RNNLM
No ratings yet
Cs224n 2025 Lecture05 RNNLM
54 pages
Natural Language Processing
No ratings yet
Natural Language Processing
6 pages
Language Modeling Lecture Notes
No ratings yet
Language Modeling Lecture Notes
88 pages
Hocken Maier 25
No ratings yet
Hocken Maier 25
46 pages
NLP 1
No ratings yet
NLP 1
13 pages
NLP Language Models Explained
No ratings yet
NLP Language Models Explained
65 pages
Language Models
No ratings yet
Language Models
11 pages
6.chapter6 LanguageModel
No ratings yet
6.chapter6 LanguageModel
33 pages
MTH MLP
No ratings yet
MTH MLP
6 pages
NLP Language Models Explained
No ratings yet
NLP Language Models Explained
9 pages
Summaries of The Chapters
No ratings yet
Summaries of The Chapters
29 pages
Unit 3-1
No ratings yet
Unit 3-1
66 pages
Statistical Language Model
No ratings yet
Statistical Language Model
9 pages
Unit 1
No ratings yet
Unit 1
20 pages
3-Lecture Three - (Chapter Two-N-gram Language Models)
No ratings yet
3-Lecture Three - (Chapter Two-N-gram Language Models)
28 pages
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
No ratings yet
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
53 pages
Chapter 1 Solutions
No ratings yet
Chapter 1 Solutions
5 pages
Module-5:: Network Analysis
No ratings yet
Module-5:: Network Analysis
22 pages
Ngrams
100% (1)
Ngrams
22 pages
CS 388: Natural Language Processing:: N-Gram Language Models
No ratings yet
CS 388: Natural Language Processing:: N-Gram Language Models
22 pages
Unit 3 Part 2
No ratings yet
Unit 3 Part 2
21 pages
Unit-4 TNM
No ratings yet
Unit-4 TNM
25 pages
Unit 5 - Notes
No ratings yet
Unit 5 - Notes
11 pages
Unit 4
No ratings yet
Unit 4
70 pages
Langauage Model
No ratings yet
Langauage Model
148 pages
MeghanaYakkala Programming Esse Certificate
No ratings yet
MeghanaYakkala Programming Esse Certificate
1 page

Unit 5 Language Modeling Notes

Uploaded by

Unit 5 Language Modeling Notes

Uploaded by

UNIT 5: LANGUAGE MODELING

1. Introduction to Language Modeling

- Predictive text input

Example: "I love reading history..." -> next word: "books"

N-gram = sequence of N words.

- Unigram: "I", "love", "reading"

- Bigram: "I love", "love reading"

- Trigram: "I love reading"

Formula using Chain Rule:

P(W) = P(w1) * P(w2|w1) * P(w3|w1, w2) ...

Approximation: P(w_n | ...) P(w_n | w_{n-k}...)

3. Language Model Evaluation

i. Coverage Rate: % of known n-grams in test data.

ii. Perplexity: Measures model's prediction power.

i. MLE: P(wi|wi-1,wi-2) = count(wi-2,wi-1,wi) / count(wi-2,wi-1)

ii. Smoothing: Assigns small probabilities to unseen n-grams.

Backoff: Uses lower-order n-grams when data is sparse.

5. Language Model Adaptation

Used when applying models to new domains.

- Interpolation: Mix in-domain and general models

- Topic-based adaptation: Cluster documents into topics

6. Types of Language Models

i. Class-Based: Group words (e.g., cities, animals)

ii. Variable-Length: Handle varying input/output sizes

iii. Discriminative: Focus on classification tasks

iv. Topic-Based (LDA): Discover hidden topics in docs

v. Neural Network Models: Use deep learning (Word2Vec, BERT)

7. Language-Specific Modeling Problems

i. Morphologically Rich: Use morphemes instead of full words

ii. No Word Segmentation: Needed in Chinese, Japanese

iii. Spoken vs Written: Require manual transcription

8. Multilingual and Crosslingual Modeling

i. Multilingual: Handle multiple languages & code-switching

ii. Crosslingual: Use one language's data for another

(Translate or share models like LSA)

ranges from simple n-grams to advanced neural models.

You might also like