0% found this document useful (0 votes)

26 views8 pages

NLP One Mark Questions With Answers

The document provides an overview of Natural Language Processing (NLP) covering various topics including applications, components, phases, and key concepts such as morphology, typology, and parsing. It discusses techniques like stemming, lemmatization, and the use of libraries like NLTK, along with the importance of syntactic analysis and treebanks. Additionally, it addresses N-gram models, smoothing techniques, and the limitations of N-gram models in NLP.

Uploaded by

harini.konkala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views8 pages

NLP One Mark Questions With Answers

Uploaded by

harini.konkala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

NLP

One mark Questions with answers

UNIT - I

1.) list out few applications of NLP.

• Question Answering
• spam detection
• machine translation
• speech correction
• Chatbot
• Speech recognition

2.) Components of NLP

• NLU (natural language understanding)

• NLG (natural language generation)

3.) Name five phases involved in NLP.

• lexical Analysis and morphological

• Syntactic Analysis
• semantic
• discourse integration
• pragmatic analysis

4.) Differentiate lexeme and lemma

Aspect Lexeme Lemma

The base unit of meaning; an abstract unit The dictionary form or canonical form
Definition
representing all inflected forms of a word. of a lexeme.
All the inflectional variants (e.g., walk, walks, A single standard form, typically used as
Represents
walked, walking). a headword in dictionaries.
Example Lexeme: RUN → run, runs, ran, running Lemma: run
Used in Linguistic analysis, corpus studies, NLP Dictionaries, NLP, morphological parsing
Nature Abstract and general Specific and representative

5.) Define Morphology

Morphology is the branch of linguistics that studies the structure and formation of words. It
analyzes how morphemes (the smallest units of meaning) combine to form words, including roots,
prefixes, and suffixes.

Example: In the word “unhappiness”, un- (prefix), happy (root), and -ness (suffix) are all morphem

6.) What is typology

Typology in linguistics is the study and classification of languages based on their structural features,
such as word order, sentence structure, or morphological patterns. It helps identify similarities and
differences among languages, regardless of their historical or genetic relationships.

Example: English follows SVO (Subject-Verb-Object) word order, while Hindi follows SOV (Subject-
Object-Verb).

7.) Mention about Fusional language

Fusional languages are defined by their feature-per-morpheme ratio higher than one (as in Arabic,
Czech, Latin, Sanskrit, German, etc.).

Ex: Word: Head

She nodded her head

She is the head of the department

check the head of the page

We should head back home now

The toothpaste came out of the head of the tube

8.) Features of NLTK

• Tokenization
• Lowercasing
• Removing stopwords
• Punctuation removal
• Stemming
• Lemmatization
• POS tagging
• Named Entity Recognition (NER)

9.) Define stemming

Stemming is the process of reducing a word to its base or root form, called a "stem.“

It helps group related words together so they can be analyzed as a single item, regardless of tense or
form.

Ex: Helping - help

studying - studi

flying - fli

helper - help

10.) Define Lemmatizing

Lemmatizing is the process of reducing a word to its lemma, or base form. Unlike stemming, it
produces a valid English word that makes sense on its own.

Stemming:
→ "caring" → "car" (not meaningful in context)

→ Fast but less accurate.

Lemmatizing:

→ "caring" → "care" (meaningful root word)

→ Slower but context-aware and grammatically correct.

11.) List out the libraries that are imported with respect to NLTK

import contractions

import nltk

import re

from nltk.tokenize import word_tokenize

from nltk.corpus import stopwords

from nltk.stem import PorterStemmer, WordNetLemmatizer

from nltk import pos_tag

12.) Differentiate chunking and chinking

Chunking: the process of identifying and grouping phrases in a sentence — like noun phrases,
verb phrases, etc.

Chinking: Removes specific patterns within a chunk (like verbs or adverbs that don't belong)

13.) Define NER (Named Entity Recognition)

Process of identifying entities in the given sentence

Ex: Person names (e.g., Mahatma Gandhi)

Organizations (e.g., MRCET)

Locations (e.g., Hyderabad, India)

Dates (e.g., 20 June 2025)

Monetary values (e.g., ₹500, $1000)(AMOUNT MENTIONED IN TEXT)

Time, Percentages, Events, etc.

UNIT-II

1.) Define Parsing/Syntax Analysis.

A. the process of analyzing a sentence's grammatical structure according to the rules of a formal
grammar. It identifies the syntactic structure of a sentence and determines how the words relate to
each other.

2.) Applications of Syntactic analysis

• Grammar checking (e.g., Grammarly)

• Question answering systems

• Chatbots

• Machine translation

• Text summarization

3.) List out Approaches to Syntax Analysis.

 Top-Down Parsing – Starts from the start symbol and tries to derive the sentence.
 Bottom-Up Parsing – Builds the parse tree from the input up to the start symbol.
 Chart Parsing – Uses dynamic programming to store intermediate parsing results.
 Shift-Reduce Parsing – A bottom-up method using a stack to shift and reduce tokens.
 Recursive Descent Parsing – A top-down parser using recursive functions for grammar rules.
 Dependency Parsing – Focuses on word-to-word relations (head-dependent).
 Constituency Parsing – Breaks sentences into phrase structures (like NP, VP).
 Probabilistic Parsing – Uses probabilities to select the most likely parse tree.

4.) Define Treebanks

Treebanks are annotated text corpora that include syntactic or grammatical structure (usually in
the form of parse trees) for each sentence. They are used in Natural Language Processing (NLP) and
linguistics to train and evaluate parsers and grammar models.

Example: A sentence like "The cat sat on the mat." would be annotated to show how words group
into phrases (like noun phrases and verb phrases).

5.) Types of Syntax trees and what are they?

There are two main types of syntax trees in linguistics:

1. Constituency Tree (Phrase Structure Tree):

Shows how words group into phrases (like noun phrases or verb phrases) based on grammar
rules.
Example: [NP The cat] [VP sat [PP on [NP the mat]]]
2. Dependency Tree:
Shows word-to-word relationships, where one word (the "head") governs the others (its
"dependents").
Example: In "The cat sat," "sat" is the main verb, and "cat" is its subject dependent.

These trees help analyze sentence structure and grammatical relationships.

6.) Uses of Treebanks.

• Training parsers (e.g., probabilistic context-free grammar parsers, neural parsers)

• Evaluating parsing algorithms

• Linguistic research
• Building tools for translation

• sentiment analysis, etc.

7.) Write about data driven approach

A data-driven approach in linguistics and NLP relies on large annotated datasets (corpora) to learn
patterns and make predictions. Instead of using fixed grammar rules, this approach uses statistical
models or machine learning algorithms trained on real language data.

Example: A machine translation system trained on parallel corpora learns how to translate based on
patterns in the data, not predefined rules.

8.) Define dependency graph

A. A Dependency Graph is defined as how words in a sentence are connected based on their
grammar roles.

Ex:"Don't drink and drive.“

9.) Where do dependency graph is used.

• A. NLP parsers (like spaCy, Stanford NLP)

• Grammar checking tools

• Machine translation

• Information extraction

10.) List out the tools used to build Phrase structure trees.

• NLTK (Natural Language Toolkit) — Python

• Stanford Parser / CoreNLP

• spaCy + Benepar (Berkeley Neural Parser)

• RSyntaxTree (Web GUI Tool)

• SyntaxNet

11.) Write about types of Parsing algorithms.

• Shift-Reduce Parsing

• Chart Parsing (CYK Algorithm)

• Hypergraph-based Parsing

12.) Define Hypergraph.

hypergraph is a type of graph in which an edge, called a hyperedge, can connect more than two
vertices. It is used to represent multi-way relationships between elements.

Vertices: A, B, C, D

A B C D

●-------●-------●

\ | /

\_____|_____/

Hyperedge E1

13.) Write about Probabilistic Context-free Grammer.

A. Probabilistic Context-Free Grammar (PCFG) is an extension of CFG (Context-Free Grammar) where:

• Each production rule has an associated probability.

• These probabilities help choose the most likely parse tree when a sentence has multiple
possible meanings.

14.) List out Types of Generative models.

• PCFG (Probabilistic Context-Free Grammar)

• Lexicalized PCFG
• Generative Neural Parsers
• Data-Oriented Parsing (DOP)
• Bayesian Generative Models
• Stochastic Tree-Substitution Grammars (STSG)
• Generative Dependency Parsers
• Minimalist Grammars (generative, theoretical)

15.) What are the advantages of Discriminative models for parsing.

• Can use rich and overlapping features (lexical, syntactic, semantic).

• Do not take own decisions
• Provides higher accuracy parsing

UNIT-III

1.) How many types of n-gram models are there. What are they?

Types of N-Gram Model

• Unigram
• Bigram

• Trigram

• Higher-order N-gram Models

2.) What is the purpose of language model evaluation?

• The accuracy of word predictions

• The fluency and naturalness of generated text

• How well the model captures language structure and meaning

3.) Define perplexity.

Perplexity is a measurement of how well a language model predicts a sequence of words.

It tells user how “confused” or “surprised” the model is when it sees the actual text.

4.) Types of Smoothing techniques.

• Add-One (Laplace) Smoothing

• Add-k Smoothing
• Good-Turing Discounting
• Backoff and Interpolation

5.) Describe the role of smoothing in N-gram models. Why is it necessary?

Answer:
Smoothing helps when some N-grams in the test sentence do not appear in the training corpus,
resulting in zero probabilities.

Example: If "I enjoy mango" never appeared in training, then:

P("mango" | "enjoy") = 0 → Whole sentence probability = 0

Solution:

 Laplace Smoothing: Adds 1 to all counts to avoid zeros.

 Backoff Models: Fall back to smaller N-grams if higher ones are missing.

Smoothing ensures the model assigns non-zero probabilities to unseen sequences.

6) What are the limitations of N-gram models and how can they be addressed?:

Limitations:

 Data sparsity: Many possible word sequences may not appear in training data.
 Limited context: N-gram models only look at a few previous words.
 High memory: Storing large N-gram tables is resource-heavy.

Solutions:

 Smoothing: Adjusts probabilities of unseen N-grams (e.g., Laplace Smoothing).

 Backoff and Interpolation: Uses lower-order N-grams when higher-order ones are
unavailable.

Think 4
100% (10)
Think 4
132 pages
English Morphology & Syntax Course
100% (3)
English Morphology & Syntax Course
3 pages
Technical English For Mining (L3)
No ratings yet
Technical English For Mining (L3)
21 pages
NLP Unit-Ii
No ratings yet
NLP Unit-Ii
45 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
NLP Ans
No ratings yet
NLP Ans
9 pages
5.natural Language Processing
No ratings yet
5.natural Language Processing
5 pages
VSAQ
No ratings yet
VSAQ
7 pages
NLP Unit-Ii
No ratings yet
NLP Unit-Ii
71 pages
NLP Basics for Beginners
No ratings yet
NLP Basics for Beginners
4 pages
NLP Mid-1
No ratings yet
NLP Mid-1
15 pages
Part - A (2 Mark Questions)
No ratings yet
Part - A (2 Mark Questions)
35 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
NLP Unit-Ii
No ratings yet
NLP Unit-Ii
118 pages
Unit II
No ratings yet
Unit II
61 pages
4.chapter5 - Syntactic and Semantic Representations
No ratings yet
4.chapter5 - Syntactic and Semantic Representations
47 pages
NLP QB
No ratings yet
NLP QB
16 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
NLP Shorts 3
No ratings yet
NLP Shorts 3
25 pages
Unit 2
No ratings yet
Unit 2
15 pages
NLP Chapter-1
No ratings yet
NLP Chapter-1
24 pages
NLP CIE 1 Important Questions
No ratings yet
NLP CIE 1 Important Questions
4 pages
NLP Chapter 3
No ratings yet
NLP Chapter 3
23 pages
Unit - 5 Natural Language Processing
No ratings yet
Unit - 5 Natural Language Processing
66 pages
NLP Objectives MID-2
No ratings yet
NLP Objectives MID-2
4 pages
Unit V Expert Systems Notes
No ratings yet
Unit V Expert Systems Notes
15 pages
NLP Basics: Key Concepts and Processes
No ratings yet
NLP Basics: Key Concepts and Processes
15 pages
Selected Topic CH 1
No ratings yet
Selected Topic CH 1
36 pages
NLP Basics for AI Enthusiasts
100% (1)
NLP Basics for AI Enthusiasts
21 pages
NLP Unitwise Imp Questions
No ratings yet
NLP Unitwise Imp Questions
5 pages
Ai Unit - 5
No ratings yet
Ai Unit - 5
12 pages
NLP Module 3
No ratings yet
NLP Module 3
41 pages
Quiz Unit No. 3 NLP
No ratings yet
Quiz Unit No. 3 NLP
2 pages
NLP for Information Retrieval
No ratings yet
NLP for Information Retrieval
8 pages
Question Bank NLP SOLUTIONS
No ratings yet
Question Bank NLP SOLUTIONS
21 pages
NLP Basics and Challenges Explained
No ratings yet
NLP Basics and Challenges Explained
3 pages
NLP Unit 2
No ratings yet
NLP Unit 2
48 pages
AI Notes - Natural Language Processing
No ratings yet
AI Notes - Natural Language Processing
8 pages
Unit 5 NLP
No ratings yet
Unit 5 NLP
16 pages
NLP - Shortnotes Unit 3
No ratings yet
NLP - Shortnotes Unit 3
16 pages
Document 1
No ratings yet
Document 1
5 pages
Unit 2 New One
No ratings yet
Unit 2 New One
12 pages
NLP Sem Unit 2
No ratings yet
NLP Sem Unit 2
12 pages
NLP Unit-2
No ratings yet
NLP Unit-2
42 pages
3nlp Computer
No ratings yet
3nlp Computer
83 pages
NLP CIE 1 Scheme and Solutions
No ratings yet
NLP CIE 1 Scheme and Solutions
5 pages
Linguistics & NLP: Morphology Basics
No ratings yet
Linguistics & NLP: Morphology Basics
14 pages
Transition Networks in Computing
No ratings yet
Transition Networks in Computing
7 pages
Introduction To Natural Language Processing and NLTK
No ratings yet
Introduction To Natural Language Processing and NLTK
23 pages
NLP Basics
No ratings yet
NLP Basics
7 pages
Unit 5
No ratings yet
Unit 5
70 pages
NLP Mid
No ratings yet
NLP Mid
5 pages
NLP Question and Answers Final
No ratings yet
NLP Question and Answers Final
129 pages
NLP Mid-1
No ratings yet
NLP Mid-1
6 pages
NLP Unit-2
No ratings yet
NLP Unit-2
31 pages
NLP Quesion Bank
No ratings yet
NLP Quesion Bank
4 pages
SNLP
No ratings yet
SNLP
18 pages
NLP Question Bank
No ratings yet
NLP Question Bank
7 pages
NLP Unitwise Imp Questions
100% (1)
NLP Unitwise Imp Questions
5 pages
NLP Unit 3 Part A PDF
No ratings yet
NLP Unit 3 Part A PDF
75 pages
Iconicity in Language - An Encyclopaedic Dictionary
100% (1)
Iconicity in Language - An Encyclopaedic Dictionary
479 pages
Lower Secondary - English As A Second Language - Suggested Resources - tcm143-710585
No ratings yet
Lower Secondary - English As A Second Language - Suggested Resources - tcm143-710585
3 pages
Materi Offering and Suggestion Kelas XI
No ratings yet
Materi Offering and Suggestion Kelas XI
4 pages
Expository Essays
No ratings yet
Expository Essays
19 pages
Mentor Teacher Observation 4
No ratings yet
Mentor Teacher Observation 4
3 pages
Types of Loanwords in Communication
100% (1)
Types of Loanwords in Communication
3 pages
Bms Exploratory Latin Syllabus
No ratings yet
Bms Exploratory Latin Syllabus
3 pages
Present Perfect and Present Perfect Continuous Tense 2
No ratings yet
Present Perfect and Present Perfect Continuous Tense 2
15 pages
History of The English Language
No ratings yet
History of The English Language
5 pages
Fabel Word Search Puzzle
No ratings yet
Fabel Word Search Puzzle
3 pages
How To Write A Formal Letter
No ratings yet
How To Write A Formal Letter
4 pages
Lexical Analysis
No ratings yet
Lexical Analysis
45 pages
Improving Students Listening Comprehension Through Cooperative Listening
No ratings yet
Improving Students Listening Comprehension Through Cooperative Listening
8 pages
Linguistics and Sociolinguistics Bibliography
No ratings yet
Linguistics and Sociolinguistics Bibliography
7 pages
Grammar Minutes Grade 3
100% (5)
Grammar Minutes Grade 3
112 pages
Copia de UNIT 5 GRAMMAR PRACTICE LINKS
No ratings yet
Copia de UNIT 5 GRAMMAR PRACTICE LINKS
4 pages
English IX
No ratings yet
English IX
4 pages
Wonders Grammar Student Workbook
100% (2)
Wonders Grammar Student Workbook
33 pages
Guía 5 Ingles Verbo To Be
92% (13)
Guía 5 Ingles Verbo To Be
6 pages
More Sentences With Multiple Clauses Â Structureâ
No ratings yet
More Sentences With Multiple Clauses Â Structureâ
2 pages
Unit 51
No ratings yet
Unit 51
2 pages
Benefits of Oral Reading Fluency in Improving The Comprehension of Senior High Students. CHAPTER1
No ratings yet
Benefits of Oral Reading Fluency in Improving The Comprehension of Senior High Students. CHAPTER1
9 pages
Prabodh Pandit
No ratings yet
Prabodh Pandit
3 pages
TCS Lect 22 Regular Grammar
No ratings yet
TCS Lect 22 Regular Grammar
39 pages
The Rules of Madd Al-Ansaar PDF
0% (1)
The Rules of Madd Al-Ansaar PDF
6 pages
Ngữ Am Am Vị Trắc Nghiệm
No ratings yet
Ngữ Am Am Vị Trắc Nghiệm
5 pages
(SIMPLE FAZ) - Joaquin
No ratings yet
(SIMPLE FAZ) - Joaquin
22 pages