0% found this document useful (0 votes)

5 views15 pages

Interaction Between Computers and Human Language

Uploaded by

imviswanthanss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views15 pages

Interaction Between Computers and Human Language

Uploaded by

imviswanthanss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

1. Define the main focus of Natural Language Processing.

a) Image recognition
b) Signal processing
c) Interaction between computers and human language
d) Circuit design

2. Describe the two broad categories of NLP.

a) Symbolic and Analog
b) Rule-based and Statistical
c) Linear and Nonlinear
d) Sequential and Parallel

3. Identify which component deals with sentence meaning.

a) Syntax
b) Semantics
c) Morphology
d) Phonology

4. Classify the applications of NLP.

a) Data mining, Sorting
b) Machine translation, Chatbots, Sentiment analysis
c) Hardware optimization, Storage
d) Circuit evaluation, Compiling

5. Examine which is a subfield of NLP.

a) Compiler design
b) Information Retrieval
c) Operating systems
d) Database indexing

6. Locate the earliest milestone in NLP history.

a) Google Translate
b) ELIZA (1966)
c) Siri
d) Alexa
7. Recall the first stage of NLP pipeline.
a) Lexical analysis
b) Syntax analysis
c) Semantic analysis
d) Pragmatics

8. Enumerate challenges of NLP.

a) Ambiguity, Context, Sarcasm
b) Sorting, Searching, Indexing
c) Multiplication, Addition
d) Compiling, Linking

9. Identify the stage that checks grammar.

a) Lexical analysis
b) Syntax analysis
c) Pragmatics
d) Information retrieval

10.Distinguish between syntactic and semantic analysis.

a) Syntax deals with meaning, semantics with structure
b) Syntax deals with structure, semantics with meaning
c) Both deal with phonetics
d) Both are about speech recognition

11.Classify the biggest challenge in NLP.

a) Large memory
b) Ambiguity
c) Parallel computation
d) Indexing speed

12.Explain the role of pragmatics.

a) Meaning of individual words
b) Meaning in context of conversation
c) Sound recognition
d) Data mining

13.Define regular expression.

a) Random text
b) A sequence of characters defining a search pattern
c) Binary search tree
d) Language compiler

14.Identify which symbol matches zero or more repetitions.

a) +
b) ?
c) *
d) ^

15.Match the symbol with its use: “^”.

a) End of string
b) Any digit
c) Whitespace
d) Start of string

16.Recall the regex for matching digits.

a) [a-z]
b) \s
c) \d
d) \w

17.Compare greedy vs non-greedy matching.

a) Greedy takes longest match, non-greedy shortest match
b) Both take same length
c) Greedy is faster
d) Non-greedy ignores regex rules

18.Examine practical use of regex.

a) Compiler optimization
b) Email validation
c) Machine learning training
d) File compression

19.Define text normalization.

a) Transforming text into standard format
b) Compressing text
c) Encrypting text
d) Tokenizing text

20.Identify which is not part of normalization.

a) Encryption
b) Lowercasing
c) Removing punctuation
d) Expanding contractions

21.Describe stemming.
a) Removing suffixes/prefixes to reach root form
b) Converting to lowercase
c) Adding tokens
d) Encoding

22.Distinguish stemming from lemmatization.

a) Both return random roots
b) Lemmatization uses dictionary, stemming cuts off suffixes
c) Lemmatization is faster
d) Stemming uses POS tags
23.Recall the step applied before tokenization.
a) Parsing
b) Cleaning text (punctuation removal, lowercasing)
c) Compiling
d) POS tagging

24.Explain why normalization is necessary.

a) To make text encrypted
b) To reduce file size
c) To make text consistent for processing
d) To identify stopwords only

25.Define minimum edit distance.

a) Number of operations to convert one word into another
b) Number of sentences in a paragraph
c) Steps in parsing
d) Syllables in speech

26.Identify the three operations in edit distance.

a) Merge, Delete, Sort
b) Insert, Delete, Substitute
c) Copy, Replace, Divide
d) Tokenize, Encode, Decode

27.Recall the edit distance between “kitten” and “sitting”.

a) 2
b) 3
c) 4
d) 1

28.Describe the algorithm commonly used.

a) Merge Sort
b) Quick Sort
c) Dynamic Programming (Wagner-Fischer)
d) BFS

29.Distinguish Levenshtein distance from Hamming distance.

a) Both require equal length strings
b) Hamming is for equal-length strings only, Levenshtein allows different lengths
c) Levenshtein is faster
d) Hamming allows insertions

30.Explain application of edit distance.

a) POS tagging
b) Parsing
c) Spell correction
d) Tokenization

31.Define an n-gram.
a) Random set of n tokens
b) Sequence of n words
c) Sequence of n characters
d) Sentence structure

32.Identify bigram model.

a) Probability of word given previous word
b) Probability of sentence length
c) Word embedding method
d) Grammar parser

33.Recall unigram model assumption.

a) Words occur independently
b) Words depend on previous two words
c) Words are random noise
d) Word order is preserved

34.Compare trigram vs bigram.

a) Trigram considers two previous words, bigram one
b) Trigram is faster
c) Both ignore history
d) Bigram uses three words

35.Examine main problem of n-grams.

a) Tokenization
b) Data sparsity
c) Large vocabulary
d) Lowercasing

36.Classify the type of model n-grams belong to.

a) Neural models
b) Statistical models
c) Rule-based models
d) Machine translation
37. Identify the main evaluation metric.
a) Perplexity
b) Accuracy
c) Recall
d) BLEU

38.Describe held-out test data.

a) Data used for training
b) Data kept aside for evaluation
c) Validation set
d) Augmented data

39.Recall the purpose of cross-validation.

a) Reduce vocabulary size
b) Ensure generalization
c) Improve syntax
d) Normalize text

40.Distinguish intrinsic vs extrinsic evaluation.

a) Intrinsic: direct measure of model; Extrinsic: task-based
b) Both are task-based
c) Intrinsic uses BLEU
d) Extrinsic ignores accuracy

41.Explain why log probability is used.

a) To speed up compilation
b) To avoid underflow and simplify multiplication
c) To reduce grammar rules
d) To create embeddings

42.Examine application of BLEU score.

a) Sentiment analysis
b) Machine translation
c) Speech tagging
d) Syntax checking
43.Identify the problem of zeros in n-grams.
a) Negative probabilities
b) Unseen events get probability zero
c) Overflow in computation
d) Division by zero

44.Recall why generalization is needed.

a) To reduce file size
b) To avoid ambiguity
c) To assign probabilities to unseen words/sequences
d) To improve tokenization
45.Compare open vs closed vocabulary.
a) Both handle infinite words
b) Closed has fixed vocabulary, open allows unseen words
c) Open ignores OOV
d) Closed allows infinite

46.Describe the solution for unseen words.

a) Drop them
b) Introduce unknown (UNK) token
c) Ignore them
d) Encode them

47.Distinguish OOV problem from ambiguity.

a) OOV: unseen word; Ambiguity: multiple meanings
b) Both are same
c) OOV deals with multiple senses
d) Ambiguity deals with spelling errors

48.Explain why zero probabilities are harmful.

a) They improve speed
b) They make sentence probability zero
c) They reduce perplexity
d) They simplify models
49.Define smoothing.
a) Technique to handle zero probabilities
b) Removing stopwords
c) Lowercasing text
d) Tokenizing text

50.Identify a simple smoothing method.

a) Add-one (Laplace) smoothing
b) Regex
c) POS tagging
d) Parsing

51.Compare Laplace vs Good-Turing.

a) Both same
b) Good-Turing estimates probability of unseen events better
c) Laplace is advanced
d) Good-Turing ignores unseen events

52.Recall the problem with add-one smoothing.

a) Too fast
b) Overestimates unseen events
c) Ignores seen events
d) Reduces vocabulary

53.Describe backoff smoothing.

a) Uses lower-order n-grams when higher-order is unavailable
b) Ignores unseen words
c) Only uses unigrams
d) Normalizes text

54.Distinguish interpolation from backoff.

a) Both drop higher n-grams
b) Interpolation combines probabilities; Backoff falls back
c) Both are same
d) Backoff is faster
55.Define perplexity.
a) Random guessing
b) Measure of how well a model predicts test data
c) Grammar rule
d) Probability of sentence length

56.Identify relation between perplexity and entropy.

a) Perplexity = 2^(Entropy)
b) Entropy = Perplexity²
c) Both are unrelated
d) Perplexity = Entropy/2

57.Recall lower perplexity means.

a) Worse model
b) Better predictive model
c) Random model
d) No effect

58.Explain entropy in NLP.

a) Word embeddings
b) Average information content per word
c) Tokenization
d) Syntax rule

59.Distinguish perplexity from accuracy.

a) Accuracy is probabilistic, perplexity is binary
b) Accuracy measures correctness, perplexity measures uncertainty
c) Both same
d) Perplexity uses F1-score
60.Describe why perplexity is exponential.
a) To simplify
b) Because it is derived from entropy measured in bits
c) To normalize data
d) To reduce vocabulary

61.Define morphology in NLP.

a) Syntax analysis
b) Study of word structure and formation
c) Sentence meaning
d) Pragmatics

62.Identify the smallest unit of meaning.

a) Phoneme
b) Morpheme
c) Grapheme
d) Token

63.Classify “unhappiness” into morphemes.

a) un + happy + ness
b) unhappy + ness
c) un + happiness
d) happiness

64.Distinguish inflectional morphemes from derivational.

a) Both change meaning
b) Inflection changes tense/number; derivation changes category/meaning
c) Derivational changes tense only
d) Inflectional creates new words

65.Describe the type of morphology in English.

a) Agglutinative
b) Inflectional
c) Polysynthetic
d) Isolating

66.Recall example of an inflectional suffix.

a) un-
b) -ed
c) re-
d) mis-
67.Identify the word class of “quickly”.
a) Adjective
b) Adverb
c) Noun
d) Pronoun

68.Define open word classes.

a) Closed set of function words
b) Classes that accept new members (nouns, verbs, adjectives, adverbs)
c) Classes that never change
d) Prepositions only

69.Recall which is a closed class.

a) Verb
b) Preposition
c) Adjective
d) Adverb

70.Classify “the” in word class.

a) Verb
b) Determiner
c) Adjective
d) Pronoun

71.Distinguish noun vs pronoun.

a) Both are identical
b) Noun names things; pronoun replaces noun
c) Pronoun is descriptive
d) Noun is functional

72.Describe interjections.
a) Complex phrases
b) Exclamatory expressions (Oh!, Wow!)
c) Helping verbs
d) Closed class
73.Define POS tagging.
a) Tokenizing text
b) Assigning word classes to tokens
c) Removing stopwords
d) Normalizing text

76.Recall the Penn Treebank tag for plural noun.

a) NN
b) NNS
c) VB
d) JJ

77.Distinguish supervised from unsupervised tagging.

a) Both require labeled data
b) Supervised uses labeled corpora; unsupervised uses clustering
c) Unsupervised is faster always
d) Both use rules only

78.Examine application of POS tagging.

a) Speech synthesis
b) Parsing and information extraction
c) Image recognition
d) Sorting words
79.Define HMM.
a) Statistical model with hidden states and observed outputs
b) Rule-based grammar model
c) Embedding model
d) Parsing algorithm

80.Identify hidden states in POS tagging.

a) Words
b) POS tags
c) Sentences
d) Morphemes

81.Recall observable sequence in HMM tagging.

a) Words in a sentence
b) POS tags
c) Morphemes
d) Syntax tree

82.Describe transition probabilities.

a) Probability of tag given previous tag
b) Probability of word given tag
c) Probability of morpheme
d) Probability of sentence length

83.Distinguish emission vs transition.

a) Both same
b) Emission: word given tag; Transition: tag given previous tag
c) Transition is word-based
d) Emission ignores probabilities

84.Explain limitation of HMM in tagging.

a) Always accurate
b) Cannot handle long dependencies well
c) Ignores syntax
d) Uses neural networks

85.Define Viterbi algorithm.

a) Sorting method
b) Dynamic programming algorithm for most probable sequence
c) Neural embedding method
d) Parsing algorithm

86.Identify what Viterbi computes in POS tagging.

a) Lexicon
b) Best sequence of tags
c) Syntax tree
d) Lemmas

87.Recall Viterbi initialization step.

a) Probability = 1 for all tags
b) Start probabilities assigned to first word
c) Transition matrix only
d) Zero for all

88.Distinguish forward vs Viterbi algorithm.

a) Both same
b) Forward sums probabilities; Viterbi chooses maximum
c) Forward ignores states
d) Viterbi ignores probabilities

89.Describe backtracking in Viterbi.

a) Recovering best tag sequence
b) Building syntax tree
c) Tokenizing
d) Counting words

90.Examine time complexity of Viterbi.

a) O(n)
b) O(n × T²) (n = words, T = tags)
c) O(T^n)
d) O(1)
91.Define Named Entity Recognition (NER).
a) Identifying proper nouns like person, location, organization
b) Tokenization
c) POS tagging
d) Parsing

92.Identify the entity in “Google was founded in California”.

a) Founded
b) Google = Organization, California = Location
c) Organization only
d) Action word

93.Recall common NER categories.

a) Pronoun, Verb, Adjective
b) Person, Location, Organization, Date
c) Root, Stem, Affix
d) Syntax, Pragmatics

94.Distinguish NER from POS tagging.

a) Both same
b) NER detects named entities; POS tags word classes
c) POS is for parsing
d) NER ignores text

95.Describe BIO tagging scheme.

a) Bigram model
b) Begin-Inside-Outside notation for entities
c) Binary index operator
d) Bag-of-words

96.Explain application of NER.

a) Information extraction in text (e.g., news, resumes)
b) Syntax analysis
c) Tokenization
d) Lowercasing
97.Define CRFs.
a) Neural networks
b) Probabilistic sequence models discriminatively trained
c) Rule-based grammar
d) Embedding models

98.Identify difference between HMM and CRF.

a) Both are generative
b) HMM is generative; CRF is discriminative
c) Both discriminative
d) HMM ignores probabilities

99.Recall why CRFs are better for NER.

a) Faster
b) They capture overlapping, global features
c) Use fewer labels
d) No probabilities needed

100. Describe feature function in CRF.

a) Maps input sequence and label sequence to real values
b) Tokenizes text
c) Embeds words
d) Parses grammar

101. Distinguish linear-chain CRF.

a) Specialized for sequential data like text
b) Ignores sequence order
c) Used for parsing trees
d) Random clustering

102. Explain training challenge of CRF.

a) Easy optimization
b) High computational cost
c) Small data requirement
d) No labeling needed
103. Define the standard metrics for NER evaluation.
a) BLEU, Perplexity
b) Precision, Recall, F1-score
c) Accuracy only
d) Word error rate

104. Identify what precision measures.

a) Correct entities out of predicted entities
b) Correct entities out of total entities
c) Predicted entities out of all tokens
d) Errors in tagging

105. Recall recall formula.

a) TP / (TP+FP)
b) TP / (TP+FN)
c) FP / (TP+FN)
d) FN / (TP+FP)

106. Distinguish micro vs macro evaluation.

a) Micro averages over all instances; Macro averages over classes
b) Both same
c) Micro ignores recall
d) Macro ignores precision

107. Describe effect of high recall but low precision.

a) Few entities detected
b) Many false positives included
c) Many entities missed
d) Perfect accuracy

108. Explain CoNLL evaluation metric.

a) F1 score for entity-level evaluation
b) Word error rate
c) BLEU score
d) Entropy
109.

Model QP NLP DrChandiniAG
No ratings yet
Model QP NLP DrChandiniAG
4 pages
Listening Forecast Tháng 2 Quan Trong 4
100% (2)
Listening Forecast Tháng 2 Quan Trong 4
77 pages
BAI601 All Modules VTU 10 Mark Complete
No ratings yet
BAI601 All Modules VTU 10 Mark Complete
18 pages
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
No ratings yet
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
5 pages
NLP 2K22 DEC CS3EA06 - IT3EA06 Natural Language Processing
No ratings yet
NLP 2K22 DEC CS3EA06 - IT3EA06 Natural Language Processing
4 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
NLP 2marks IAE 1 PDF
No ratings yet
NLP 2marks IAE 1 PDF
1 page
Quest NLP
No ratings yet
Quest NLP
13 pages
9783293-CLASS10 AI Worksheet PART B UNIT6 Natural Language Processing
No ratings yet
9783293-CLASS10 AI Worksheet PART B UNIT6 Natural Language Processing
3 pages
IT3EA06 Natural Language Processing
No ratings yet
IT3EA06 Natural Language Processing
3 pages
Lucas Paquetta Raw NLP
No ratings yet
Lucas Paquetta Raw NLP
12 pages
Comprehensive NLP Practice Assignment
No ratings yet
Comprehensive NLP Practice Assignment
2 pages
NLP 2K22 MAY CS3EA06 Natural Language Processing
No ratings yet
NLP 2K22 MAY CS3EA06 Natural Language Processing
2 pages
It3ea06 Natural Lanuage Processing
No ratings yet
It3ea06 Natural Lanuage Processing
4 pages
NLP Basics: Key Concepts and Processes
No ratings yet
NLP Basics: Key Concepts and Processes
15 pages
Viva Q&a
No ratings yet
Viva Q&a
5 pages
NLP Question
No ratings yet
NLP Question
4 pages
NLP QB
No ratings yet
NLP QB
5 pages
Important Questions and Answer NLP
No ratings yet
Important Questions and Answer NLP
10 pages
NLP Assignment
No ratings yet
NLP Assignment
8 pages
Question Bank
No ratings yet
Question Bank
3 pages
Computer 2
No ratings yet
Computer 2
13 pages
NLP Endsem 2016
No ratings yet
NLP Endsem 2016
2 pages
CM3060 NLP Mock Exam Oct2021
No ratings yet
CM3060 NLP Mock Exam Oct2021
4 pages
NLP Question Bank
No ratings yet
NLP Question Bank
7 pages
NLP Study Material
No ratings yet
NLP Study Material
8 pages
NLP 2K19 MAY CS3EA06-IT3EA06 Natural Language Processing
No ratings yet
NLP 2K19 MAY CS3EA06-IT3EA06 Natural Language Processing
3 pages
Question Bank - NLP
No ratings yet
Question Bank - NLP
3 pages
NLP Previous Sem
No ratings yet
NLP Previous Sem
5 pages
Module 1
No ratings yet
Module 1
5 pages
Practice Set NLP
No ratings yet
Practice Set NLP
5 pages
Eti 3111
No ratings yet
Eti 3111
28 pages
NLP QB
No ratings yet
NLP QB
4 pages
NLP Q&A1a Text Processing
No ratings yet
NLP Q&A1a Text Processing
16 pages
Qns
No ratings yet
Qns
6 pages
NLP Sample QB
No ratings yet
NLP Sample QB
12 pages
Ai Unit - 5
No ratings yet
Ai Unit - 5
12 pages
NLP Worksheet for Students
No ratings yet
NLP Worksheet for Students
10 pages
Artificial Intelligence Class X Unit 7: Natural Language Processing
No ratings yet
Artificial Intelligence Class X Unit 7: Natural Language Processing
10 pages
CM3060 Past Paper September 2024
No ratings yet
CM3060 Past Paper September 2024
5 pages
NLP Exam Guide for Students
No ratings yet
NLP Exam Guide for Students
8 pages
NLP Comprehensive Study Guide Pokhara University Fall 2025
No ratings yet
NLP Comprehensive Study Guide Pokhara University Fall 2025
50 pages
NLP New QB
No ratings yet
NLP New QB
3 pages
OneRead 20250903 1119
No ratings yet
OneRead 20250903 1119
4 pages
NLP Quiz Seg 1 To 4
No ratings yet
NLP Quiz Seg 1 To 4
9 pages
Mock Interview Question - NLP
No ratings yet
Mock Interview Question - NLP
3 pages
NLP Notes
No ratings yet
NLP Notes
3 pages
NLP 2
No ratings yet
NLP 2
45 pages
Long Answer Qs
No ratings yet
Long Answer Qs
2 pages
NLP Previous Sem-1-3
No ratings yet
NLP Previous Sem-1-3
3 pages
Bai601 Simp
No ratings yet
Bai601 Simp
4 pages
Unit-I QB
No ratings yet
Unit-I QB
5 pages
Ch-6 Natural Language Processing Q&A's
No ratings yet
Ch-6 Natural Language Processing Q&A's
8 pages
Ajaz Ahmad 101203540
No ratings yet
Ajaz Ahmad 101203540
7 pages
NLP Question Bank
No ratings yet
NLP Question Bank
10 pages
NLP Question Bank: Chapter-Wise Practice Problems With Solutions
No ratings yet
NLP Question Bank: Chapter-Wise Practice Problems With Solutions
45 pages
X - AI-NLP Worksheet
No ratings yet
X - AI-NLP Worksheet
2 pages
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
No ratings yet
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
11 pages
Authentication UMTS GSM
No ratings yet
Authentication UMTS GSM
8 pages
Legal Dispute on DOJ Order in Murder Case
No ratings yet
Legal Dispute on DOJ Order in Murder Case
14 pages
Appendix A 2
No ratings yet
Appendix A 2
7 pages
GeM Bidding Corr 7124359 3
No ratings yet
GeM Bidding Corr 7124359 3
3 pages
Lesson 6 Uts
No ratings yet
Lesson 6 Uts
53 pages
Israeli Special Forces Selection
100% (1)
Israeli Special Forces Selection
14 pages
Pulmonary Function Tests
No ratings yet
Pulmonary Function Tests
65 pages
Ucchista Ganapati Mantra Guide
No ratings yet
Ucchista Ganapati Mantra Guide
19 pages
Quitoy Feature
No ratings yet
Quitoy Feature
2 pages
Exercise (S-1) : Definitions of Kinematics Variables
No ratings yet
Exercise (S-1) : Definitions of Kinematics Variables
5 pages
The Holy Spirit in Christian Life
No ratings yet
The Holy Spirit in Christian Life
14 pages
Human Resource Management Thesis Philippines
100% (2)
Human Resource Management Thesis Philippines
6 pages
Construction and Standardization of Psychology Aptitude Test For Incoming College Psychology Students
No ratings yet
Construction and Standardization of Psychology Aptitude Test For Incoming College Psychology Students
7 pages
Van Buiten
No ratings yet
Van Buiten
8 pages
Earthquake Resistance Structure Experiment No. - : I-Objectives
No ratings yet
Earthquake Resistance Structure Experiment No. - : I-Objectives
2 pages
NURS 1112 Health Promotion Course Outline
No ratings yet
NURS 1112 Health Promotion Course Outline
7 pages
G8-Pilot Humility Class Insights
No ratings yet
G8-Pilot Humility Class Insights
4 pages
Sabrimalai Ayyappan
No ratings yet
Sabrimalai Ayyappan
2 pages
JK Tyre Industries LTD
No ratings yet
JK Tyre Industries LTD
15 pages
Myocardial Infarction
No ratings yet
Myocardial Infarction
37 pages
The 2019 2020 Tokushima Prefecture ALT Skill Development Conference
No ratings yet
The 2019 2020 Tokushima Prefecture ALT Skill Development Conference
9 pages
1 Amartya Sen
No ratings yet
1 Amartya Sen
5 pages
Gashadokuro
No ratings yet
Gashadokuro
1 page
Cambridge International AS & A Level: Global Perspectives and Research 9239/12
No ratings yet
Cambridge International AS & A Level: Global Perspectives and Research 9239/12
25 pages
Land Laws: Rights & Legislation
No ratings yet
Land Laws: Rights & Legislation
25 pages
SL - No 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46
No ratings yet
SL - No 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46
56 pages
Jose Rizal's Love Affairs
100% (1)
Jose Rizal's Love Affairs
31 pages
The Present Simple and Present Continuous in Engli Activities Promoting Classroom Dynamics Group Form - 94392
No ratings yet
The Present Simple and Present Continuous in Engli Activities Promoting Classroom Dynamics Group Form - 94392
2 pages
Architecture Books Overview 2016
No ratings yet
Architecture Books Overview 2016
8 pages

Interaction Between Computers and Human Language

Uploaded by

Interaction Between Computers and Human Language

Uploaded by

1.​ Define the main focus of Natural Language Processing.

2.​ Describe the two broad categories of NLP.​

3.​ Identify which component deals with sentence meaning.​

4.​ Classify the applications of NLP.​

5.​ Examine which is a subfield of NLP.​

6.​ Locate the earliest milestone in NLP history.​

8.​ Enumerate challenges of NLP.​

9.​ Identify the stage that checks grammar.​

10.​Distinguish between syntactic and semantic analysis.​

11.​Classify the biggest challenge in NLP.​

12.​Explain the role of pragmatics.​

13.​Define regular expression.​

14.​Identify which symbol matches zero or more repetitions.​

15.​Match the symbol with its use: “^”.​

16.​Recall the regex for matching digits.​

17.​Compare greedy vs non-greedy matching.​

18.​Examine practical use of regex.​

19.​Define text normalization.​

20.​Identify which is not part of normalization.​

22.​Distinguish stemming from lemmatization.​

24.​Explain why normalization is necessary.​

25.​Define minimum edit distance.​

26.​Identify the three operations in edit distance.​

27.​Recall the edit distance between “kitten” and “sitting”.​

28.​Describe the algorithm commonly used.​

29.​Distinguish Levenshtein distance from Hamming distance.​

30.​Explain application of edit distance.​

32.​Identify bigram model.​

33.​Recall unigram model assumption.​

34.​Compare trigram vs bigram.​

35.​Examine main problem of n-grams.​

36.​Classify the type of model n-grams belong to.​

38.​Describe held-out test data.​

39.​Recall the purpose of cross-validation.​

40.​Distinguish intrinsic vs extrinsic evaluation.​

41.​Explain why log probability is used.​

42.​Examine application of BLEU score.​

44.​Recall why generalization is needed.​

46.​Describe the solution for unseen words.​

47.​Distinguish OOV problem from ambiguity.​

48.​Explain why zero probabilities are harmful.​

50.​Identify a simple smoothing method.​

51.​Compare Laplace vs Good-Turing.​

52.​Recall the problem with add-one smoothing.​

53.​Describe backoff smoothing.​

54.​Distinguish interpolation from backoff.​

56.​Identify relation between perplexity and entropy.​

57.​Recall lower perplexity means.​

58.​Explain entropy in NLP.​

59.​Distinguish perplexity from accuracy.​

61.​Define morphology in NLP.​

62.​Identify the smallest unit of meaning.​

63.​Classify “unhappiness” into morphemes.​

64.​Distinguish inflectional morphemes from derivational.​

65.​Describe the type of morphology in English.​

66.​Recall example of an inflectional suffix.​

68.​Define open word classes.​

69.​Recall which is a closed class.​

70.​Classify “the” in word class.​

71.​Distinguish noun vs pronoun.​

74.​Identify the POS tag for “run” in “I will run fast”.​

76.​Recall the Penn Treebank tag for plural noun.​

77.​Distinguish supervised from unsupervised tagging.​

78.​Examine application of POS tagging.​

80.​Identify hidden states in POS tagging.​

81.​Recall observable sequence in HMM tagging.​

82.​Describe transition probabilities.​

83.​Distinguish emission vs transition.​

84.​Explain limitation of HMM in tagging.​

85.​Define Viterbi algorithm.​

86.​Identify what Viterbi computes in POS tagging.​

87.​Recall Viterbi initialization step.​

88.​Distinguish forward vs Viterbi algorithm.​

89.​Describe backtracking in Viterbi.​

90.​Examine time complexity of Viterbi.​

92.​Identify the entity in “Google was founded in California”.​

1. Define the main focus of Natural Language Processing.

2. Describe the two broad categories of NLP.

3. Identify which component deals with sentence meaning.

4. Classify the applications of NLP.

5. Examine which is a subfield of NLP.

6. Locate the earliest milestone in NLP history.

8. Enumerate challenges of NLP.

9. Identify the stage that checks grammar.

10.Distinguish between syntactic and semantic analysis.

11.Classify the biggest challenge in NLP.

12.Explain the role of pragmatics.

13.Define regular expression.

14.Identify which symbol matches zero or more repetitions.

15.Match the symbol with its use: “^”.

16.Recall the regex for matching digits.

17.Compare greedy vs non-greedy matching.

18.Examine practical use of regex.

19.Define text normalization.

20.Identify which is not part of normalization.

22.Distinguish stemming from lemmatization.

24.Explain why normalization is necessary.

25.Define minimum edit distance.

26.Identify the three operations in edit distance.

27.Recall the edit distance between “kitten” and “sitting”.

28.Describe the algorithm commonly used.

29.Distinguish Levenshtein distance from Hamming distance.

30.Explain application of edit distance.

32.Identify bigram model.

33.Recall unigram model assumption.

34.Compare trigram vs bigram.

35.Examine main problem of n-grams.

36.Classify the type of model n-grams belong to.

38.Describe held-out test data.

39.Recall the purpose of cross-validation.

40.Distinguish intrinsic vs extrinsic evaluation.

41.Explain why log probability is used.

42.Examine application of BLEU score.

44.Recall why generalization is needed.

46.Describe the solution for unseen words.

47.Distinguish OOV problem from ambiguity.

48.Explain why zero probabilities are harmful.

50.Identify a simple smoothing method.

51.Compare Laplace vs Good-Turing.

52.Recall the problem with add-one smoothing.

53.Describe backoff smoothing.

54.Distinguish interpolation from backoff.

56.Identify relation between perplexity and entropy.

57.Recall lower perplexity means.

58.Explain entropy in NLP.

59.Distinguish perplexity from accuracy.

61.Define morphology in NLP.

62.Identify the smallest unit of meaning.

63.Classify “unhappiness” into morphemes.

64.Distinguish inflectional morphemes from derivational.

65.Describe the type of morphology in English.

66.Recall example of an inflectional suffix.

68.Define open word classes.

69.Recall which is a closed class.

70.Classify “the” in word class.

71.Distinguish noun vs pronoun.

74.Identify the POS tag for “run” in “I will run fast”.

76.Recall the Penn Treebank tag for plural noun.

77.Distinguish supervised from unsupervised tagging.

78.Examine application of POS tagging.

80.Identify hidden states in POS tagging.

81.Recall observable sequence in HMM tagging.

82.Describe transition probabilities.

83.Distinguish emission vs transition.

84.Explain limitation of HMM in tagging.

85.Define Viterbi algorithm.

86.Identify what Viterbi computes in POS tagging.

87.Recall Viterbi initialization step.

88.Distinguish forward vs Viterbi algorithm.

89.Describe backtracking in Viterbi.

90.Examine time complexity of Viterbi.

92.Identify the entity in “Google was founded in California”.

93.Recall common NER categories.

94.Distinguish NER from POS tagging.

95.Describe BIO tagging scheme.

96.Explain application of NER.