Vidyavardhini’s College of Engineering & Technology
Department of Computer Engineering
Academic Year : 2025-26
Assignment - 2
Course Code/Name: CSDC7013 Natural Language Processing Year/Sem: BE/ VII
Date: Submission Date:
Course Outcome:
CSDC7013.2: To apply Morphological analysis approaches on the given data.
Q. Question Bloom
No Level
.
1. You are developing a preprocessing pipeline for a document Apply
classification system used in analyzing editorials from multiple sources.
The task requires effective word normalization, while also considering
the linguistic accuracy of transformations.
Statements:
Batch A:"The researchers were analyzing the effectiveness of different
algorithms in predicting outcomes accurately."
Batch B:"Citizens protested against increasing taxes, arguing that the
decision was unfair and unreasonable."
Batch C:"Journalists have been reporting various developments related
to economic reforms and political strategies."
1. Apply the Porter Stemmer and Lemmatizer with POS tags to
the Statement given above Batchwise.
2. Prepare a table comparing the original words, stemmed words,
and lemmatized words.
3. Identify 3 words where stemming leads to incorrect or
ambiguous results, and explain why it could be problematic.
4. Analyze and illustrate technique (stemming or lemmatization)
would be better suited for your NLP pipeline, justifying your
answer with linguistic and functional reasoning.