0% found this document useful (0 votes)

9 views14 pages

TSA Lab Manual New

Uploaded by

rajaprasath2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views14 pages

TSA Lab Manual New

Uploaded by

rajaprasath2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

EXP 1:

PROGRAM:
import re
text = "The quick brown fox jumps over the lazy dog"
pattern = r'\b[a-zA-Z]*[qQ][a-zA-Z]*\b' # Words containing the letter 'q' or 'Q'
matches = re.findall(pattern, text)
print(matches) # Output: ['quick']
OUTPUT:
['quick']
PROGRAM:
import re
text = "The quick brown fox"
tokens = re.split(r'\s+', text)
print(tokens) # Output: ['The', 'quick', 'brown', 'fox']
OUTPUT:
['The', 'quick', 'brown', 'fox']
EXP 2:
PROGRAM:
import re
from collections import Counter
text = """
Natural Language Processing (NLP) is a fascinating field of Artificial Intelligence.
It helps computers understand, interpret, and generate human language.
With NLP, we can build chatbots, translators, sentiment analyzers, and more. “””
# 1️ Tokenization (simple split using regex)
# --------------------------
tokens = re.findall(r'\b\w+\b', text.lower())

# --------------------------
# 2️ Search in Text
# --------------------------
search_word = "nlp"
occurrences = [i for i, token in enumerate(tokens) if token == search_word]
print(f"Positions of '{search_word}':", occurrences)

# --------------------------
# 3️ Vocabulary Size
# --------------------------
vocab = set(tokens)
print("Vocabulary Size:", len(vocab))
print("Vocabulary:", sorted(vocab))

# --------------------------
# 4️ Frequency Distribution
# --------------------------
fdist = Counter(tokens)
print("\nTop 5 Frequent Words:")
for word, freq in fdist.most_common(5):
print(word, "->", freq)

# --------------------------
# 5️ Bigrams
# --------------------------
bigrams_list = [(tokens[i], tokens[i+1]) for i in range(len(tokens)-1)]
print("\nBigrams List:", bigrams_list)
OUTPUT:
Positions of 'nlp': [3, 21]
Vocabulary Size: 28
Vocabulary: ['a', 'analyzers', 'and', 'artificial', 'build', 'can', 'chatbots', 'computers', 'fascinating',
'field', 'generate', 'helps', 'human', 'intelligence', 'interpret', 'is', 'it', 'language', 'more', 'natural',
'nlp', 'of', 'processing', 'sentiment', 'translators', 'understand', 'we', 'with']
Top 5 Frequent Words:
language -> 2
nlp -> 2
and -> 2
natural -> 1
processing -> 1
Bigrams List: [('natural', 'language'), ('language', 'processing'), ('processing', 'nlp'), ('nlp', 'is'), ('is',
'a'), ('a', 'fascinating'), ('fascinating', 'field'), ('field', 'of'), ('of', 'artificial'), ('artificial',
'intelligence'), ('intelligence', 'it'), ('it', 'helps'), ('helps', 'computers'), ('computers', 'understand'),
('understand', 'interpret'), ('interpret', 'and'), ('and', 'generate'), ('generate', 'human'), ('human',
'language'), ('language', 'with'), ('with', 'nlp'), ('nlp', 'we'), ('we', 'can'), ('can', 'build'), ('build',
'chatbots'), ('chatbots', 'translators'), ('translators', 'sentiment'), ('sentiment', 'analyzers'),
('analyzers', 'and'), ('and', 'more')]
=== Code Execution Successful ===
EXP 3:
PROGRAM:
import nltk
from nltk.corpus import gutenberg, brown, wordnet
# Download corpora if not already downloaded
nltk.download('gutenberg')
nltk.download('brown')
nltk.download('wordnet')
# Accessing the Gutenberg Corpus
print("=== Gutenberg Corpus ===")
print("Available files:", gutenberg.fileids())
emma_text = gutenberg.raw('austen-emma.txt')[:500] # Extracting first 500 characters
print("Sample text from 'Emma' by Jane Austen:")
print(emma_text)
# Accessing the Brown Corpus
print("\n=== Brown Corpus ===")
print("Available categories (genres):", brown.categories())
news_text = brown.raw(categories='news')[:500] # Extracting first 500 characters
print("Sample text from the 'news' category:")
print(news_text)
# Accessing the WordNet Corpus
print("\n=== WordNet Corpus ===")
car_synsets = wordnet.synsets('car') # Synsets for the word 'car'
print("Synsets for the word 'car':", car_synsets)
print("Definitions of the synsets:")
for synset in car_synsets:
print("-", synset.definition())
OUTPUT:
=== Gutenberg Corpus ===
Available files: ['austen-emma.txt', 'austen-persuasion.txt', 'austen-sense.txt', 'bible-kjv.txt',
'blake-poems.txt', 'bryant-stories.txt', 'burgess-busterbrown.txt', 'carroll-alice.txt', 'chesterton-
ball.txt', 'chesterton-brown.txt', 'chesterton-thursday.txt', 'edgeworth-parents.txt', 'melville-
moby_dick.txt', 'milton-paradise.txt', 'shakespeare-caesar.txt', 'shakespeare-hamlet.txt',
'shakespeare-macbeth.txt', 'whitman-leaves.txt']
Sample text from 'Emma' by Jane Austen:
[Emma by Jane Austen 1816]

VOLUME I

CHAPTER I

Emma Woodhouse, handsome, clever, and rich, with a comfortable home

and happy disposition, seemed to unite some of the best blessings
of existence; and had lived nearly twenty-one years in the world
with very little to distress or vex her.
She was the youngest of the two daughters of a most affectionate,
indulgent father; and had, in consequence of her sister's marriage,
been mistress of his house from a very early period. Her mother
had died t

=== Brown Corpus ===

Available categories (genres): ['adventure', 'belles_lettres', 'editorial', 'fiction', 'government',
'hobbies', 'humor', 'learned', 'lore', 'mystery', 'news', 'religion', 'reviews', 'romance',
'science_fiction']
Sample text from the 'news' category:

The/at Fulton/np-tl County/nn-tl Grand/jj-tl Jury/nn-tl said/vbd Friday/nr an/at

investigation/nn of/in Atlanta's/np$ recent/jj primary/nn election/nn produced/vbd ``/`` no/at
evidence/nn ''/'' that/cs any/dti irregularities/nns took/vbd place/nn ./.

The/at jury/nn further/rbr said/vbd in/in term-end/nn presentments/nns that/cs the/at

City/nn-tl Executive/jj-tl Committee/nn-tl ,/, which/wdt had/hvd over-all/jj charge/nn of/in the/at
election/nn ,/, ``/`` deserves/vbz the/at praise/nn and/c

=== WordNet Corpus ===

Synsets for the word 'car': [Synset('car.n.01'), Synset('car.n.02'), Synset('car.n.03'),
Synset('car.n.04'), Synset('cable_car.n.01')]
Definitions of the synsets:
- a motor vehicle with four wheels; usually propelled by an internal combustion engine
- a wheeled vehicle adapted to the rails of railroad
- the compartment that is suspended from an airship and that carries personnel and the cargo and
the power plant
- where passengers ride up and down
- a conveyance for passengers or freight on a cable railway
EXP 4:
PROGRAM:
import nltk
from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize
from nltk.probability import FreqDist

# Download required resources

nltk.download('punkt') # Tokenizer
nltk.download('punkt_tab') # New tokenizer table (fixes your error)
nltk.download('stopwords') # Stop words

def most_frequent_words(text):
# Tokenize the text
tokens = word_tokenize(text)

# Filter out stop words

stop_words = set(stopwords.words('english'))
filtered_tokens = [word for word in tokens if word.lower() not in stop_words and
word.isalpha()]

# Calculate frequency distribution

fdist = FreqDist(filtered_tokens)

# Get the 50 most frequent words

most_frequent = fdist.most_common(50)
return most_frequent

# Example usage
text = """This is a sample text. It contains some words that will be counted.
The words in this text will be analyzed to find the most frequent ones.
This text is here to demonstrate frequency counting in NLP using NLTK."""

result = most_frequent_words(text)
print("50 most frequently occurring words (excluding stop words):")
print(result)

OUTPUT:
[nltk_data] Downloading package punkt to /root/nltk_data...
[nltk_data] Package punkt is already up-to-date!
[nltk_data] Downloading package punkt_tab to /root/nltk_data...
50 most frequently occurring words (excluding stop words):
[('text', 3), ('words', 2), ('sample', 1), ('contains', 1), ('counted', 1), ('analyzed', 1), ('find', 1),
('frequent', 1), ('ones', 1), ('demonstrate', 1), ('frequency', 1), ('counting', 1), ('NLP', 1), ('using',
1), ('NLTK', 1)]
[nltk_data] Unzipping tokenizers/punkt_tab.zip.
[nltk_data] Downloading package stopwords to /root/nltk_data...
[nltk_data] Package stopwords is already up-to-date!
EXP: 5
PROGRAM:
import nltk
from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize
import gensim
from gensim.models import Word2Vec

# Download required NLTK resources

nltk.download('punkt')
nltk.download('punkt_tab') # Fixes tokenizer table error in newer NLTK versions
nltk.download('stopwords')

# -------------------------
# 1️ Sample corpus
# -------------------------
corpus = """
Natural Language Processing (NLP) is a field of Artificial Intelligence (AI).
It helps computers understand, interpret, and generate human language.
Word2Vec is a popular algorithm for word embeddings.
It represents words in vector space, capturing semantic meaning.
"""

# -------------------------
# 2️ Preprocessing
# -------------------------
stop_words = set(stopwords.words('english'))

# Tokenize and clean text

tokens = word_tokenize(corpus.lower())
tokens = [word for word in tokens if word.isalpha() and word not in stop_words]

# Word2Vec expects a list of sentences (list of list of tokens)

sentences = [tokens]

# -------------------------
# 3️ Train Word2Vec model
# -------------------------
model = Word2Vec(sentences, vector_size=50, window=3, min_count=1, sg=1)
# vector_size: dimension of vectors
# window: context size
# min_count: ignores words with frequency < 1
# sg=1: skip-gram model (sg=0 for CBOW)

# -------------------------
# 4️ Explore the model
# -------------------------
print("\nVector for 'language':\n", model.wv['language'])
print("\nMost similar words to 'language':\n", model.wv.most_similar('language'))

# Save model (optional)

model.save("word2vec_model.model")

OUTPUT:
Most similar words to 'language':
[('embeddings', 0.2707250714302063), ('human', 0.21214450895786285), ('capturing',
0.18699145317077637), ('understand', 0.16841934621334076), ('space', 0.16121222078800201),
('semantic', 0.15025976300239563), ('intelligence', 0.1321220099925995), ('interpret',
0.12795543670654297), ('helps', 0.10077444463968277), ('words', 0.07131274044513702)]
EXP: 6
PROGRAM:
# Simple Rule-Based Chatbot

def chatbot_response(user_input):
user_input = user_input.lower()

# Rule-based responses
if "hello" in user_input or "hi" in user_input:
return "Hello! How can I help you today?"

elif "how are you" in user_input:

return "I'm just a bot, but I'm doing great! How about you?"

elif "your name" in user_input:

return "I am ChatBot 1.0, your friendly assistant."

elif "weather" in user_input:

return "I can't check the weather right now, but it looks sunny in my world."

elif "bye" in user_input or "goodbye" in user_input:

return "Goodbye! Have a nice day."

else:
return "I'm not sure how to respond to that. Could you rephrase?"

# Main loop
print("ChatBot 1.0: Hello! Type 'bye' to end the chat.")
while True:
user_text = input("You: ")
response = chatbot_response(user_text)
print("Bot:", response)
if "bye" in user_text.lower():
break

OUTPUT:
Chat Bot 1.0: Hello! Type 'bye' to end the chat.
You: HI HOW CAN I HELP YOU
Bot: Hello! How can I help you today?
You: HELO
Bot: I'm not sure how to respond to that. Could you rephrase?
You: I DON'T UNDERSAND THIS
Bot: Hello! How can I help you today?
You: THANK U
Bot: I'm not sure how to respond to that. Could you rephrase?
You: GOOD BYE
Bot: Goodbye! Have a nice day.

EXP: 7
PROGRAM:
# Install required libraries
!pip install gTTS speechrecognition pydub jiwer

from gtts import gTTS

import speech_recognition as sr
from jiwer import wer
import os

# --------------------
# Step 1: Text to Speech
# --------------------
text = "Hello, welcome to the NLP Lab. We are testing text to speech
accuracy."
tts = gTTS(text=text, lang='en')
tts.save("output.mp3")

print("✅ Audio file saved as 'output.mp3'")

# --------------------
# Step 2: Speech to Text (recognition)
# --------------------
recognizer = sr.Recognizer()

# Convert MP3 to WAV for recognition

from pydub import AudioSegment
sound = AudioSegment.from_mp3("output.mp3")
sound.export("output.wav", format="wav")

# Load audio file

with sr.AudioFile("output.wav") as source:
audio_data = recognizer.record(source)
try:
recognized_text = recognizer.recognize_google(audio_data)
print("🔹 Recognized Text:", recognized_text)
except sr.UnknownValueError:
print("Speech Recognition could not understand the audio.")
recognized_text = ""
except sr.RequestError as e:
print(f"Could not request results; {e}")
recognized_text = ""

# --------------------
# Step 3: Accuracy Calculation
# --------------------
error_rate = wer(text.lower(), recognized_text.lower())
accuracy = (1 - error_rate) * 100

print(f"📊 Word Error Rate (WER): {error_rate:.2f}")

print(f"✅ Accuracy: {accuracy:.2f}%")

OUTPUT:
Collecting gTTS
Downloading gTTS-2.5.4-py3-none-any.whl.metadata (4.1 kB)
Collecting speechrecognition
Downloading speechrecognition-3.14.3-py3-none-any.whl.metadata (30 kB)
Requirement already satisfied: pydub in /usr/local/lib/python3.11/dist-
packages (0.25.1)
Collecting jiwer
Downloading jiwer-4.0.0-py3-none-any.whl.metadata (3.3 kB)
Requirement already satisfied: requests<3,>=2.27 in
/usr/local/lib/python3.11/dist-packages (from gTTS) (2.32.3)
Collecting click<8.2,>=7.1 (from gTTS)
Downloading click-8.1.8-py3-none-any.whl.metadata (2.3 kB)
Requirement already satisfied: typing-extensions in
/usr/local/lib/python3.11/dist-packages (from speechrecognition) (4.14.1)
Collecting rapidfuzz>=3.9.7 (from jiwer)
Downloading rapidfuzz-3.13.0-cp311-cp311-
manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (12 kB)
Requirement already satisfied: charset-normalizer<4,>=2 in
/usr/local/lib/python3.11/dist-packages (from requests<3,>=2.27->gTTS)
(3.4.2)
Requirement already satisfied: idna<4,>=2.5 in
/usr/local/lib/python3.11/dist-packages (from requests<3,>=2.27->gTTS) (3.10)
Requirement already satisfied: urllib3<3,>=1.21.1 in
/usr/local/lib/python3.11/dist-packages (from requests<3,>=2.27->gTTS)
(2.5.0)
Requirement already satisfied: certifi>=2017.4.17 in
/usr/local/lib/python3.11/dist-packages (from requests<3,>=2.27->gTTS)
(2025.8.3)
Downloading gTTS-2.5.4-py3-none-any.whl (29 kB)
Downloading speechrecognition-3.14.3-py3-none-any.whl (32.9 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
32.9/32.9 MB 27.8 MB/s eta 0:00:00
Downloading jiwer-4.0.0-py3-none-any.whl (23 kB)
Downloading click-8.1.8-py3-none-any.whl (98 kB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
98.2/98.2 kB 6.4 MB/s eta 0:00:00
Downloading rapidfuzz-3.13.0-cp311-cp311-
manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.1 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 3.1/3.1
MB 83.2 MB/s eta 0:00:00
Installing collected packages: speechrecognition, rapidfuzz, click, jiwer,
gTTS
Attempting uninstall: click
Found existing installation: click 8.2.1
Uninstalling click-8.2.1:
Successfully uninstalled click-8.2.1
Successfully installed click-8.1.8 gTTS-2.5.4 jiwer-4.0.0 rapidfuzz-3.13.0
speechrecognition-3.14.3
✅ Audio file saved as 'output.mp3'
🔹 Recognized Text: hello welcome to the NLP lab we are testing text to speech
accuracy
📊 Word Error Rate (WER): 0.23
✅ Accuracy: 76.92%

T105 - CV For Legal TranscribeMe Style Guide (LPE 2 - 24)
No ratings yet
T105 - CV For Legal TranscribeMe Style Guide (LPE 2 - 24)
24 pages
Ccs339 Text and Speech Analysis Lab Manual
No ratings yet
Ccs339 Text and Speech Analysis Lab Manual
51 pages
NLP Practical Manual
No ratings yet
NLP Practical Manual
48 pages
The Gingerbread Man Activity Book
No ratings yet
The Gingerbread Man Activity Book
20 pages
The Last Lesson Class 12 English Chapter 1 Summary Explanation, Question Answers
No ratings yet
The Last Lesson Class 12 English Chapter 1 Summary Explanation, Question Answers
24 pages
STARS Sample Lesson Teacher Grade8
No ratings yet
STARS Sample Lesson Teacher Grade8
21 pages
Natural Language Processing Journal
No ratings yet
Natural Language Processing Journal
73 pages
NLP Lab
No ratings yet
NLP Lab
63 pages
Visual-Gestural Communication - A Workbook in Nonverbal Expression and Reception
No ratings yet
Visual-Gestural Communication - A Workbook in Nonverbal Expression and Reception
257 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
Ccs369 - Text and Speech Analysis - Lab Manual
100% (1)
Ccs369 - Text and Speech Analysis - Lab Manual
23 pages
NLP Core Using NLTK: Dr. Muhammad Nouman Durrani
No ratings yet
NLP Core Using NLTK: Dr. Muhammad Nouman Durrani
42 pages
NLP Practical Journal
No ratings yet
NLP Practical Journal
36 pages
WEEK 27 - Unit 8 - Going Away + REVISION LESSON
No ratings yet
WEEK 27 - Unit 8 - Going Away + REVISION LESSON
8 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
NLP Session 4
No ratings yet
NLP Session 4
13 pages
NLP Lab Codes Till Mod3
No ratings yet
NLP Lab Codes Till Mod3
7 pages
Lab-1 - Tokenization, Stemming, Stopwords - Jupyter Notebook
No ratings yet
Lab-1 - Tokenization, Stemming, Stopwords - Jupyter Notebook
15 pages
NLP Lab Manual - Final
No ratings yet
NLP Lab Manual - Final
15 pages
ECA2+ - Tests - Answer Key C
No ratings yet
ECA2+ - Tests - Answer Key C
26 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
21 pages
Natural Langauage Processing (NLP) : Tokenization of Words
No ratings yet
Natural Langauage Processing (NLP) : Tokenization of Words
8 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
32 pages
CCS369 - Text and Speech Analysis
No ratings yet
CCS369 - Text and Speech Analysis
31 pages
Expressing Preference: English Worksheet For 8º Grade Students 2019 Expressing Prefference
No ratings yet
Expressing Preference: English Worksheet For 8º Grade Students 2019 Expressing Prefference
2 pages
NLP PRGRM-1
No ratings yet
NLP PRGRM-1
7 pages
CCS369-Text and Speech Analysis Lab (1-9)
No ratings yet
CCS369-Text and Speech Analysis Lab (1-9)
37 pages
NLP - Record (Weeks 1-12)
No ratings yet
NLP - Record (Weeks 1-12)
41 pages
NLP Record
No ratings yet
NLP Record
23 pages
NLP FinAL
No ratings yet
NLP FinAL
27 pages
Ai&Ml Bai601 NLP Lab Manual
No ratings yet
Ai&Ml Bai601 NLP Lab Manual
48 pages
NLP Pratical
No ratings yet
NLP Pratical
14 pages
Tsa Labmanual
No ratings yet
Tsa Labmanual
26 pages
Natural Language Processing Lab Manual
No ratings yet
Natural Language Processing Lab Manual
24 pages
Tsarecord
No ratings yet
Tsarecord
22 pages
R22 NLP Python Programs
No ratings yet
R22 NLP Python Programs
15 pages
NLP
No ratings yet
NLP
12 pages
Kiswahili: East Africa's Lingua Franca
No ratings yet
Kiswahili: East Africa's Lingua Franca
1 page
DS 7
No ratings yet
DS 7
3 pages
Ccs369-Lab Ex 3,4,5
No ratings yet
Ccs369-Lab Ex 3,4,5
8 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
The Handbook of Business Discourse
No ratings yet
The Handbook of Business Discourse
521 pages
NLP Lab Manual for CSE Students
No ratings yet
NLP Lab Manual for CSE Students
28 pages
NLP Practical Journal 2023-24
No ratings yet
NLP Practical Journal 2023-24
22 pages
7 Idf
No ratings yet
7 Idf
5 pages
NLTK Cheatsheet for Text Analysis
No ratings yet
NLTK Cheatsheet for Text Analysis
3 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
A7 Dsbda Sana
No ratings yet
A7 Dsbda Sana
15 pages
NLP Techniques for Text Processing
No ratings yet
NLP Techniques for Text Processing
41 pages
Natural Language Processing
No ratings yet
Natural Language Processing
25 pages
20BCP123 - NLP Lab Manual
No ratings yet
20BCP123 - NLP Lab Manual
45 pages
NLP Op
No ratings yet
NLP Op
16 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
All Practicals
No ratings yet
All Practicals
33 pages
NLP EXP 3 (B) - Word Generation
No ratings yet
NLP EXP 3 (B) - Word Generation
2 pages
Relative Clause
No ratings yet
Relative Clause
6 pages
Batch 2
No ratings yet
Batch 2
13 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
Python NLP Tasks with NLTK
No ratings yet
Python NLP Tasks with NLTK
17 pages
NLTK - N-Gram LM
No ratings yet
NLTK - N-Gram LM
13 pages
Text Processing
No ratings yet
Text Processing
16 pages
DSBD 7 Ass
No ratings yet
DSBD 7 Ass
9 pages
Grade 3 English Lesson Plan
No ratings yet
Grade 3 English Lesson Plan
4 pages
NLP Projects
No ratings yet
NLP Projects
4 pages
TSA Student
No ratings yet
TSA Student
20 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
FIONA LAM - IU Students-Outsiders Test Study Guide
No ratings yet
FIONA LAM - IU Students-Outsiders Test Study Guide
7 pages
Text Analysis With NLTK Cheatsheet PDF
No ratings yet
Text Analysis With NLTK Cheatsheet PDF
3 pages
8A ESP Questionnaire
No ratings yet
8A ESP Questionnaire
12 pages
Python NLP Techniques Guide
No ratings yet
Python NLP Techniques Guide
18 pages
Change The Verbs Below Into The Past Tense and Put Them Into The Correct Place in The Sentences
0% (1)
Change The Verbs Below Into The Past Tense and Put Them Into The Correct Place in The Sentences
5 pages
NLP Challenges & Techniques
No ratings yet
NLP Challenges & Techniques
45 pages
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
No ratings yet
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
13 pages
Own It 3 SB Own It 3 SB
No ratings yet
Own It 3 SB Own It 3 SB
144 pages
Butler Stevens 2001 Standardized Assessment of The Content Knowledge of English Language Learners K 12 Current Trends
No ratings yet
Butler Stevens 2001 Standardized Assessment of The Content Knowledge of English Language Learners K 12 Current Trends
19 pages
NLP Using Python
No ratings yet
NLP Using Python
4 pages
Simple Anthropometry-Based Calculations To Monitor Body Composition in Athletes, Scoping Review and Reference Values
No ratings yet
Simple Anthropometry-Based Calculations To Monitor Body Composition in Athletes, Scoping Review and Reference Values
15 pages
10TH Class PH-1 07-06-2020 Q.P
No ratings yet
10TH Class PH-1 07-06-2020 Q.P
16 pages
Intro to Large Language Models
No ratings yet
Intro to Large Language Models
29 pages
Color and Psychological Functioning The Effect of Red On Performance Attainment
No ratings yet
Color and Psychological Functioning The Effect of Red On Performance Attainment
15 pages
MAR578
No ratings yet
MAR578
6 pages
Russian Language Русский Язык: 25 July 2019
No ratings yet
Russian Language Русский Язык: 25 July 2019
10 pages
Gollapalli Akhil Reddy: Contact No:-+91-8333950379 Career Objective
No ratings yet
Gollapalli Akhil Reddy: Contact No:-+91-8333950379 Career Objective
3 pages
Grade 8 Lily Exam
No ratings yet
Grade 8 Lily Exam
3 pages
05 - Writing Advice
No ratings yet
05 - Writing Advice
2 pages
영어교수법
No ratings yet
영어교수법
2 pages
Simple Past and Past Continuous Exercise (4) ENGLISH PAGE
No ratings yet
Simple Past and Past Continuous Exercise (4) ENGLISH PAGE
1 page
Subject Report On Students' Performance Yurag Fherly Anne
No ratings yet
Subject Report On Students' Performance Yurag Fherly Anne
1 page
Instruction To Mod 4
No ratings yet
Instruction To Mod 4
2 pages

TSA Lab Manual New

Uploaded by

TSA Lab Manual New

Uploaded by

EXP 1:

Emma Woodhouse, handsome, clever, and rich, with a comfortable home

=== Brown Corpus ===

The/at Fulton/np-tl County/nn-tl Grand/jj-tl Jury/nn-tl said/vbd Friday/nr an/at

The/at jury/nn further/rbr said/vbd in/in term-end/nn presentments/nns that/cs the/at

=== WordNet Corpus ===

# Download required resources

# Filter out stop words

# Calculate frequency distribution

# Get the 50 most frequent words

# Download required NLTK resources

# Tokenize and clean text

# Word2Vec expects a list of sentences (list of list of tokens)

# Save model (optional)

elif "how are you" in user_input:

elif "your name" in user_input:

elif "weather" in user_input:

elif "bye" in user_input or "goodbye" in user_input:

from gtts import gTTS

print("✅ Audio file saved as 'output.mp3'")

# Convert MP3 to WAV for recognition

# Load audio file

print(f"📊 Word Error Rate (WER): {error_rate:.2f}")

You might also like