0% found this document useful (0 votes)

31 views5 pages

Python Lab: Levenshtein Distance

This document outlines three tasks for a CS370 Artificial Intelligence lab assignment on calculating Levenshtein distance between strings using Python. The first task involves writing a program to calculate the edit distance between two input strings. The second task modifies this to take two text files as input and output the word-level distance between sentences. The third task further modifies this to ignore common words when calculating the distance. Students are to submit their Python programs to calculate Levenshtein distance between strings and text files in various ways.

Uploaded by

mahadm.bscs21seecs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views5 pages

Python Lab: Levenshtein Distance

Uploaded by

mahadm.bscs21seecs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Department of Computing

CS370: Artificial Intelligence

Class: BSCS-11C
Lab 01: Introduction to Python

Date: 15-09-2023

Lab Engineer: Ms Shakeela

Mahad Mohtashim
379889
BSCS-11-C

Page 1
Task #1

Write down a python program which takes two strings as input and calculate the
Levenshtein/Edit distance between the two strings.

Explanation:-

Levenshtein/Edit distance gives us a measure of similarity between two strings/sequences.

Going by formal definition it is minimum number of single character edits required to transform
one string into another.

Single character edits include:-

 Insertion
 Deletion
 Substitution

Mathematically:-

Mathematically Levenshtein/Edit distance between two strings ‘a’ and ‘b’ is defined as:-

For further understanding of the formula you may read this blog as it explains it in great depth
or you may get back to me wherever/whenever you stuck.

https://medium.com/@ethannam/understanding-the-levenshtein-distance-equation-for-beginners-
c4285a5604f0

Page 2
But it does not explain how to count the edit operations while calculating overall Levenshtein
distance.

The output of your program should somewhat look like:-

Task #2

Now modify the above written program in such a way that it takes two text files containing
single- line and lowercase English sentences named as reference.txt and hypothesis.txt, and
outputs the file result.txt containing Levenshtein distance of these two files as below. The
distance should be word level and not character level.

Page 3
**********reference.txt***************

this is some text and we would like to see if it has been identified correctly by speech recognition system

***************************************

**********hypothesis.txt*************

this is a text and we would like to check what has been identified by the speech recognition

***************************************

*********result.txt*******************

Levenshtein distance is 7

Insertions 1

Deletions 3

Substitutions 3

***************************************

Hint:-

In this case we can treat words as characters in previous case, right?

Task #3

Now modify the above program so that it ignores 10 common words in such a way:-

 Insertions and deletions involving these common words are ignored

 Substitutions are ignored when both initial and final word are one of 10 common words

List of 10 common words:

the, of, and, a, be, this, there, an, been, some

Now the result2.txt should look like :-

Page 4
*********result2.txt*******************

Levenshtein distance is 5

Insertions 0

Deletions 3

Substitutions 2

***************************************

Submission Guidelines:-

Deliverables and Deadline:

Please add as per your convenience

Page 5

CS601 Assignment 1 Solution
100% (1)
CS601 Assignment 1 Solution
3 pages
d2161r5-ATAATAPI Command Set - 3 PDF
No ratings yet
d2161r5-ATAATAPI Command Set - 3 PDF
577 pages
19 - Xcode Build (Signed)
No ratings yet
19 - Xcode Build (Signed)
3,540 pages
Installing and Configuring FreeNAS
No ratings yet
Installing and Configuring FreeNAS
29 pages
Problem Set 5 Instructions
No ratings yet
Problem Set 5 Instructions
8 pages
Vue.js Guide for Developers
100% (7)
Vue.js Guide for Developers
19 pages
Levenshtein Distance Explained
No ratings yet
Levenshtein Distance Explained
3 pages
Python File Handling Tasks
100% (1)
Python File Handling Tasks
14 pages
Practical File
No ratings yet
Practical File
49 pages
NLP Similarity Distance Metrics
No ratings yet
NLP Similarity Distance Metrics
16 pages
Python File Handling Exercises
No ratings yet
Python File Handling Exercises
2 pages
Class XII Computer Science Exam
No ratings yet
Class XII Computer Science Exam
3 pages
Assignments 4 FILE in Python Programming BCC402
No ratings yet
Assignments 4 FILE in Python Programming BCC402
6 pages
Hardware Features of The Cisco ASR 1001-X Router
No ratings yet
Hardware Features of The Cisco ASR 1001-X Router
8 pages
Damerau-Levenshtein Algorithm and Bayes Theorem For Spell Checker Optimization
No ratings yet
Damerau-Levenshtein Algorithm and Bayes Theorem For Spell Checker Optimization
6 pages
File Handling
No ratings yet
File Handling
42 pages
Levenshtein Algorithm 1 PDF
No ratings yet
Levenshtein Algorithm 1 PDF
10 pages
Worksheet 5 CS
No ratings yet
Worksheet 5 CS
2 pages
Problem Set 3: Document Distance: Pset Buddy
No ratings yet
Problem Set 3: Document Distance: Pset Buddy
7 pages
Files: For Multiple-Choice and Essay Questions
No ratings yet
Files: For Multiple-Choice and Essay Questions
6 pages
PDF Scanned & Optical Character Recognition (OCR)
No ratings yet
PDF Scanned & Optical Character Recognition (OCR)
47 pages
Trace
No ratings yet
Trace
34 pages
Class 12 Cs Final Prac
No ratings yet
Class 12 Cs Final Prac
68 pages
Duo Lingo
0% (1)
Duo Lingo
24 pages
Oracle 11GR2 High Availability Guide
No ratings yet
Oracle 11GR2 High Availability Guide
19 pages
SQL Server DBA Professional Profile
No ratings yet
SQL Server DBA Professional Profile
4 pages
3G Wireless Technology Overview
0% (1)
3G Wireless Technology Overview
28 pages
File Handling Worksheet
No ratings yet
File Handling Worksheet
5 pages
File Handing Practical
No ratings yet
File Handing Practical
18 pages
Text File Practice Questions
No ratings yet
Text File Practice Questions
3 pages
Network Upgrade Status Report
No ratings yet
Network Upgrade Status Report
3 pages
Introduction To Algorithms Lecture Notes (MIT 6 - 006) - It-eBooks - It-eBooks-2017, 2017 - IBooker It-eBooks - Anna's Archive
No ratings yet
Introduction To Algorithms Lecture Notes (MIT 6 - 006) - It-eBooks - It-eBooks-2017, 2017 - IBooker It-eBooks - Anna's Archive
150 pages
Information Security Transformation-Nahil Mahmood-Lecture 7
No ratings yet
Information Security Transformation-Nahil Mahmood-Lecture 7
5 pages
File Handling
No ratings yet
File Handling
23 pages
Lecture 10
No ratings yet
Lecture 10
7 pages
Ch-5 - File Handling
No ratings yet
Ch-5 - File Handling
15 pages
Syserr
No ratings yet
Syserr
2 pages
Google Dorks
No ratings yet
Google Dorks
3 pages
Assignment Textfile 20230525210733459 22052024 083944
No ratings yet
Assignment Textfile 20230525210733459 22052024 083944
6 pages
CS Practical File
No ratings yet
CS Practical File
47 pages
Lab 04 B
No ratings yet
Lab 04 B
2 pages
Practical File by Aksh Jaiswal
No ratings yet
Practical File by Aksh Jaiswal
48 pages
12 Practical Python Part-1
No ratings yet
12 Practical Python Part-1
11 pages
BioInfor Assignment
No ratings yet
BioInfor Assignment
4 pages
Cours de Bi Heig-Vd 9
No ratings yet
Cours de Bi Heig-Vd 9
19 pages
File Handling-Text Files
No ratings yet
File Handling-Text Files
4 pages
Tamil Nadu - Esanjeevani National Summit
No ratings yet
Tamil Nadu - Esanjeevani National Summit
23 pages
F 33813024
No ratings yet
F 33813024
3 pages
Help Line No: 18003455384 (Toll Free) : State Name District Name Block/Municipality Municipality Name Ward No. Select by
No ratings yet
Help Line No: 18003455384 (Toll Free) : State Name District Name Block/Municipality Municipality Name Ward No. Select by
1 page
File Handling Questions 2
No ratings yet
File Handling Questions 2
4 pages
Homework 4-1
No ratings yet
Homework 4-1
4 pages
12 Practical - Python
No ratings yet
12 Practical - Python
11 pages
PPL Experiment No-8
No ratings yet
PPL Experiment No-8
7 pages
Xii Comp Practical Journal
No ratings yet
Xii Comp Practical Journal
45 pages
Change Log
No ratings yet
Change Log
69 pages
Data File Handling - Worksheet 1 - 3 Marks
No ratings yet
Data File Handling - Worksheet 1 - 3 Marks
6 pages
For Video Explanation of This Topic, Please Click On The Following Link
No ratings yet
For Video Explanation of This Topic, Please Click On The Following Link
8 pages
AISSAT M2PRIME BROSUR-compressed
No ratings yet
AISSAT M2PRIME BROSUR-compressed
3 pages
Customer Support (Resume)
No ratings yet
Customer Support (Resume)
2 pages
CL 12 Worksheet 2 Programs On File Handling
No ratings yet
CL 12 Worksheet 2 Programs On File Handling
4 pages
Anshika's Project Do Not Touch!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
No ratings yet
Anshika's Project Do Not Touch!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
15 pages
Class Xii Text Files Assignment 1
No ratings yet
Class Xii Text Files Assignment 1
2 pages
Text Files Workbook
No ratings yet
Text Files Workbook
8 pages
Etliolu2019 Chapter AStudyAboutAf 241125 063601
No ratings yet
Etliolu2019 Chapter AStudyAboutAf 241125 063601
18 pages
Python Programming Laboratory
No ratings yet
Python Programming Laboratory
35 pages
Edit Distance
No ratings yet
Edit Distance
5 pages
Group 12 PPT Software Programing & Development
No ratings yet
Group 12 PPT Software Programing & Development
24 pages
2-Zkbio Cvsecurity Mobile App Zkbio Cvconnect Platform 20240109
No ratings yet
2-Zkbio Cvsecurity Mobile App Zkbio Cvconnect Platform 20240109
16 pages
PLC Extraexp-1 - 12
No ratings yet
PLC Extraexp-1 - 12
6 pages
ProductPrice Oracle資料庫百寶箱
No ratings yet
ProductPrice Oracle資料庫百寶箱
8 pages
SnapMirror ActiveSync
No ratings yet
SnapMirror ActiveSync
2 pages
12 B2 QP (70) Answer Key
No ratings yet
12 B2 QP (70) Answer Key
6 pages
1.home Work-Text Files
No ratings yet
1.home Work-Text Files
1 page
Xii Cs Text File Ws
No ratings yet
Xii Cs Text File Ws
5 pages
Data File Handling-Text File
No ratings yet
Data File Handling-Text File
1 page
Class Xii Text File Handling Assignment
No ratings yet
Class Xii Text File Handling Assignment
3 pages
Quizzes and Explanations Merged Full
No ratings yet
Quizzes and Explanations Merged Full
33 pages
CH 4 - File Handling Material For Board Exam
No ratings yet
CH 4 - File Handling Material For Board Exam
2 pages
Mostafa Ali Ismail Morsy Original
No ratings yet
Mostafa Ali Ismail Morsy Original
3 pages
File Handling - Questions
No ratings yet
File Handling - Questions
7 pages
05 02 2025 - 185534 Text File Handling Programs
No ratings yet
05 02 2025 - 185534 Text File Handling Programs
3 pages
File Handling in Python Text File
No ratings yet
File Handling in Python Text File
10 pages
Text Files Programs III
No ratings yet
Text Files Programs III
11 pages
Practical Document From Ravita Pathak 12 CS
No ratings yet
Practical Document From Ravita Pathak 12 CS
66 pages
Text Files Programs II
No ratings yet
Text Files Programs II
6 pages
Measure Distance Between 2 Words by Simple Calculation
No ratings yet
Measure Distance Between 2 Words by Simple Calculation
7 pages
MA Spring 2025 Assignment 1
No ratings yet
MA Spring 2025 Assignment 1
6 pages

Python Lab: Levenshtein Distance

Uploaded by

Python Lab: Levenshtein Distance

Uploaded by

Department of Computing

CS370: Artificial Intelligence

Lab Engineer: Ms Shakeela

Levenshtein/Edit distance gives us a measure of similarity between two strings/sequences.

Single character edits include:-

The output of your program should somewhat look like:-

In this case we can treat words as characters in previous case, right?

 Insertions and deletions involving these common words are ignored

List of 10 common words:

the, of, and, a, be, this, there, an, been, some

Now the result2.txt should look like :-

Deliverables and Deadline:

Please add as per your convenience

You might also like