Exact String Matching Using Suffix Trees

Uploaded by

pavithra.r

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views2 pages

Exact String Matching Using Suffix Trees

Uploaded by

pavithra.r

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Exact String Matching using Suffix Trees is a method to efficiently find all occurrences of

a pattern within a text using a pre-built suffix tree of the text. The algorithm leverages the
suffix tree to achieve O(m)O(m)O(m) time complexity for searching, where mmm is the
length of the pattern.

Steps for Exact String Matching with Suffix Tree

1. Build the Suffix Tree:

o Create a suffix tree for the given text TTT.
o Append a special character (e.g., $) to TTT to ensure no suffix is a prefix of
another.
2. Search for the Pattern:
o Start at the root of the suffix tree and attempt to match the pattern PPP along
the edges.
o Follow edges in the tree based on the characters of PPP. If the pattern is found:
 Return the positions of all leaf nodes beneath the matched node.
o If the pattern is not found:
 Terminate and return no matches.

Example

Input:

 Text: BANANA
 Pattern: ANA

Step 1: Build the Suffix Tree

For the text BANANA$, the suffixes are:

 BANANA$
 ANANA$
 NANA$
 ANA$
 NA$
 A$
 $

The suffix tree looks like this (compressed for clarity):

ruby
Copy code
(root)
├── B → ANANA$
├── A → NA$
├── N → ANA$
├── $
Step 2: Search for the Pattern ANA

1. Start at the root and follow the edges labeled with the characters of ANA.
o Match A → Follow the edge A.
o Match NA → Continue matching along the edge NA$.
2. After matching the entire pattern ANA:
o The path ends at an internal node.
o Collect all leaf nodes below this node to get the starting positions of matches.

Result:

 Leaf nodes below the matched node correspond to suffixes starting at indices 1 and 3.
 Matches found at positions 1 and 3 (0-based index) in the text BANANA.

Advantages

1. Efficient Search: Once the suffix tree is built, searching for a pattern is
O(m)O(m)O(m), where mmm is the pattern length.
2. Multiple Matches: The tree naturally stores all occurrences of a pattern in the text.

Applications

1. Text Search: Quickly locate substrings in large texts.

2. Plagiarism Detection: Find repeated or matching segments across documents.
3. Bioinformatics: Search for DNA or protein sequences in genomic data.
4. Data Compression: Detect repeated patterns for compression algorithms.

Time Complexity

1. Suffix Tree Construction: O(n)O(n)O(n), where nnn is the length of the text (using
Ukkonen’s Algorithm).
2. Search: O(m)O(m)O(m), where mmm is the length of the pattern.

Suffix trees provide a robust method for string matching, especially in applications requiring
repetitive queries over a large dataset.

Algorithms On String Trees and Sequences
No ratings yet
Algorithms On String Trees and Sequences
326 pages
Algorithms On Strings Trees and Sequence PDF
No ratings yet
Algorithms On Strings Trees and Sequence PDF
326 pages
CCTV Training Sample Questions
0% (2)
CCTV Training Sample Questions
4 pages
6 Suffix-Tree
No ratings yet
6 Suffix-Tree
20 pages
Suf Tree
No ratings yet
Suf Tree
6 pages
Suffix Trees, Suffix Arrays, and Their Applications
No ratings yet
Suffix Trees, Suffix Arrays, and Their Applications
29 pages
16 Rabin Karp Algorithm 07-02-2025
No ratings yet
16 Rabin Karp Algorithm 07-02-2025
7 pages
String Matching: CPSC 212: Algorithms and Data Structures Brian C. Dean
No ratings yet
String Matching: CPSC 212: Algorithms and Data Structures Brian C. Dean
23 pages
Suffix Trees: CSC 448 Bioinformatics Algorithms Alexander Dekhtyar
No ratings yet
Suffix Trees: CSC 448 Bioinformatics Algorithms Alexander Dekhtyar
8 pages
Suffix Trees and Suffix Arrays
No ratings yet
Suffix Trees and Suffix Arrays
33 pages
09 SuffixTrees
No ratings yet
09 SuffixTrees
21 pages
Applications of Suffix Trees
No ratings yet
Applications of Suffix Trees
40 pages
Suffix Trees in Detail
No ratings yet
Suffix Trees in Detail
23 pages
Suffix Tree and Suffix Array Techniques For Pattern Analysis in Strings
No ratings yet
Suffix Tree and Suffix Array Techniques For Pattern Analysis in Strings
78 pages
Algorithm Design for CS Students
No ratings yet
Algorithm Design for CS Students
16 pages
Suffix Trees
No ratings yet
Suffix Trees
76 pages
Notes 06 Text Indexing PDF
No ratings yet
Notes 06 Text Indexing PDF
162 pages
String Matching & Trie Algorithms
No ratings yet
String Matching & Trie Algorithms
17 pages
Pattern Matching: Suffix Tree Applications
No ratings yet
Pattern Matching: Suffix Tree Applications
39 pages
Suffixtrees
No ratings yet
Suffixtrees
50 pages
String Algorithms for CS Students
No ratings yet
String Algorithms for CS Students
48 pages
Suffix Trees and String Algorithms
No ratings yet
Suffix Trees and String Algorithms
130 pages
Ukkonen
No ratings yet
Ukkonen
14 pages
Suffix Tree
No ratings yet
Suffix Tree
6 pages
Lecture03 SuffixTree
No ratings yet
Lecture03 SuffixTree
3 pages
Suffix Tree Construction Guide
No ratings yet
Suffix Tree Construction Guide
32 pages
Trie and Suffix Tree Guide
No ratings yet
Trie and Suffix Tree Guide
6 pages
Chapter 3 Part 2
No ratings yet
Chapter 3 Part 2
22 pages
9 Suffix Trees: Tttta
No ratings yet
9 Suffix Trees: Tttta
9 pages
Suffix Trees and Their Applications in String Algo
No ratings yet
Suffix Trees and Their Applications in String Algo
21 pages
Burros Wheeler Transform - Bioinformatics
No ratings yet
Burros Wheeler Transform - Bioinformatics
67 pages
Module 06. String Algorithms Lecture 1 - 2
No ratings yet
Module 06. String Algorithms Lecture 1 - 2
19 pages
On-Line Construction of Suffix Trees
No ratings yet
On-Line Construction of Suffix Trees
18 pages
40 Years of Suffix Trees
No ratings yet
40 Years of Suffix Trees
8 pages
Talg 11
No ratings yet
Talg 11
33 pages
Lecture4 - Indexing and Searching I
No ratings yet
Lecture4 - Indexing and Searching I
56 pages
Suffix Arrays: Justin Zhang 24 May 2017
No ratings yet
Suffix Arrays: Justin Zhang 24 May 2017
5 pages
22XX402 LP4 MCQ
No ratings yet
22XX402 LP4 MCQ
3 pages
Types of Tries
No ratings yet
Types of Tries
20 pages
Foundations of Sequence Analysis
No ratings yet
Foundations of Sequence Analysis
161 pages
String - Pattern Matching
No ratings yet
String - Pattern Matching
86 pages
Suffix Arrays
No ratings yet
Suffix Arrays
20 pages
10 String Algorithms
No ratings yet
10 String Algorithms
36 pages
Toc
No ratings yet
Toc
6 pages
String Matching
No ratings yet
String Matching
5 pages
String Matching Algorithms Guide
No ratings yet
String Matching Algorithms Guide
46 pages
Programming-Assignment-3
No ratings yet
Programming-Assignment-3
17 pages
Trie and Suffix Trie Basics
No ratings yet
Trie and Suffix Trie Basics
26 pages
BSc Text Searching Exam 2010
No ratings yet
BSc Text Searching Exam 2010
8 pages
4 Module Algorithms
No ratings yet
4 Module Algorithms
28 pages
HW 2
No ratings yet
HW 2
5 pages
Notesa
No ratings yet
Notesa
15 pages
Pattern Matching + Hashing
No ratings yet
Pattern Matching + Hashing
29 pages
Suffix Arrays for String Search
No ratings yet
Suffix Arrays for String Search
71 pages
Pattern Search in A Single Genome
No ratings yet
Pattern Search in A Single Genome
34 pages
Brain Booster Worksheet
No ratings yet
Brain Booster Worksheet
2 pages
15 + Activities For Teaching Shapes
No ratings yet
15 + Activities For Teaching Shapes
15 pages
Algorithm To Be Used To Generate The Pseudorandom Numbers
No ratings yet
Algorithm To Be Used To Generate The Pseudorandom Numbers
1 page
A Forensic Analysis of Android Malware
No ratings yet
A Forensic Analysis of Android Malware
10 pages
03 Mind Map Theory
No ratings yet
03 Mind Map Theory
24 pages
LG Inverter SCAC Catalog
100% (1)
LG Inverter SCAC Catalog
20 pages
MA111 Exam 2019
No ratings yet
MA111 Exam 2019
4 pages
Practical File Questions
No ratings yet
Practical File Questions
2 pages
B224 Epcc20 000 CS DRW 1003
No ratings yet
B224 Epcc20 000 CS DRW 1003
7 pages
Types of Brakes
No ratings yet
Types of Brakes
12 pages
19-2G0017 - Perf Curves
No ratings yet
19-2G0017 - Perf Curves
1 page
Z390M-ITXac multiQIG
No ratings yet
Z390M-ITXac multiQIG
159 pages
AKSA Battery Charger
No ratings yet
AKSA Battery Charger
2 pages
ISM - Guidelines For System Management (December 2023)
No ratings yet
ISM - Guidelines For System Management (December 2023)
8 pages
0936E1001R00
No ratings yet
0936E1001R00
1 page
LOGIQ P9P7 R3 User Guide - English - UM - 5791624-100 - 3
No ratings yet
LOGIQ P9P7 R3 User Guide - English - UM - 5791624-100 - 3
343 pages
Thesis Body Structure
100% (3)
Thesis Body Structure
7 pages
Ada Boost Optimizes Wave Energy Arrays
No ratings yet
Ada Boost Optimizes Wave Energy Arrays
6 pages
Learnhive - CBSE Grade 5 Science Human Body - Lessons, Exercises, and Practice Tests
No ratings yet
Learnhive - CBSE Grade 5 Science Human Body - Lessons, Exercises, and Practice Tests
9 pages
Schneider Electric - Altivar-31-Variable-Speed-Drives-VFD-Legacy - ATV31HU40N4
No ratings yet
Schneider Electric - Altivar-31-Variable-Speed-Drives-VFD-Legacy - ATV31HU40N4
4 pages
Cisco® Catalyst® 9400 Series
No ratings yet
Cisco® Catalyst® 9400 Series
25 pages
NZ Driver Licence Replacement Guide
No ratings yet
NZ Driver Licence Replacement Guide
3 pages
Haier: Service Manual
No ratings yet
Haier: Service Manual
31 pages
PL 100F VFD - UserManual
No ratings yet
PL 100F VFD - UserManual
35 pages
LBYEC3P Exp01 - Prelim Report
No ratings yet
LBYEC3P Exp01 - Prelim Report
7 pages
Assignment 1 - Linear Programming I - With Answers
No ratings yet
Assignment 1 - Linear Programming I - With Answers
2 pages
Aws Kms Best Practices PDF
No ratings yet
Aws Kms Best Practices PDF
24 pages
1 Info Packet 1 (April 2022)
No ratings yet
1 Info Packet 1 (April 2022)
10 pages
Electrolux 102255 User Manual
No ratings yet
Electrolux 102255 User Manual
2 pages
Portable Percent Oxygen Analyzer With USB Data Logging
No ratings yet
Portable Percent Oxygen Analyzer With USB Data Logging
1 page
Bom Chiller Cu 1720 01 02 (1951 U 806 A&b) Acds 040 Augqv
No ratings yet
Bom Chiller Cu 1720 01 02 (1951 U 806 A&b) Acds 040 Augqv
2 pages
KI235 For Goods Movement, What Should Be Done
No ratings yet
KI235 For Goods Movement, What Should Be Done
8 pages

Exact String Matching Using Suffix Trees

Uploaded by

Exact String Matching Using Suffix Trees

Uploaded by

Exact String Matching using Suffix Trees is a method to efficiently find all occurrences of

Steps for Exact String Matching with Suffix Tree

1. Build the Suffix Tree:

Step 1: Build the Suffix Tree

For the text BANANA$, the suffixes are:

The suffix tree looks like this (compressed for clarity):

1. Text Search: Quickly locate substrings in large texts.

You might also like