0% found this document useful (0 votes)

630 views38 pages

RE to DFA Conversion Guide

This document describes regular expressions and how to convert a regular expression (RE) to a deterministic finite automaton (DFA). It defines REs recursively and covers the operators of union, concatenation, and Kleene star. An example RE is given for strings starting with "a" followed by "a" or "b", or the string "c". Steps are provided to convert this RE to a nondeterministic finite automaton (NFA) and then to an equivalent DFA. The formal process guarantees converting any RE to an equivalent NFA and DFA.

Uploaded by

Jesus Pinillos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

630 views38 pages

RE to DFA Conversion Guide

Uploaded by

Jesus Pinillos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Regular Expressions and Converting an RE to a DFAJP

● Prerequisite knowledge:
Deterministic Finite Automata
Nondeterministic Finite Automata
Conversion of NFA to DFA
Regular Languages
Set Theory
JFLAP Tutorial

Description of Regular Expressions

Regular expressions provide a relatively compact representation for regular languages.

Definition of Regular Expressions

Regular expressions are made up of sets of strings and operations over those sets. Formally, regular
expressions are defined recursively as follows.

Given a finite alphabet Σ, the following are defined as regular expressions:

● the “empty set” { } is a set containing no strings, denoted by
● the “empty string” { ε } is a set containing only the empty string, denoted by ε
● “literal characters” { α } a set for every α Σ, denoted by α

Given regular expressions R and S, the following operations over them produce regular
expressions:
● “union” is the set union of R and S, denoted by R | S
● “concatenation” includes sequences of two regular expressions, denoted by RS
● “Kleene star” is a sequence of zero or more instances of a regular expression, denoted by
R* and by S*

JFLAP Notation
JFLAP uses a slightly different notation. When entering regular expressions, the empty string is
represented by the character “!” which is commonly known as “exclamation mark”, “exclamation point”,
or “bang”. The union operator is represented by the character “+” which is commonly known as “plus
sign” or “plus”.

Examples of Regular Operators

Union: R | S denotes the set union of sets described by R and S. For example, if R = {"a", "c"} and S =
{"b", "d"}, then R | S = {"a", "c", "b", "d"}.

Concatenation: RS denotes the set of strings obtained by concatenating a string in R and a string in S. For
example, if R = {"a", "c"} and S = {"b", "d"}, then RS = {"ab", "ad", "cb", "cd"}.

Kleene star: R* denotes the set that contains ε and all strings formed by concatenating any finite number
of strings from R. For example, if R = {"a", "c"}, then R* = {ε, "a", "c", "aa", "ac", "ca", "cc", "aaa",
"aac", "aca", "acc", "caa", ... }.
Precedence
Parentheses have highest precedence overall. Of the RE operators, Kleene star has highest precedence,
then concatenation, and finally union.

Example: RE for a Regular Language

Consider the regular language L over the alphabet Σ = {a, b, c} comprised of (1) all strings that begin
with “a” followed by an arbitrary length sequence made up of of “a” and “b” symbols and (2) the string
“c”.

Since L is being described as the union of (1) and (2), let’s first look at defining each of those components
of L, then apply the union operator.

Strings beginning with “a” that is followed by another string can be represented as the concatenation of
“a” with another string. The concatenation operator is simply ordered contiguous placement. The regular
expression could thus begin with “a” followed by the set of arbitrary length strings made up of “a” and
“b” symbols. The latter may be defined as the Kleene star of the union of “a” and “b”, whose operator
symbol is the vertical bar, and written as: (a|b)*. Thus this first set of strings may be represented by the
regular expression a(a|b)*

Creating the union of this set with the set consisting of the string “c” only, we can produce the regular
expression: a(a|b)*|c

Note that operator precedence renders a unique interpretation of this regular expression.

Entering this into JFLAP is simply a matter of typing the sequence of characters into the Regular
Expression editor. (See: regex_abc.jff) Recall that JFLAP uses + as its union operator and ! to
represent the empty string.
Examples of strings in this language include: “a” “c” “ab” “aa” “aab” “aba” “abb” “aaaba”

Examples of strings not in this language include: ε, "ac", “ca”, “b”, “ba”, “abc”, “bac”, “baa”

At present, JFLAP does not provide a means to directly assess which strings are and are not in the
language represented by a specific RE. However, JFLAP does support converting a RE into an NFA
which can then be used to test acceptance and rejection of specific strings.

Example: Converting an RE to a NFA by Inspection

Again consider the regular language L over the alphabet Σ = {a, b, c} comprised of the string “c” and all
strings that begin with “a” followed by an arbitrary length sequence made up of of “a” and “b” symbols,
which we just saw could be represented by the RE a(a|b)*|c. Here is an example of converting that RE to
an equivalent NFA using an informal approach. Note that this example was done using the Finite
Automaton feature of JFLAP rather than the Regular Expression feature. As such, JFLAP did not
interpret the transition labels as regular expressions!

1. Start by creating a two-state machine with transition labeled using the RE.

Recall that this is for illustrative purposes only and that JFLAP will not interpret the label on the
transition as a regular expression.

2. Noticing that this is the union of a(a+b)* and c, which means either can be chosen, split the transition
into two parallel transitions representing that union.
3. Since a(a+b)* represents the concatenation of a with (a+b)*, we can implement that transition by a
sequence.

4. The Kleene star indicates zero or more repetitions of (a+b), which we can show using ε-transitions.

5. The remaining union operation, a+b, can again be decomposed into parallel transitions.

6. Since all transitions are valid, using only symbols from Σ*, this is a valid NFA and can be run to test
acceptance and rejection of candidate strings.
7. We can now also apply the known NFA to DFA conversion process.

For simple cases, such as this example, it may be feasible to proceed intuitively from RE to NFA. In
general, however, it may not be obvious how best to proceed and since there are an infinite number of
equivalent NFA, this heuristic approach is not guaranteed to halt with a solution. Fortunately, there is a
formal process that guarantees finding an equivalent NFA and DFA for any given RE.

Description of Formal RE DFA Conversion Process

For any regular language L there exists one or more regular expressions (RE) as well as one or more
deterministic finite automata (DFA) that represent L. The following algorithm facilitates the conversion
from a RE for language L to a DFA for language L by first converting the RE to a nondeterministic finite
automaton (NDA) then using converting the NFA into an equivalent DFA (as described in unit “Explain
algorithm and Convert NFA to DFA”).

The conversion of a regular expression to a nondeterministic finite automaton proceeds by considering

each case in the definition provided previously.

● The language comprised of the empty string ε is represented by a two-state NFA with transition
for ε.
ε

● The language comprised of a literal character α is represented by a two-state NFA with transition
for α.
a

● The language comprised of the union of two languages, R and S, is represented by an NFA whose
initial state is attached by ε-transitions to the initial states of R and S, and whose accepting state is
attached by ε-transitions from the accepting states of R and S.
a|b

● The language comprised of a concatenation (sequence) of two languages, R and S, is an NFA

with the accepting state of R attached to the initial state of S by an ε-transition.

● The language comprised of the Kleene star of a language R is an NFA with an ε-transition from
its initial state to the initial state of R, an ε-transition to its accepting state from the accepting state
of R, and ε-transitions between its initial and accepting states.

Since each of these results in a valid NFA, each can be subjected to the NFA to DFA conversion
algorithm described in unit “Explain algorithm and Convert NFA to DFA”. For example, here are
equivalent DFA for the previously developed NFA.

ε
a

a|b

a*
Example: Conversion from RE to DFA (RE NFA DFA)
Consider the RE developed in the previous example: a(a|b)*|c

1. Specifying the RE
First, enter the RE into the Regular Expression Editor in JFLAP using “+” for the union operator. (See:
regex_abc.jff)
2. Converting RE to NFA
We begin the RE to NFA conversion process by creating a state diagram with an initial state, an accepting
state, and a transition labeled with the specified RE. The JFLAP Convert:Convert to NFA menu item
produces such a diagram.
Note that this diagram does not represent an executable automaton within JFLAP and serves only as an
intermediary representation. Thus, for example, you cannot run this machine or save this representation.
Once conversion is complete, you will be able to export and use the resulting NFA (see:
regex_abc_nfa.jff).

There are an infinite number of NFA for a given regular language. JFLAP restricts the space by adhering
to a specific set of conversion steps. You can always choose to develop an equivalent FA in a separate
JFLAP window to compare with JFLAP’s conversion results.

There are three options within JFLAP for assisting with the conversion from RE to NFA.

Option 1: Have JFLAP do the conversion all at once

Option 2: Follow along as JFLAP demonstrates step-by-step conversion

Option 3: Manually select resolutions and add ε-transitions during step-by-step conversion
Option 1: Choose “Do All” from the initial diagram

The result is a well-defined NFA that can be exported (choose “Export”; see: regex_abc_nfa.jff)
and then used for conversion to DFA.
Option 2: Follow along as JFLAP demonstrates step-by-step conversion
Simply choose “Do Step” repeatedly and observe as JFLAP goes through the steps of conversion from
RE to NFA.
You now have a well-defined NFA that can be exported (choose “Export”; see: regex_abc_nfa.jff)
and then used for conversion to DFA.
Option 3: Select resolutions and add ε-transitions manually during conversion
JFLAP provides a “(D)e-expressionify Transition” button that enables you to select which transition
is to be converted next. Having chosen a transition, you then indicate all required ε-transitions associated
with that conversion step. Once all associated ε-transitions have been specified, you can again use the
(D)e-expressionify Transition button. This process continues until the result is a valid NFA.

Here are the steps in the conversion of the example RE.

You now have a well-defined NFA that can be exported (choose “Export”; see: regex_abc_nfa.jff)
and then used for conversion to DFA.
3. Convert NFA to DFA
However you arrived at the equivalent NFA for the RE, you can now apply the transformation process to
convert the exported NFA to a DFA described in unit “Explain algorithm and Convert NFA to DFA”
(see: regex_abc_dfa.jff).

Questions to Think About

1. What is a regular expression for the following language L?
L = { w | w {a, b}* where every occurrence of a is followed by a b }
Answer: b*(ab)*b*

2. How many strings are in the language defined by the regular expression?
(a|b)a(a|b)b
Answer: 4 {aaab, baab, aabb, babb}

3. Specify a DFA for the language defined by the following regular expression.
a*(b|ab)*

Answer:
(see DFA_astar_babstar.jff)

References
Wikipedia, Regular Expression
http://en.wikipedia.org/wiki/Regular_expression
[13 June 2014; Accessed on 17 June 2014]

Brown, Barry, Convert Regular Expression to DFA

https://www.youtube.com/watch?v=dlH2pIndNrU
[14 May 2011; Accessed on 26 June 2014]
JFLAP Tutorial, Regular Expressions and Converting to a NFA
http://www.jflap.org/tutorial/regular/index.html
[Accessed on 27 June 2014]

Chapter 2 MMW
83% (6)
Chapter 2 MMW
4 pages
(Ebook) Logic: The Basics by Beall, JC Logan, Shay A. ISBN 9781317528609, 1317528603 All Chapters Available
No ratings yet
(Ebook) Logic: The Basics by Beall, JC Logan, Shay A. ISBN 9781317528609, 1317528603 All Chapters Available
106 pages
Compiler Design: RE to DFA
No ratings yet
Compiler Design: RE to DFA
23 pages
VTU 21CS51 ATC Module 1 Automata Part
No ratings yet
VTU 21CS51 ATC Module 1 Automata Part
35 pages
Madcom
No ratings yet
Madcom
12 pages
LPL Textbook PDF
No ratings yet
LPL Textbook PDF
621 pages
A Study of Modal Logic With Semantics Based On Rough Set Theory
No ratings yet
A Study of Modal Logic With Semantics Based On Rough Set Theory
26 pages
Uml Case Study Questions
100% (1)
Uml Case Study Questions
3 pages
Automata Theory Exam Paper
No ratings yet
Automata Theory Exam Paper
4 pages
TIC 2151 - Theory of Computation: Decidability
100% (1)
TIC 2151 - Theory of Computation: Decidability
14 pages
Cse384 Compiler Design Laboratory Lab Manual
No ratings yet
Cse384 Compiler Design Laboratory Lab Manual
55 pages
Vision 2023 Toc Chapter 5 Context Free Grammar 12
No ratings yet
Vision 2023 Toc Chapter 5 Context Free Grammar 12
25 pages
Lecture 1-2 (CMS)
No ratings yet
Lecture 1-2 (CMS)
27 pages
Solution1 10
No ratings yet
Solution1 10
5 pages
HW3 Solutions 2017 Spring
100% (1)
HW3 Solutions 2017 Spring
4 pages
Unit 123 (NLP)
No ratings yet
Unit 123 (NLP)
3 pages
Automata Theory Exam Guide
No ratings yet
Automata Theory Exam Guide
5 pages
Non-Deterministic Finite Automata
100% (1)
Non-Deterministic Finite Automata
36 pages
Java Enum
100% (1)
Java Enum
21 pages
Complete
No ratings yet
Complete
4 pages
TOC Syllabus
No ratings yet
TOC Syllabus
2 pages
Gonzalez Asenjo. A Calculos of Antinomies
No ratings yet
Gonzalez Asenjo. A Calculos of Antinomies
3 pages
Mastering The Formal Geometry Proof: Mark Ryan Geometry For Dummies, 2nd Edition
No ratings yet
Mastering The Formal Geometry Proof: Mark Ryan Geometry For Dummies, 2nd Edition
2 pages
Rakudo
No ratings yet
Rakudo
17 pages
Programming Paradigms CSI2120: Jochen Lang EECS, University of Ottawa Canada
No ratings yet
Programming Paradigms CSI2120: Jochen Lang EECS, University of Ottawa Canada
19 pages
16 Decidable Cfgs
No ratings yet
16 Decidable Cfgs
26 pages
Pumping Lemma For Regular Languages
No ratings yet
Pumping Lemma For Regular Languages
60 pages
CD - CO - PO - MAPPING (1) & Justification
No ratings yet
CD - CO - PO - MAPPING (1) & Justification
5 pages
Introduction To Logic
No ratings yet
Introduction To Logic
19 pages
CD-30 Questions With Solution
No ratings yet
CD-30 Questions With Solution
43 pages
Chapter 7 Logical Agents
No ratings yet
Chapter 7 Logical Agents
40 pages
Automata & Compiler Design Handout
No ratings yet
Automata & Compiler Design Handout
59 pages
FALLSEM2020-21 CSI1003 TH VL2020210103426 Reference Material I 17-Jul-2020 TOC-Chapter2
No ratings yet
FALLSEM2020-21 CSI1003 TH VL2020210103426 Reference Material I 17-Jul-2020 TOC-Chapter2
38 pages
Unit 2 MCQ
No ratings yet
Unit 2 MCQ
24 pages
Week 03 A Regular Expressions Examples
No ratings yet
Week 03 A Regular Expressions Examples
45 pages
Lesson # 3 - DFA
No ratings yet
Lesson # 3 - DFA
25 pages
Propositions
No ratings yet
Propositions
25 pages
The Coq Proof Assistant
No ratings yet
The Coq Proof Assistant
53 pages
Re To DFA
No ratings yet
Re To DFA
6 pages
Object Oriented Modeling and Design (9166) - Sample Paper of MSBTE For Sixth Semester Final Year Computer Engineering Diploma (80 Marks)
0% (1)
Object Oriented Modeling and Design (9166) - Sample Paper of MSBTE For Sixth Semester Final Year Computer Engineering Diploma (80 Marks)
2 pages
Compiler Design
No ratings yet
Compiler Design
85 pages
Recognition of Tokens
No ratings yet
Recognition of Tokens
34 pages
Chapter 2 RegularExpressions
No ratings yet
Chapter 2 RegularExpressions
95 pages
Epsilon NFA Into NFA Into DFA
No ratings yet
Epsilon NFA Into NFA Into DFA
18 pages
Theory of Automata Chapter 4
No ratings yet
Theory of Automata Chapter 4
24 pages
Automata Project Proposal Milestone 1
No ratings yet
Automata Project Proposal Milestone 1
1 page
Automata Chapter 3 Regular Expression PDF
0% (1)
Automata Chapter 3 Regular Expression PDF
3 pages
FLAT PYQs
No ratings yet
FLAT PYQs
9 pages
Class 18 Context Free Grammar
No ratings yet
Class 18 Context Free Grammar
35 pages
Compiler Design Unit 1 Notes
No ratings yet
Compiler Design Unit 1 Notes
21 pages
Lecture 3
No ratings yet
Lecture 3
30 pages
002chapter 2 - Lexical Analysis
No ratings yet
002chapter 2 - Lexical Analysis
114 pages
The Uniqueness of Software Quality Assurance - The Environments For Which SQA Methods
No ratings yet
The Uniqueness of Software Quality Assurance - The Environments For Which SQA Methods
21 pages
Automata Conversion Explained
No ratings yet
Automata Conversion Explained
3 pages
Compiler Design
No ratings yet
Compiler Design
48 pages
2 Syntax Directed Transiation
No ratings yet
2 Syntax Directed Transiation
9 pages
Arid Agriculture University, Rawalpindi: (Theory)
No ratings yet
Arid Agriculture University, Rawalpindi: (Theory)
6 pages
Database Schemas for Students & Companies
No ratings yet
Database Schemas for Students & Companies
2 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
A Ad - A - Ab - Abc - B: Generate The SLR Parsing Table For The Following Grammar
0% (1)
A Ad - A - Ab - Abc - B: Generate The SLR Parsing Table For The Following Grammar
7 pages
Intro to Finite Automata Concepts
100% (1)
Intro to Finite Automata Concepts
12 pages
Question Solved TCS
No ratings yet
Question Solved TCS
15 pages
16CS517-Formal Languages and Automata Theory
No ratings yet
16CS517-Formal Languages and Automata Theory
8 pages
Context-Free Grammar Basics
100% (1)
Context-Free Grammar Basics
68 pages
TOC Assignment
No ratings yet
TOC Assignment
7 pages
Kuk B.tech Cse Automata Theory
No ratings yet
Kuk B.tech Cse Automata Theory
237 pages
Problem
No ratings yet
Problem
4 pages
Automata Theory Lec-02
No ratings yet
Automata Theory Lec-02
31 pages
Nfa Epsilon Defined
No ratings yet
Nfa Epsilon Defined
11 pages
Compiler Syntax Analysis Guide
No ratings yet
Compiler Syntax Analysis Guide
41 pages
MITP CH 3 Lecture Notes
No ratings yet
MITP CH 3 Lecture Notes
25 pages
Kleene Star
No ratings yet
Kleene Star
34 pages
Name - Debraj Saha Reg - No. - D222307730 Sub.-Theory of Automata
No ratings yet
Name - Debraj Saha Reg - No. - D222307730 Sub.-Theory of Automata
11 pages
Compiler Design Code Generation
No ratings yet
Compiler Design Code Generation
4 pages
Full Notes
No ratings yet
Full Notes
152 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
56 pages
Automata Assignments
No ratings yet
Automata Assignments
24 pages
Lab Manual: Department of Computer Science & Engineering
No ratings yet
Lab Manual: Department of Computer Science & Engineering
80 pages
Context-Free Language Properties
No ratings yet
Context-Free Language Properties
25 pages
Pumping Lemma For RG
No ratings yet
Pumping Lemma For RG
13 pages
Parsing Techniques Homework
No ratings yet
Parsing Techniques Homework
14 pages
Chapter 08 Finite Automata With Output
No ratings yet
Chapter 08 Finite Automata With Output
23 pages
Theory of Computation-Lecture 1
No ratings yet
Theory of Computation-Lecture 1
78 pages
CD PPTS 2
No ratings yet
CD PPTS 2
27 pages
Chapter 3 - Syntax Analysis Part One
No ratings yet
Chapter 3 - Syntax Analysis Part One
17 pages
IS 7118 Unit-2 Regular Expressions
No ratings yet
IS 7118 Unit-2 Regular Expressions
69 pages
Unit 4 PDF
No ratings yet
Unit 4 PDF
52 pages
Parallel Algorithm Design Guide
No ratings yet
Parallel Algorithm Design Guide
35 pages
Pumping Lemma
No ratings yet
Pumping Lemma
64 pages
Automata Theory for CS Students
No ratings yet
Automata Theory for CS Students
19 pages