0% found this document useful (0 votes)

7 views30 pages

Module 4

The document states that the training data is current only up to October 2023. It implies that any developments or information beyond that date are not included. This limitation should be considered when referencing the data.

Uploaded by

dhanushreebv0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views30 pages

Module 4

Uploaded by

dhanushreebv0

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Module 4

Database Design
Relational DB Design:
 It's
the process of designing a database that stores data in the
form of tables (relations).
The goal is to design it in a way that's:
 Efficient
 Easy to maintain
 Easy to retrieve data from

Design Stages:
When designing a relational database, there are 4 main stages:
 Define Relations – Decide what tables you need.
 Define Primary Keys (PK) – Choose the main unique
identifier for each table (e.g., ID).
 Define Relationships (Rship) – Set up how tables are
connected (e.g., customer → orders).
 Normalization – Improve table structure by removing
duplication and organizing data.
Two Major Design Approaches
There are 2 ways you can design a database:
1. Top-Down Design
a) Develop a Conceptual Model using something
like an ER diagram (Entity-Relationship model).
b) Map the model to tables (convert entities to
tables).
c) Normalize the tables (to improve structure and
remove redundancy).
2. Bottom-Up Design
a) Design by Decomposition – Break one big
table into smaller meaningful ones.
b) Use Normalization to improve these tables
Measuring Quality of DB Design (2 Levels)
There are 2 ways to look at the quality of the design:
1. Logical Level Design
Focus: Database structure
Involves deciding what tables, fields, and relationships
you need.
Example: Designing tables for “Customers”, “Orders”, etc.

2. Physical Level Design

Focus: How data is stored and accessed
Involves storage formats, indexes, and access paths.

Features of a Good Relational Database Design

A good design should:
Be easy to modify and maintain
Make it easy to retrieve data
Allow developers to easily build apps that use it
Informal design guidelines for relation
schemas
The four informal measures of quality for relation
schema
Semantics of the attributes
Reducing the redundant values in tuples
Reducing the null values in tuples
Disallowing the possibility of generating spurious
tuples
Semantics of the attributes
Semantics refers to the meaning of attributes in a
relation. It specifies how to interpret attribute
values in a tuple and how they relate to each other.
Guideline 1:
Design a relation schema so that it is easy to explain

its meaning.
Do not combine attributes from multiple entity types

and relationship types into a single relation.

If a relation includes attributes from multiple
entities or relationships, it can lead to semantic
ambiguity, making it hard to explain or understand
the meaning of the relation.
Examples:
Emp_dept Relation
Ename SSN DOB Addr Dno Dname Mgrssn

This mixes employee-related attributes (Ename, SSN,

DOB, etc.) with department-related attributes (Dno,
Dname, Mgrssn).
Emp_proj Relation
SSN Pno Hrs Enam Pnam | Ploc
e e
This mixes employee (SSN, Ename), project (Pno,
Pname, Ploc), and relationship (Hrs) attributes.
Reducing the redundant values in tuples:
Good schema design minimizes storage space and
avoids redundancy.
Storing the same information repeatedly across
tuples (rows) wastes space and can cause anomalies.
This happens when attributes from multiple
entities are grouped into a single relation.
Problems Due to Redundancy and Anomalies:
 Insert Anomaly
To insert a new employee tuple into Emp_dept,
you must include either:
Department details (even if the employee doesn't
work in a department yet), or
NULL for unrelated info (which is bad practice).
Example Table: Emp_dept
MgrSS
Ename SSN Addr Dno Dname
N
A 11 X 1 HR 22
B 22 Y 2 Fin 11
C 33 Z 3 Acc 22
D 44 W NULL NULL NULL
NULL NULL NULL 4 Testing 33

Problems Illustrated:
 Row 4 (D): An employee without department info → must use
NULLs for department fields.
 Row 5: Trying to insert a department without employees → must
use NULLs for employee fields.
This leads to:
 Wasted space due to NULLs.
 Insert anomaly: Can't add departments without employees unless
using NULLs (bad design).
 SSN as a primary key cannot be NULL, making it hard to insert
 Deletion Anomaly:
A deletion anomaly occurs when deleting a tuple (row) that
contains information about one entity also causes loss of
important information about another, unrelated entity.
Example:Emp_dept
Ename SSN Addr Dno Dname MgrSSN
A 11 X 1 HR 22
B 22 Y 2 Fin 11
C 33 Z 2 Acc 22

Now, if we run:
DELETE FROM Emp_dept WHERE SSN = 11;

We're deleting Employee A, but:

 Employee A is the only one working in department 1 (HR).
So, deleting that row removes all info about Dept 1, including:
 Dname = HR
 Dno = 1

Good Design: Normalized Tables
Instead of storing both employee and department
data in one table, split them:
Table 1: Emp
Ename SSN Addr Dno
A 11 X 1
B 22 Y 2
C 33 Z 2
Table 2: Dept
Dno Dname MgrSSN
1 HR 22
2 Fin 11
3 Acc 22

Now, if you delete an employee, department info

stays safe.
 Modification Anomalies
A modification anomaly occurs when:
 A change in one piece of data requires changes in
multiple rows.
 If these changes aren't made everywhere consistently, the
database becomes inconsistent.
Example: EMP_DEPT
 If department data like the department name (Dname) or
manager SSN (MgrSSN) is repeated in multiple rows (i.e.,
for each employee), you face problems during updates.
 Scenario Given:
 Suppose we want to change department name "Acc" to
"Accounts".
 If multiple rows contain "Acc" as department name, we
must update all of them.
 If even one row is missed, it leads to inconsistency.
DNO DNAME MGRSSN
1 HR 22
2 Fin 11
3 Accounts 22
4 Acc 22

 Problem: "Acc" and "Accounts" refer to the same department, but are
now inconsistent.
Note beside the table:
 "Causes inconsistency – we have to change everywhere.“

Guideline 2:
 Design the base relation schema so that update anomalies are
not present.
This means:
 Separate entity data into distinct tables.
 Avoid duplication of the same data in multiple rows.
 If update anomalies are unavoidable:
◦ Document them.
◦ Ensure that any application updating the database does so correctly and
Reducing the null values in tuples :
Design tables so that most attributes have values in
most rows—avoid columns that are NULL in the
majority of tuples.”
Why it matters:
Wastes storage
Complex or unpredictable queries
NULL meaning is ambiguous

Guideline 3:
Avoid placing attributes in a base relation whose
values are mostly null. Disallowing spurious tuples.
Problematic Table:
Passport LicenseN
EmpID Name Email
Number umber
1 Alice [email protected] P1234 (NULL)
m
2 Bob [email protected] (NULL) L5678
m
3 Charlie charlie@x. (NULL) (NULL)
com

Passport Number and License Number are mostly NULL.

Redesign idea: Move them to separate tables.

Better Design:
EMPLOYEE(EmpID, Name, Email)
EMP_PASSPORT(EmpID, Passport Number)
EMP_LICENSE(EmpID, License Number)

No NULLs in the main table, and optional data is

stored only when available
Generating spurious tuples :
 Decompose tables so that rejoining them by key
attributes guarantees no spurious rows.”
 A natural join on non-key attributes may produce false
tuples.
 Use only primary-key ↔ foreign-key joins to ensure a
lossless join.
Spurious Tuples Example
Ssn
OriginalPno
Table: EMP_PROJ
Hours Ename Pname Ploc
101 1 20 Alice ISRO Bng
102 2 15 Bob IISc Bng

Decomposed:
 EMP_LOCS(Ename, Ploc)
 EMP_PROJ1(Ssn, Pno, Hours, Pname, Ploc)
Bad Join on Ploc:
SELECT *
FROM EMP_LOCS NATURAL JOIN EMP_PROJ1;
Produces:
Ename Ploc Ssn Pno Hours Pname
Alice Bng 101 1 20 ISRO
Alice Bng 102 2 15 IISc
Bob Bng 101 1 20 ISRO
Bob Bng 102 2 15 IISc

Alice is wrongly associated with the IISc project, and

Bob with ISRO—these are spurious tuples
Fix – Use Lossless Decomposition
 Normalized Tables:
 EMPLOYEE(Ssn, Ename)
 PROJECT(Pno, Pname, Ploc)
 WORKS_ON(Ssn, Pno, Hours)

Rejoin properly:

⋈ WORKS_ON USING(Ssn)
EMPLOYEE

⋈ PROJECT USING(Pno)
Join on primary-key ↔ foreign-key ensures no
spurious tuples and lossless reconstruction.
Ename Ploc Ssn Pno Hours Pname
Alice Bng 101 1 20 ISRO
Alice Bng 102 2 15 IISc
Bob Bng 101 1 20 ISRO
Summary:

Guideline What It Prevents How to Fix It

Move rarely-used
Storage waste, NULL-
Guideline 3 attributes into separate
related confusion
tables
False data from bad Decompose only on PK–
Guideline 4
JOINs FK; enforce lossless join
Functional Dependency:
A functional dependency, written as X → Y, means:
Whenever two rows have the same values for
attributes in set X, they must also have the same
values for attributes in set Y. Formally:
Given relation schema R, subsets X, Y ⊆ R,
X is called the determinant.(LHS)
Y is the dependent.(RHS)

For instance, in a student table:

StudentID Name Semester
1234 Alice 4
1235 Bob 6

We see StudentID → Name, Semester, because each

StudentID corresponds to one unique Name & Semester
Why FDs Are Important
 Normalization: Identify which attributes belong
together logically.
 Avoid anomalies: If dependencies are improperly
placed, updating or deleting data may lead to
inconsistencies.
 Schema structure: Helps decide how to split data into
well-structured tables.
Definition
For relation schema R(A₁,…,Aₙ), subsets X, Y ⊆ R,
we say X → Y if, in every legal instance of R,
whenever two tuples t₁ and t₂ agree on all attributes in
X, they must also agree on all attributes in Y
Key points:
 LHS (X) = determinant; RHS (Y) = dependent.
Example with Table
Consider this table R(A, B, C)

A B C
1 2 3
4 2 3
5 3 3

A → B: 1-2,2-2,5-3.
B → C:2-3,3-3
BC → A : NO 2 3-1,4 3 3-5
AC → B: 1 3-2,4 3-2,5 3- 3
Therefore for same values 2 different values
Application of FD:
These are the 4 main applications listed in your
image. Let's understand each:
 To find additional FDs

Using known FDs, we can find new FDs using rules (like
Armstrong’s Axioms).
Example: If we know:
A→B
B→C
Then we can say: A → C (by transitivity rule)
 To identify the key
Functional Dependencies help us find:
o Primary Key (PK): Main unique identifier
o Super Key (SK): Bigger set that uniquely identifies
o Candidate Key (CK): Minimum key with no extra attributes.
Example: If A → B and A → C, then A is a key for table (it can
uniquely find all other attributes).
 To find equivalent FDs

Sometimes, two sets of FDs mean the same thing, even if they look
different.
These are called equivalent sets of FDs.
Example: FD set 1:
A → B
A → C

FD set 2:
 A → BC
 To find minimal FDs
We try to simplify FDs — remove unnecessary attributes
and write the smallest set of FDs that still describe the
same data.
This process is called Finding the Minimal Cover or
Canonical Cover.
Example: From A → BC, we can split to:
A → B
A → C

This is the minimal version.

Classification of FD:
1.Trivial Functional Dependency:
 Occurs when the RHS (dependent) is a subset of the
LHS (determinant).
 These always hold but aren’t informative for design.
 Notation: X→Y is trivial if Y⊆X.

Example:

RollNo Name
1 Alice
2 Bob

{RollNo, Name} → Name is trivial because Name is

already in the determinant set
2. Non‑trivial Functional Dependency:
Here, RHS is not a subset of LHS.
These are meaningful constraints like primary keys
determining other attributes.
Example: RollNo→Name
RollNo Name Age
1 A 17
2 B 18

RollNo→Name and RollNo→Age are non‑trivial.

3. Partial Functional Dependency.
Occurs in relations with composite keys. A non‑key
attribute depends on only part of the composite key
—not the whole.
Leads to 2NF violation

Example:
StudentN
StudentID CourseID Grade
ame
101 C1 Alice A
101 C2 Alice B

Composite key = {StudentID, CourseID} → Grade.

StudentName is functionally dependent on
StudentID alone → partial dependency.
4. Full (Complete) Functional Dependency
A non‑key attribute depends on the entire composite
key, and not just a part.
No proper subset of the key determines the
dependent attribute.
Required to satisfy 2NF.

Example:
StudentID CourseID Grade
101 C1 A
101 C2 B

{StudentID, CourseID} → Grade is a full FD (neither

StudentID nor CourseID alone suffice).
5. Transitive Functional Dependency
Occurs when:
X→Y and Y→Z ⇒ X→Z
The dependency is indirect. Violates 3NF if Z is a
non‑prime attribute.
Example:
DeptNam
EmpID DeptID
e
E1 D1 HR
E2 D2 IT
EmpID→DeptID
DeptID→DeptName
Therefore, transitively: EmpID→DeptName

DBMS Unit 2
No ratings yet
DBMS Unit 2
276 pages
Unit 3 Normalization
No ratings yet
Unit 3 Normalization
67 pages
Functionald Dependencies and Normalization
No ratings yet
Functionald Dependencies and Normalization
66 pages
Dbms - Unit - III
No ratings yet
Dbms - Unit - III
91 pages
CH - 5 FD and Normalization
No ratings yet
CH - 5 FD and Normalization
49 pages
DBMS Module4 Notes
No ratings yet
DBMS Module4 Notes
124 pages
DBMS Module4
No ratings yet
DBMS Module4
124 pages
Fundamentals of Database System Chapter 14
No ratings yet
Fundamentals of Database System Chapter 14
70 pages
5.1 5.2 Relational Database Design
No ratings yet
5.1 5.2 Relational Database Design
62 pages
Module 3
No ratings yet
Module 3
41 pages
Normalization 1
No ratings yet
Normalization 1
25 pages
DBMS Module-4 Notes
No ratings yet
DBMS Module-4 Notes
38 pages
DBMS - Module 3
No ratings yet
DBMS - Module 3
49 pages
Normalization
No ratings yet
Normalization
175 pages
Normalisation 2025
No ratings yet
Normalisation 2025
74 pages
Chapter14 - Revised
No ratings yet
Chapter14 - Revised
60 pages
Focus 4 Test 1 GR A
80% (5)
Focus 4 Test 1 GR A
4 pages
DBMS Module 3
No ratings yet
DBMS Module 3
27 pages
Chapter 4 - Database Design - (Normalization)
No ratings yet
Chapter 4 - Database Design - (Normalization)
43 pages
4-Database Design Theory-Without Inclass Exercises
No ratings yet
4-Database Design Theory-Without Inclass Exercises
121 pages
Operation Strategy
100% (1)
Operation Strategy
22 pages
Functional Dependencies and Normalization For Relational Databases
100% (2)
Functional Dependencies and Normalization For Relational Databases
11 pages
M3 Imp
No ratings yet
M3 Imp
13 pages
Relational Database Design - Features of Good Relational Designs
100% (1)
Relational Database Design - Features of Good Relational Designs
27 pages
Module - III
No ratings yet
Module - III
38 pages
Module 4 - Normalization
No ratings yet
Module 4 - Normalization
141 pages
Module 5 - Relational Database Design-JT-HP 2
No ratings yet
Module 5 - Relational Database Design-JT-HP 2
91 pages
Kelly Strategy for Investors
50% (2)
Kelly Strategy for Investors
7 pages
05 - Relational Database Design - Week 05
No ratings yet
05 - Relational Database Design - Week 05
37 pages
MM 3
No ratings yet
MM 3
14 pages
DBMS M4 - Ktunotes - in
No ratings yet
DBMS M4 - Ktunotes - in
114 pages
Unit-2 Relational Model & Normalization (1NF 2NF 3NF BCNF)
No ratings yet
Unit-2 Relational Model & Normalization (1NF 2NF 3NF BCNF)
42 pages
DBMS Module - 04
No ratings yet
DBMS Module - 04
33 pages
Chapter 4-Functional Dependancy and Normalization
No ratings yet
Chapter 4-Functional Dependancy and Normalization
86 pages
CH - 5 FD and Normalization
No ratings yet
CH - 5 FD and Normalization
44 pages
1 - Dbms Module 4 PPT 1
No ratings yet
1 - Dbms Module 4 PPT 1
64 pages
Schema Refinement (Normalization) in DBMS
No ratings yet
Schema Refinement (Normalization) in DBMS
39 pages
Module 3 Notes - 20250709 - 091039 - 0000
No ratings yet
Module 3 Notes - 20250709 - 091039 - 0000
14 pages
This Approach Is Not Very Popular in Practice Because It Suffers From The
No ratings yet
This Approach Is Not Very Popular in Practice Because It Suffers From The
6 pages
20240628152931D6667 - 006. Schema Refinement
No ratings yet
20240628152931D6667 - 006. Schema Refinement
30 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
56 pages
FDMS - Chapter Four
No ratings yet
FDMS - Chapter Four
62 pages
Chapter Five
No ratings yet
Chapter Five
35 pages
RDBMS Unit3 Informaldesign Guidelines
No ratings yet
RDBMS Unit3 Informaldesign Guidelines
27 pages
5-Review of DBMS Techniques - Normalization-09-01-2024
No ratings yet
5-Review of DBMS Techniques - Normalization-09-01-2024
62 pages
Chap 5 Dbms
No ratings yet
Chap 5 Dbms
6 pages
15 05 Normalisasi
No ratings yet
15 05 Normalisasi
48 pages
Part4 - Ch9 - Functional Dependencies and Normalization
No ratings yet
Part4 - Ch9 - Functional Dependencies and Normalization
26 pages
4 DBMS Module-IV
No ratings yet
4 DBMS Module-IV
12 pages
Criminology MCQs
100% (1)
Criminology MCQs
4 pages
Chapter# 14 Database Design Theory and Normalization
No ratings yet
Chapter# 14 Database Design Theory and Normalization
54 pages
DBMS - Unit 4
No ratings yet
DBMS - Unit 4
27 pages
Normalization
No ratings yet
Normalization
65 pages
Unit 4 Relational Database Design
No ratings yet
Unit 4 Relational Database Design
22 pages
Relational Database Design
No ratings yet
Relational Database Design
17 pages
Relational Database Design Guide
No ratings yet
Relational Database Design Guide
30 pages
Unit 9 Functional Dependencies and Normalization For Relational Databases
No ratings yet
Unit 9 Functional Dependencies and Normalization For Relational Databases
20 pages
Module-4 Normalization: Database Design Theory DBMS (18CS53)
No ratings yet
Module-4 Normalization: Database Design Theory DBMS (18CS53)
24 pages
NORMALIZATION
No ratings yet
NORMALIZATION
51 pages
Schema Design Essentials
No ratings yet
Schema Design Essentials
41 pages
Unit - Ii
No ratings yet
Unit - Ii
45 pages
Unit 6 - Normalization
No ratings yet
Unit 6 - Normalization
10 pages
Unit Iv Data Normalization: Semantics of Attributes Should Be Easy To Interpret
No ratings yet
Unit Iv Data Normalization: Semantics of Attributes Should Be Easy To Interpret
14 pages
Avasthas of Planets
No ratings yet
Avasthas of Planets
13 pages
Devotional Insights of Gaura-kiçora
No ratings yet
Devotional Insights of Gaura-kiçora
95 pages
RRB Alp Xam: Study Material For Quantative Aptitude
No ratings yet
RRB Alp Xam: Study Material For Quantative Aptitude
12 pages
Tomato Processing Guide by Mynampati Sreenivasa Rao
No ratings yet
Tomato Processing Guide by Mynampati Sreenivasa Rao
4 pages
COC III Set Up Computer Server
No ratings yet
COC III Set Up Computer Server
77 pages
Heimdal The Gjallarhorn The Horn Resounding and Ragnarok by Ormungandr Melchizedek
100% (1)
Heimdal The Gjallarhorn The Horn Resounding and Ragnarok by Ormungandr Melchizedek
4 pages
A General Theory of Domination and Justice 1st Edition Lovett Instant Download
No ratings yet
A General Theory of Domination and Justice 1st Edition Lovett Instant Download
145 pages
Personal Dynamics Part A
No ratings yet
Personal Dynamics Part A
20 pages
B1 Booster v1
No ratings yet
B1 Booster v1
32 pages
Understanding Kohlberg's Moral Stages
No ratings yet
Understanding Kohlberg's Moral Stages
43 pages
Faircode Technologies Private Limited - Home
No ratings yet
Faircode Technologies Private Limited - Home
1 page
Reto 4
No ratings yet
Reto 4
5 pages
Lifting Eye Bolts B18.15
No ratings yet
Lifting Eye Bolts B18.15
2 pages
Chapter 4 (Answers)
No ratings yet
Chapter 4 (Answers)
5 pages
Design and Analysis of A High Gain Rail To Rail Operational Amplifier
No ratings yet
Design and Analysis of A High Gain Rail To Rail Operational Amplifier
5 pages
Greek Architecture
No ratings yet
Greek Architecture
13 pages
CIE IGNITE Season 01 Ideathon Idea Submissions
No ratings yet
CIE IGNITE Season 01 Ideathon Idea Submissions
255 pages
Science Quiz Bee
No ratings yet
Science Quiz Bee
5 pages
2006-12-31: Overall Conclusion For The Year of 'Arise and Shine'
No ratings yet
2006-12-31: Overall Conclusion For The Year of 'Arise and Shine'
6 pages
Action Plan For NLC
No ratings yet
Action Plan For NLC
9 pages
Partial Derivatives Quiz Analysis
No ratings yet
Partial Derivatives Quiz Analysis
8 pages
Three-Dimensional Printing (3D Printing) : by Dr. Vineet Srivastava
No ratings yet
Three-Dimensional Printing (3D Printing) : by Dr. Vineet Srivastava
9 pages
FAINT YET PURSUING by KELLY JOEL
No ratings yet
FAINT YET PURSUING by KELLY JOEL
13 pages
PCC-2000 Reference Manual V1.42
No ratings yet
PCC-2000 Reference Manual V1.42
26 pages
6089202f4e466 The Amorphous Nature of Agile No One Size Fits All
No ratings yet
6089202f4e466 The Amorphous Nature of Agile No One Size Fits All
42 pages

Module 4

Uploaded by

Module 4

Uploaded by

Module 4

2. Physical Level Design

Features of a Good Relational Database Design

and relationship types into a single relation.

This mixes employee-related attributes (Ename, SSN,

We're deleting Employee A, but:

Now, if you delete an employee, department info

Passport Number and License Number are mostly NULL.

No NULLs in the main table, and optional data is

Alice is wrongly associated with the IISc project, and

Guideline What It Prevents How to Fix It

For instance, in a student table:

We see StudentID → Name, Semester, because each

This is the minimal version.

{RollNo, Name} → Name is trivial because Name is

RollNo→Name and RollNo→Age are non‑trivial.

Composite key = {StudentID, CourseID} → Grade.

{StudentID, CourseID} → Grade is a full FD (neither

You might also like