0% found this document useful (0 votes)

120 views9 pages

Indexing Structures For Files: Database Design Database Design

The document outlines different types of single-level indexes for database files, including: 1. Primary indexes which have one index entry per data block and point to the first record in each block. 2. Clustering indexes which index on a non-key field and group records with the same field value into blocks. 3. Secondary indexes which provide an additional access path and include one index entry per data record, making them dense indexes.

Uploaded by

sanjay.careear3267

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views9 pages

Indexing Structures For Files: Database Design Database Design

Uploaded by

sanjay.careear3267

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Chapter Outline

Database Design

Types of Single-level Ordered Indexes

Chapter 14

Primary Indexes
Clustering Indexes
Indexing Structures for Files Secondary Indexes
Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees and B+-
Trees
CS 6360.501 (Fall 2009)
Instructor: Sunan Han
The University of Texas at Dallas

Slide 14- 2

Indexes as Access Paths Indexes as Access Paths

Assume a file of records already exists with some primary The index file usually occupies considerably less
organization described in Ch13 (ordered, unordered or disk blocks than the data file because its entries are
hashed) much smaller (can be easily stored in memory)
Indexes are additional auxiliary access structures that are
A binary search on the index yields a pointer to the
used to speed up the retrieval of records
file record
In a database system, index structures provide an efficient
secondary access path without affecting the physical The index is usually specified on one field of the file
placement of records on disk (although it could be specified on several fields)
Indexes can also be characterized as dense or sparse One form of an index is a file of entries <field value,
A dense index has an index entry for every search key value pointer to record>, which is ordered by field value
(and hence every record) in the data file.
A sparse (or nondense) index, on the other hand, has index
The index is called an access path on the field
entries for only some of the search values
Slide 14- 3 Slide 14- 4
Single-Level Indexes: Primary Index
Primary index
Defined on an ordered data file on the ordered
The data file is ordered on a key field key field
Includes one index entry for each block in the data file;
the index entry has the key field value for the first record
in the block, which is called the block anchor
A similar scheme can use the last record in a block.
A primary index is a nondense (sparse) index, since it
includes an entry for each disk block of the data file and
the keys of its anchor record rather than for every search
value.

Slide 14- 5 Slide 14- 6

Example-1 Single-Level Indexes: Clustering Index

Suppose that records are key-field ordered in the file and that:
fixed record size R = 100 bytes, block size B = 1024 bytes, number of
records r = 30,000 Defined on an ordered data file
The records are unspanned. Then, we get: The data file is ordered on a non-key field unlike primary
blocking factor Bfr = B/R = 1024/100 = 10 records/block
index, which requires that the ordering field of the data
number of file blocks b =  r/Bfr = 30000/10 = 3000 blocks
For an index on the key field, assume the field size V = 9 bytes,
file have a distinct value for each record.
assume the record pointer size P = 6 bytes. Then: Includes one index entry for each distinct value of the
index entry size Ri = (V + P) = (9 + 6) = 15 bytes field; the index entry points to the first data block that
index blocking factor Bfri = B/Ri  = 1024/15  = 68 entries/block contains records with that field value.
number of index blocks bi = b/ Bfri = 3000/68 = 45 blocks
(One index entry corresponds to one file block for sparse indexing) It is another example of nondense index where Insertion
binary search needs log2bi = log245 = 6 block accesses. It requires an and Deletion is relatively straightforward with a
additional block access to the data file => total block access is 7 clustering index.
Without index, the binary search cost on the file itself would be:
log2b = log23000 = 12 block accesses
Slide 14- 7 Slide 14- 8
A Clustering
Index Another Clustering
Example Index Example:
Records of same
clustering field value
are in separate blocks

Slide 14- 9 Slide 14- 10

Single-Level Indexes: Secondary Index

A secondary index provides a secondary means of accessing
a file for which some primary access already exists.
The secondary index may be on a field which is a candidate Example of a
key and has a unique value in every record, or a non-key with Dense
duplicate values. They are non-ordering fields
The index is an ordered file with two fields.
Secondary Index
The first field is of the same data type as some non-ordering for a Key Field
field of the data file that is an indexing field.
The second field is either a block pointer or a record pointer.
There can be many secondary indexes (and hence, indexing
fields) for the same file, for different records fields
Includes one entry for each record in the data file; hence, it is
a dense index
Slide 14- 11 Slide 14- 12
Example-2
Suppose that:
fixed record size R = 100 bytes, block size B = 1024 bytes, number of
records r = 30,000
The records are unspanned. Then, we get:
blocking factor Bfr = B/R = 1024/100 = 10 records/block Example of a
number of file blocks b =  r/Bfr = 30000/10 = 3000 blocks
We construct a secondary index on a nonordering candidate key
Secondary Index
field, assume the field size V = 9 bytes, assume the record pointer for a Nonkey
size P = 6 bytes. Then:
index entry size Ri = (V + P) = (9 + 6) = 15 bytes
Field
index blocking factor Bfri = B/Ri  = 1024/15 = 68 entries/block
number of index blocks bi = r/ Bfri = 30000/68 = 442 blocks
(Each index entry corresponds to one file record for dense indexing)
binary search needs log2bi = log2442 = 9 block accesses (10 is the
total for the final data file block access)
This is compared to an average linear search cost (w/o index) of:
(b/2) = 3000/2 = 1500 block accesses
Slide 14- 13 Slide 14- 14

Summary of Single-Level Indexing Multi-Level Indexes

Primary indexing reduces the search cost of the original file

search on an ordering field
Because a single-level index is an ordered file, we can create
a primary index to the index itself to further reduce the
search cost
In this case, the original index file is called the first-level index
and the index to the index is called the second-level index.
We can repeat the process, creating a third, fourth, ..., top
level until all entries of the top level fit in one disk block
A multi-level index can be created for any type of first-level
index (primary, secondary, clustering) as long as the first-
level index consists of more than one disk block
Slide 14- 15 Slide 14- 16
Example-3
In example 1 (sparse primary index)
Two-level Primary fixed record size R = 100 bytes, block size B = 1024 bytes, number of
records r = 30,000
Index blocking factor Bfr = B/R = 1024/100 = 10 records/block
number of file blocks b = r/Bfr = 30000/10 = 3000 blocks
index entry size Ri = (V + P) = (9 + 6) = 15 bytes
index blocking factor Bfri = B/Ri  = 1024/15  = 68 entries/block
(This is called the fan-out factor of the multi-level index)
number of index blocks b1 = b/ Bfri = 3000/68 = 45 blocks
binary search needs log2b1 = log245 = 6 block accesses (7 is the total
for an additional access to the data file block)
For the second-level index to the 45 first-level index file blocks:
number of index blocks b2 = b1 / Bfri = 45/68 = 1 block
Total file block access is 1 (2nd-level) + 1 (1st-level) + 1 (data file) =
3
Slide 14- 17 Slide 14- 18

Multi-Level Indexes Search Trees

Such a multi-level index is a form of search tree A search tree of order p is a tree such that each
node contains at most p-1 search values and p
However, insertion and deletion of new index entries
pointers in the order <P1,K1,P2,K2, …, Pq-1,Kq-1,Pq>,
is a severe problem because every level of the index
where q ≤ p, each Pi is a pointer to a child node, or
is an ordered file
null and each Ki is a unique search value from some
This leads to dynamic multi-level indexes ordered set of values, and the following must hold:
Dynamic multi-level indexing leaves some additional 1. Within each node, K1 < K2 < …, < Kq-1
space in each block for inserting new entries 2. For all values X in the subtree pointed at by Pi,
Ki-1 < X < Ki, for 1<i<q and X < Ki if i=1, and Ki-1 < X
if i=q

Slide 14- 19 Slide 14- 20

A Node in a Search Tree with Pointers to FIGURE 14.9
Subtrees below It A search tree of order p = 3.

Slide 14- 21 Slide 14- 22

Dynamic Multilevel Indexes Using B-Trees and Dynamic Multilevel Indexes Using B-Trees and
B+-Trees B+-Trees
In B-Tree and B+-Tree data structures, each node An insertion into a node that is not full is quite
corresponds to a disk block efficient
Most multi-level indexes use B-tree or B+-tree data If a node is full the insertion causes a split into two
structures because of the insertion and deletion problem nodes
Space has to be reserved in each tree node to allow for Splitting may propagate to other tree levels
new index entries A deletion is quite efficient if a node does not
Each node is kept between half-full and completely full become less than half full
If a deletion causes a node to become less than half
full, it must be merged with neighboring nodes

Slide 14- 23 Slide 14- 24

Difference between B-tree and B+-tree B-Trees
When used as an access structure on a key field in a data
file, a B-Tree of order p can be defined as follows
In a B-tree, pointers to data records exist at all levels 1. Each internal node in the B-tree is of the form
of the tree <P1,<K1,Pr1>,P2,<K2,Pr2>, …, Pq-1,<Kq-1,Prq>,Pq>, where q ≤ p,
each Pi is a tree pointer and Pri is a data pointer to the record
In a B+-tree, all pointers to data records exists at the whose search key field value is Ki (or the block containing the
leaf-level nodes record)
A B+-tree can have less levels (or higher capacity of 2. Within each node, K1 < K2 < …, < Kq-1
search values) than the corresponding B-tree 3. For all values X in the subtree pointed at by Pi,
Ki-1 < X < Ki, for 1<i<q and X < Ki if i=1, and Ki-1 < X if i=q
4. Each internal node has at least p/2 tree pointers
5. A node with q tree pointers (q ≤ p) has q-1 search key field
values (and q-1 data pointers)
6. All leaf nodes are at the same level and have their tree
pointers to be null
Slide 14- 25 Slide 14- 26

B-tree Structures B-Trees Insertion

It starts with a single root node at level 0
When it’s full with p-1 key values and an insertion occurs, two
nodes at level 1 are created and all values except the middle
one are evenly distributed in the two new nodes
The root keeps the middle value and adds two tree pointers
to the new split nodes
When any node in the B-tree is full, it undergoes the same
process to split into two node at the next level
When a node used up all its tree pointers (can not be split
any more) the split will propagate upwards
If it happens at the root, the root is split and a new root and
therefore a new tree level is added
Slide 14- 27 Slide 14- 28
B-Trees Deletion B+-Trees
The internal nodes are similar a search tree defined
When deletion of a record causes two neighboring earlier (<P1,K1,P2,K2, …, Pq-1,Kq-1,Pq>) except that
nodes to be less than half full, a merge will happen Ki-1 < X ≤ Ki, for 1<i<q and X ≤ Ki if i=1, and Ki-1 < X if i=q
Each internal node has at least p/2 tree pointers
This merge may cause a reduction of a tree level
The leaf nodes are define as follows
<<K1,Pr1>,<K2,Pr2>, …, ,<Kq-1,Prq>,Pnext>, where q ≤ p, each
Pri is a data pointer to the record whose search key field
value is Ki, or to a file block containing the record. Pnext
points to the next leaf node
K1 < K2 < …, < Kq-1
Each leaf node has at least p/2 values, or a
redistribution/deletion is needed
All leaf nodes are at the same level
Slide 14- 29 Slide 14- 30

The Nodes of a B+-tree

An Example
of an Insertion
in a B+-tree

Internal nodes form paths

to the leaf nodes that point
to the actual data

2 levels => up to 9 leaves

Slide 14- 31 Slide 14- 32

An Example of a
Deletion in a B+- Summary
tree
Types of Single-level Ordered Indexes
Primary Indexes
Causes a redistribution
at the same level Clustering Indexes
Secondary Indexes
Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees and B+-
Causes a redistribution Trees
at higher levels

Slide 14- 33 Slide 14- 34

Assignment #12

Page 545: 14.14 a, b, c, d, e

Due date 11/23/09

Slide 14- 35

Week 15 Physical Database Design Index - CH 17 Updated
No ratings yet
Week 15 Physical Database Design Index - CH 17 Updated
35 pages
Final Updates - Lec 2
No ratings yet
Final Updates - Lec 2
40 pages
CO3-Session-09 & 10
No ratings yet
CO3-Session-09 & 10
41 pages
Apple iPhone 6S Plus Invoice Receipt
No ratings yet
Apple iPhone 6S Plus Invoice Receipt
5 pages
Indexing
No ratings yet
Indexing
89 pages
Indexing
No ratings yet
Indexing
53 pages
Chapter - 3 - Indexing Structures For Files
No ratings yet
Chapter - 3 - Indexing Structures For Files
83 pages
CS2202 IndexingHashing
No ratings yet
CS2202 IndexingHashing
83 pages
File Organizations and Indexes
No ratings yet
File Organizations and Indexes
51 pages
Indexing
No ratings yet
Indexing
27 pages
Indexing
No ratings yet
Indexing
41 pages
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
48 pages
Week 7 - Indexing Structures
No ratings yet
Week 7 - Indexing Structures
25 pages
Elmasri - 6e - Ch18
No ratings yet
Elmasri - 6e - Ch18
53 pages
DBMS Indexing 5
No ratings yet
DBMS Indexing 5
63 pages
File Organization and Indexing
No ratings yet
File Organization and Indexing
38 pages
20-M4-File Organization - Single Level Indexing-09-09-2024
No ratings yet
20-M4-File Organization - Single Level Indexing-09-09-2024
28 pages
قواعد معطيات 2 (النظري) - 7
No ratings yet
قواعد معطيات 2 (النظري) - 7
27 pages
Co3 Session 21
No ratings yet
Co3 Session 21
53 pages
Index Structures
No ratings yet
Index Structures
34 pages
Lec 09
No ratings yet
Lec 09
52 pages
Chapter 3 File Organization Indexed Methods
No ratings yet
Chapter 3 File Organization Indexed Methods
31 pages
Index 2
No ratings yet
Index 2
24 pages
Indexation 1
No ratings yet
Indexation 1
24 pages
Chapter 3
No ratings yet
Chapter 3
50 pages
Lec06-Indexing in Dbms
No ratings yet
Lec06-Indexing in Dbms
21 pages
CNG351 Lecture 12 A
No ratings yet
CNG351 Lecture 12 A
21 pages
Index 3
No ratings yet
Index 3
21 pages
Indexing Dbms
No ratings yet
Indexing Dbms
22 pages
Index 1
No ratings yet
Index 1
25 pages
08 File Handling
No ratings yet
08 File Handling
18 pages
9 Files, Indices and Database Tuning
No ratings yet
9 Files, Indices and Database Tuning
17 pages
Database Management System-203105251: Assistant Professor Computer Science & Engineering
No ratings yet
Database Management System-203105251: Assistant Professor Computer Science & Engineering
35 pages
Weekly Exercises 01
No ratings yet
Weekly Exercises 01
16 pages
CH 14
No ratings yet
CH 14
6 pages
Indexing
No ratings yet
Indexing
62 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
4 Chapter17 Index
No ratings yet
4 Chapter17 Index
41 pages
Module 4 Indexing
No ratings yet
Module 4 Indexing
20 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
25 pages
Indexing Lecture Nov 2023 Detailed
No ratings yet
Indexing Lecture Nov 2023 Detailed
37 pages
DBMS Indexing Methods
No ratings yet
DBMS Indexing Methods
33 pages
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
No ratings yet
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
44 pages
Indexing
No ratings yet
Indexing
6 pages
Index and Hashing 2017 Combined
No ratings yet
Index and Hashing 2017 Combined
60 pages
Memoryhierarchy Indexing
No ratings yet
Memoryhierarchy Indexing
9 pages
CO3 Notes Indexing
No ratings yet
CO3 Notes Indexing
11 pages
Database Indexing Essentials
No ratings yet
Database Indexing Essentials
110 pages
51 Cutover Templates
100% (2)
51 Cutover Templates
13 pages
Indexing Structures & Database Design
No ratings yet
Indexing Structures & Database Design
39 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
23 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
SM-A305F.FN Galaxy A30 PDF
No ratings yet
SM-A305F.FN Galaxy A30 PDF
1 page
02 - Indices
No ratings yet
02 - Indices
208 pages
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
No ratings yet
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
32 pages
SHS Grade 11 MIL Q4W6 FINAL
No ratings yet
SHS Grade 11 MIL Q4W6 FINAL
19 pages
Indexing Files: Last Time
No ratings yet
Indexing Files: Last Time
5 pages
Chapter 11: Indexing and Hashing
No ratings yet
Chapter 11: Indexing and Hashing
47 pages
Human Relations in Organizations Applications and Skill Building 10th Edition Lussier Test Bank 1
100% (72)
Human Relations in Organizations Applications and Skill Building 10th Edition Lussier Test Bank 1
26 pages
Nour Abdelhafiz CV
No ratings yet
Nour Abdelhafiz CV
2 pages
Indexing in Database
No ratings yet
Indexing in Database
33 pages
SingleLevelIndexing Examples
No ratings yet
SingleLevelIndexing Examples
24 pages
Artificial Intelligence Questions
No ratings yet
Artificial Intelligence Questions
15 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
30 pages
Java Solve
No ratings yet
Java Solve
28 pages
Database Indexing Techniques Guide
No ratings yet
Database Indexing Techniques Guide
8 pages
S1-K12 Laser Service Manual
No ratings yet
S1-K12 Laser Service Manual
10 pages
Computer Applications in Hydraulic Engineering Tutorials 2020-Jul-21
No ratings yet
Computer Applications in Hydraulic Engineering Tutorials 2020-Jul-21
100 pages
Sony hcd-gtr6 gtr6b gtr7 gtr8 gtr8b Ver.1.2 PDF
No ratings yet
Sony hcd-gtr6 gtr6b gtr7 gtr8 gtr8b Ver.1.2 PDF
92 pages
Jwt-Auth: Pacote: Tymon/Jwt-Auth Github: Documentação: 1. Instalar O Pacote
No ratings yet
Jwt-Auth: Pacote: Tymon/Jwt-Auth Github: Documentação: 1. Instalar O Pacote
3 pages
MBIST (Memory Built-In Self Test) - 5
No ratings yet
MBIST (Memory Built-In Self Test) - 5
5 pages
Week 4 Cyber Attacks On Online Learning Platforms Transcript
No ratings yet
Week 4 Cyber Attacks On Online Learning Platforms Transcript
3 pages
KIDNAPPERS AND ROBBERS THREAT-ALERT INTELLIGENT SYSTEM 2 Unical Conference
No ratings yet
KIDNAPPERS AND ROBBERS THREAT-ALERT INTELLIGENT SYSTEM 2 Unical Conference
13 pages
John Locke Essays
100% (2)
John Locke Essays
5 pages
Target Hardware Debugging Boundary Scan
No ratings yet
Target Hardware Debugging Boundary Scan
13 pages
BlackBelt Plus Roadmap - 23 - v2
No ratings yet
BlackBelt Plus Roadmap - 23 - v2
6 pages
Sinhgad Institute of Management, Pune-41: Assignment No.4
No ratings yet
Sinhgad Institute of Management, Pune-41: Assignment No.4
2 pages
Weighbridge Integration With Sap
No ratings yet
Weighbridge Integration With Sap
10 pages
Here Is The Placeholder For Three Lines Title Create Social Media Accounts For Your Business
No ratings yet
Here Is The Placeholder For Three Lines Title Create Social Media Accounts For Your Business
21 pages
333 High Frequency GRE Words With Meanings
No ratings yet
333 High Frequency GRE Words With Meanings
7 pages
Fractal Geometry and Superformula To Model Natural Shapes Over The World
No ratings yet
Fractal Geometry and Superformula To Model Natural Shapes Over The World
15 pages
Magnetically Levitated Ball
No ratings yet
Magnetically Levitated Ball
4 pages
Porn Site Block List for Parents
0% (1)
Porn Site Block List for Parents
97 pages
Bluetooth Communication Using A Touchscreen Interface With The Raspberry Pi
No ratings yet
Bluetooth Communication Using A Touchscreen Interface With The Raspberry Pi
4 pages
Sales Performance Report
No ratings yet
Sales Performance Report
4 pages
Dell Latitude E5400 and E5500 Spec Sheet
100% (1)
Dell Latitude E5400 and E5500 Spec Sheet
2 pages
System On Chip
No ratings yet
System On Chip
12 pages
7 Ways To Optimize Jenkins
No ratings yet
7 Ways To Optimize Jenkins
15 pages

Indexing Structures For Files: Database Design Database Design

Uploaded by

Indexing Structures For Files: Database Design Database Design

Uploaded by

Chapter Outline

Types of Single-level Ordered Indexes

Indexes as Access Paths Indexes as Access Paths

Slide 14- 5 Slide 14- 6

Example-1 Single-Level Indexes: Clustering Index

Slide 14- 9 Slide 14- 10

Single-Level Indexes: Secondary Index

Summary of Single-Level Indexing Multi-Level Indexes

Primary indexing reduces the search cost of the original file

Multi-Level Indexes Search Trees

Slide 14- 19 Slide 14- 20

Slide 14- 21 Slide 14- 22

Slide 14- 23 Slide 14- 24

B-tree Structures B-Trees Insertion

The Nodes of a B+-tree

Internal nodes form paths

2 levels => up to 9 leaves

Slide 14- 31 Slide 14- 32

Slide 14- 33 Slide 14- 34

Page 545: 14.14 a, b, c, d, e

You might also like