0% found this document useful (0 votes)

4 views3 pages

CH 12

The document discusses various concepts related to database indexing, including the reasons for not keeping multiple indices, the distinction between clustering and secondary indices, and the implications of using dense versus sparse indices. It also covers hashing techniques, bucket overflow causes, and the efficiency of B+-trees for range queries. Additionally, it addresses methods for optimizing B+-tree structures and computing existence bitmaps while considering null values.

Uploaded by

miguelalmeidapt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views3 pages

CH 12

Uploaded by

miguelalmeidapt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

// vim: spl=en spell tw=80 encoding=utf-8:

12.1 Since indices speed query processing, why might they not be kept on
several search keys? List as many reasons as possible.

----

* they take up space, and so might not really be worth it to keep them
* on search keys that have a lot of entries but few values, it may not be wise
to keep an index. Accessing using a full table scan would be more efficient due
to lower seek rate.
* on a query where several indexes could be used, it may not be clear to the
optimizer which would provide the best choice.
* ... ?

12.2 Is it possible in general to have two clustering indices on the same

relation for different search keys? Explain your answer.

----

No. A clustering index is used to order the relation on-disk. As different

search keys usually impose a different ordering, it is usually not possible to
have such a setup.

12.13 When is it preferable to use a dense index rather than a sparse index?
Explain your answer.

----

When most queries are equality queries, for instance, when we want to use
several indexes to do a ``in memory bitmap scan'', when the index is secondary
it cannot be sparse, when ... ?

12.14 What is the difference between a clustering index and a secondary index?

----

A clustering index is closely related to the ordering of the tuples on disk,

actually defining it. A secondary index is basically a side structure from which
we can get pointers to the tuples we want.

12.16 The solution presented in Section 12.5.3 to deal with non-unique search
keys added an extra attribute to the search key. What effect does this change
have on the height of the B+-tree?

----

Having a larger search key means that less search keys can be fit into a page,
and thus the logarithmic factor of the height of the tree is reduced. Notice
that although this is true it may not be meaningful, as the extra attribute is
generally quite small, and relations quite big, the overhead may not be all that
impairing.

12.17 Explain the distinction between closed and open hashing. Discuss the
relative merits of each technique in database applications.

----

In closed hashing when a bucket is full, new tuples that would end up in that
bucket are put in chaining overflow buckets (chaining means that a linked list
of overflow buckets is created). In open hashing whenever a bucket is full
tuples that would be in it are put in other (already existing) buckets, by some
kind of probing for a free bucket.

Open hashing has the principal advantage that when the hash function does not
satisfyingly disperse the values through the buckets, no extra space is required
for the hash table. On the other hand it does not allow for deletes or updates
in an easy manner.

Closed Hashing allows for deletes and updates but requires extra space even if
there are buckets that are not full, or even that are empty. It also has the
disadvantage that overflow buckets usually require additional disk access, while
in linear probing open hashing one can fetch 2 or 3 buckets which almost guarantees
that from that disk access it will be possible to find the required entry.

12.18 What are the causes of bucket overflow in a hash file organization? What
can be done to reduce the occurrence of bucket overflows?

----

(esta pergunta é parva...) Bucket overflows can occur due to insufficient number
of buckets or to a badly chosen hash function for the set of expectable data.
Against this one can either properly size the hash table or use dynamic hashing.

With dynamic hashing the hash table grows it's buckets as needed, without the
need for rehashing.

12.19 Why is a hash structure not the best choice for a search key on which
range queries are likely?

----

While B+-tree's tend to cluster similar data together (in order), hash tables,
by definition, tend to spread data throughout the various buckets. A range query
in an hash table would most likely access buckets that are in different areas of
the disk, and thus require a lot of I/O to process.

12.20 Suppose there is a relation R(A,B,C), with a B+-Tree index with search key
(A,B).

a. What is the worst case cost of finding records satisfying 10 < A < 50 using
this index, in terms of the number of records retrieved n1 and the height h of
the tree?

b. What is the worst case of finding records satisfying 10 < A < 50 AND 5 < B <
10 using this index, in terms of the number of records n2 that satisfy this
selection, as well as n1 and h defined above.

c. Under what conditions on n1 and n2 would the index be an efficient way of

finding records satisfying 10 < 5 < 50 AND 5 < B < 10.

----

a. h + 1 + n1 seeks and block transfers.

b. same as above, only + n2

c. in the condition where the index was a clustering index. If this were true,
n1+n2 would be block transfers instead of seeks.

12.21 Suppose that you have to create a B+-tree index on a large number of
names, where the maximum size of a name may be quite large (say 40 characters)
And the average name is itself large (say 10 characters). Explain how prefix
compression can be used to maximize the average fanout of internal nodes.

----

Given that B+-Tree's are ordered it is expectable that at least on lower level
internal nodes most keys have the same prefix, and would thus gain from
compression. This leads to less space occupied by each key in the node, leaving
more space available for other keys which in turn leads to shorter trees, which
are more efficient to query.

12.22 Why might the leaf nodes of a B+-tree file organization lose
sequentiality? Suggest how the file organization may be reorganized to restore
sequentiality.

---- (sim, a minha resposta a esta pergunta é meio duvidosa)

When internal nodes explode there has to be more pages available. This may
require using a new page that's after the leaf nodes. When, in turn, leaf nodes
explode, they will require pages from after this internal node. This may lead to
leaf nodes being interleaved by internal nodes.

12.24 Show how to compute existence bitmaps from other bitmaps. Make sure that
your technique works even in the presence of null values, by using a bitmap for
the value null.

---- (tão simples? onde está o truque na manga?)

Just OR all bitmaps and you get a bitmap with all existing tuples.

Unit 4 Two Marks Q&A
No ratings yet
Unit 4 Two Marks Q&A
5 pages
Unit Iv
No ratings yet
Unit Iv
6 pages
CH 12 Updated
No ratings yet
CH 12 Updated
55 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
Indexing: Contents
No ratings yet
Indexing: Contents
13 pages
DM Module-3
No ratings yet
DM Module-3
60 pages
Index and Hashng
No ratings yet
Index and Hashng
2 pages
Database Indexing Solutions
No ratings yet
Database Indexing Solutions
11 pages
Static Hashing in DBMS
No ratings yet
Static Hashing in DBMS
75 pages
Solution 3
No ratings yet
Solution 3
7 pages
CS2202 IndexingHashing
No ratings yet
CS2202 IndexingHashing
83 pages
Dbms Unit 3&4
No ratings yet
Dbms Unit 3&4
18 pages
CSE 301 Lecture-8-Indexing WT
No ratings yet
CSE 301 Lecture-8-Indexing WT
31 pages
Index and Hashing
No ratings yet
Index and Hashing
82 pages
IT3020 L06 Indexing
No ratings yet
IT3020 L06 Indexing
41 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
9 pages
Database Indexing & Hashing Basics
No ratings yet
Database Indexing & Hashing Basics
7 pages
DBMS W09 Pas
No ratings yet
DBMS W09 Pas
12 pages
Database Modeling - Notes-V
No ratings yet
Database Modeling - Notes-V
9 pages
Indexing and Hashing Techniques
No ratings yet
Indexing and Hashing Techniques
36 pages
4 Marks Chapter (12) : 1) Physical Storage Media
No ratings yet
4 Marks Chapter (12) : 1) Physical Storage Media
6 pages
B+ Tree & B Tree
No ratings yet
B+ Tree & B Tree
38 pages
Indexing Hashing Files
No ratings yet
Indexing Hashing Files
68 pages
Unit-5 B+Trees & Hashing
No ratings yet
Unit-5 B+Trees & Hashing
37 pages
5 Unit PDF
No ratings yet
5 Unit PDF
2 pages
UNIT-5: Indexing and Hashing
No ratings yet
UNIT-5: Indexing and Hashing
78 pages
Question Bank-Unit 4
No ratings yet
Question Bank-Unit 4
2 pages
Ch14, Veiws, Normalization - Summary
No ratings yet
Ch14, Veiws, Normalization - Summary
68 pages
Indexing
No ratings yet
Indexing
24 pages
Unit-4 Hand Written
No ratings yet
Unit-4 Hand Written
35 pages
Week2 S1 Indexing
No ratings yet
Week2 S1 Indexing
50 pages
Hash-Based Indexing Techniques
No ratings yet
Hash-Based Indexing Techniques
15 pages
Database Indexing Basics
No ratings yet
Database Indexing Basics
31 pages
DBMS Unit-Iv
No ratings yet
DBMS Unit-Iv
9 pages
Memoryhierarchy Indexing
No ratings yet
Memoryhierarchy Indexing
9 pages
1972 Bayer Mccreight
No ratings yet
1972 Bayer Mccreight
17 pages
Indexing and Hashing: B.Ramamurthy
No ratings yet
Indexing and Hashing: B.Ramamurthy
24 pages
Homework 2 Solution
No ratings yet
Homework 2 Solution
7 pages
DBMS Unit5
No ratings yet
DBMS Unit5
40 pages
INDEXING
No ratings yet
INDEXING
10 pages
Storage Final
No ratings yet
Storage Final
77 pages
Chapter 11: Indexing and Hashing
No ratings yet
Chapter 11: Indexing and Hashing
47 pages
B+ Trees for Database Students
No ratings yet
B+ Trees for Database Students
8 pages
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
No ratings yet
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
52 pages
Hash Tree Index
No ratings yet
Hash Tree Index
44 pages
Storage and Indexing
No ratings yet
Storage and Indexing
41 pages
Database Indexing Techniques
No ratings yet
Database Indexing Techniques
50 pages
Organization and Maintenance of Large Ordered Indices
No ratings yet
Organization and Maintenance of Large Ordered Indices
35 pages
CO3 Session 6
No ratings yet
CO3 Session 6
29 pages
Aplikasi DB-MKG 7
No ratings yet
Aplikasi DB-MKG 7
22 pages
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
No ratings yet
IN3020/4020 - Database Systems Spring 2020, Week 3.1 Indexing
44 pages
Storage and Indexing Methods
No ratings yet
Storage and Indexing Methods
43 pages
Unit 6.2 Indexing and Hashing
No ratings yet
Unit 6.2 Indexing and Hashing
37 pages
CH 13
No ratings yet
CH 13
34 pages
Chapter 12: Indexing and Hashing
No ratings yet
Chapter 12: Indexing and Hashing
31 pages
Unit 3 Storage Strategies Indices B-Trees Hashing
No ratings yet
Unit 3 Storage Strategies Indices B-Trees Hashing
12 pages
Chapter - 1
No ratings yet
Chapter - 1
22 pages
Relational DB Checklist
No ratings yet
Relational DB Checklist
2 pages
Low-Level Design of Payment Apps
No ratings yet
Low-Level Design of Payment Apps
17 pages
LDAP Directories Explained
No ratings yet
LDAP Directories Explained
330 pages
Data Flows
No ratings yet
Data Flows
17 pages
Introduction To HANA - Deep Dive
No ratings yet
Introduction To HANA - Deep Dive
106 pages
MAZZOCCHI, 2017. Knowledge Organization System (IEKO)
No ratings yet
MAZZOCCHI, 2017. Knowledge Organization System (IEKO)
22 pages
Spark Architecture for Developers
No ratings yet
Spark Architecture for Developers
7 pages
StruMIS Remote Connection - Quick Guide
No ratings yet
StruMIS Remote Connection - Quick Guide
5 pages
Obi Odi Lineage
No ratings yet
Obi Odi Lineage
31 pages
504 Lecture4
No ratings yet
504 Lecture4
42 pages
LinkTransformer for Easy Record Linkage
No ratings yet
LinkTransformer for Easy Record Linkage
16 pages
Real Generative AI Use Cases You Can Build Right Now (DataRobot) (White-Paper)
No ratings yet
Real Generative AI Use Cases You Can Build Right Now (DataRobot) (White-Paper)
43 pages
Pandas Python Data Analysis Guide
No ratings yet
Pandas Python Data Analysis Guide
32 pages
Stored Procedure
No ratings yet
Stored Procedure
20 pages
Jyoti Resume
No ratings yet
Jyoti Resume
1 page
Class XII CS HOTS Questions & Answers
No ratings yet
Class XII CS HOTS Questions & Answers
33 pages
Vinay Rao CV
No ratings yet
Vinay Rao CV
6 pages
TheTechBlackBoard-AZ-900 - 135questions - Sample
No ratings yet
TheTechBlackBoard-AZ-900 - 135questions - Sample
12 pages
Database & Database Management Systems (Notes)
No ratings yet
Database & Database Management Systems (Notes)
22 pages
Awsglossary Ref
No ratings yet
Awsglossary Ref
69 pages
Accelerating Data Modernization With Azure
No ratings yet
Accelerating Data Modernization With Azure
7 pages
Unit 1
No ratings yet
Unit 1
41 pages
Python Manual
No ratings yet
Python Manual
53 pages
IT Professionals' Certification Hub
No ratings yet
IT Professionals' Certification Hub
3 pages
Practical 1:: Using Advanced Functions
No ratings yet
Practical 1:: Using Advanced Functions
24 pages
Maps and Sets 02 Class Notes DECODE DSA With C 2-0-659c0f305baa270018a79167
No ratings yet
Maps and Sets 02 Class Notes DECODE DSA With C 2-0-659c0f305baa270018a79167
14 pages
Data Engineer-Resume
No ratings yet
Data Engineer-Resume
1 page
Apex Installation On Linux
No ratings yet
Apex Installation On Linux
21 pages
Experiment No. 1 DBMS Creating A Database
No ratings yet
Experiment No. 1 DBMS Creating A Database
12 pages

CH 12

Uploaded by

CH 12

Uploaded by

// vim: spl=en spell tw=80 encoding=utf-8:

12.2 Is it possible in general to have two clustering indices on the same

No. A clustering index is used to order the relation on-disk. As different

A clustering index is closely related to the ordering of the tuples on disk,

c. Under what conditions on n1 and n2 would the index be an efficient way of

a. h + 1 + n1 seeks and block transfers.

b. same as above, only + n2

---- (sim, a minha resposta a esta pergunta é meio duvidosa)

---- (tão simples? onde está o truque na manga?)

You might also like