0% found this document useful (0 votes)

12 views8 pages

Problem Solving 2

This document outlines the homework assignment for CS411 Database Systems, due on November 3, 2009, and includes instructions for submission and grading. It consists of two parts with various problems related to database concepts such as block and record addresses, variable-length data, index structures, B-trees, and hash tables. Each problem includes specific questions and calculations to demonstrate understanding of the material covered in the course.

Uploaded by

wekis59166

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views8 pages

Problem Solving 2

Uploaded by

wekis59166

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

CS411 Database Systems

Fall 2009

HW#3
Due: 3:15pm CST, November 3, 2009

Note: Print your name and NetID in the upper right corner of every page of your submission.
Hand in your stapled homework to Donna Coleman in 2106 SC. In case Donna is not in office,
slide your homework under the door.
To grade homeworks faster, the homework is partitioned into two parts. Please, submit
each part separately. For each part, make sure to write down your name and NetID.
Handwritten submissions will be graded but they will take longer to grade. For clarity,
machine formatted text is preferable: Expect to lose points if your handwritten answer is
unclear or misread by the grader.
This homework is partitioned into two parts as follows:

• Part 1: Problem 1 - Problem 3

• Part 2: Problem 4 - Problem 5
Part 1
Problem 1 Representing Block and Record Addresses (18 points)
Suppose that we have 4096-byte blocks in which we store records of 100 bytes. The block
header consists of an offset table, as in the figure below, using 2-byte pointers to records
within the block.

Figure 1: Block Header

a) What kind of information is stored in the block header in Figure 1 besides the offset
table? (2 points)
Block header also contains information such as schema, length, timestamp.

b) Explain why we prefer to have unused area in the middle of the block? (2 points)
To accommodate growing records (records are placed starting at the end of
the block). If records are placed just after the offset table, we have to move
all existing records to make space for a new entry in the offset table, every
time we insert a new record.

c) On an average day, two records per block are inserted, and one record is deleted. A
deleted record must have its pointer replaced by a ”tombstone”, because there may be
dangling pointers to it. For specificity, assume the deletion on any day always occurs
before the insertions. If the block is initially empty, after how many days will there be
no room to insert any more records? (10 points)
Each additional record requires 100 bytes + 2-byte pointer. When a record
is deleted the 2-byte pointer remains.
First record inserted: 204 bytes
next day:
delete followed by 2 records inserting: 204-100+2(102) = 308
third day:
delete followed by 2 records inserting: 308-100+2(102) = 412
fourth day:
delete followed by 2 records inserting: 412-100+2(102) = 516
Each day, after the first record insertion, the effective record growth is 104
bytes. 204+n(104) ≤ 4096 where n is 1 less than the total number of days.
n = 37
Answer: After 38 days there will be not enough space to insert new record.

d) Redesign the layout of a block to store records more efficiently considering the fact the
block stores f ixed − length records. (4 points)
Given that it stores fixed-length records, the offset table and record headers
are not needed, consisting only of the block header and actual records.

Figure 2: Fixed length records

Problem 2 Variable-length Data and Records (15 points)
Suppose blocks have 1000 bytes available for the storage of records, and we wish to store
on them fixed-length records of length r, where 500 < r ≤ 1000. The value of r includes
the record header, but a record fragment requires an additional 16 bytes for the fragment
header. For what values of r can we improve space utilization by spanning records?

Every record and record fragment requires a fragment header to support spanned
records.
Since 500 < r ≤ 1000, we can store only a single record when we don’t support
spanned records. Therefore, we don’t use 1000-r bytes in this case. If we sup-
port them, we can store one record and one record fragment and we need to use
32 bytes for fragment headers. Therefore, if r < 1000 - 32 = 968, then we can
improve space usage.
Problem 3 Index structure basics (20 points)
Consider an indexed sequential file consisting of 10,000 blocks. Each block contains 10 fixed
sized records. Each key value found in the file is unique. For this problem, assume that:
• Pointers to blocks are 10 bytes long.
• Pointers to records are 20 bytes long.
• Index blocks are 5000 bytes (in addition to the header).
• Search keys for file records are 10 bytes long.

(a) How many blocks do we need to hold a sparse one-level, primary index? (5 points).

ceil(10000*(10+10) / 5000) = 40

Explanation: We need to store search key per block (10 bytes per block),
and one block pointer (10 bytes) that points to the first block. Block point-
ers per block are not required since the blocks are contiguous, and so a block
pointer can be computed using the offset from the first block pointer.
Number of blocks: 40
(b) In (a), how many disk I/Os do we need to find and retrieve a record with a given key
at the worst case? (5 points)

Sparse index has pointers for each block, not records. In part a, 21 blocks
result. In order to search for the correct index, log2 40 I/O plus a final I/O
to retrieve data;
log2 40 + 1 = 6.32
(c) Suppose you now construct a one-level, dense secondary index. Compute its minimum
size in blocks. (5 points)

ceil( 10000 * 10 / floor( 5000 / (10+20) ) ) = 603

Note: Taking floor as above is necessary to make sure that records do

not span blocks. However, students who assumed that records span blocks,
and computed number of blocks as 10000 * 10 / ( 5000 / (10+20) ) = 600
is considered correct.
Number of blocks: 603
(d) Suppose that we introduce an additional second level index on the sparse index in (a).
How many disk I/Os do we need to find and retrieve a record with a given key at the
worst case? (5 points)

We only need one second-level index. Thus, we need one I/O to read the
second-level index, one I/O to read the first-level index, and one I/O to
read the actual data block. In total, we need three I/O’s.
Part 2
Problem 4 B-Tree (28 points, each part 7 points)
Consider a B-tree of degree d = 2, shown in Figure 2. Remember that each block has
space for 2d keys and 2d + 1 pointers. The textbook uses a different parameter n in Chapter
14.2.1, which is equal to d/2. Please consider the execution of each operation in the following
questions.

Figure 3: Block Header

(a) Look up the record with key 82. Please describe how to traverse the tree in detail.
Check 82 against the key stored in the root node - since 82 is bigger, take
the right edge to the second node containing 77 and 83. Again, compare
82 against these keys linearly traversing them; since 82 is bigger than 77
but less than 83, take the edge from the second pointer to the leaf node
containing 79 and 82, compare against these keys traversing from left to
right. The key is found since the second key in the node matches the value
82.

(b) Look up the records with key in the range of 51 and 80. Please describe how to traverse
the tree in detail.
Similarly, compare the key in the root node against 51 and traverse down to
the 4th root node. From that node, linearly traverse to the next leaf node,
until a key with 51 is found, or the first key with a value greater than 51
is found. In this case, it is the key with value 53. Once this key is found,
traverse further until a key with value equal to 80 is found or the last key
with a value less than 80 is found, which in this case is 79.
Figure 4: c. after adding 5

Figure 5: d. after deleting 72

Problem 5 Hash Table (15 points)

Consider indexing the following key values using an extensible hash table. Keys are inserted
in the following order:
34, 60, 51, 73, 49, 84, 25

The hash function h(n) for key n is h(n) = n mod 16, that is , the hash function is the
reminder after the key value is divided by 16, giving the hash a 4-bit value. Assume that
each bucket can hold 2 data items.
a) 34 % 16 = 2 → 0010
60 % 16 = 12 →1100
51 % 16 = 3 → 0011
73 % 16 = 9 → 1001
49 % 16 = 1 → 0001
84 % 16 = 4 → 0100
25 % 16 = 9 → 1001
Figure 6: a). hash table

Figure 7: b). linear hash table

Capstone Paper Materials and Methods
0% (1)
Capstone Paper Materials and Methods
6 pages
Solution 3
No ratings yet
Solution 3
7 pages
NPTEL DBMS 2 Week 6
No ratings yet
NPTEL DBMS 2 Week 6
41 pages
Chapter 13: Disk Storage, Basic File Structures, and Hashing
No ratings yet
Chapter 13: Disk Storage, Basic File Structures, and Hashing
12 pages
File Organization and Indexing: Structure of Disks
No ratings yet
File Organization and Indexing: Structure of Disks
28 pages
LECTURE21-Dictionaries BinarySearch Hashing
No ratings yet
LECTURE21-Dictionaries BinarySearch Hashing
23 pages
CH 13
No ratings yet
CH 13
34 pages
DBMS Solution-6
No ratings yet
DBMS Solution-6
10 pages
Week 6 Solution
No ratings yet
Week 6 Solution
11 pages
Implementation Priority Queue Using Array
No ratings yet
Implementation Priority Queue Using Array
3 pages
hw3 Sol
100% (1)
hw3 Sol
6 pages
Assignment3 Solution 2024
No ratings yet
Assignment3 Solution 2024
4 pages
FSDP Question Bank 1
No ratings yet
FSDP Question Bank 1
5 pages
GIS Manual
100% (1)
GIS Manual
37 pages
Hashing
No ratings yet
Hashing
33 pages
Create View Statement
No ratings yet
Create View Statement
9 pages
Updated PDAssignment6
No ratings yet
Updated PDAssignment6
15 pages
Data Structures Digital Notes-111-120
No ratings yet
Data Structures Digital Notes-111-120
10 pages
Operating System Lab Manual
50% (2)
Operating System Lab Manual
44 pages
Database Systems Assignment Guide
No ratings yet
Database Systems Assignment Guide
4 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
DSAL Lab Manual
No ratings yet
DSAL Lab Manual
61 pages
Database Systemcomp6
No ratings yet
Database Systemcomp6
2 pages
Proj4 2
No ratings yet
Proj4 2
3 pages
Task 2 - Hashing and Linear Probing
No ratings yet
Task 2 - Hashing and Linear Probing
16 pages
UT Dallas Syllabus For cs4347.501 05s Taught by Latifur Khan (Lkhan)
No ratings yet
UT Dallas Syllabus For cs4347.501 05s Taught by Latifur Khan (Lkhan)
3 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
HW4 Solutions
No ratings yet
HW4 Solutions
7 pages
Appendix F Hash PDF
No ratings yet
Appendix F Hash PDF
7 pages
DSimp 2
No ratings yet
DSimp 2
21 pages
Assignment 6 DBMS JUL 22
No ratings yet
Assignment 6 DBMS JUL 22
10 pages
DBMS W09 Pas
No ratings yet
DBMS W09 Pas
12 pages
Homework3 Sol
No ratings yet
Homework3 Sol
3 pages
File Systems
No ratings yet
File Systems
8 pages
Numerical Based On Indexing: Problem 1.2
No ratings yet
Numerical Based On Indexing: Problem 1.2
3 pages
Dsa 240404 220052
No ratings yet
Dsa 240404 220052
9 pages
Midterm Sample 4411 9538
No ratings yet
Midterm Sample 4411 9538
4 pages
HW 2 Sol 1
No ratings yet
HW 2 Sol 1
8 pages
Midterm 15w2
No ratings yet
Midterm 15w2
8 pages
Ds 3rd Internals Answers
No ratings yet
Ds 3rd Internals Answers
14 pages
A8 - E8-1-to-E8-3 Database System
No ratings yet
A8 - E8-1-to-E8-3 Database System
5 pages
9 Files, Indices and Database Tuning
No ratings yet
9 Files, Indices and Database Tuning
17 pages
Homework 2 Solution
No ratings yet
Homework 2 Solution
7 pages
Experiment 8 DS Student
No ratings yet
Experiment 8 DS Student
8 pages
EAPP Pointers REVIEWER 1
No ratings yet
EAPP Pointers REVIEWER 1
9 pages
LabVIEW Data Types & Conversions Between These Types
100% (4)
LabVIEW Data Types & Conversions Between These Types
23 pages
Weekly Exercises 01
No ratings yet
Weekly Exercises 01
16 pages
Hashing: Presented by
No ratings yet
Hashing: Presented by
35 pages
Quiz 10 November 2020 Questions
No ratings yet
Quiz 10 November 2020 Questions
7 pages
Computer Science Paper 1 HL Nov 2017
No ratings yet
Computer Science Paper 1 HL Nov 2017
9 pages
File Organization Notes
No ratings yet
File Organization Notes
21 pages
DBMS B-Tree and Disk Problems
No ratings yet
DBMS B-Tree and Disk Problems
6 pages
Project 0: Implementing A Hash Table: CS 165, Data Systems, Fall 2017
No ratings yet
Project 0: Implementing A Hash Table: CS 165, Data Systems, Fall 2017
6 pages
File Org & Indexing - Practice Sheet 05 (Database Management System)
No ratings yet
File Org & Indexing - Practice Sheet 05 (Database Management System)
4 pages
DSA Chapter 08 (Searching)
No ratings yet
DSA Chapter 08 (Searching)
65 pages
HW3 Sol
No ratings yet
HW3 Sol
12 pages
Hashing Powerpoint
No ratings yet
Hashing Powerpoint
58 pages
Database Design and Applications (SSZ G518) 2 Semester 2017-18 Homework SOLUTIONS Topic: Indexing
No ratings yet
Database Design and Applications (SSZ G518) 2 Semester 2017-18 Homework SOLUTIONS Topic: Indexing
3 pages
Lab08 - DS - Hash Tables
No ratings yet
Lab08 - DS - Hash Tables
9 pages
Disk Storage & File Structures Guide
No ratings yet
Disk Storage & File Structures Guide
10 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
Data and File Structures: Hashing
No ratings yet
Data and File Structures: Hashing
24 pages
CS-Database System Principles: Final Exam - Summer 2001
No ratings yet
CS-Database System Principles: Final Exam - Summer 2001
18 pages
06 - CONVERSION - Oracle Outbound Interface Process With Example
0% (1)
06 - CONVERSION - Oracle Outbound Interface Process With Example
8 pages
Final Rev Guidelines NHRP0301204
No ratings yet
Final Rev Guidelines NHRP0301204
27 pages
Unit 4 DigitalData
No ratings yet
Unit 4 DigitalData
22 pages
Ov4 en PDF
No ratings yet
Ov4 en PDF
2 pages
Software Engineering (R22a0505)
No ratings yet
Software Engineering (R22a0505)
86 pages
Integrating Research and Practice in Software Engineering Stan Jarzabek
No ratings yet
Integrating Research and Practice in Software Engineering Stan Jarzabek
53 pages
Computer Networks (R22a0512)
No ratings yet
Computer Networks (R22a0512)
180 pages
Computer Organization (R22a0508)
No ratings yet
Computer Organization (R22a0508)
86 pages
IBM Datacap Taskmaster Training
No ratings yet
IBM Datacap Taskmaster Training
2 pages
Git Hub
No ratings yet
Git Hub
2 pages
Slides For Chapter 17: Distributed Transactions: Distributed Systems: Concepts and Design
No ratings yet
Slides For Chapter 17: Distributed Transactions: Distributed Systems: Concepts and Design
24 pages
Data Analytics Lab Assignment
No ratings yet
Data Analytics Lab Assignment
6 pages
Database Management Systems (R22a0504)
No ratings yet
Database Management Systems (R22a0504)
96 pages
Discrete Mathematics (R22a0028)
No ratings yet
Discrete Mathematics (R22a0028)
87 pages
Documentum Architecture White Paper
No ratings yet
Documentum Architecture White Paper
47 pages
Operating Systems (R22a0509)
No ratings yet
Operating Systems (R22a0509)
160 pages
Data Structures (R220503)
No ratings yet
Data Structures (R220503)
101 pages
Semester 2 Final Exam - Oracle Academy
100% (1)
Semester 2 Final Exam - Oracle Academy
23 pages
MMW Lesson 1 (1st Year - MMLS)
No ratings yet
MMW Lesson 1 (1st Year - MMLS)
63 pages
Pola Komunikasi Konstruktif Mahasiswa Saat Menghadapi Tekanan Psikologis Dalam Penyelesaian Tugas Akhir
No ratings yet
Pola Komunikasi Konstruktif Mahasiswa Saat Menghadapi Tekanan Psikologis Dalam Penyelesaian Tugas Akhir
17 pages
RHEL
No ratings yet
RHEL
1 page
Data Warehousing Lab Excercise
No ratings yet
Data Warehousing Lab Excercise
45 pages
Tri Plot
No ratings yet
Tri Plot
5 pages
Mod3 InsightIQ
No ratings yet
Mod3 InsightIQ
33 pages
Pur Com
No ratings yet
Pur Com
8 pages
Oops Through Java (R22a0507)
No ratings yet
Oops Through Java (R22a0507)
131 pages
DF S Example
No ratings yet
DF S Example
4 pages
Bi Part It e Graph Example
No ratings yet
Bi Part It e Graph Example
4 pages
Bfs Example
No ratings yet
Bfs Example
6 pages
Elijah Brown Resume
No ratings yet
Elijah Brown Resume
2 pages
DFC10103 Operating System: Polytechnic Sultan Mizan Zainal Abidin Department of Information and Communication Technology
No ratings yet
DFC10103 Operating System: Polytechnic Sultan Mizan Zainal Abidin Department of Information and Communication Technology
6 pages
Nosql
No ratings yet
Nosql
20 pages
Perancangan Sistem Informasi Logistik Dan Basis Data: ILI-3F3
No ratings yet
Perancangan Sistem Informasi Logistik Dan Basis Data: ILI-3F3
33 pages
Power BI Mastery with Enterprise DNA
No ratings yet
Power BI Mastery with Enterprise DNA
26 pages
Capability and Strength of Computer
No ratings yet
Capability and Strength of Computer
6 pages
Master of Library and Information SCIENCE (Revised) 1-1 - 7) FI Term-End Examination December, 2019 Mli-101: Information, Communication and Society
No ratings yet
Master of Library and Information SCIENCE (Revised) 1-1 - 7) FI Term-End Examination December, 2019 Mli-101: Information, Communication and Society
4 pages
SCTP Tutorial
No ratings yet
SCTP Tutorial
267 pages
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2 - WithScreenShots
No ratings yet
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2 - WithScreenShots
42 pages
Descriptive Method of Research
No ratings yet
Descriptive Method of Research
25 pages

Problem Solving 2

Uploaded by

Problem Solving 2

Uploaded by

CS411 Database Systems

• Part 1: Problem 1 - Problem 3

Figure 1: Block Header

Figure 2: Fixed length records

ceil( 10000 * 10 / floor( 5000 / (10+20) ) ) = 603

Note: Taking floor as above is necessary to make sure that records do

Figure 3: Block Header

Figure 5: d. after deleting 72

Problem 5 Hash Table (15 points)

Figure 7: b). linear hash table

You might also like