03 Storage1

The lecture discusses the architecture of a disk-oriented DBMS, focusing on the differences between volatile and non-volatile storage, and the management of data movement between disk and memory. It explains the organization of data into pages, the role of the buffer pool, and the importance of managing disk access efficiently. The document also covers file storage, page layout, and tuple layout, emphasizing the need for a DBMS to control its own data management for optimal performance.

Uploaded by

Darion Yaphet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views5 pages

03 Storage1

Uploaded by

Darion Yaphet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Lecture #03: Database Storage (Part I)

15-445/645 Database Systems (Spring 2025)

https://15445.courses.cs.cmu.edu/spring2025/
Carnegie Mellon University
Jignesh Patel

1 Storage
In this class, we focus on a “disk-oriented” DBMS architecture that assumes that the primary storage
location of the database is on non-volatile disk(s).
At the top of the storage hierarchy, you have the devices that are closest to the CPU. This is the fastest
storage, but it is also the smallest and most expensive. The further you get away from the CPU, the larger
but slower the storage devices get. These devices also get cheaper per GB.
Volatile Devices
• Volatile means that the device does not retain its state after power loss. Therefore, the data that is
stored in such devices can be lost.
• Volatile storage supports fast random access with byte-addressable locations. This means that the
program can jump to any byte address and get the data that is there (e.g. in DRAM).
• For our purposes, we will always refer to this volatile storage class as “memory.”
Non-Volatile Devices
• Non-volatile devices do retain their state even when the machine/computer is off or power loss
occurs. Therefore, the data that these devices store can be retrieved even after the machine/computer
shuts down and restarts (e.g. disk).
• Non-Volatile devices are block/page addressable. This means that in order to read a value at a par-
ticular offset (byte), the program first has to load the 4 KB page into memory that holds the value
that the program wants to read.
• Non-volatile storage is traditionally better at sequential access (reading contiguous blocks of data
because of its architecture e.g. magnetic hard drive).
• We will refer to this as “disk.” We will not make a (major) distinction between solid-state storage
(SSD) and spinning hard drives (HDD).
There is also a relatively new class of storage devices that are becoming more popular called persistent
memory. These devices are designed to be the best of both worlds: almost as fast as DRAM with the
persistence of disk. We will not cover these devices in this course, and they are currently not in widespread
production use. Probably the most famous example is Optane; unfortunately Intel is winding down its
production as of summer 2022. Note that you may see older references to persistent memory as “non-
volatile memory”.
You may see references to NVMe SSDs, where NVMe stands for non-volatile memory express. These
NVMe SSDs are not the same hardware as persistent memory modules. Rather, they are typical NAND
flash drives that connect over an improved hardware interface. This improved hardware interface allows
for much faster transfers, which leverages improvements in NAND flash perfomance.
Spring 2025 – Lecture #03 Database Storage (Part I)

Since our DBMS architecture assumes that the database is stored on disk, the components of the DBMS
are responsible for figuring out how to move data between non-volatile disk and volatile memory since
the system cannot operate on the data directly on disk.
During this semester, we will refer to DRAM storage as ”memory” and anything below that as ”disk”. We
will also not worry about read/writing data to CPU caches or individual CPU registers in this class. We will
focus on hiding the latency of the disk rather than optimizations with registers and caches since getting
data from disk is the slowest part of the data movement.
In order to better understand the orders of magnitute of latency difference when reading from various
devices suppose that reading data from the L1 cache reference took one second, then reading from an SSD
would take 4.4 hours, and reading from an HDD would take 3.3 weeks!

2 Disk-Oriented DBMS Overview

The database is stored on disk, and the data within the database files are organized into pages, with the first
page being the directory page. To operate on the data, the DBMS needs to bring the data into memory.
It does this by having a buffer pool that manages the data movement back and forth between disk and
memory. The DBMS also has an execution engine that will execute queries. The execution engine will ask
the buffer pool for a specific page, and the buffer pool will take care of bringing that page into memory
and giving the execution engine a pointer to that page in memory. The buffer pool manager will ensure
that the page is there while the execution engine operates on that part of memory.

3 DBMS vs. OS
A high-level design goal of the DBMS is to support databases that exceed the amount of available memory.
Since reading/writing to disk is expensive, disk use must be carefully managed. We do not want large stalls
from fetching something from disk to slow down everything else. We want the DBMS to be able to process
other queries while it is waiting to get the data from disk.
This high-level design goal is like virtual memory, where there is a large address space and a place for the
OS to bring in pages from disk.
One way to achieve this virtual memory is by using mmap to map the contents of a file in a process’ ad-
dress space, which makes the OS responsible for moving pages back and forth between disk and memory.
Unfortunately, this means that if mmap hits a page fault, the process will be blocked.
• You never want to use mmap in your DBMS if you need to write.
• The DBMS (almost) always wants to control things itself and can do a better job at it since it knows
more about the data being accessed and the queries being processed.
• The operating system is not your friend.
It is possible to use the OS by using:
• madvise: Tells the OS know when you are planning on reading certain pages.
• mlock: Tells the OS to not swap memory ranges out to disk.
• msync: Tells the OS to flush memory ranges out to disk.
We do not advise using mmap in a DBMS for correctness and performance reasons.
Even though the system will have functionalities that seem like something the OS can provide, having the
DBMS implement these procedures itself gives it better control and performance.

15-445/645 Database Systems

Page 2 of 5
Spring 2025 – Lecture #03 Database Storage (Part I)

4 File Storage
In its most basic form, a DBMS stores a database as files on disk. Some may use a file hierarchy, others
may use a single file (e.g. SQLite).
The OS does not know anything about the contents of these files. Only the DBMS knows how to decipher
their contents, since it is encoded in a way specific to the DBMS.
The DBMS’s storage manager is responsible for managing a database’s files. It represents the files as a
collection of pages. It also keeps track of what data has been read and written to pages as well how much
free space there is in these pages.

5 Database Pages
The DBMS organizes the database across one or more files in fixed-size blocks of data called pages. Pages
can contain different kinds of data (tuples, indexes, etc). Most systems will not mix these types within
pages. Some systems will require that pages are self-contained, meaning that all the information needed to
read each page is on the page itself.
Each page is given a unique identifier (page ID). If the database is a single file, then the page id can just
be the file offset. A page ID could be unique per DBMS instance, per database, or per table. Most DBMSs
have an indirection layer that maps a page id to a file path and offset. The upper levels of the system will
ask for a specific page number. Then, the storage manager will have to turn that page number into a file
and an offset to find the page.
Most DBMSs use fixed-size pages to avoid the engineering overhead needed to support variable-sized
pages. For example, with variable-size pages, deleting a page could create a hole in files that the DBMS
cannot easily fill with new pages.
There are three concepts of pages in DBMS:
1. Hardware page (usually 4 KB).
2. OS page (4 KB).
3. Database page (1-16 KB).
DBMSs that specialize in read-only workloads have larger page sizes.
The storage device guarantees an atomic write of the size of the hardware page. If the hardware page
is 4 KB and the system tries to write 4 KB to the disk, either all 4 KB will be written, or none of it will.
This means that if our database page is larger than our hardware page, the DBMS will have to take extra
measures to ensure that the data gets written out safely since the program can get partway through writing
a database page to disk when the system crashes.

6 Database Heap
There are a couple of ways to find the location of the page a DBMS wants on the disk, and heap file
organization is one of those ways. A heap file is an unordered collection of pages where tuples are stored
in random order.
The DBMS can locate a page on disk given a page id by using a linked list of pages or a page directory.
1. Linked List: Header page holds pointers to a list of free pages and a list of data pages. However, if
the DBMS is looking for a specific page, it has to do a sequential scan on the data page list until it
finds the page it is looking for.

15-445/645 Database Systems

Page 3 of 5
Spring 2025 – Lecture #03 Database Storage (Part I)

2. Page Directory: DBMS maintains special pages, called page directory, to track locations of data
pages, the amount of free space on each page, a list of free/empty pages and the page type. These
special pages have one entry for each database object.

7 Page Layout
Every page includes a header that records meta-data about the page’s contents:
• Page size.
• Checksum.
• DBMS version.
• Transaction visibility.
• Self-containment. (Some systems like Oracle require this.)
A strawman approach to laying out data is to keep track of how many tuples the DBMS has stored in a
page and then append to the end every time a new tuple is added. However, problems arise when tuples
are deleted or when tuples have variable-length attributes.
There are two main approaches to laying out data in pages: (1) slotted-pages and (2) log-structured.
Slotted Pages: Page maps slots to offsets.
• Most common approach used in DBMSs today.
• Header keeps track of the number of used slots, the offset of the starting location of the last used
slot, and a slot array, which keeps track of the location of the start of each tuple.
• To add a tuple, the slot array will grow from the beginning to the end, and the data of the tuples will
grow from end to the beginning. The page is considered full when the slot array and the tuple data
meet.
Log-Structured: Covered in the next lecture.

8 Tuple Layout
A tuple is essentially a sequence of bytes (these bytes do not have to be contiguous). It is the DBMS’s job
to interpret those bytes into attribute types and values.
Tuple Header: Contains meta-data about the tuple.
• Visibility information for the DBMS’s concurrency control protocol (i.e., information about which
transaction created/modified that tuple, will be covered later in the semester).
• Bit Map for NULL values.
• Note that the DBMS does not need to store meta-data about the schema of the database here.
Tuple Data: Actual data for attributes.
• Attributes are typically stored in the order that you specify them when you create the table.
• Most DBMSs do not allow a tuple to exceed the size of a page.
Unique Identifier:
• Each tuple in the database is assigned a unique identifier.
• Most common: page id + (offset or slot).
• An application cannot rely on these ids.
Denormalized Tuple Data: If two tables are related, the DBMS can “pre-join” them, so the tables end up
on the same page. This makes reads faster since the DBMS only has to load in one page rather than two

15-445/645 Database Systems

Page 4 of 5
Spring 2025 – Lecture #03 Database Storage (Part I)

separate pages. However, it makes updates more expensive since the DBMS needs more space for each
tuple.

15-445/645 Database Systems

Page 5 of 5

IAO131 - Fresh Fever From The Skies
100% (19)
IAO131 - Fresh Fever From The Skies
736 pages
Foundations of Microeconomics 7 Ed Bade
No ratings yet
Foundations of Microeconomics 7 Ed Bade
307 pages
Automation Manual
No ratings yet
Automation Manual
250 pages
Understanding Smart Cities - An Integrative Framework - Chourabi
No ratings yet
Understanding Smart Cities - An Integrative Framework - Chourabi
9 pages
ASSIGNMENT
No ratings yet
ASSIGNMENT
8 pages
Bio Enzyme Recipes
No ratings yet
Bio Enzyme Recipes
5 pages
Dbms Notes
100% (1)
Dbms Notes
28 pages
Web Guide
No ratings yet
Web Guide
60 pages
Ucs Director Admin Guide
No ratings yet
Ucs Director Admin Guide
164 pages
Delay-Tolerant Network Architecture
No ratings yet
Delay-Tolerant Network Architecture
17 pages
Jira Essentials Powerpoint
No ratings yet
Jira Essentials Powerpoint
49 pages
Class-3 - Ratio & Proportion& Data Interpreation
No ratings yet
Class-3 - Ratio & Proportion& Data Interpreation
11 pages
Yaesu FT-8800 Summary Sheet
No ratings yet
Yaesu FT-8800 Summary Sheet
1 page
Database Topics Explained With Examples
No ratings yet
Database Topics Explained With Examples
43 pages
UNIT 1 - PART 1 New
No ratings yet
UNIT 1 - PART 1 New
44 pages
Social Studies Lesson Exemplar
100% (1)
Social Studies Lesson Exemplar
8 pages
Legal Appeal on Property Dispute
No ratings yet
Legal Appeal on Property Dispute
7 pages
Gold Loan Marketing Strategies Study
No ratings yet
Gold Loan Marketing Strategies Study
51 pages
Module 1 Introduction To DBMS
No ratings yet
Module 1 Introduction To DBMS
67 pages
Optimize ACC Cement Distribution
100% (3)
Optimize ACC Cement Distribution
4 pages
Lec 4
No ratings yet
Lec 4
29 pages
Notes 03 - Database Storage - I
No ratings yet
Notes 03 - Database Storage - I
42 pages
Dbms Notes Unit1,2,6
No ratings yet
Dbms Notes Unit1,2,6
86 pages
03 Storage1
No ratings yet
03 Storage1
55 pages
File Organisation in DBMS
No ratings yet
File Organisation in DBMS
27 pages
Unix Lab File
No ratings yet
Unix Lab File
27 pages
Invoice Details For Plab
No ratings yet
Invoice Details For Plab
3 pages
FULL
No ratings yet
FULL
449 pages
HRMS Guide for Employees
No ratings yet
HRMS Guide for Employees
30 pages
United States District Court Northern District of California
No ratings yet
United States District Court Northern District of California
1 page
October 2023 Online Version
No ratings yet
October 2023 Online Version
5 pages
Earth's Layers Worksheet: Grade 4
No ratings yet
Earth's Layers Worksheet: Grade 4
3 pages
Postmodernism and Consumer Societ1
No ratings yet
Postmodernism and Consumer Societ1
3 pages
Hcm65r-Hcm65b Manual - 2002 - Issue 1
No ratings yet
Hcm65r-Hcm65b Manual - 2002 - Issue 1
6 pages
Notes 02 - Hardware
No ratings yet
Notes 02 - Hardware
62 pages
SQL Database
No ratings yet
SQL Database
220 pages
DBMS 1
No ratings yet
DBMS 1
41 pages
04 Memory
No ratings yet
04 Memory
6 pages
Chapter 01 Introduction
No ratings yet
Chapter 01 Introduction
52 pages
03 Storage1
No ratings yet
03 Storage1
65 pages
The Islamic-Byzantine Frontier
100% (1)
The Islamic-Byzantine Frontier
372 pages
Unit 5 DBMS
No ratings yet
Unit 5 DBMS
34 pages
Dbms Unit I Notes
No ratings yet
Dbms Unit I Notes
40 pages
Lecture Data Storage
No ratings yet
Lecture Data Storage
28 pages
Database Storage: Intro To Database Systems Andy Pavlo
No ratings yet
Database Storage: Intro To Database Systems Andy Pavlo
63 pages
Bakery Secrets and a Holocaust Survivor
No ratings yet
Bakery Secrets and a Holocaust Survivor
4 pages
Internship Review Form
No ratings yet
Internship Review Form
5 pages
Brave New World Essay
No ratings yet
Brave New World Essay
3 pages
Storing Data: Disks and Files: (R&G Chapter 9)
No ratings yet
Storing Data: Disks and Files: (R&G Chapter 9)
39 pages
Database Files
No ratings yet
Database Files
121 pages
Disks, Memories & Buffer Management: "The Two Offices of Memory Are Collection and Distribution." - Samuel Johnson
No ratings yet
Disks, Memories & Buffer Management: "The Two Offices of Memory Are Collection and Distribution." - Samuel Johnson
28 pages
DBMS Indexing and Storage
No ratings yet
DBMS Indexing and Storage
53 pages
Introduction To Database Concepts
No ratings yet
Introduction To Database Concepts
64 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
23 pages
Database Management System
No ratings yet
Database Management System
86 pages
Chapter 11: Indexing and Storage: Modified From: Database System Concepts, 6 Ed
No ratings yet
Chapter 11: Indexing and Storage: Modified From: Database System Concepts, 6 Ed
53 pages
Disk Storage & DBMS Basics
No ratings yet
Disk Storage & DBMS Basics
33 pages
KMBN It03 - Unit - 1
No ratings yet
KMBN It03 - Unit - 1
18 pages
FMCG 2425 05141
No ratings yet
FMCG 2425 05141
3 pages
DBMS Unit - 1
No ratings yet
DBMS Unit - 1
40 pages
Database Management Systems, R. Ramakrishnan and J. Gehrke 1
No ratings yet
Database Management Systems, R. Ramakrishnan and J. Gehrke 1
32 pages
Lecture 1 Addition What Is Data?
No ratings yet
Lecture 1 Addition What Is Data?
22 pages
Introduction To DBMS Internals: Alvin Cheung Aditya Parameswaran
No ratings yet
Introduction To DBMS Internals: Alvin Cheung Aditya Parameswaran
31 pages
INFO445: Advanced Database Design, Management, and Maintenance
No ratings yet
INFO445: Advanced Database Design, Management, and Maintenance
21 pages
Storing Data: Disks and Files
No ratings yet
Storing Data: Disks and Files
29 pages
Review Review: Views - "Named" Queries Subqueries in FROM Clause
No ratings yet
Review Review: Views - "Named" Queries Subqueries in FROM Clause
18 pages
03-Storage1 Notes
No ratings yet
03-Storage1 Notes
4 pages
03 Storage1
No ratings yet
03 Storage1
4 pages
06-Bufferpool 2
No ratings yet
06-Bufferpool 2
6 pages
Storing Data: Disks and Files: Why Not Store Everything in Main Memory?
No ratings yet
Storing Data: Disks and Files: Why Not Store Everything in Main Memory?
10 pages
Block Diagram of A DBMS: (R&G Chapter 9)
No ratings yet
Block Diagram of A DBMS: (R&G Chapter 9)
6 pages
Review: (R&G Chapter 9) - Aren't Databases Great? - Relational Model - SQL
No ratings yet
Review: (R&G Chapter 9) - Aren't Databases Great? - Relational Model - SQL
7 pages
Intro to Database Management
No ratings yet
Intro to Database Management
20 pages
DBMS Internals: How Does It All Work?
No ratings yet
DBMS Internals: How Does It All Work?
94 pages
Storing Data: Disks and Files
No ratings yet
Storing Data: Disks and Files
29 pages
What Is A Database?
No ratings yet
What Is A Database?
4 pages
DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
90 pages
In The Shadow of The Sun em Castellan Instant Download
No ratings yet
In The Shadow of The Sun em Castellan Instant Download
33 pages

03 Storage1

Uploaded by

03 Storage1

Uploaded by

Lecture #03: Database Storage (Part I)

15-445/645 Database Systems (Spring 2025)

2 Disk-Oriented DBMS Overview

15-445/645 Database Systems

15-445/645 Database Systems

15-445/645 Database Systems

15-445/645 Database Systems

You might also like