0% found this document useful (0 votes)

22 views4 pages

Outline: File System Consistency Issues in The Presence of Failures

The document discusses transaction concepts and their implementation in file systems. It covers topics like write-ahead logging, two-phase locking, and log structured file systems, explaining how transactions provide consistency and durability even in the presence of failures.

Uploaded by

hoang.van.tuan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views4 pages

Outline: File System Consistency Issues in The Presence of Failures

Uploaded by

hoang.van.tuan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Outline

n File system consistency issues in the presence of failures:

n Data structure is inconsistent

Transactions & Log Structured FS n Solutions:

n Synchronous operations
n Ordering the synchronous operations
n Run a program to fix mistakes: “fsck”
Arvind Krishnamurthy
Spring 2001 n Today:
n Transactions for correctness

n Log structured file systems for better performance

User data consistency Transaction concept

n For user data, Unix uses “write back” --- forced to disk n Transactions: group actions together so that they are
every 30 seconds (or user can call “sync” to force to disk n Atomic: either happens or it does not (no partial operations)
immediately). n Serializable: transactions appear to happen one after the other
n Durable: once it happens, stays happened
No guarantee blocks are written to disk in any order.
Critical sections are atomic and serializable, but not durable
Sometimes meta-data consistency is good enough
Need two more items:
Fsck checks: Commit --- when transaction is done (durable)
Number of dir links to a file and file link count Rollback --- if failure during a transaction (means it didn’t happen at all)
I-node contains a block that is free on the disk block map
I-nodes contain overlapping blocks/fragments n Do a set of operations tentatively. If you get to commit, ok. Otherwise,
roll back the operations as if the transaction never happened.
Build a graph data structure and do connectivity analysis

Transaction implementation
Transaction implementation (cont’d)
n Key idea: fix problem of how you make multiple updates to disk, by Disk
turning multiple updates into a single disk write! Memory cache

n Example: money transfer from account x to account y: X: 0 X: 0

Y: 2 Y: 2
Begin transaction
x =x+1
y = y–1
Commit Sequence of steps to execute transaction: X=1 Y=1 commit
1. Write new value of X to log
n Keep “write-ahead” (or “redo”) log on disk of all changes in 2. Write new value of Y to log write-ahead log (on disk or
transaction. 3. Write commit tape or non-volatile RAM)
n A log is like a journal, never erased, record of everything you’ve done 4. Write x to disk
n Once both changes are on log, transaction is committed. 5. Write y to disk
n Then can “write behind” changes to disk --- if crash after commit, replay
log to make sure updates get to disk 6. Reclaim space on log

1
Transaction implementation Transaction implementation
(cont’d) (cont’d)
u What if we crash after 1?
n Can we write X back to disk before commit ?
X=1 Y=1 commit
u No commit, nothing on disk, so
just ignore changes
1. Write new value of X to log
n Yes: Keep an “undo log”
u What if we crash after 2? Ditto
2. Write new value of Y to log u What if we crash after 3 before
3. Write commit 4 or 5? n Save old value along with new value
4. Write x to disk u Commit written to log, so replay n If transaction does not commit, “undo change”!
5. Write y to disk those changes back to disk
6. Reclaim space on log
u What if we crash while we are
writing “commit” ?
u As with concurrency, we need
some primitive atomic operation
or else can’t build anything. (e.g.,
writing a single sector on disk is
atomic!)

Transactions under multiple

threads Two-phase locking
n What if two threads run same transaction at same time? use locks! n Don’t allow “unlock” before commit.
Begin transaction
Lock x, y
x=x+1
n First phase: only allowed to acquire locks (this avoids
y=y–1 deadlock concerns).
Unlock x, y
Commit
n Second phase: all unlocks happen at commit
n Problematic scenario:
n Thread A grabs locks, modifies x, y, writes to the log, unlocks
n Before A commits, thread B grabs lock, writes x, y, unlocks, and does n Thread B can’t see any of A’s changes, until A commits and
commit. releases locks. This provides serializability.
n Before A commits, it crashes!

B commits values for x and y that depends on A committing!

Transactions in file systems What is next?

n Write-ahead logging
n Almost all file systems built after 1985 (NT, Solaris) uses “write
ahead logging”
n Wednesday: original Unix paper
n Write all changes in a transaction to log (update directory, allocate
block, etc.) before sending any changes to disk.
n Example transactions: “Create file”, “Delete file”, “Move file” n Next week: networking

This eliminates any need for file system check (fsck) after crash
If crash, read log:
n Followed by: remote procedure calls, distributed file
n If log is not complete, no change!
systems
n If log is completely written, apply all changes to disk

n If log is zero, then all updates have gotten to disk.

n Pros: reliability Cons: all data written twice

2
Log Structured File Systems Motivation (contd.)
n Radical, different approach to designing file systems n File System motivations:
n Disk will run much faster if you use it as a tape drive! n File caches help a reads a lot
File caches do not help writes very much
n Technology motivations: n

n Delayed writes help but cannot delay for ever

n Some technologies are advancing more faster than others
n File caches make disk writes more often than disk reads
n CPU are getting faster every year (x2 every 1-2 years)
n Files are mostly small
n Everything else except CPU will become a bottleneck (Amdahl’s
law) n Too much synchronous I/O
n Disks are not getting much faster n Disk geometries not predictable
n Only good disk is an idle disk n RAID: whole bunch of disks with data striped across them
n Increases bandwidth, but does not change latency
n Memory is growing in size dramatically (x2 every 1.5 years)
n Does not help small files (more on this later)
n File systems è File caches are a good idea (cut down on disk
bandwidth)

Implementation Locating Data on the Disk

n Treat disk like a tape: n Use the same structures as Unix
n Collect all information in memory (Inodes, directory, data, …) n Inodes, data blocks, indirect blocks, doubly indirect blocks
n Wait a little while n How to find I-nodes?
n Do one big I/O (say 1MB) n In Unix, I-number maps to a location on disk
n Log append only – no over-write in place n In LFS, Inodes float
n Log is the only thing on disk! Main storage structure n When you rewrite the Inodes, it goes to the end of the block
n Just add another level of indirection: Inode-map
n No data other than the log
n Maps I-numbers to disk positions
n Two main problems: n Inode-map gets written to the disk

n How do you read stuff back from the log? n How do you find Inode-map?

n Add another level of indirection n Add another level of indirection

n Wrap around? Reclaim free space n Now it fits in memory

n How do you guarantee there is always free space? n Write into checkpoint memory

Wrap Around Problem Cleaner

n Two approaches: n Write whole clean segments
n Compaction n Clean segments in the background
n Read-in information that is still alive and compact it
n Write cost for “u” utilization: 2/(1 – u)
n Threading n FFS write cost: 10 (normal) – 4 (best)
n Leave data in-place and write new information around old data n Disk does not have to be 50% utilized, only segments have to be 50%
n Eventually fragments
utilized
n Would be great to have a bi-modal distribution
n Simulations:
n Sprite (first implementation of LFS): n Tried lots of approaches
n Combination of the two; open up free segments & avoid n Initial simulation: random
copying n Greedy cleaner
n Segmented log: statically partition disks into segments (1MB in n Next: hot and cold data à results got worse!
LFS), pick a segment size is large enough to amortize disk seek n Next: segregate new and old data à results didn’t change
costs, compaction within segments, threading in-between
n Finally: do cost, benefit analysis

3
RAIDs and availability More on RAIDs
n Suppose you need to store more data than fits on a single n Benefits
disk (e.g., large database or file servers). How should n Load gets automatically balanced among disks
arrange data across disks? n Can transfer large file at aggregate bandwidth of all disks

n Option 1: treat disks as huge pool of disk blocks

n Disk1 has blocks 1, 2, …, N
n Problem --- what if one disk fails ?
n Disk2 has blocks N+1, N+2, …, 2N n Goal --- availability --- never lose access to data
n ………… n System should continue to work even if some components are not
working.
n Option 2: RAID (Redundant Arrays of Inexpensive Disks)
Stripe data across disks, with k disks: n Solution: dedicate one disk to hold bitwise parity for other disks in
stripe.
n Disk1 has blocks 1, k+1, 2k+1, …

n Disk2 has blocks 2, k+2, 2k+2, …

n …………

Adding parity bits to RAID

n With k+1 disks
n Disk1 has blocks 1, k+1, 2k+1, …
n Disk2 has blocks 2, k+2, 2k+2, …
n …
n Parity disk has blocks parity(1..k), parity (k+1..2k), …

n If lose any disk, can recover data from other disks plus parity
n Disk1 holds 1001
n Disk2 holds 0101
n Disk3 holds 1000
n Parity disk: 0100
What if we lose disk2? Its contents are parity of remainder!
Thus can lose any disk and data would still be available.

n Updating a disk block needs to update both data and parity --- need to
use write ahead logging to support crash recovery

Unit V
No ratings yet
Unit V
91 pages
Os Unit-5 File System and Io Management
No ratings yet
Os Unit-5 File System and Io Management
103 pages
Build Your Own Database From Scratch - James Smith
No ratings yet
Build Your Own Database From Scratch - James Smith
375 pages
File System and Secondary Storage
No ratings yet
File System and Secondary Storage
40 pages
Unit 6 File System Interface
No ratings yet
Unit 6 File System Interface
60 pages
Distributed File System
No ratings yet
Distributed File System
68 pages
537 L22 LFS
No ratings yet
537 L22 LFS
64 pages
#20 (File System Logging)
No ratings yet
#20 (File System Logging)
32 pages
#19 (File System Internals)
No ratings yet
#19 (File System Internals)
51 pages
Unix File System Case Study
No ratings yet
Unix File System Case Study
23 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
39 pages
Module 5
No ratings yet
Module 5
36 pages
Oslecture14 15
No ratings yet
Oslecture14 15
48 pages
Operating Systems: Unit-6 I/O Management
No ratings yet
Operating Systems: Unit-6 I/O Management
40 pages
Unit 6: File-System Interface
No ratings yet
Unit 6: File-System Interface
43 pages
3distributed File System
No ratings yet
3distributed File System
42 pages
Crash Consistency
No ratings yet
Crash Consistency
14 pages
Chapter Five A Memory Management II
No ratings yet
Chapter Five A Memory Management II
56 pages
Module 4 File System
No ratings yet
Module 4 File System
58 pages
File System Consistency and Exam Review
No ratings yet
File System Consistency and Exam Review
43 pages
12 - File System Implementation
No ratings yet
12 - File System Implementation
42 pages
OS CHAPTER-11 - File Management
No ratings yet
OS CHAPTER-11 - File Management
44 pages
File System Implementation OS
No ratings yet
File System Implementation OS
54 pages
Unit VI File Management
No ratings yet
Unit VI File Management
41 pages
Week 11
No ratings yet
Week 11
78 pages
Chapter No Vi
No ratings yet
Chapter No Vi
15 pages
Ext3 Journaling File System: Chadd Williams Shrug 10/05/2001
No ratings yet
Ext3 Journaling File System: Chadd Williams Shrug 10/05/2001
21 pages
SGDB
No ratings yet
SGDB
14 pages
Kernel VFS and File System Efficiency
No ratings yet
Kernel VFS and File System Efficiency
28 pages
Operating System
No ratings yet
Operating System
40 pages
FILE CONCEPT For Second Internels
No ratings yet
FILE CONCEPT For Second Internels
20 pages
L Crash
No ratings yet
L Crash
6 pages
Cloud Storage Systems Overview
No ratings yet
Cloud Storage Systems Overview
35 pages
Os Assignment
No ratings yet
Os Assignment
31 pages
The Google File System: CSE 490h, Autumn 2008
No ratings yet
The Google File System: CSE 490h, Autumn 2008
29 pages
Os Cycle Test 3 Answer Key
No ratings yet
Os Cycle Test 3 Answer Key
11 pages
File Management
No ratings yet
File Management
40 pages
Log Structured File Systems: Motivation
No ratings yet
Log Structured File Systems: Motivation
4 pages
CSS Finals Reviewer
No ratings yet
CSS Finals Reviewer
19 pages
Storage 4
No ratings yet
Storage 4
15 pages
Lec7 Logging
No ratings yet
Lec7 Logging
4 pages
On Pharmacy Management System, Made On PHP, To Be Submitted by Btech Student As A Minor Project
67% (6)
On Pharmacy Management System, Made On PHP, To Be Submitted by Btech Student As A Minor Project
18 pages
File System Implementation
No ratings yet
File System Implementation
45 pages
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
No ratings yet
WINSEM2012-13 CP0029 06-Mar-2013 RM01 DFT 2
46 pages
Operating Systems CMPSC 473
No ratings yet
Operating Systems CMPSC 473
27 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
70 pages
ICS 143 - Principles of Operating Systems
No ratings yet
ICS 143 - Principles of Operating Systems
54 pages
GNR-18 DBMS Unit-5
No ratings yet
GNR-18 DBMS Unit-5
22 pages
Transaction Processing Concepts Concurrency Control and Recovery Part 3
No ratings yet
Transaction Processing Concepts Concurrency Control and Recovery Part 3
34 pages
MCQs Nptel
100% (11)
MCQs Nptel
4 pages
Outline: Access Control Lists (ACL) : Keep Lists of Access For Each Domain With
No ratings yet
Outline: Access Control Lists (ACL) : Keep Lists of Access For Each Domain With
5 pages
Computer Virus and Antivirus Techniques
No ratings yet
Computer Virus and Antivirus Techniques
37 pages
HWG6-120-WBT Introduction To Brocade Gen6 Hardware: Student Guide
100% (1)
HWG6-120-WBT Introduction To Brocade Gen6 Hardware: Student Guide
84 pages
Oschapter 8
No ratings yet
Oschapter 8
27 pages
Unit 3 GRP
No ratings yet
Unit 3 GRP
12 pages
Amid Inership Mainenance DWITC - 2015
No ratings yet
Amid Inership Mainenance DWITC - 2015
33 pages
EPOS Command Library
No ratings yet
EPOS Command Library
92 pages
Chapter - 6
No ratings yet
Chapter - 6
48 pages
EG860 User Guide (V200R002C00 - 06) (PDF) - EN
No ratings yet
EG860 User Guide (V200R002C00 - 06) (PDF) - EN
110 pages
He-Dieu-Hanh - Kai-Li - Filelayout - (Cuuduongthancong - Com)
No ratings yet
He-Dieu-Hanh - Kai-Li - Filelayout - (Cuuduongthancong - Com)
7 pages
14 Recovery
No ratings yet
14 Recovery
4 pages
ERX Intro+bras
No ratings yet
ERX Intro+bras
156 pages
Comprehensive Guide to File Systems
No ratings yet
Comprehensive Guide to File Systems
43 pages
CCNPv6 SWITCH Lab1-1 Clearing Switches Instructor
No ratings yet
CCNPv6 SWITCH Lab1-1 Clearing Switches Instructor
5 pages
SCTP Multihoming for Telecom Experts
No ratings yet
SCTP Multihoming for Telecom Experts
235 pages
Vdocuments - MX High Performance Memory Testing Design Principles Fault Modeling
No ratings yet
Vdocuments - MX High Performance Memory Testing Design Principles Fault Modeling
249 pages
Modbus and Modbus TCP
No ratings yet
Modbus and Modbus TCP
45 pages
Ext3/4 File Systems: Don Porter CSE 506
No ratings yet
Ext3/4 File Systems: Don Porter CSE 506
33 pages
Polytechnic University of The Philippines College of Engineering Computer Engineering Department
No ratings yet
Polytechnic University of The Philippines College of Engineering Computer Engineering Department
13 pages
Class Notes
No ratings yet
Class Notes
12 pages
Control Hardware and IO Modules Firmware Upgrade Guide-EPDOC-X150-en-520A
No ratings yet
Control Hardware and IO Modules Firmware Upgrade Guide-EPDOC-X150-en-520A
64 pages
Buffer Cache Algorithms: Session No:5 Operating System Design @KL University, 2020
No ratings yet
Buffer Cache Algorithms: Session No:5 Operating System Design @KL University, 2020
21 pages
Cisco 350-701 Exam Dumps 2023
No ratings yet
Cisco 350-701 Exam Dumps 2023
8 pages
Linux Migration for IT Professionals
No ratings yet
Linux Migration for IT Professionals
8 pages
CH 11
No ratings yet
CH 11
30 pages
Arithmetic Logic Unit Technical Memo
No ratings yet
Arithmetic Logic Unit Technical Memo
6 pages
Macas Open VMC Interface - EN
No ratings yet
Macas Open VMC Interface - EN
46 pages
System Variables I/O Variables: Lnput
No ratings yet
System Variables I/O Variables: Lnput
1 page
IoT & Cloud: Key Concepts Quiz
No ratings yet
IoT & Cloud: Key Concepts Quiz
20 pages
System Requirements Variants of Win 3D-View
No ratings yet
System Requirements Variants of Win 3D-View
1 page
FULL PPT PDF 4mb - Compressed
No ratings yet
FULL PPT PDF 4mb - Compressed
51 pages
Reading: Washington. Thank You, Hank!
No ratings yet
Reading: Washington. Thank You, Hank!
4 pages
Module 11
No ratings yet
Module 11
14 pages
File Systems: Implementation and Management
No ratings yet
File Systems: Implementation and Management
8 pages
Distributed File Systems: Arvind Krishnamurthy Spring 2001
No ratings yet
Distributed File Systems: Arvind Krishnamurthy Spring 2001
3 pages
A Survey of File Systems
No ratings yet
A Survey of File Systems
2 pages
Scheduling Using Priorities: New Priority Old Priority Decay-Factor
No ratings yet
Scheduling Using Priorities: New Priority Old Priority Decay-Factor
4 pages
The Deadlock Problem: Lock (A) Lock (B) Lock (B) Lock (A)
No ratings yet
The Deadlock Problem: Lock (A) Lock (B) Lock (B) Lock (A)
4 pages
Thread and Process Management Guide
No ratings yet
Thread and Process Management Guide
4 pages
Outline
No ratings yet
Outline
4 pages
Recap: Translation Box (MMU)
No ratings yet
Recap: Translation Box (MMU)
4 pages
Today's Lecture: Performing I/Os
No ratings yet
Today's Lecture: Performing I/Os
4 pages
Common Technology Acronyms List
No ratings yet
Common Technology Acronyms List
4 pages
Cloud Infrastructure Achitecture Case Study
No ratings yet
Cloud Infrastructure Achitecture Case Study
38 pages
Raids and Availability
No ratings yet
Raids and Availability
3 pages
Appinfo Log
No ratings yet
Appinfo Log
8 pages
Ubuntu 22.04 GUI Installation
No ratings yet
Ubuntu 22.04 GUI Installation
3 pages
Cloud Computing Case Study Based Questions and Answers
No ratings yet
Cloud Computing Case Study Based Questions and Answers
4 pages
MFC L2710DW Brochure
No ratings yet
MFC L2710DW Brochure
8 pages

Outline: File System Consistency Issues in The Presence of Failures

Uploaded by

Outline: File System Consistency Issues in The Presence of Failures

Uploaded by

Outline

n File system consistency issues in the presence of failures:

Transactions & Log Structured FS n Solutions:

n Log structured file systems for better performance

User data consistency Transaction concept

n Example: money transfer from account x to account y: X: 0 X: 0

Transactions under multiple

B commits values for x and y that depends on A committing!

Transactions in file systems What is next?

n If log is zero, then all updates have gotten to disk.

n Pros: reliability Cons: all data written twice

n Delayed writes help but cannot delay for ever

Implementation Locating Data on the Disk

n Add another level of indirection n Add another level of indirection

n Wrap around? Reclaim free space n Now it fits in memory

Wrap Around Problem Cleaner

n Option 1: treat disks as huge pool of disk blocks

n Disk2 has blocks 2, k+2, 2k+2, …

Adding parity bits to RAID

You might also like