0% found this document useful (0 votes)

19 views27 pages

9 - CH05 - Cache Memory Organization

Uploaded by

DizzyDragon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views27 pages

9 - CH05 - Cache Memory Organization

Uploaded by

DizzyDragon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Computer Organization and Architecture

Designing for Performance

11th Edition

Chapter 5
Cache Memory

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Cache Memory Principles
• Block
– The minimum unit of transfer between cache and main memory
• Frame
– To distinguish between the data transferred and the chunk of
physical memory, the term frame, or block frame, is sometimes used
with reference to caches
• Line
– A portion of cache memory capable of holding one block, so-called
because it is usually drawn as a horizontal object
• Tag
– A portion of a cache line that is used for addressing purposes
• Line size
– The number of data bytes, or block size, contained in a line

The processor generates

the read address (RA) of
a word to be read. If the
word is contained in the
cache (cache hit), it is
delivered to the
processor.

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Table 5.1: Elements of Cache Design
Cache Addresses Write Policy
Logical Write through
Physical Write back
Cache Size Line Size
Mapping Function Number of Caches
Direct Single or two level
Associative Unified or split
Set associative
Replacement Algorithm
Least recently used (LRU)
First in first out (FIFO)
Least frequently used (LFU)
Random
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Figure 5.5: Logical and Physical Caches

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Cache Memory
• Locality of Reference
– The references to memory at any given time interval tend to be confined
within a localized areas
– Temporal Locality -- The information which will be used in near future is
likely to be in use already (e.g. Reuse of information in loops)
– Spatial Locality -- If a word is accessed, adjacent (near) words are likely
accessed soon (e.g. Related data items (arrays) are usually stored
together; Instructions are executed sequentially)
• Cache
– The property of Locality of Reference makes the Cache memory systems
work
– Cache is a fast small capacity memory that should hold those information
which are most likely to be accessed

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Performance of Cache
• Memory Access
– All the memory accesses are directed first to Cache
– If the word is in Cache; Access cache to provide it to CPU
– If the word is not in Cache; Bring a block including that
word to replace a block now in Cache
• Main issues
– How can we know if the word that is required is there ?
– If a new block is to replace one of the old blocks, which
one should we choose ?

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Table 5.3: Cache Access Methods
Mapping of Main
Memory Access using Main
Method Organization Blocks to Cache Memory Address
Direct Sequence of m Each block of main Line portion of address used
Mapping lines memory maps to one to access cache line; Tag
unique line of cache. portion used to check for hit
on that line.
Associative Sequence of m Each block of main Tag portion of address used
Mapping lines memory can map to to check every line for hit on
any line of cache. that line.
Set- Sequence of m Each block of main Line portion of address used to
Associative lines organized as memory maps to one access cache set; Tag portion
Mapping v sets of k lines unique cache set. used to check every line in that
each (m = v × k) set for hit on that line.

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping
• Mapping Function : Specification of correspondence between
main memory blocks and cache blocks
–Associative mapping
–Direct mapping
–Set-associative mapping
• To help discuss the three mapping techniques, we will use
the example shown below:

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping:
Associative Mapping
•Associative Mapping
–Any block location in Cache can store any block in memory
▪→ Most flexible
–Mapping Table is implemented in an associative memory
▪→ Fast, very Expensive
– Mapping Table: Stores both address and the content of the
memory word

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping:
Direct Mapping
• Direct Mapping
– Each memory block has only one place to load in Cache.
– Mapping Table is made of RAM instead of CAM.
– n-bit memory address consists of 2 parts: k bits of Index
field and n-k bits of Tag field.
– n-bit addresses are used to access main memory and k-bit
Index is used to access the Cache.

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping:
Direct Mapping – contd.

• Addressing relationships main and cache memories.

Octal Octal
Address Address

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping:
Direct Mapping – contd.
•Cache organization

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping:
Direct Mapping – contd.
•Operation
–CPU generates a memory request with (TAG;
INDEX)
–Access cache using INDEX; (tag; data)
▪Compare TAG and tag
–If matches: Hit
▪Provide Cache[INDEX](data) to CPU
–If not match: Miss
▪M[tag ; INDEX] ← Cache[INDEX](data)
▪Cache[INDEX] ← (TAG;M[TAG; INDEX])
▪CPU ← Cache[INDEX](data)
17

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping:
Direct Mapping – contd.
• Direct mapping with block size of 8 words
– 64 blocks (64 x 8 = 512)
• Each time a miss occurs, an entire block of 8 words must be
transferred from main memory to cache memory.

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping:
Set Associative Mapping
• Set Associative Mapping:
– Each memory block has a set of locations in the cache to load

• Set Associative Mapping cache with set size of two

Two-way set-associative mapping cache.

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Memory and Cache Mapping:
Set Associative Mapping
•Operation
– CPU generates a request memory address (TAG; INDEX)
– Access cache with INDEX, (Cache word = (tag 0, data 0);
(tag 1, data 1))
– Compare TAG and tag 0 and then tag 1
– If tag i = TAG: Hit
▪ CPU ← data i
– If tag i ≠ TAG: Miss,
▪//Replace either (tag 0, data 0) or (tag 1, data 1),
▪//Assume (tag 0, data 0) is selected for replacement.
▪M [tag 0, INDEX] ← Cache [INDEX] (data 0)
▪Cache [INDEX] (tag 0, data 0) ← (TAG, M [TAG,INDEX]),
▪CPU ← Cache [INDEX] (data 0)
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Block Replacement Policy
•Many different block replacement policies are
available:
– Random
▪Chooses one tag-data item for replacement at random
– FIFO (First In First Out)
▪Replaces the item that has been in the set the longest
– LRU (Least Recently Used)
▪Replaces the item that has been least recently used by
the CPU.

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Block Replacement Policy (LRU) algorithm
• The most easy policy to implement: LRU (Least Recently Used)
• Implementation of LRU in the Set Associative Mapping with set size=2
– Cache word = (tag 0, data 0, U0);(tag 1, data 1, U1), Ui = 0 or 1 (binary)
• Modifications
– Initially all U0 = U1 = 1
– When hit to (tag 0, data 0, U0), U1 ← 1(least recently used)
– When hit to (tag 1, data 1, U1), U0 ← 1(least recently used))
– When miss, find the least recently used one (Ui=1)
▪ If U0 = 1 and U1 = 0, then replace (tag 0, data 0) :
– M[tag 0, INDEX] ← Cache[INDEX](data 0)
– Cache[INDEX](tag 0, data 0, U0) ← (TAG,M[TAG,INDEX], 0); U1 ← 1
▪ If U0 = 0 and U1 = 1, then replace (tag 1, data 1) :
– Similar to above; U0 ← 1
▪ If U0 = U1 = 0, this condition does not exist
▪ If U0 = U1 = 1, both of them are candidates :
–Take random selection
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
The most common replacement algorithms
are (more explanation):
• Least recently used (LRU)
– Most effective
– Replace that block in the set that has been in the cache longest with
no reference to it
– Because of its simplicity of implementation, LRU is the most popular
replacement algorithm
• First-in-first-out (FIFO)
– Replace that block in the set that has been in the cache longest
– Easily implemented as a round-robin or circular buffer technique
• Least frequently used (LFU)
– Replace that block in the set that has experienced the fewest
references
– Could be implemented by associating a counter with each line

Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Cache Write
• Write Through
– When writing into memory:
▪ If hit, both cache and memory is written in parallel
▪ If miss, memory is written
(For read miss, missing block may be overloaded onto a cache block)
– (+) Memory is always updated
▪ -> Important when CPU and DMA I/O are both executing
– (-) Slow due to the memory access time
• Write-Back (Copy-Back)
– When writing into memory:
▪ If hit, only cache is written
▪ If miss, missing block is brought to cache and write into cache
(For a read miss, candidate block must be written back to the memory)
– (-) Memory is not up-to-date, i.e., the same item in cache and memory
24
may have different value.
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Cache Timing Model
• Direct-mapped cache access
– The first operation is checking the Tag field of an address against the tag
value in the line designated by the Line field
– If there is not a match (miss), the operation is complete
– If there is a match (hit), the cache hardware reads the data block from the
line in the cache and then fetches the byte or word indicated by the Offset
field of the address
– An advantage is that it allows simple and fast speculation
• Fully associative cache
– The line number is not known until the tag comparison is competed
– The hit time is the same as for direct-mapped
– Because this is a content-addressable memory, the miss time is simply the
tag comparison time
• Set associative
– It is not possible to transmit bytes and compare tags in parallel as can be
done with direct-mapped with speculative access
– However, the circuitry can be designed so that the data block from each line
in a set can be loaded and then transmitted once the tag check is made
Copyright © 2019, 2016, 2013 Pearson Education, Inc. All Rights Reserved
Table 5.6: Cache Timing Equations
Time for hit Time for miss

Direct-Mapped thit = trl + txb + tct tmiss = trl + tct

Direct-Mapped with
thit = trl + txb tmiss = trl + tct
Speculation

Fully Associative thit = trl + txb + tct tmiss = tct

Set-Associative thit = trl + txb + tct tmiss = trl + tct

Set-Associative with Way thit = trl + txb + (1 –

T= = trl + tct
Prediction Fp) tct

This work is protected by United States copyright laws and is provided solely
for the use of instructions in teaching their courses and assessing student
learning. dissemination or sale of any part of this work (including on the
World Wide Web) will destroy the integrity of the work and is not permit-
ted. The work and materials from it should never be made available to
students except by instructors using the accompanying text in their
classes. All recipients of this work are expected to abide by these
restrictions and to honor the intended pedagogical purposes and the needs of
other instructors who rely on these materials.

Cache Memory Presentation Slides
No ratings yet
Cache Memory Presentation Slides
25 pages
CSE-200 Accredited Services Architect Day 3 - Performance Slide
No ratings yet
CSE-200 Accredited Services Architect Day 3 - Performance Slide
63 pages
TAFJ-H2 Install
No ratings yet
TAFJ-H2 Install
11 pages
4 Unit Speed, Size and Cost
No ratings yet
4 Unit Speed, Size and Cost
5 pages
Computer Architecture: Cache Memory
No ratings yet
Computer Architecture: Cache Memory
57 pages
Cache Memory
No ratings yet
Cache Memory
47 pages
Unit 1 Part 2 (Chapter 4) Cache Memory
No ratings yet
Unit 1 Part 2 (Chapter 4) Cache Memory
53 pages
iOS & Objective-C FAQ Guide
0% (1)
iOS & Objective-C FAQ Guide
12 pages
Unit 4 - Memory Organization
No ratings yet
Unit 4 - Memory Organization
127 pages
Sampriya Chandra Cache Memory
No ratings yet
Sampriya Chandra Cache Memory
36 pages
Cache Memory: A Safe Place For Hiding or Storing Things
100% (1)
Cache Memory: A Safe Place For Hiding or Storing Things
34 pages
Computer Architecture and Organization: Dr. Mohd Hanafi Ahmad Hijazi
No ratings yet
Computer Architecture and Organization: Dr. Mohd Hanafi Ahmad Hijazi
47 pages
HMiSoft User Manual
No ratings yet
HMiSoft User Manual
468 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
46 pages
Memory Hierarchy and Cache Design
No ratings yet
Memory Hierarchy and Cache Design
53 pages
CH04 COA9e Cache Memory Repaired
No ratings yet
CH04 COA9e Cache Memory Repaired
42 pages
CH04 Cache Memory
No ratings yet
CH04 Cache Memory
44 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
46 pages
Cache Memory: A Safe Place For Hiding or Storing Things
No ratings yet
Cache Memory: A Safe Place For Hiding or Storing Things
34 pages
AC14L08 Memory Hierarchy
No ratings yet
AC14L08 Memory Hierarchy
20 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
Cache - Memory - Concept
No ratings yet
Cache - Memory - Concept
73 pages
CH04 COA11e
No ratings yet
CH04 COA11e
48 pages
Set Associative Mapping: 2 Way Associative Mapping A Given Block Can Be in One of 2 Lines in Only One Set
No ratings yet
Set Associative Mapping: 2 Way Associative Mapping A Given Block Can Be in One of 2 Lines in Only One Set
13 pages
Non Linear Data Structures
No ratings yet
Non Linear Data Structures
50 pages
Pertemuan 6
No ratings yet
Pertemuan 6
56 pages
Indira College of Commerce and Science
No ratings yet
Indira College of Commerce and Science
43 pages
Com Arch Lec Slide 3 2
No ratings yet
Com Arch Lec Slide 3 2
31 pages
Cache Memory
No ratings yet
Cache Memory
12 pages
CH05 COA11e
No ratings yet
CH05 COA11e
43 pages
CH04
No ratings yet
CH04
46 pages
Unit 5 Memory System
No ratings yet
Unit 5 Memory System
77 pages
Cache Memory Mapping Techniques
No ratings yet
Cache Memory Mapping Techniques
7 pages
04 - Cache Memory
No ratings yet
04 - Cache Memory
47 pages
Cache & Virtual Memory Guide
No ratings yet
Cache & Virtual Memory Guide
16 pages
Conspect of Lecture 7
No ratings yet
Conspect of Lecture 7
13 pages
Designing MPLS in Next Generation Data Center: A Case Study
No ratings yet
Designing MPLS in Next Generation Data Center: A Case Study
112 pages
Front End Design Tools E-Module - Code-205 - BCA - Sem. III
No ratings yet
Front End Design Tools E-Module - Code-205 - BCA - Sem. III
228 pages
CH04 COA10e
No ratings yet
CH04 COA10e
41 pages
ESS Migration Document Template
No ratings yet
ESS Migration Document Template
11 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
Daily Activities: Task Transaction Procedure
No ratings yet
Daily Activities: Task Transaction Procedure
8 pages
Coa PPT
No ratings yet
Coa PPT
158 pages
Unit 6
No ratings yet
Unit 6
25 pages
Coa (21CS34)
No ratings yet
Coa (21CS34)
13 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
Introduction To Mongodb
No ratings yet
Introduction To Mongodb
50 pages
CH04 COA10e
No ratings yet
CH04 COA10e
46 pages
Splay Tree
100% (1)
Splay Tree
68 pages
Cache Memory in Computer Organization
No ratings yet
Cache Memory in Computer Organization
12 pages
Gentoo Linux Portage Guide
No ratings yet
Gentoo Linux Portage Guide
11 pages
MariaDB for Database Professionals
No ratings yet
MariaDB for Database Professionals
33 pages
Data Transformation Routines Guide
No ratings yet
Data Transformation Routines Guide
10 pages
Unit3 COA
No ratings yet
Unit3 COA
47 pages
Laravel-Composite Primary Key
No ratings yet
Laravel-Composite Primary Key
9 pages
DS Assignment1
No ratings yet
DS Assignment1
28 pages
Understanding Parity & ECC Memory
No ratings yet
Understanding Parity & ECC Memory
4 pages
CH05 COA11e
No ratings yet
CH05 COA11e
43 pages
Cache Memory-1
No ratings yet
Cache Memory-1
10 pages
Unit-2 CDA DrManojY
No ratings yet
Unit-2 CDA DrManojY
81 pages
Car Radio EEPROM Codes Guide
No ratings yet
Car Radio EEPROM Codes Guide
91 pages
Practical File
No ratings yet
Practical File
30 pages
Intrebari Interviu Programator
No ratings yet
Intrebari Interviu Programator
7 pages
CH04 COA10e Updated
No ratings yet
CH04 COA10e Updated
70 pages
Operating Systems Class Notes
No ratings yet
Operating Systems Class Notes
2 pages
Quiz 2.2 Database Programming
No ratings yet
Quiz 2.2 Database Programming
9 pages
TrainingWorksheet-MacrosandPivotTables AGUDA
No ratings yet
TrainingWorksheet-MacrosandPivotTables AGUDA
3 pages
Input Output Organization (2.3)
No ratings yet
Input Output Organization (2.3)
151 pages
Chapter 5
No ratings yet
Chapter 5
16 pages
Chapter 2z
No ratings yet
Chapter 2z
54 pages
CTS Set 2 PDF
No ratings yet
CTS Set 2 PDF
4 pages
Q1. Question To Define A Class and Its Member Function
No ratings yet
Q1. Question To Define A Class and Its Member Function
2 pages
CH04 COA10e
No ratings yet
CH04 COA10e
50 pages
CH05 COA11e
100% (1)
CH05 COA11e
43 pages
CH04 COA10e
No ratings yet
CH04 COA10e
16 pages
CH05-COA11e - Modified
No ratings yet
CH05-COA11e - Modified
46 pages
6.module 2 - Part 2
No ratings yet
6.module 2 - Part 2
39 pages
Lec 23 CAOCache Memory
No ratings yet
Lec 23 CAOCache Memory
11 pages
Java String & Array Quiz
No ratings yet
Java String & Array Quiz
5 pages
SQL Crud
No ratings yet
SQL Crud
11 pages
Dicc
No ratings yet
Dicc
164 pages
Screenshot 2025-02-12 at 4.07.38 PM
No ratings yet
Screenshot 2025-02-12 at 4.07.38 PM
5 pages
Cacne Memory - Mapping Techniques
No ratings yet
Cacne Memory - Mapping Techniques
7 pages
Cache Memory
No ratings yet
Cache Memory
56 pages
3 CacheMemory
No ratings yet
3 CacheMemory
20 pages
CH04 COA10e
No ratings yet
CH04 COA10e
46 pages
Com Arch Lec Slide 3
No ratings yet
Com Arch Lec Slide 3
30 pages
Lec 5
No ratings yet
Lec 5
29 pages

9 - CH05 - Cache Memory Organization

Uploaded by

9 - CH05 - Cache Memory Organization

Uploaded by

Computer Organization and Architecture

Designing for Performance

The processor generates

• Addressing relationships main and cache memories.

• Set Associative Mapping cache with set size of two

Two-way set-associative mapping cache.

Direct-Mapped thit = trl + txb + tct tmiss = trl + tct

Fully Associative thit = trl + txb + tct tmiss = tct

Set-Associative thit = trl + txb + tct tmiss = trl + tct

Set-Associative with Way thit = trl + txb + (1 –

You might also like