0% found this document useful (0 votes)

24 views11 pages

Analyzing Processor

Uploaded by

Burcu Taşçı

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views11 pages

Analyzing Processor

Uploaded by

Burcu Taşçı

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Analyzing the Processor Bottlenecks

in SPEC CPU 2000

Joshua Yi (Freescale Semiconductor Inc.)

Ajay Joshi (Univ. of Texas)
Resit Sendag (Univ. of Rhode Island)
Lieven Eeckhout (Ghent Univ.)
David Lilja (Univ. of Minnesota)

SPEC Benchmarking Workshop

January 23, 2006

Presentation Overview

• Bottleneck Characterization

• Plackett & Burman Design

• Performance / Power Bottlenecks

• Benchmark Classification

• Summary

1
Bottleneck Characterization
• Rank processor parameters (X) based on their effect on Y

X1 X2 X3 XN
Processor Parameters (X)
e.g., Cache Size, Num. of ALUs

Performance Measure (Y)

e.g., Cycles-Per-Instruction,
Microprocessor Energy-Per-Instruction,
Benchmark +
Performance
Input Set Energy-Delay Product
Model

• Statistical Techniques for Ranking Parameters

– ANOVA - Captures All Interactions - But 2N Test Cases
– One-at-a-Time - N Test Cases - But Only Single Parameter Effects

Plackett & Burman (P&B) Design

• Efficient screening design to quantify significance of

parameters
• Vary values of X parameters simultaneously over 2N
test cases (N is next multiple of 4 greater than X)
• Possible values of parameters
+1 : Higher than normal value (e.g., Num. of ALUs = 8)
-1 : Lower than normal value (e.g., Num of ALUs = 1)

• Amount of Information
– All single parameter effects (X1, X2 … XN)
– Two parameter interactions (X1X2, X1X3 ….)

2
Plackett & Burman Mechanics
+1: High Value for Parameter 1st Row From PB Paper
e.g., Number of Integer ALUs = 8

X1 X2 X3 X4 X5 X6 X7 Execution Time

1 +1 +1 +1 -1 +1 -1 -1 9
2 -1 +1 +1 +1 -1 +1 -1 11
3 -1 -1 +1 +1 +1 -1 +1 20
4 +1 -1 -1 +1 +1 +1 -1 1
5 -1 +1 -1 -1 +1 +1 +1 1
6 +1 -1 +1 -1 -1 +1 +1 9
7 +1 +1 -1 +1 -1 -1 +1 19
8 -1 -1 -1 -1 -1 -1 -1 74
Effect -68 -64 -46 -42 -82 -100 -46

-19+111-120+11+11 …-174= -100

-1: Low Value for Parameter
Most Significant Parameter
e.g., Number of Integer ALUs = 1

Finding Significant Bottlenecks

1. Execute Plackett and Burman Design X1 = 100 →5

– Run Simulations X2 = 200 →1
– Calculate Effect of All Parameters X3 = 150 →3
X4 = 120 →4
2. For Each Benchmark
X5 = 175 →2
– Sort Parameters in Descending Order
– Rank the Parameters (1=Most Important)
X1 5 4 5 4.7
3. Across Benchmarks, Average the Ranks X2 1 3 2 2.0
X3 3 2 1 2.0
4. Lowest Ranked Parameters are the Most X4 4 1 2 2.3
Significant
X5 2 5 4 3.7

3
Experiment Framework
Plackett and Burman Design
– 43 parameters (processor core and memory core) of a
superscalar microprocessor
– 88 (very) different processor configurations
Simulation Environment
– SimpleScalar Simulator
– sim-outorder performance model
Benchmarks
– SPEC CPU2000 benchmark 46 program-input pairs (ref)
– Alpha Binaries compiled at –O3

P&B High/Low Values – Processor Core

Parameter Low Value High Value
Fetch Queue Entries 4 32
Branch Predictor 2-Level Perfect
Branch MPred Penalty 10 Cycles 2 Cycles
RAS Entries 4 64
BTB Entries 16 512
BTB Assoc 2-Way Fully-Assoc
Spec Branch Update In Commit In Decode
Decode/Issue Width 4-Way
ROB Entries 8 64
LSQ Entries 0.25 * ROB 1.0 * ROB
Memory Ports 1 4

4
P&B High/Low Values – Functional Units
Parameter Low Value High Value
Int ALUs 1 4
Int ALU Latency 2 Cycles 1 Cycle
Int ALU Throughput 1
FP ALUs 1 4
FP ALU Latency 5 Cycles 1 Cycle
FP ALU Throughputs 1
Int Mult/Div Units 1 4
Int Mult Latency 15 Cycles 2 Cycles
Int Div Latency 80 Cycles 10 Cycles
Int Mult Throughput 1
Int Div Throughput Equal to Int Div Latency
FP Mult/Div Units 1 4
FP Mult Latency 5 Cycles 2 Cycles
FP Div Latency 35 Cycles 10 Cycles
FP Sqrt Latency 35 Cycles 15 Cycles
FP Mult Throughput Equal to FP Mult Latency
FP Div Throughput Equal to FP Div Latency
FP Sqrt Throughput Equal to FP Sqrt Latency

P&B High/Low Values – Memory System (1)

Parameter Low Value High Value
L1 I-Cache Size 4 KB 128 KB
L1 I-Cache Assoc 1-Way 8-Way
L1 I-Cache Block Size 16 Bytes 64 Bytes
L1 I-Cache Repl Policy Least Recently Used
L1 I-Cache Latency 4 Cycles 1 Cycle
L1 D-Cache Size 4 KB 128 KB
L1 D-Cache Assoc 1-Way 8-Way
L1 D-Cache Block Size 16 Bytes 64 Bytes
L1 D-Cache Repl Policy Least Recently Used
L1 D-Cache Latency 4 Cycles 1 Cycle
L2 Cache Size 256 KB 8192 KB
L2 Cache Assoc 1-Way 8-Way
L2 Cache Block Size 64 Bytes 256 Bytes

5
P&B High/Low Values – Memory System (2)
Parameter Low Value High Value
L2 Cache Repl Policy Least Recently Used
L2 Cache Latency 20 Cycles 5 Cycles
Mem Latency, First 200 Cycles 50 Cycles
Mem Latency, Next 0.02 * Mem Latency, First
Mem Bandwidth 4 Bytes 32 Bytes
I-TLB Size 32 Entries 256 Entries
I-TLB Page Size 4 KB 4096 KB
I-TLB Assoc 2-Way Fully Assoc
I-TLB Latency 80 Cycles 30 Cycles
D-TLB Size 32 Entries 256 Entries
D-TLB Page Size Same as I-TLB Page Size
D-TLB Assoc 2-Way Fully-Assoc
D-TLB Latency Same as I-TLB Latency
Memory Ports 1 4

Most Significant Performance Bottlenecks

gzip gcc
Rank Parameter mcf equake
(graphic) (200)
1 ROB Entries 1 2 3 2
2 L2 Cache Size 11 1 1 8
3 Memory Latency First 13 3 2 1
4 L2 Cache Latency 7 4 5 5
5 Branch Predictor Accuracy 2 5 8 11
6 L1 I-Cache Size 17 8 16 42
7 Number of Integer ALUs 3 6 9 37
8 Load Store Queue Entries 5 13 7 6
9 L1 D-Cache Latency 4 7 22 12
10 L1 I-Cache Block Size 29 10 34 34
11 Memory Bandwidth 23 11 4 4
12 L1 D-Cache Size 12 35 33 14

6
Most Significant Power Bottlenecks
gzip gcc
Rank Parameter mcf equake
(graphic) (200)
1 BTB Associativity 3 1 3 2
2 BTB Entries 2 2 4 3
3 Branch Predictor Accuracy 1 3 11 12
4 Memory Latency First 28 6 1 1
5 L2 Cache Latency 13 4 6 11
6 L1 I-Cache Size 4 8 10 10
7 L2 Cache Size 5 39 2 8
8 ROB Entries 16 19 7 4
9 L1 D-Cache Size 7 5 8 6
10 L1 D-Cache Block Size 23 7 19 9
11 Memory Bandwidth 25 12 5 7
12 Number of Integer ALUs 6 13 29 21

Similarity Between Benchmarks

P&B Bottleneck Characterization for each
benchmark-input set
e.g., Vector of 43 ranks for each Benchmark
< 1, 22, 41, 5, 3 ………. >

Remove Correlation & Reduce Dimensions using

Principal Component Analysis

Apply Clustering Algorithm

(e.g., K-means, Hierarchical) to group programs

Classification Intuition:
Similar Effect → Similar Significant Parameters → Similar Bottlenecks

7
Classification Across All Bottlenecks

Processor Core Bottlenecks

Cluster Benchmarks
1 gcc-expr, gcc-200, gcc-scilab
2 gzip-graphic, gzip-program, gzip-random, gzip-source
3 eon-cook, eon-kajiya, eon-rushmeier, crafty
galgel, equake, facerec, fma3d, sixtrack perlbmk-makerand, perlbmk-
4 splitmail_850, perlbmk-splitmail_957, gap, bzip2-graphic, bzip2-program,
bzip2-source, twolf, apsi
5 wupwise
mcf, ammp, perlbmk-splitmail_535, perlbmk-splitmail_704, vortex-1,
6
vortex-3,
7 gcc-166, gcc-integrate
8 lucas
9 swim, mgrid, applu
10 gzip-log, parser
11 vpr-route, mesa, art-110, art-470, perlbmk-diffmail, vortex-2

8
Data Memory Bottlenecks
Cluster Benchmarks
1 gcc-166, gcc-integrate, lucas
vpr-route, galgel, facerec, equake, parser, bzip2-graphic, bzip2-program,
2
bzip2-source, apsi
3 art-110, art-470, mcf, ammp, twolf

4 wupwise, swim, mgrid, applu

mesa, crafty, fma3d, eon-cook, eon-kajiya, eon-rushmeier, perlbmk-diffmail,
5
perlbmk-makerand, gap, vortex-1, vortex-2, vortex-3
6 gcc-200, gcc-expr, gcc-scilab
gzip-graphic, gzip-log, gzip-program, gzip-random, gzip-source, sixtrack,
7
perlbmk-splitmail_850, perlbmk-splitmail_957
8 perlbmk-splitmail_535, perlbmk-splitmail_704

Instruction Memory Bottlenecks

Cluster Benchmarks
gzip-graphic, gzip-log, gzip-random, gzip-source, art-110, art-470, facerec,
1
ammp, parser, bzip2-graphic, bzip2-program, bzip2-source
mesa, crafty, fma3d, eon-cook, eon-kajiya, eon-rushmeier, perlbmk-
2
makerand,
3 vpr-route, galgel, perlbmk-splitmail_535, perlbmk-splitmail_704

4 applu, gcc-166, gcc-integrate, lucas

5 perlbmk-diffmail, vortex-1, vortex-2, vortex-3

6 wupwise, swim, mgrid, gcc-200, gcc-expr, gcc-scilab
gzip-program, perlbmk-splitmail_850, mcf, equake, sixtrack, perlbmk-
7
splitmail_957, twolf, apsi

9
Control Flow Bottlenecks
Cluster Benchmarks
1 gzip-log, parser
gzip-graphic, gzip-program, gzip-random, gzip-source, perlbmk-
splitmail_535, perlbmk-splitmail_704, perlbmk-splitmail_850, perlbmk-
2
splitmail_957, gap, vortex-1, vortex-2, vortex-3, bzip2-graphic, bzip2-
program, bzip2-source
mesa, equake, crafty, facerec, sixtrack, eon-cook, eon-kajiya, eon-
3
rushmeier, perlbmk-makerand
4 swim, galgel, art-110, art-470, mcf, ammp, fma3d, apsi

5 wupwise, vpr-route, twolf

6 mgrid, applu, gcc-166, gcc-integrate
7 gcc-200, gcc-expr, gcc-scilab
8 lucas

Classification Across All Bottlenecks

Cluster Benchmarks
1 mesa, crafty, eon-cook, eon-kajiya, eon-rushmeier, perlbmk-makerand
2 perlbmk-splitmail_535, perlbmk-splitmail_704
3 perlbmk-diffmail, vortex-1, vortex-2, vortex-3

4 wupwise, swim, mgrid, equake, fma3d, sixtrack, gap

5 applu, gcc-166, gcc-integrate

gzip-graphic, gzip-program, gzip-random, gzip-source, perlbmk-
6
splitmail_850, perlbmk-splitmail_957
7 gcc-200, gcc-expr, gcc-scilab
8 gzip-log, parser, bzip2-graphic, bzip2-program, bzip2-source
9 mcf, facerec, ammp, twolf, apsi
10 vpr-route, galgel, art-110, art-470
11 lucas

10
Summary
• Plackett & Burman bottleneck characterization
– Computer Architect – Understand Bottlenecks
– Benchmark Designer – Similarity & Diversity
• Bottleneck Characterization of SPEC CPU2000
– ROB entries, L2 cache size, and L1 I-cache size, Memory
Latency are key bottlenecks
– Overall power and performance bottlenecks are similar
(Except BTB entries)
– Bottlenecks for gzip, gcc, and perlbmk depend on input-set
– lucas has most unique bottleneck characteristics

Lecture 8
No ratings yet
Lecture 8
37 pages
ACA UNit 1
No ratings yet
ACA UNit 1
29 pages
High Performance Computing - Benchmarks: DR M. Probert
No ratings yet
High Performance Computing - Benchmarks: DR M. Probert
30 pages
Problem Project 1
No ratings yet
Problem Project 1
4 pages
Intel Pentium 4 Processor: Presented by Michele Co
No ratings yet
Intel Pentium 4 Processor: Presented by Michele Co
60 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
64-Bit vs 32-Bit CPUs in Scientific Computing
No ratings yet
64-Bit vs 32-Bit CPUs in Scientific Computing
27 pages
Pub - Computer Performance Evaluation and Benchmarking S PDF
No ratings yet
Pub - Computer Performance Evaluation and Benchmarking S PDF
152 pages
Micro Benchmark
No ratings yet
Micro Benchmark
13 pages
Unit II
No ratings yet
Unit II
9 pages
Hpca Notes
No ratings yet
Hpca Notes
216 pages
CompArch Cheatsheet
No ratings yet
CompArch Cheatsheet
2 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
I. Extending Project 2: Designs Over The Budget Will Get 0 Point
No ratings yet
I. Extending Project 2: Designs Over The Budget Will Get 0 Point
4 pages
Advanced Memory Design Guide
No ratings yet
Advanced Memory Design Guide
48 pages
Hello
No ratings yet
Hello
4 pages
t2 Ammp 8
No ratings yet
t2 Ammp 8
5 pages
Computer Architecture: Fundamentals
No ratings yet
Computer Architecture: Fundamentals
36 pages
Name Value Description
No ratings yet
Name Value Description
35 pages
Week2 - 1
No ratings yet
Week2 - 1
64 pages
106
No ratings yet
106
80 pages
TABLE 3.1 Optimized Designs Provide Better Area - Time Performance at The Expense of Design Time. Type of Design Design Level Relative Expected Area × Time
No ratings yet
TABLE 3.1 Optimized Designs Provide Better Area - Time Performance at The Expense of Design Time. Type of Design Design Level Relative Expected Area × Time
6 pages
Computer Architecture and Organization: Lecture15: Cache Performance
No ratings yet
Computer Architecture and Organization: Lecture15: Cache Performance
17 pages
Cpu DB: Recording Microprocessor History
No ratings yet
Cpu DB: Recording Microprocessor History
9 pages
Advanced Computer Architecture: 563 L02.1 Fall 2011
No ratings yet
Advanced Computer Architecture: 563 L02.1 Fall 2011
57 pages
A4 版本1 （未使用）
No ratings yet
A4 版本1 （未使用）
2 pages
Superscalar Processor Simulation
No ratings yet
Superscalar Processor Simulation
16 pages
Chapter Two
No ratings yet
Chapter Two
33 pages
Poor Man's Computing Revisited: Alexander Shchepetkin, I.G.P.P. UCLA
No ratings yet
Poor Man's Computing Revisited: Alexander Shchepetkin, I.G.P.P. UCLA
12 pages
Skylake
No ratings yet
Skylake
10 pages
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
No ratings yet
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
37 pages
Advanced Python and HPC Optimization
No ratings yet
Advanced Python and HPC Optimization
70 pages
Advanced Processor Architecture: Summer 1997
No ratings yet
Advanced Processor Architecture: Summer 1997
28 pages
Migdalskiy Sergiy Physics Optimization Strategies
No ratings yet
Migdalskiy Sergiy Physics Optimization Strategies
104 pages
Lecture Slides 07 071-Caches-Basics
No ratings yet
Lecture Slides 07 071-Caches-Basics
11 pages
Linux Performance Tuning and Performance
100% (4)
Linux Performance Tuning and Performance
13 pages
Pipeline History
No ratings yet
Pipeline History
30 pages
COA Digital-Cheatsheet
No ratings yet
COA Digital-Cheatsheet
4 pages
DDR Benchmarking with LMBench Tools
100% (1)
DDR Benchmarking with LMBench Tools
29 pages
Skylake Architecture
No ratings yet
Skylake Architecture
31 pages
Lec 3
No ratings yet
Lec 3
20 pages
All-Products Esuprt Software Esuprt It Ops Datcentr MGMT High-Computing-Solution-Resources White-Papers84 En-Us
No ratings yet
All-Products Esuprt Software Esuprt It Ops Datcentr MGMT High-Computing-Solution-Resources White-Papers84 En-Us
8 pages
Ca Sol PDF
No ratings yet
Ca Sol PDF
8 pages
Advance Computer Architecture Homework 2 Solution
No ratings yet
Advance Computer Architecture Homework 2 Solution
8 pages
L07 MemoryII
No ratings yet
L07 MemoryII
27 pages
Week 5 - The Impact of Multi-Core Computing On Computational Optimization
No ratings yet
Week 5 - The Impact of Multi-Core Computing On Computational Optimization
11 pages
Computer Architecture Assignment: The ARM Cortex-A53
No ratings yet
Computer Architecture Assignment: The ARM Cortex-A53
8 pages
Lecture 3: Memory Buffers and Scheduling
No ratings yet
Lecture 3: Memory Buffers and Scheduling
21 pages
FIT9134 Week11
No ratings yet
FIT9134 Week11
21 pages
Smart Memory
No ratings yet
Smart Memory
19 pages
Module 2
No ratings yet
Module 2
127 pages
Lecture Notes Pipelining Stages 7B
No ratings yet
Lecture Notes Pipelining Stages 7B
7 pages
It Is A Device That Helps To Process Input Data and Instruction
No ratings yet
It Is A Device That Helps To Process Input Data and Instruction
30 pages
Computer Systems Pipelining Guide
No ratings yet
Computer Systems Pipelining Guide
7 pages
09 ParallelizationRecap PDF
No ratings yet
09 ParallelizationRecap PDF
62 pages
Speedup 0912
No ratings yet
Speedup 0912
34 pages
hw2 Solns
No ratings yet
hw2 Solns
15 pages
Blocked
No ratings yet
Blocked
7 pages
Astm Lu-1801
No ratings yet
Astm Lu-1801
401 pages
Infineum P6542
No ratings yet
Infineum P6542
28 pages
Table of The Amino Acids
No ratings yet
Table of The Amino Acids
2 pages
Infineum P6060
No ratings yet
Infineum P6060
28 pages
Address 100 Barr Harbor Drive Phone 610.832.9500 Fax 610.832.9666 Web
No ratings yet
Address 100 Barr Harbor Drive Phone 610.832.9500 Fax 610.832.9666 Web
96 pages
PDS Lubrizol-6950p
50% (2)
PDS Lubrizol-6950p
4 pages
Practical Examples On Traceability, Measurement Uncertainty and Validation in Chemistry
No ratings yet
Practical Examples On Traceability, Measurement Uncertainty and Validation in Chemistry
208 pages
Measuring Viscosity at High Shear Rate and High Temperature by Tapered Bearing Simulator
No ratings yet
Measuring Viscosity at High Shear Rate and High Temperature by Tapered Bearing Simulator
7 pages
ASTMD6595 Spectroil
No ratings yet
ASTMD6595 Spectroil
6 pages
Epa 9076
No ratings yet
Epa 9076
10 pages
UNEP CHW WAST GUID MGT ESM PCB - en
No ratings yet
UNEP CHW WAST GUID MGT ESM PCB - en
40 pages
X67 System-ENG - V3.00 PDF
100% (1)
X67 System-ENG - V3.00 PDF
975 pages
Office Automation Tools Notes
No ratings yet
Office Automation Tools Notes
75 pages
Micro Project Coa
No ratings yet
Micro Project Coa
10 pages
Computer Hardware Concept
No ratings yet
Computer Hardware Concept
47 pages
PLC Basics: Understanding Programmable Logic Controllers
No ratings yet
PLC Basics: Understanding Programmable Logic Controllers
4 pages
Ilovepdf Merged-4 Compressed
No ratings yet
Ilovepdf Merged-4 Compressed
619 pages
Instruction Set 8051 - v1
No ratings yet
Instruction Set 8051 - v1
10 pages
(2025-26) - S5 EE Class Timetable - ODD Sem
No ratings yet
(2025-26) - S5 EE Class Timetable - ODD Sem
1 page
300+ Computer Question PDF
No ratings yet
300+ Computer Question PDF
8 pages
Unit 3
No ratings yet
Unit 3
31 pages
CS 230 - Distributed Systems
No ratings yet
CS 230 - Distributed Systems
37 pages
Practical Guide
No ratings yet
Practical Guide
239 pages
Viva Questions Bharat Acharya 2018 8086 80386 Pentium
84% (44)
Viva Questions Bharat Acharya 2018 8086 80386 Pentium
38 pages
(HPC) Pratik
No ratings yet
(HPC) Pratik
8 pages
Computer Architecture Quiz Answers
No ratings yet
Computer Architecture Quiz Answers
3 pages
Unit - 1 (PPS)
100% (1)
Unit - 1 (PPS)
19 pages
SEO Document Optimization Guide
100% (7)
SEO Document Optimization Guide
40 pages
2018 THSF mt8173 PCM
No ratings yet
2018 THSF mt8173 PCM
95 pages
Wa0001.
No ratings yet
Wa0001.
77 pages
Introduction To Computer
No ratings yet
Introduction To Computer
88 pages
ECE 4100/6100 Advanced Computer Architecture: Lecture 13 Multithreading and Multicore Processors
No ratings yet
ECE 4100/6100 Advanced Computer Architecture: Lecture 13 Multithreading and Multicore Processors
56 pages
Dhanalakshmi Srinivasan: College of Engineering & Technology
No ratings yet
Dhanalakshmi Srinivasan: College of Engineering & Technology
2 pages
IQMath Fixed Vs Floating PDF
No ratings yet
IQMath Fixed Vs Floating PDF
30 pages
Main TextBook: Basic Concepts of Information Technology
No ratings yet
Main TextBook: Basic Concepts of Information Technology
117 pages
Real Time System - : BITS Pilani
No ratings yet
Real Time System - : BITS Pilani
44 pages
BIT Lab File
No ratings yet
BIT Lab File
26 pages
CH03 COA11e
No ratings yet
CH03 COA11e
46 pages
Bca 2019 20
No ratings yet
Bca 2019 20
42 pages
21EC62 Model Question Paper COA 2
No ratings yet
21EC62 Model Question Paper COA 2
2 pages
Concept of Multimedia
100% (1)
Concept of Multimedia
15 pages

Analyzing Processor

Uploaded by

Analyzing Processor

Uploaded by

Analyzing the Processor Bottlenecks

in SPEC CPU 2000

Joshua Yi (Freescale Semiconductor Inc.)

SPEC Benchmarking Workshop

January 23, 2006

• Plackett & Burman Design

• Performance / Power Bottlenecks

Performance Measure (Y)

• Statistical Techniques for Ranking Parameters

Plackett & Burman (P&B) Design

• Efficient screening design to quantify significance of

-1*9+1*11-1*20+1*1+1*1 …-1*74= -100

Finding Significant Bottlenecks

1. Execute Plackett and Burman Design X1 = 100 →5

P&B High/Low Values – Processor Core

P&B High/Low Values – Memory System (1)

Most Significant Performance Bottlenecks

Similarity Between Benchmarks

Remove Correlation & Reduce Dimensions using

Apply Clustering Algorithm

Processor Core Bottlenecks

4 wupwise, swim, mgrid, applu

Instruction Memory Bottlenecks

4 applu, gcc-166, gcc-integrate, lucas

5 perlbmk-diffmail, vortex-1, vortex-2, vortex-3

5 wupwise, vpr-route, twolf

Classification Across All Bottlenecks

4 wupwise, swim, mgrid, equake, fma3d, sixtrack, gap

5 applu, gcc-166, gcc-integrate

You might also like

-19+111-120+11+11 …-174= -100