Workload Characterization

Uploaded by

Momoh Gaius

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views5 pages

Workload Characterization

Uploaded by

Momoh Gaius

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

WORKLOAD CHARACTERIZATION

This refers to the demands placed by the requests on the various system resources. An important
feature of computer systems is that their performance depends dramatically on the workload
(or simply load) they are subjected to. The load characterizes the quantity and the nature of
requests submitted to the system.
Workload characterization is one of the central issues in performance evaluation because it is
not always clear what aspects of the workload are important, in how much detail the workload
should be recorded, and how the workload should be represented and used.
Regardless of which performance evaluation technique is used, we need to provide input to the
model or real system under study. Many new computer and network applications and
programming paradigms are constantly emerging. Understanding the characteristics of today’s
emerging workloads is essential to design efficient and cost-effective architectures for them. It
is important to characterize web servers, database systems, and transaction processing systems,
multimedia, networks, ATM switches, and scientific workloads.
It is also useful to design models for workloads. An accurate characterization of application
and operation system behavior leads to improved architectures and designs. Analytical
modeling of workloads is a challenge and needs to be performed carefully. This is because it
takes significant amounts of time to perform trace-driven or execution-driven simulations due
to the increased complexity of the processor, memory subsystem, and the workload domain.
Quantitative characterization of workloads can help significantly in the creation and validation
of analytic models. They can capture the essential features of systems and workloads, which
can be helpful in providing early predication about the design. Moreover, quantitative and
analytical characterization of workloads is important in understanding and exploiting their
interesting features. Figure 1.1 depicts an overall block diagram of workload characterization
process.

FIGURE 1.1. Overall workload characterization process.

In this context, there are two types of relevant inputs: (a) parameters that can be controlled by
the system designer, such as resource allocation buffering technique and scheduling schemes,
and (b) input generated by the environments in which the system under study is used such as
interarrival times. Such inputs are used to drive the real system if the measurement technique
or the simulation model is used. They also can be used to determine adequate distributions for
the analytic and simulation models. In the published literature, such inputs are often called
workloads.
Workload characterization is considered an important issue in performance evaluation, as it is
not always clear what
(a) level of detail the workload should have
(b) aspects of the workload are significant, and
(c) method to be used to represent the workload.

Workload Model
 Workload characterization only builds a model of the real workload, since not every
aspect of the real workload may be captured or is relevant.
 A workload model may be executable or non-executable. For example, recording the
arrival instants and service durations of jobs creates an executable model, whereas only
determining the distributions creates a non-executable model.
 An executable model need not be a record of inputs, it can also be a program that
generates the inputs.
 Executable workloads are useful in direct measurements and trace-driven simulations,
whereas non-executable workloads are useful for analytic modeling and distribution-
driven simulations.
In workload characterization, the term ‘‘user’’ may or may not be a human being. In most
related literature, the term ‘‘workload component’’ or ‘‘workload unit’’ is used instead of user.
This means that workload characterization attempts to characterize a typical component.
Examples of workload components include (a) applications such as website, e-mail service, or
program development (b) sites such as several sites for the same company, and (c) user sessions
such as monitoring complete sessions from user login and logout and applications that can be
run during such sessions. Measured quantities, requests, and resource demands used to
characterize the workload are called parameters. Transaction types include (a) packet sizes, (b)
source and destination of packets, and (c) instructions.
In general, workload parameters are preferable over system parameters for the characterization
of workloads. The parameters of significant impact are included, whereas those of minor
impact are usually excluded.
Among the techniques that can be used to specify workload are:
(a) Averaging
The averaging is the simplest scheme. It relies on presenting a single number that summarizes
the parameter values observed, such as arithmetic mean, median/mode/geometric or harmonic
means. The arithmetic means may not be appropriate for certain applications. In such cases,
the median, mode, geometric means, and harmonic means are used. For example, in the case
of addresses in a network, the mean or median is meaningless, therefore, the mode is often
chosen.
(b) Single-parameter histogram
In the single-parameter histogram scheme, we use histograms to show the relative frequencies
of various values of the parameter under consideration. The drawback of using this scheme is
that when using individual-parameter histograms, these histograms ignore the correlation
among various parameters.
(c) Multi-parameter histogram
To avoid the problem of correlation among different parameters in the single-parameter
scheme, the multi-parameter scheme is often used. In the latter scheme, a k-dimensional
histogram is constructed to describe the distribution of k workload parameters. The difficulty
with the same technique is that it is not easy to construct joint histograms for more than two
parameters.

(d) Markov models.

Markov models are used in cases when the next request is dependant only on the last request.
In general, we can say that if the next state of the system under study depends only on the
current state, then the overall systems’ behavior follows the Markov model. Markov models
are often used in queuing analysis. We can illustrate the model by a transition matrix that gives
the values of the probabilities of the next state given present state.
(e) Clustering
The clustering scheme is used when the measured workload is made of a huge number of
components. In such a case, these huge components are categorized into a small number of
clusters such that the components in one cluster are as similar to each other as possible. This is
almost similar to what is used in clustering in pattern recognition. One class member may be
selected from each cluster to be its representative and to conduct the needed study to find out
what system design decisions are needed for that cluster/group.
(f) use of dispersion measures such as coefficient of variation (COV)
The use of dispersion measure can give better information about the variability of the data, as
the mean scheme alone is insufficient in cases where the variability in the data set is large. The
variability can be quantified using the variance, standard deviation or the COV.
𝑛

𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒 = 𝑆 2 = 1/(𝑛 − 1) ∑(𝑥𝑖 − 𝑥̅ )2

𝑖=1

and 𝐶𝑂𝑉 = 𝑆/𝑥̅

where
𝑆 2 = sample variance
𝑥𝑖 = the value of the one observation
𝑥̅ = the sample mean or the mean value of all observations
𝑛 = the number of observations
A high COV means high variance, which means in such a case, the mean is not sufficient. A
zero COV means that the variance is zero, and in such a case, the mean value gives the same
information as the complete data set.
(g) Principal component analysis.
The principal-component analysis is used to categorize workload components using the
weighted sum of their parameter values. If 𝑑𝑖 is the weight for the 𝑖 𝑡ℎ parameter 𝑥𝑖 , then the
weighted sum W is as follows:
𝑘

𝑊 = ∑ 𝑑𝑖 𝑥𝑖
𝑖=1

The value of 𝑊𝑖 is called the principal factor or principal component. In general, if we are given
a set of k parameters, such as x1, x2,…, xn, then the principal component analysis produces a
set of factors and W1, W2,…, Wk, such that: (a) the W’s are linear combinations of x’s, (b) the
W’s form an orthogonal set, which means that their inner product is zero: Inner Product =
∑ 𝑊𝑗 × 𝑊𝑗 = 0.

1.7.2 Case Study: Website Characterization

The phenomenal growth of the World-Wide Web (WWW), in both the volume of information
on it and the numbers of users desiring access to it, is dramatically increasing the performance
requirements for large-scale information servers. WWW server performance is a central issue
in providing universal, reliable, and efficient information access.
It is important that the WWW traffic workload be understood as it is crucial in the analysis of
a server’s performance. Capturing the main characteristics of such systems, such as the
distributions of file sizes and buffering schemes, is vital to provide a quantitative measure of
the aggregate overall advantage of a particular server system’s optimization. Workload
generators that can be used for such systems include SpecWeb96, WebStone, and SURGE.
In the characterization of a web server, we need to choose parameters that best describe the
characteristics of the workload of the servers and system software used, monitor the systems
to obtain some raw performance data, analyze performance data, and finally construct a
workload model of the system under investigation. Workload characterization allows us to
understand the current state of the system under investigation. Characterizing workload is also
essential to the design of new system components.
1.8 PERFORMANCE EVALUATION CHECKLIST
PE1 Define your goal. For example: dimension of the system, find the overload behaviour;
evaluate alternatives. Do you need a performance evaluation study? Aren’t the results obvious?
Are they too dependent on the input factors, which are arbitrary?
PE2 Identify the factors. What are all the factors? Are there external factors which need to be
controlled?
PE3 Define your metrics. For example: response time, server occupancy, number of
transactions per hour, Joule per Megabyte. Define not only what is measured but also under
which condition or sampling method. If the metric is multidimensional, different metric values
are not always comparable and there may not be a best metric value. However, there may be
non-dominated metric values.
PE4 Define the offered load. How is it expressed: transactions per second, number of users,
number of visits per hour? Is it measured on a real system? Artificial load generated by a
simulator, by a synthetic load generator? Load model in a theoretical model?
PE5 Know your bottlenecks. The performance often depends only on a small number of
factors, often those whose utilization (= load/capacity) is high. Make sure what you are
evaluating is one of them.
PE6 Know your system well. Know the system you are evaluating and list all factors. Use
evaluation tools that you know well. Know common performance patterns for your system.

CMG Workload Correlation and Virtualization
No ratings yet
CMG Workload Correlation and Virtualization
11 pages
CMP 415
No ratings yet
CMP 415
5 pages
Workload & WL Characterization
No ratings yet
Workload & WL Characterization
34 pages
San 06 ISPASS
No ratings yet
San 06 ISPASS
10 pages
2nd Unit
No ratings yet
2nd Unit
10 pages
A Systematic Approach To Performance Evaluation
No ratings yet
A Systematic Approach To Performance Evaluation
6 pages
Cours Part 12
No ratings yet
Cours Part 12
69 pages
CSC 417 Note
No ratings yet
CSC 417 Note
5 pages
Characterization
No ratings yet
Characterization
44 pages
10.1201 b16328 Previewpdf
No ratings yet
10.1201 b16328 Previewpdf
47 pages
Workload Characterization Guide
No ratings yet
Workload Characterization Guide
52 pages
Best Capacity Planning Method For Evaluating Large Systems
No ratings yet
Best Capacity Planning Method For Evaluating Large Systems
7 pages
Ch1a Slides
No ratings yet
Ch1a Slides
33 pages
Lec 2
No ratings yet
Lec 2
39 pages
Problem Statement # 01: Citizen Care Systems
No ratings yet
Problem Statement # 01: Citizen Care Systems
3 pages
Workloads 02 Tutorial
No ratings yet
Workloads 02 Tutorial
149 pages
Week 6 CH 16 Distributed Processors
No ratings yet
Week 6 CH 16 Distributed Processors
8 pages
Accurately Recreating Web Workloads Using Production Data
No ratings yet
Accurately Recreating Web Workloads Using Production Data
29 pages
CH 12
No ratings yet
CH 12
53 pages
Software Performance Workload Modelling
No ratings yet
Software Performance Workload Modelling
6 pages
Assignment II
No ratings yet
Assignment II
5 pages
Performance Modeling and Design of Computer Systems: Queueing Theory in Action
67% (3)
Performance Modeling and Design of Computer Systems: Queueing Theory in Action
574 pages
Guidelines For Git Business Case
No ratings yet
Guidelines For Git Business Case
6 pages
Network Management & Design Intro
No ratings yet
Network Management & Design Intro
23 pages
Software Testing Interview Guide
No ratings yet
Software Testing Interview Guide
5 pages
Topic3 Performance Concepts PDF
No ratings yet
Topic3 Performance Concepts PDF
42 pages
INFO 6055 Week 2 A
No ratings yet
INFO 6055 Week 2 A
14 pages
System Fundamentals
No ratings yet
System Fundamentals
58 pages
Dept of Cse & It VSSUT, Burla
No ratings yet
Dept of Cse & It VSSUT, Burla
6 pages
Vdoc - Pub Performance Modeling and Design of Computer Systems Queueing Theory in Action
No ratings yet
Vdoc - Pub Performance Modeling and Design of Computer Systems Queueing Theory in Action
574 pages
Performance Testing Presentation On 03july
No ratings yet
Performance Testing Presentation On 03july
36 pages
IT Infrastructure Performance Guide
No ratings yet
IT Infrastructure Performance Guide
42 pages
Performance and Evaluation CSC416 ECU Final
No ratings yet
Performance and Evaluation CSC416 ECU Final
39 pages
Mcabcamsc Project
No ratings yet
Mcabcamsc Project
23 pages
Intro
No ratings yet
Intro
10 pages
Online Job Portal
No ratings yet
Online Job Portal
126 pages
Traditional Design Tools: Tools of The System Analyst
No ratings yet
Traditional Design Tools: Tools of The System Analyst
19 pages
L36 - IO Perf Measures
No ratings yet
L36 - IO Perf Measures
10 pages
Existing Systems: Customer Description, Center Description, Ser-Vice Demands
No ratings yet
Existing Systems: Customer Description, Center Description, Ser-Vice Demands
22 pages
Performance Evaluation Guide
No ratings yet
Performance Evaluation Guide
59 pages
Network Management System
100% (1)
Network Management System
10 pages
Requirement Analysis & Specification
No ratings yet
Requirement Analysis & Specification
10 pages
Software Testing Interview Guide
No ratings yet
Software Testing Interview Guide
5 pages
Practitioner Flashcards Fronts
No ratings yet
Practitioner Flashcards Fronts
15 pages
Unit 5
No ratings yet
Unit 5
34 pages
Software Project Management
No ratings yet
Software Project Management
113 pages
RNC Troubleshoot
No ratings yet
RNC Troubleshoot
16 pages
Computer Performance Evaluation
100% (1)
Computer Performance Evaluation
6 pages
Lesson 17 Requirements Discovery
No ratings yet
Lesson 17 Requirements Discovery
24 pages
Perf Teaching 3
No ratings yet
Perf Teaching 3
16 pages
Network Measurement for CS Students
No ratings yet
Network Measurement for CS Students
17 pages
08 Handout 1
No ratings yet
08 Handout 1
3 pages
Computer Network Management: Best Practices
No ratings yet
Computer Network Management: Best Practices
9 pages
Bronchial Artery Pseudoaneurysm and Mediast - 2021 - Archivos de Bronconeumolog
No ratings yet
Bronchial Artery Pseudoaneurysm and Mediast - 2021 - Archivos de Bronconeumolog
2 pages
Software Testing
No ratings yet
Software Testing
6 pages
Changes in Control Status of COPD Over Time and T - 2021 - Archivos de Bronconeu
No ratings yet
Changes in Control Status of COPD Over Time and T - 2021 - Archivos de Bronconeu
8 pages
Cybercrime and Cybersecurity in Africa
No ratings yet
Cybercrime and Cybersecurity in Africa
6 pages
Software Engineering Q&A
No ratings yet
Software Engineering Q&A
8 pages
SW Project Management
No ratings yet
SW Project Management
10 pages
Software Maintenance
No ratings yet
Software Maintenance
7 pages
Software Analysis
No ratings yet
Software Analysis
11 pages
Sftware Requirement
No ratings yet
Sftware Requirement
8 pages
Software Design Interface
No ratings yet
Software Design Interface
7 pages
Assignment Unit1
No ratings yet
Assignment Unit1
2 pages
Medical Education Assessment Methods
No ratings yet
Medical Education Assessment Methods
12 pages
Using Computer Simulations To Enhance Science Teaching and Learning
100% (1)
Using Computer Simulations To Enhance Science Teaching and Learning
10 pages
Sophie Fini Whitepaper
No ratings yet
Sophie Fini Whitepaper
33 pages
1 s2.0 S0003687021003173 Main
No ratings yet
1 s2.0 S0003687021003173 Main
8 pages
[Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 243] Shuai Liu, Matt Glowatz, Marco Zappatore, Honghao Gao, Bing Jia, Alberto Bucciero - e-Learning, e-Education, and Onl
No ratings yet
[Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 243] Shuai Liu, Matt Glowatz, Marco Zappatore, Honghao Gao, Bing Jia, Alberto Bucciero - e-Learning, e-Education, and Onl
387 pages
Able Seafarer Engine
100% (2)
Able Seafarer Engine
197 pages
M.Tech Cad - Cam-2010
No ratings yet
M.Tech Cad - Cam-2010
60 pages
BusInt ERPsim Outline Latest
No ratings yet
BusInt ERPsim Outline Latest
5 pages
Project Plan Scope of Work Timeline: First Semester
No ratings yet
Project Plan Scope of Work Timeline: First Semester
2 pages
Using Games and Simulations To Support Learning in The Classroom
No ratings yet
Using Games and Simulations To Support Learning in The Classroom
32 pages
List of Finite Element Software Packages - 2 PDF
No ratings yet
List of Finite Element Software Packages - 2 PDF
3 pages
ACU Fusion 360 PDF
No ratings yet
ACU Fusion 360 PDF
2 pages
Generating Production Profiles For An-Oil Field
No ratings yet
Generating Production Profiles For An-Oil Field
6 pages
Ansys All You Need To Know About Hardware For Simulation
No ratings yet
Ansys All You Need To Know About Hardware For Simulation
36 pages
PHD Electrical Engineering Thesis PDF
100% (2)
PHD Electrical Engineering Thesis PDF
7 pages
Checklist For Planning and Conducting A Radiography Source (Class 7-Radioactive) Emergency Response Exercise
No ratings yet
Checklist For Planning and Conducting A Radiography Source (Class 7-Radioactive) Emergency Response Exercise
7 pages
CDMA RF Network Optimization Guidebook
No ratings yet
CDMA RF Network Optimization Guidebook
223 pages
THE THREE-POINT SNIPER'S MANUAL - A Complete Guide To Becoming An Elite Shooter
No ratings yet
THE THREE-POINT SNIPER'S MANUAL - A Complete Guide To Becoming An Elite Shooter
43 pages
Mechanism and Machine Theory: J.S. Rao
No ratings yet
Mechanism and Machine Theory: J.S. Rao
28 pages
Introduction To Informatics Assignment
No ratings yet
Introduction To Informatics Assignment
12 pages
Peace Corps OST Learning Objectives
No ratings yet
Peace Corps OST Learning Objectives
13 pages
Vertical Integration of Simulation Environments & Automated Test Suite For Validation of JLR Adas Features
No ratings yet
Vertical Integration of Simulation Environments & Automated Test Suite For Validation of JLR Adas Features
25 pages
CBC Hilot Wellness Massage NCII
100% (1)
CBC Hilot Wellness Massage NCII
70 pages
MIL - Module 15 16
No ratings yet
MIL - Module 15 16
25 pages
Efficient Selection of Agitators: Sulzer Pumps
No ratings yet
Efficient Selection of Agitators: Sulzer Pumps
2 pages
M.Tech Thesis Writing Aid
100% (3)
M.Tech Thesis Writing Aid
8 pages
Eleves: A New Software Tool For Electric Vehicle Modeling and Simulation
No ratings yet
Eleves: A New Software Tool For Electric Vehicle Modeling and Simulation
8 pages
Design of Heat Exchangers Using Aspen
No ratings yet
Design of Heat Exchangers Using Aspen
6 pages
Resume Pal Lav
No ratings yet
Resume Pal Lav
1 page

Workload Characterization

Uploaded by

Workload Characterization

Uploaded by

WORKLOAD CHARACTERIZATION

FIGURE 1.1. Overall workload characterization process.

(d) Markov models.

𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒 = 𝑆 2 = 1/(𝑛 − 1) ∑(𝑥𝑖 − 𝑥̅ )2

and 𝐶𝑂𝑉 = 𝑆/𝑥̅

1.7.2 Case Study: Website Characterization

You might also like