0% found this document useful (0 votes)

428 views15 pages

System Models For Distributed and Cloud Computing

This document discusses different types of distributed computing systems including clusters, peer-to-peer networks, grids, and clouds. It provides classifications and descriptions of each type of system. Key aspects covered include that clusters consist of connected computers working as a single resource, peer-to-peer networks have no central coordination, grids are heterogeneous clusters with centralized control, and clouds provide virtualized resources that can be rapidly provisioned. The document also discusses performance metrics, scalability, software environments, and concepts like Amdahl's law related to distributed systems.

Uploaded by

Subrahmanyam Sudi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

428 views15 pages

System Models For Distributed and Cloud Computing

Uploaded by

Subrahmanyam Sudi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 15

System Models for

Distributed and Cloud

Computing
Dr. Sanjay P. Ahuja, Ph.D.
2010-14 FIS Distinguished Professor
of Computer Science
School of Computing, UNF

Classification of Distributed Computing Systems

These can be classified into 4 groups: clusters, peer-to-peer

networks, grids, and clouds.

A computing cluster consists of interconnected stand-alone
computers which work cooperatively as a single integrated
computing resource. The network of compute nodes are connected
by LAN/SAN and are typically homogeneous with distributed
control running Unix/Linux. They are suited to HPC.

Peer-to-peer (P2P) Networks

In a P2P network, every node (peer) acts as both a client and server. Peers

act autonomously to join or leave the network. No central coordination or

central database is needed. No peer machine has a global view of the entire
P2P system. The system is self-organizing with distributed control.

Unlike the cluster or grid, a P2P network does not use dedicated
interconnection network.

P2P Networks are classified into different groups:

Distributed File Sharing: content distribution of MP3 music, video, etc. E.g.
Gnutella, Napster, BitTorrent.
Collaboration P2P networks: Skype chatting, instant messaging, gaming etc.
Distributed P2P computing: specific application computing such as
SETI@home provides 25 Tflops of distributed computing power over 3 million
Internet host machines.

Computational and Data Grids

Grids are heterogeneous clusters interconnected by high-speed

networks. They have centralized control, are server-oriented with

authenticated security. They are suited to distributed
supercomputing. E.g. TeraGrid.

Like an electric utility power grid, a computing grid offers an

infrastructure that couples computers, software/middleware,
people, and sensors together.

The grid is constructed across LANs, WANs, or Internet backbones

at a regional, national, or global scale.

The computers used in a grid include servers, clusters, and

supercomputers. PCs, laptops, and mobile devices can be used to

access a grid system.

Clouds
A Cloud is a pool of virtualized computer resources. A cloud can
host a variety of different workloads, including batch-style
backend jobs and interactive and user-facing applications.

Workloads can be deployed and scaled out quickly through rapid

provisioning of VMs. Virtualization of server resources has enabled

cost effectiveness and allowed cloud systems to leverage low
costs to benefit both users and providers.

Cloud system should be able to monitor resource usage in real

time to enable rebalancing of allocations when needed.

Cloud computing applies a virtualized platform with elastic

resources on demand by provisioning hardware, software, and

data sets dynamically. Desktop computing is moved to a serviceoriented platform using server clusters and huge databases at
datacenters.

Advantage of Clouds over Traditional Distributed

Systems
Traditional distributed computing systems provided for onpremise computing and were owned and operated by
autonomous administrative domains (e.g. a company).

These traditional systems encountered performance

bottlenecks, constant system maintenance, poor server

(and other resource) utilization, and increasing costs
associated with hardware/software upgrades.

Cloud computing as an on-demand computing paradigm

resolves or relieves many of these problems.

Software Environments for Distributed Systems and

Clouds:

Service-Oriented Architecture (SOA) Layered

Architecture
web services, Java RMI, and

CORBA, an entity is, respectively, a

service, a Java remote object, and a
CORBA object. These build on the
TCP/IP network stack. On top of the
network stack we have a base
software environment, which would
be .NET/Apache Axis for web
services, the JVM for Java, and the
ORB network for CORBA. On top of
this base environment, a higher level
environment with features specific to
the
distributed
computing
environment is built.

Loose

coupling and support of

heterogeneous
implementations
make services more attractive than
distributed objects.

CORBA Stack

RMI Stack

Web Services Stack

IDL

Java
interface

WSDL

CORBA Services

RMI
Registry

UDDI

CORBA
Stubs/Skeletons

RMI
Stubs/Skelet
ons

SOAP Message

CDR binary
encoding

Java native
encoding serialization

XML Unicode encoding

IIOP

JRMP

HTTP

RPC or Message Oriented Middleware (Websphere MQ or

JMS)
ORB

JVM

.NET/Apache Axis

TCP/IP/DataLink/Physical

Performance Metrics and Scalability Analysis

Performance Metrics:
CPU speed: MHz or GHz, SPEC benchmarks like SPECINT
Network Bandwidth: Mbps or Gbps
System throughput: MIPS, TFlops (tera floating-point operations
per second), TPS (transactions per second), IOPS (IO operations
per second)
Other metrics: Response time, network latency, system availability

Scalability:
Scalability is the ability of a system to handle growing amount of

work in a capable/efficient manner or its ability to be enlarged to

accommodate that growth.
For example, it can refer to the capability of a system to increase
total throughput under an increased load when resources
(typically hardware) are added.

Scalability
Scale Vertically
To scale vertically (or scale up) means to add resources to a single
node in a system, typically involving the addition of CPUs or
memory to a single computer.
Tradeoffs
There are tradeoffs between the two models. Larger numbers of
computers means increased management complexity, as well as a
more complex programming model and issues such as throughput
and latency between nodes.
Also, some applications do not lend themselves to a distributed
computing model.
In the past, the price difference between the two models has
favored "scale up" computing for those applications that fit its
paradigm, but recent advances in virtualization technology have
blurred that advantage, since deploying a new virtual
system/server over a hypervisor is almost always less expensive
than actually buying and installing a real one.

Scalability
One form of scalability for parallel and distributed systems is:
Size Scalability
This refers to achieving higher performance or more functionality by
increasing the machine size. Size in this case refers to adding
processors, cache, memory, storage, or I/O channels.
Scale Horizontally and Vertically
Methods of adding more resources for a particular application fall into
two broad categories:
Scale Horizontally
To scale horizontally (or scale out) means to add more nodes to a
system, such as adding a new computer to a distributed software
application. An example might be scaling out from one Web server
system to three.
The scale-out model has created an increased demand for shared
data storage with very high I/O performance, especially where
processing of large amounts of data is required.

Amdahls Law
It is typically cheaper to add a new node to a system in order to
achieve improved performance than to perform performance tuning
to improve the capacity that each node can handle. But this approach
can have diminishing returns as indicated by Amdahls Law.
Consider the execution of a given program on a uniprocessor
workstation with a total execution time of T minutes. Now, lets say
that the program has been parallelized or partitioned for parallel
execution on a cluster of many processing nodes.
Assume that a fraction of the code must be executed sequentially,
called the sequential block. Therefore, (1 - ) of the code can be
compiled for parallel execution by n processors. The total execution
time of program is calculated by:
T + (1 - ) T / n
where the first term is the sequential execution time on a single
processor and the second term is the parallel execution time on n
processing nodes.
All system or communication overhead is ignored here. The I/O and
exception handling time is also not included in the speedup analysis.

Amdahls Law
Amdahls Law states that the Speedup Factor of using the nprocessor system over the use of a single processor is expressed by
Speedup = S = T / [ T + (1 - ) T / n]
= 1 / [ + (1 - ) / n]
The maximum speedup of n is achievable only when = 0, i.e. the
entire program is parallelizable.
As the cluster becomes sufficiently large, i.e. n , then S 1 / ,
an upper bound on the speedup S. This upper bound is
independent of the cluster size, n. The sequential bottleneck is the
portion of the code that cannot be parallelized.
Example, = 0.25 and so (1 0.25) = 0.75 then the maximum
speedup, S = 4 even if one uses hundreds of processors.
Amdahls Law teaches us that we should make the sequential
bottleneck as small as possible. Increasing the cluster size alone
may not result in a good speedup in this case.

Amdahls Law

Example: suppose 70% of a program can be sped up if parallelized

and run on multiple CPUs instead of one CPU.

N = 4 processors
S = 1 / [0.3 + (1 0.3) / 4] = 2.105

Doubling the number of processors to N = 8 processors

S = 1 / [0.3 + (1 0.3) / 8] = 2.581
Double the processing power has only improved the speedup by
roughly one-fifth. Therefore, throwing in more hardware is not
necessarily the optimal approach.

System Efficiency

To execute a fixed workload on n processors, parallel processing may

lead to a system efficiency defined as:

System Efficiency, E = S / n = 1 / [ n + (1 - ) ]
System efficiency can be rather low if the cluster size is very large.
Example: To execute a program on a cluster with n = 4, = 0.25 and so
(1 0.25) = 0.75,
E = 1 / [0.25 * 4 + 0.75] = 0.57 or 57%
Now if we have 256 nodes (i.e. n = 256)
E = 1 / [0.25 * 256 + 0.75] = 0.015 or 1.5%
This is because only a few processors (4, as in the previous case) are
kept busy, while the majority of the processors (or nodes) are left idling.

Fault Tolerance and System Availability

High availability (HA) is desired in all clusters, grids, P2P networks,

and cloud systems. A system is highly available if it has a long Mean

Time to Failure (MTTF) and a short Mean Time to Repair (MTTR).

System Availability = MTTF / (MTTF + MTTR)

All hardware, software, and network components may fail. Single

points of failure that bring down the entire system must be avoided
when designing distributed systems.

Adding hardware redundancy, increasing component reliability,

designing for testability all help to enhance system availability and

dependability.

In general, as a distributed system increases in size, availability

decreases due to a higher chance of failure and a difficulty in isolating

failures.

PHD CS 100 Questions Answers Complete
No ratings yet
PHD CS 100 Questions Answers Complete
9 pages
B. Discuss Key Enabling Technologies in Cloud Computing Systems
No ratings yet
B. Discuss Key Enabling Technologies in Cloud Computing Systems
3 pages
Unit - Ii Cloud Computing Architecture: Architecture and Event-Driven Architecture
No ratings yet
Unit - Ii Cloud Computing Architecture: Architecture and Event-Driven Architecture
30 pages
Nosql Module 2
100% (1)
Nosql Module 2
87 pages
Overview of The Computing Paradigm: 1.1 Recent Trends in Distributed Computing
No ratings yet
Overview of The Computing Paradigm: 1.1 Recent Trends in Distributed Computing
5 pages
Module - 04 CC (Bcs601) Search Creators - 250426 - 131037
No ratings yet
Module - 04 CC (Bcs601) Search Creators - 250426 - 131037
64 pages
Practical File Cloud Computing IT-704
No ratings yet
Practical File Cloud Computing IT-704
27 pages
Subject Name Parallel and Distributed Computing
100% (1)
Subject Name Parallel and Distributed Computing
3 pages
Characterization of Distributed Systems Ds Module1
No ratings yet
Characterization of Distributed Systems Ds Module1
23 pages
Module 4 Nosql
No ratings yet
Module 4 Nosql
8 pages
VTU Exam Question Paper With Solution of BCS403 Database Management System July-2024-Poornima Manjunath, Ciyamala Kushbu
No ratings yet
VTU Exam Question Paper With Solution of BCS403 Database Management System July-2024-Poornima Manjunath, Ciyamala Kushbu
20 pages
Cloud Computing Chapter-11
No ratings yet
Cloud Computing Chapter-11
15 pages
Cloud Platform Architecture Over
No ratings yet
Cloud Platform Architecture Over
71 pages
Module-4 Cloud Computing Architecture PDF
No ratings yet
Module-4 Cloud Computing Architecture PDF
19 pages
Module-1: Review Questions: Automata Theory and Computability - 15CS54
No ratings yet
Module-1: Review Questions: Automata Theory and Computability - 15CS54
4 pages
Hadoop Distributed File System
No ratings yet
Hadoop Distributed File System
5 pages
Cp4152 Database Practice Lab Manual R 2021
No ratings yet
Cp4152 Database Practice Lab Manual R 2021
48 pages
CHAPTER 03: Big Data Technology Landscape
No ratings yet
CHAPTER 03: Big Data Technology Landscape
81 pages
CSE 2-2 CS & Syllabus - UG - R20
No ratings yet
CSE 2-2 CS & Syllabus - UG - R20
83 pages
Cloud Computing Architecture Guide
No ratings yet
Cloud Computing Architecture Guide
17 pages
P.prabu (28x61c) CCS334 BDA - Unit 4
No ratings yet
P.prabu (28x61c) CCS334 BDA - Unit 4
28 pages
Unit 5 BDA
No ratings yet
Unit 5 BDA
34 pages
Cs8791 Unit III Cloud Computing Notes
No ratings yet
Cs8791 Unit III Cloud Computing Notes
36 pages
Module - 05 CC (Bcs601) Search Creators
100% (2)
Module - 05 CC (Bcs601) Search Creators
35 pages
Chapter 06 Part1
No ratings yet
Chapter 06 Part1
20 pages
Hadoop Unit-4
No ratings yet
Hadoop Unit-4
44 pages
M Tech Full Time Cloud Computing PDF
No ratings yet
M Tech Full Time Cloud Computing PDF
30 pages
Embedded Lab Manual Final
No ratings yet
Embedded Lab Manual Final
63 pages
CP5261 Data Analytics Laboratory LTPC0042 Objectives
No ratings yet
CP5261 Data Analytics Laboratory LTPC0042 Objectives
80 pages
NOSQL
No ratings yet
NOSQL
16 pages
Implementation Techniques - Unit 4
No ratings yet
Implementation Techniques - Unit 4
29 pages
Cloud Computing Architecture Module III
No ratings yet
Cloud Computing Architecture Module III
14 pages
Bda - Unit 3
No ratings yet
Bda - Unit 3
29 pages
Unit 2 BDA
No ratings yet
Unit 2 BDA
32 pages
SEPM Question Paper Solution May 2023
No ratings yet
SEPM Question Paper Solution May 2023
23 pages
Introduction
No ratings yet
Introduction
25 pages
Cloud Computing Notes (MCA III)
No ratings yet
Cloud Computing Notes (MCA III)
209 pages
CC Module 5
No ratings yet
CC Module 5
26 pages
Distributed File System - File Service Architecture
No ratings yet
Distributed File System - File Service Architecture
51 pages
RRIT Question Bank 1 - CC - IA-1-2021-22
No ratings yet
RRIT Question Bank 1 - CC - IA-1-2021-22
2 pages
DBMS Question Bank
No ratings yet
DBMS Question Bank
4 pages
Chapter 6 Security Reference Model
No ratings yet
Chapter 6 Security Reference Model
30 pages
CC Unit 1
No ratings yet
CC Unit 1
139 pages
Java - Lab - Manual-21csl35 - Skit
No ratings yet
Java - Lab - Manual-21csl35 - Skit
30 pages
Parallel Database Systems
No ratings yet
Parallel Database Systems
17 pages
Advance Java Questions
No ratings yet
Advance Java Questions
4 pages
ADBMS Lab Manual
No ratings yet
ADBMS Lab Manual
33 pages
DBMS Exam Paper for B.E. Students
No ratings yet
DBMS Exam Paper for B.E. Students
8 pages
Cloud, Microservices and Applications Notes (5 Units)
No ratings yet
Cloud, Microservices and Applications Notes (5 Units)
71 pages
Counting Tree
No ratings yet
Counting Tree
7 pages
Unit 3 Greedy & Dynamic Programming
No ratings yet
Unit 3 Greedy & Dynamic Programming
217 pages
BDA Presentations Unit-4 - Hadoop, Ecosystem
100% (1)
BDA Presentations Unit-4 - Hadoop, Ecosystem
25 pages
Data Engineering Interview Preparation Questions
No ratings yet
Data Engineering Interview Preparation Questions
7 pages
Unit I
No ratings yet
Unit I
53 pages
Module - 01 CC (BCS601)
No ratings yet
Module - 01 CC (BCS601)
47 pages
Question Bank - Unit I
No ratings yet
Question Bank - Unit I
2 pages
Unit 4-DBP
No ratings yet
Unit 4-DBP
66 pages
AIML 4th and 5th Module Notes
No ratings yet
AIML 4th and 5th Module Notes
77 pages
What Kind of Data Can Be Mined
No ratings yet
What Kind of Data Can Be Mined
6 pages
Distributed Computing Systems Types
No ratings yet
Distributed Computing Systems Types
14 pages
Transformation Description Examples of When Transformation Would Be Used
No ratings yet
Transformation Description Examples of When Transformation Would Be Used
7 pages
SSIS Architecture
No ratings yet
SSIS Architecture
4 pages
SSIS Materials
No ratings yet
SSIS Materials
133 pages
Tax Declaration for Employees
No ratings yet
Tax Declaration for Employees
3 pages
Microsoft Business Intelligence Guide
No ratings yet
Microsoft Business Intelligence Guide
164 pages
Expressions For Alternate Row Color,: "Page " " of "
No ratings yet
Expressions For Alternate Row Color,: "Page " " of "
2 pages
Is Interview Questions
No ratings yet
Is Interview Questions
2 pages
Informatica PowerCenter Guide
No ratings yet
Informatica PowerCenter Guide
21 pages
Select From Tablename Where (Case When @repparam 'All' and Colname @repparam Then 1 When @repparam 'All' Then 1 End) 1
No ratings yet
Select From Tablename Where (Case When @repparam 'All' and Colname @repparam Then 1 When @repparam 'All' Then 1 End) 1
1 page
Expressions For Alternate Row Color,: "Page " " of "
No ratings yet
Expressions For Alternate Row Color,: "Page " " of "
2 pages
SSRS 2012 Material
No ratings yet
SSRS 2012 Material
58 pages
SSRS 2012 Material
No ratings yet
SSRS 2012 Material
58 pages
Creating Cube in SSAS 2008
No ratings yet
Creating Cube in SSAS 2008
6 pages
Is Interview Questions
No ratings yet
Is Interview Questions
2 pages
Translations in SSAS
No ratings yet
Translations in SSAS
9 pages
Transformation Description Examples of When Transformation Would Be Used
No ratings yet
Transformation Description Examples of When Transformation Would Be Used
7 pages
Microsoft Business Intelligence
No ratings yet
Microsoft Business Intelligence
10 pages
SSIS Materials
No ratings yet
SSIS Materials
133 pages
Primary Account
No ratings yet
Primary Account
1 page
Step by Step Installation of Microsoft SQL Server 2012 With Business Intelligence
No ratings yet
Step by Step Installation of Microsoft SQL Server 2012 With Business Intelligence
29 pages
Process Simulator & Visio: Optimize Business Models
No ratings yet
Process Simulator & Visio: Optimize Business Models
2 pages
AllTorque Gen II Manual
100% (1)
AllTorque Gen II Manual
43 pages
Introduction To Word, Ribbons and QAT
No ratings yet
Introduction To Word, Ribbons and QAT
2 pages
05 - CH73,76 - Full Autority Digital Engine Control FADEC
100% (2)
05 - CH73,76 - Full Autority Digital Engine Control FADEC
54 pages
Qurbani Management System (QMS) SRS
No ratings yet
Qurbani Management System (QMS) SRS
10 pages
Pom
No ratings yet
Pom
2 pages
Module 4
No ratings yet
Module 4
15 pages
Mathematics
No ratings yet
Mathematics
2 pages
SRIHARI V RESUME Rev
No ratings yet
SRIHARI V RESUME Rev
3 pages
English Wikipedia
No ratings yet
English Wikipedia
4 pages
Ethical Hacking
No ratings yet
Ethical Hacking
2 pages
Hospital IT System Overview
No ratings yet
Hospital IT System Overview
31 pages
7 I 76
No ratings yet
7 I 76
9 pages
SE Lab 08 - Updated
No ratings yet
SE Lab 08 - Updated
5 pages
SAP-TCodes Module MDM-EN
No ratings yet
SAP-TCodes Module MDM-EN
8 pages
9th EM L 2 MCQ
No ratings yet
9th EM L 2 MCQ
7 pages
Python Data Science Project Report
100% (1)
Python Data Science Project Report
20 pages
Top Five Colleges: Preparing To Present
No ratings yet
Top Five Colleges: Preparing To Present
9 pages
Using The Fluke 5000A-RH/T With MET/CAL V6.11
No ratings yet
Using The Fluke 5000A-RH/T With MET/CAL V6.11
15 pages
White Paper Openmatics, ZF Friedrichshafen AG - A Platform For All Telematics Applications - English
No ratings yet
White Paper Openmatics, ZF Friedrichshafen AG - A Platform For All Telematics Applications - English
5 pages
Ujwal Maharjan IT CV & Experience
No ratings yet
Ujwal Maharjan IT CV & Experience
2 pages
Questions Interview
No ratings yet
Questions Interview
7 pages
Invoice - Bitrefill
No ratings yet
Invoice - Bitrefill
2 pages
YamMonManual 1.0
No ratings yet
YamMonManual 1.0
5 pages
Excel Basics: Workbook & Formulas
No ratings yet
Excel Basics: Workbook & Formulas
3 pages
Current Log
No ratings yet
Current Log
55 pages
Ps C:/Users/Faiza C/Users/Faiza/C/Firstrepo
No ratings yet
Ps C:/Users/Faiza C/Users/Faiza/C/Firstrepo
25 pages
Rainbow Vistas at Rock Garden - Google Maps PDF
No ratings yet
Rainbow Vistas at Rock Garden - Google Maps PDF
3 pages
Removing Active Directory Rights Management Services Step-By-Step Guide
No ratings yet
Removing Active Directory Rights Management Services Step-By-Step Guide
9 pages
Design and Fpga Implementation of Hamming Code Encoder and Decoder Under The Guidance of Asst - Professor Dr. K Rajendra Prasad
No ratings yet
Design and Fpga Implementation of Hamming Code Encoder and Decoder Under The Guidance of Asst - Professor Dr. K Rajendra Prasad
20 pages

System Models For Distributed and Cloud Computing

Uploaded by

System Models For Distributed and Cloud Computing

Uploaded by

System Models for

Distributed and Cloud

Classification of Distributed Computing Systems

networks, grids, and clouds.

Peer-to-peer (P2P) Networks

act autonomously to join or leave the network. No central coordination or

P2P Networks are classified into different groups:

Computational and Data Grids

networks. They have centralized control, are server-oriented with

Like an electric utility power grid, a computing grid offers an

The grid is constructed across LANs, WANs, or Internet backbones

The computers used in a grid include servers, clusters, and

supercomputers. PCs, laptops, and mobile devices can be used to

Workloads can be deployed and scaled out quickly through rapid

provisioning of VMs. Virtualization of server resources has enabled

Cloud system should be able to monitor resource usage in real

Cloud computing applies a virtualized platform with elastic

resources on demand by provisioning hardware, software, and

Advantage of Clouds over Traditional Distributed

These traditional systems encountered performance

bottlenecks, constant system maintenance, poor server

Cloud computing as an on-demand computing paradigm

Software Environments for Distributed Systems and

Service-Oriented Architecture (SOA) Layered

CORBA, an entity is, respectively, a

coupling and support of

Web Services Stack

XML Unicode encoding

RPC or Message Oriented Middleware (Websphere MQ or

Performance Metrics and Scalability Analysis

work in a capable/efficient manner or its ability to be enlarged to

Example: suppose 70% of a program can be sped up if parallelized

and run on multiple CPUs instead of one CPU.

Doubling the number of processors to N = 8 processors

To execute a fixed workload on n processors, parallel processing may

Fault Tolerance and System Availability

High availability (HA) is desired in all clusters, grids, P2P networks,

and cloud systems. A system is highly available if it has a long Mean

System Availability = MTTF / (MTTF + MTTR)

Adding hardware redundancy, increasing component reliability,

designing for testability all help to enhance system availability and

In general, as a distributed system increases in size, availability

decreases due to a higher chance of failure and a difficulty in isolating

You might also like