0% found this document useful (0 votes)

23 views19 pages

Lecture 1

Uploaded by

iakambamu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views19 pages

Lecture 1

Uploaded by

iakambamu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Trends in Computing

Architecture
CMSC828E
Ramani Duraiswami

Several slides taken from a Microway/NVIDIA webinar

Some figures adapted from web sources
Problem sizes in simulation and data processing
are increasing
• Change in paradigm in science
– Simulate then test
– Fidelity demands larger simulations
– Problems being simulated are also much more
• Sensors are getting varied and cheaper; and storage is
getting cheaper
– Cameras, microphones
• Other Large data
– Text (all the newspapers, books, technical papers)
– Genome data
– Medical/biological data (X-Ray, PET, MRI, Ultrasound, Electron
microscopy …)
– Climate (Temperature, Salinity, Pressure, Wind, Oxygen content, …)
Ways to attack problem size
growth
• Faster algorithms with better asymptotic
complexity
• Faster processors
– “Moore’s law will take care of it”
• Go parallel!
– Clusters of computers
– New data parallel chips (multicore processors,
GPUs)
“Moore’s Law will take care of it”
• Not law but an
observation by Gordon
Moore in the 1960s
• Number of transistors
doubles every 18
months
• Basically has been
taken to mean that the
“standard computer”s
performance improves
exponentially, with a
doubling time of 18
months
Refuting the Moore’s law argument
• Argument:
– Moore’s law: Processor speed doubles every 18 months
– If we wait long enough the computer will get fast enough and let my
inefficient algorithm tackle the problem
• Is this true?
– Yes for algorithms with linear asymptotic complexity
– No!! For algorithms with different asymptotic complexity
– Most scientific algorithms are O(N2) or O(N3)
– For a million variables, we would need about 16 generations of
Moore’s law before a O(N2) algorithm was comparable with a O(N)
algorithm
• Did no one tell you that Moore’s law is dead?
Moore’s Law is dead:
“Issues at small scales”

- Lithography not possible

- 2D electrostatics harder to control,
- “parasitic resistance” degrade performance,
- device to device variations will be larger,
- ultra-thin bodies and hyper-abrupt junctions
make manufacturing difficult
Moore’s Law is dead!
• Feature sizes and clock speeds on commodity
chips have been stagnant over the past 4 years
– ~3 GHz and 45 nm
• All manufacturers are going with multicore to
maintain performance
– Core-2, core-2-duo, quad-core, …
• Shared memory multiprocessing
– Intel has demo’ed several many core systems
• Graphics processors and gaming
consoles have already been on the
multicore path for a decade!
Gamer Power

Sony Playstation 3 Microsoft X-Box 360

2.18 teraflops <$400 1.04 teraflops <$300

Difficult to program Difficult to program

Multicore Intel box with 3 GPUs
GEFORCE 8880 GTX
in Slots
~ 1 Teraflop for < 3000
(shown with 1 GPU)
Programming on the GPU
• GPU organized as groups of
multiprocessors (8 relatively slow
processors) with small amount of own Local memory
memory and access to common shared ~50kB
memory
• Factor of 100s difference in speed as one
goes up the memory hierarchy
• To achieve gains problems must fit the GPU GPU shared
programming paradigm/ manage memory memory
~1GB
• Fortunately many practically important tasks
do map well and work on converting others
– Image and Audio Processing
– Some types of linear algebra cores
– Many machine learning algorithms Host memory
• Research issues: ~2-32 GB
– Identifying important tasks and mapping them to the
architecture
– Making it convenient for programmers to call GPU
code from host code
What is GPU Computing?

4 cores

Computing with CPU +

GPU
Heterogeneous
Computing
11
Not 2x or 3x : Speedups are 20x to
150x

146X 36X 18X 50X 100X

Medical Molecular Video Matlab Astrophysic

Imaging Dynamics Transcoding Computing s
U of Utah U of Illinois, Elemental Tech AccelerEyes RIKEN
Urbana

149X 47X 20X 130X 30X

Financial Linear Algebra 3D Quantum Gene

simulation Universidad Ultrasound Chemistry Sequencing
Oxford Jaime Techniscan U of Illinois, U of Maryland
Urbana

12
Accelerating Time to Discovery

4.6 Days
2.7 Days 3 Hours
8 Hours

30 Minutes
27 Minutes 16 Minutes
13 Minutes

CPU Only With GPU

13
Molecular Dynamics

Available MD software
NAMD / VMD (alpha
release)
HOOMD
ACE-MD
MD-GPU Source: Stone, Phillips, Hardy, Schulten

Ongoing work
LAMMPS
CHARMM
GROMACS
AMBER

Source: Anderson, Lorenz, Travesset

14
Quantum Chemistry

Available MD software
NAMD / VMD (alpha
release)
HOOMD
ACE-MD Source: Ufimtsev, Martinez

Ongoing work
LAMMPS
CHARMM
Q-Chem
Gaussian
GAMESS

Source: Yasuda
15
Computational Fluid Dynamics (CFD)

Ongoing work
Navier-Stokes
Lattice Boltzman
3D Euler Solver
Weather and ocean
modeling
Source: Thibault, Senocak

Source: Tolke, Krafczyk 16

Electromagnetics / Electrodynamics

FDTD Solvers
Acceleware
EM Photonics
CUDA Tutorial

Ongoing work
Maxwell equation solver
Ring Oscillator (FDTD)
Particle beam dynamics
simulator

FDTD Acceleration using GPUs

Source: Acceleware

17
Weather, Atmospheric, & Ocean
Modeling
CUDA-accelerated WRF available
Other kernels in WRF being
ported

Ongoing work
Tsunami modeling
Ocean modeling Source: Michalakes,
Vachharajani
Several CFD codes

Source: Matsuoka, Akiyama, et al

18
Computational Finance
Financial Computing Software vendors
SciComp : Derivatives pricing
modeling
Hanweck: Options pricing & risk
analysis
Aqumin: 3D visualization of market
data
Source: SciComp
Exegy: High-volume Tickers & Risk
Analysis
QuantCatalyst: Pricing & Hedging
Engine
Oneye: Algorithmic Trading
Arbitragis Trading: Trinomial Options
Pricing

Ongoing work
LIBOR Monte Carlo market model Source: CUDA SDK
19
Callable Swaps and Continuous Time

GPGPU
No ratings yet
GPGPU
139 pages
AHA Unit - 4
No ratings yet
AHA Unit - 4
173 pages
PK Introduction CUDA
No ratings yet
PK Introduction CUDA
170 pages
Topic 8
No ratings yet
Topic 8
71 pages
Parallel Computing Course Guide
No ratings yet
Parallel Computing Course Guide
50 pages
Unit 4
100% (1)
Unit 4
48 pages
Kien Truc May Tinh David Brooks Cs146 Lecture1 Introduction To Computer Architecture (Cuuduongthancong - Com)
No ratings yet
Kien Truc May Tinh David Brooks Cs146 Lecture1 Introduction To Computer Architecture (Cuuduongthancong - Com)
14 pages
03 Why Parallel
No ratings yet
03 Why Parallel
34 pages
Ada2024 Gpu 1
No ratings yet
Ada2024 Gpu 1
47 pages
Introduction To CUDA
No ratings yet
Introduction To CUDA
51 pages
Lec 14
No ratings yet
Lec 14
52 pages
AHA U4
No ratings yet
AHA U4
199 pages
Introduction To Massively Parallel Computing
No ratings yet
Introduction To Massively Parallel Computing
44 pages
Week 1 Csc447
No ratings yet
Week 1 Csc447
36 pages
Transcript of Pivotal Climate-Change Hearing 1988
100% (4)
Transcript of Pivotal Climate-Change Hearing 1988
216 pages
48423B Fusion Whitepaper WEB
No ratings yet
48423B Fusion Whitepaper WEB
8 pages
CSED405 Lec2-CUDA Overview - 240916 - 131108
No ratings yet
CSED405 Lec2-CUDA Overview - 240916 - 131108
52 pages
Mrcs Part B Osce Anatomy
No ratings yet
Mrcs Part B Osce Anatomy
287 pages
CUDA
No ratings yet
CUDA
46 pages
Lecture1 Introduction To Parallel Computing - 2025
No ratings yet
Lecture1 Introduction To Parallel Computing - 2025
38 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
02 Architecture
No ratings yet
02 Architecture
65 pages
ECE 498AL The CUDA Programming Model
No ratings yet
ECE 498AL The CUDA Programming Model
37 pages
Lec 3
No ratings yet
Lec 3
48 pages
CI-0120 Arquitectura de Computadoras Ejemplos FundamentosDiseño
No ratings yet
CI-0120 Arquitectura de Computadoras Ejemplos FundamentosDiseño
52 pages
Accelerating Scientific Computing with GPUs
100% (2)
Accelerating Scientific Computing with GPUs
96 pages
GS 150
No ratings yet
GS 150
72 pages
Ec23 Chapter1
No ratings yet
Ec23 Chapter1
84 pages
Lecture Slides-Week1
No ratings yet
Lecture Slides-Week1
59 pages
Parallel Programming for Scientists
No ratings yet
Parallel Programming for Scientists
50 pages
Kirk+Hwu GPU
No ratings yet
Kirk+Hwu GPU
92 pages
Computer Evolution 2 (Details)
No ratings yet
Computer Evolution 2 (Details)
23 pages
Lecture 1: Introduction: Graphics Processing Units (Gpus) : Architecture and Programming
No ratings yet
Lecture 1: Introduction: Graphics Processing Units (Gpus) : Architecture and Programming
33 pages
MSDS Pigment Yellow 14
No ratings yet
MSDS Pigment Yellow 14
3 pages
Whirlpool Schema
No ratings yet
Whirlpool Schema
11 pages
Advanced Computer Architecture Fall 2019 Multithreaded Architectures
No ratings yet
Advanced Computer Architecture Fall 2019 Multithreaded Architectures
31 pages
Parralel Demro 001
No ratings yet
Parralel Demro 001
45 pages
A Look Into Parallel Architectures
No ratings yet
A Look Into Parallel Architectures
43 pages
Lecture 0: Cpus and Gpus: Prof. Mike Giles
No ratings yet
Lecture 0: Cpus and Gpus: Prof. Mike Giles
36 pages
Note2 4
No ratings yet
Note2 4
11 pages
Lecture 2
No ratings yet
Lecture 2
15 pages
ch1 PC
No ratings yet
ch1 PC
84 pages
Defining Computer Architecture
No ratings yet
Defining Computer Architecture
6 pages
CC Unit 1
No ratings yet
CC Unit 1
24 pages
Owens
No ratings yet
Owens
67 pages
Advanced Computer Architecture: Azvjvhd
No ratings yet
Advanced Computer Architecture: Azvjvhd
61 pages
Gpus
No ratings yet
Gpus
32 pages
Arallel Rocessing NIT
No ratings yet
Arallel Rocessing NIT
44 pages
CUDA Class Lecture01
No ratings yet
CUDA Class Lecture01
26 pages
The Sigma Guidelines-Toolkit: Sigma Opportunity and Risk Guide
No ratings yet
The Sigma Guidelines-Toolkit: Sigma Opportunity and Risk Guide
21 pages
Sodium Chloride Nacl Data Sheet
No ratings yet
Sodium Chloride Nacl Data Sheet
1 page
Android-Controlled Pesticide Spraying Robot
No ratings yet
Android-Controlled Pesticide Spraying Robot
6 pages
Programming For Graphics Processing Units (Gpus) : Parallel
No ratings yet
Programming For Graphics Processing Units (Gpus) : Parallel
35 pages
p10 Cuda
No ratings yet
p10 Cuda
28 pages
Chapter 1 5 Thesis Sample
100% (2)
Chapter 1 5 Thesis Sample
64 pages
GPGPU Programming With CUDA: Leandro Avila - University of Northern Iowa
No ratings yet
GPGPU Programming With CUDA: Leandro Avila - University of Northern Iowa
29 pages
Summary Exam 2015
No ratings yet
Summary Exam 2015
30 pages
06 Intro Gpus
No ratings yet
06 Intro Gpus
33 pages
Lecture 3 - Layered Network Architecture, Protocols, Interfaces, Services
No ratings yet
Lecture 3 - Layered Network Architecture, Protocols, Interfaces, Services
27 pages
Leading With Joy
No ratings yet
Leading With Joy
6 pages
Georges Renault Cvis II
No ratings yet
Georges Renault Cvis II
76 pages
Medan LPG Terminal Overview
100% (1)
Medan LPG Terminal Overview
38 pages
Kavi Bhai Santokh Singh
No ratings yet
Kavi Bhai Santokh Singh
4 pages
Group 17 - Research Proposal-1
No ratings yet
Group 17 - Research Proposal-1
36 pages
GPU Computing Course Overview
No ratings yet
GPU Computing Course Overview
17 pages
Current Affairs Weekly Q&A PDF February 2023 2nd Week by AffairsCloud 1
No ratings yet
Current Affairs Weekly Q&A PDF February 2023 2nd Week by AffairsCloud 1
79 pages
Lecture 4 Network Security
No ratings yet
Lecture 4 Network Security
17 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
17 pages
XI - BST - 3 - Private, Public and Global Enterprises
No ratings yet
XI - BST - 3 - Private, Public and Global Enterprises
3 pages
CSE 820 Graduate Computer Architecture: Dr. Enbody
No ratings yet
CSE 820 Graduate Computer Architecture: Dr. Enbody
25 pages
Advanced Flight Ops Training
No ratings yet
Advanced Flight Ops Training
3 pages
лк CUDA - 1 PDCn
No ratings yet
лк CUDA - 1 PDCn
31 pages
Region Religion and Politics 100 Years of Shiromani Alcali Dal Amarjit S Narang Download
No ratings yet
Region Religion and Politics 100 Years of Shiromani Alcali Dal Amarjit S Narang Download
64 pages
The Life and Death of Planet Earth How The New Science of Astrobiology Charts The Ultimate Fate of Our World 1st Edition Peter Ward Download
No ratings yet
The Life and Death of Planet Earth How The New Science of Astrobiology Charts The Ultimate Fate of Our World 1st Edition Peter Ward Download
51 pages
Flipkart Sample Opposition
100% (1)
Flipkart Sample Opposition
76 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
Advanced Computer Architecture Course
No ratings yet
Advanced Computer Architecture Course
28 pages
USPCAS-E Manual
No ratings yet
USPCAS-E Manual
119 pages
Libble Eu
No ratings yet
Libble Eu
55 pages
Unit 4 INTERNET AND WEB
No ratings yet
Unit 4 INTERNET AND WEB
29 pages
Group One
No ratings yet
Group One
16 pages
Single Channel LoRa IoT Kit v2 User Manual - v1.0.7
No ratings yet
Single Channel LoRa IoT Kit v2 User Manual - v1.0.7
61 pages
Trends in Computer Architecture
No ratings yet
Trends in Computer Architecture
30 pages
Hull For: Aerodynamic Design HASPA LTA Optimization
No ratings yet
Hull For: Aerodynamic Design HASPA LTA Optimization
5 pages
6089202f4e466 The Amorphous Nature of Agile No One Size Fits All
No ratings yet
6089202f4e466 The Amorphous Nature of Agile No One Size Fits All
42 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
20 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Lecture 4
No ratings yet
Lecture 4
19 pages
Computational Math for CS Students
No ratings yet
Computational Math for CS Students
17 pages
Unit 3 Importance of Computers
No ratings yet
Unit 3 Importance of Computers
15 pages
Extra-Creamy Scrambled Eggs Recipe - NYT Cooking
No ratings yet
Extra-Creamy Scrambled Eggs Recipe - NYT Cooking
2 pages
Gpu1 - GPU Introduction
No ratings yet
Gpu1 - GPU Introduction
20 pages
ES Alcoholic Beverages
No ratings yet
ES Alcoholic Beverages
10 pages
Unit4 - Instruction Set Architecture (ISA)
No ratings yet
Unit4 - Instruction Set Architecture (ISA)
9 pages
Ismaeel
No ratings yet
Ismaeel
9 pages
Group F
No ratings yet
Group F
8 pages
en - GASP 2020 2022 Global Aviation Safety Plan
No ratings yet
en - GASP 2020 2022 Global Aviation Safety Plan
144 pages
Leture 7
No ratings yet
Leture 7
7 pages
Ocean Acidification Virtual Lab
No ratings yet
Ocean Acidification Virtual Lab
4 pages
Prac 7
No ratings yet
Prac 7
7 pages
CUDA Programming for Engineers
No ratings yet
CUDA Programming for Engineers
84 pages

Lecture 1

Uploaded by

Lecture 1

Uploaded by

Trends in Computing

Several slides taken from a Microway/NVIDIA webinar

- Lithography not possible

Sony Playstation 3 Microsoft X-Box 360

Difficult to program Difficult to program

Computing with CPU +

146X 36X 18X 50X 100X

Medical Molecular Video Matlab Astrophysic

149X 47X 20X 130X 30X

Financial Linear Algebra 3D Quantum Gene

CPU Only With GPU

Source: Anderson, Lorenz, Travesset

Source: Tolke, Krafczyk 16

FDTD Acceleration using GPUs

Source: Matsuoka, Akiyama, et al

You might also like