0% found this document useful (0 votes)

171 views23 pages

Lecture 4

The document provides an overview of digital signal processors (DSPs), including their architecture features, applications, and how they differ from general purpose processors and microcontrollers. It discusses key DSP hardware characteristics like the Harvard architecture, dedicated multiply-accumulate units, single-instruction multiple-data (SIMD) and very long instruction word (VLIW) parallelism, pipelining, saturation arithmetic, zero overhead looping, and hardware circular addressing. Examples of DSP applications like digital filtering and fast Fourier transforms are also provided.

Uploaded by

Kunal Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

171 views23 pages

Lecture 4

Uploaded by

Kunal Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

Lecture 4

Introduction to Digital Signal

Processors (DSPs)

Dr. Konstantinos Tatas

Outline/objectives
• Identify the most important DSP processor
architecture features and how they relate
to DSP applications
• Understand the types of code appropriate
for DSP implementation

ACOE343 - Embedded Real-Time Processor Systems - 2

Frederick University
What is a DSP?
• A specialized microprocessor for real-
time DSP applications
– Digital filtering (FIR and IIR)
– FFT
– Convolution, Matrix Multiplication etc

DIGITAL DIGITAL
ANALOG INPUT OUTPUT ANALOG
ADC DSP DAC
INPUT OUTPUT

ACOE343 - Embedded Real-Time Processor Systems - 3

Frederick University
Hardware used in DSP
ASIC FPGA GPP DSP

Performance Very High High Medium Medium High

Flexibility Very low High High High

Power Very low low Medium Low Medium

consumption

Development Long Medium Short Short

Time

ACOE343 - Embedded Real-Time Processor Systems - 4

Frederick University
Common DSP features
• Harvard architecture
• Dedicated single-cycle Multiply-Accumulate
(MAC) instruction (hardware MAC units)
• Single-Instruction Multiple Data (SIMD) Very
Large Instruction Word (VLIW) architecture
• Pipelining
• Saturation arithmetic
• Zero overhead looping
• Hardware circular addressing
• Cache
• DMA
ACOE343 - Embedded Real-Time Processor Systems - 5
Frederick University
Harvard Architecture
• Physically separate
DATA
memories and paths MEMORY
for instruction and
data CPU

PROGRAM
MEMORY

ACOE343 - Embedded Real-Time Processor Systems - 6

Frederick University
Single-Cycle MAC unit
ai xi

Multiplier
a i-1 x i-1
n
Σ(a ix i )
ai xi

Adder
i=0
a i x i + a i-1 x i-1
Can compute a sum of n-
Register
products in n cycles

ACOE343 - Embedded Real-Time Processor Systems - 7

Frederick University
Single Instruction - Multiple Data
(SIMD)
• A technique for data-level parallelism by
employing a number of processing
elements working in parallel

ACOE343 - Embedded Real-Time Processor Systems - 8

Frederick University
Very Long Instruction Word (VLIW)
• A technique for
VLIW instruction F=a+b c=e/g d=x&y w=z*h

instruction-level a
F
parallelism by executing b PU

instructions without
dependencies (known at e PU
c

compile-time) in parallel g
• Example of a single x d
PU
VLIW instruction: y

F=a+b; c=e/g; d=x&y; w=z*h;

z w
PU
h

ACOE343 - Embedded Real-Time Processor Systems - 9

Frederick University
CISC vs. RISC vs. VLIW

ACOE343 - Embedded Real-Time Processor Systems - 10

Frederick University
Pipelining
• DSPs commonly feature deep pipelines
• TMS320C6x processors have 3 pipeline stages
with a number of phases (cycles):
– Fetch
• Program Address Generate (PG)
• Program Address Send (PS)
• Program ready wait (PW)
• Program receive (PR)
– Decode
• Dispatch (DP)
• Decode (DC)
– Execute
• 6 to 10 phases

ACOE343 - Embedded Real-Time Processor Systems - 11

Frederick University
Saturation Arithmetic
• fixed range for operations like addition and
multiplication
• normal overflow and underflow produce the
maximum and minimum allowed value,
respectively
• Associativity and distributivity no longer apply
• 1 signed byte saturation arithmetic examples:
• 64 + 69 = 127
• -127 – 5 = -128
• (64 + 70) – 25 = 122 ≠ 64 + (70 -25) = 109

ACOE343 - Embedded Real-Time Processor Systems - 12

Frederick University
Examples
• Perform the following operations using
one-byte saturation arithmetic
• 0x77 + 0x99 =
• 0x4*0x42=
• 0x3*0x51=

ACOE343 - Embedded Real-Time Processor Systems - 13

Frederick University
Zero Overhead Looping
• Hardware support for loops with a
constant number of iterations using
hardware loop counters and loop buffers
• No branching
• No loop overhead
• No pipeline stalls or branch prediction
• No need for loop unrolling

ACOE343 - Embedded Real-Time Processor Systems - 14

Frederick University
Hardware Circular Addressing
• A data structure Head

implementing a fixed X[n]

length queue of fixed size

X[n-1]
objects where objects are
added to the head of the X[n]

queue while items are Cycle1

removed from the tail of X[n-1] X[n-2] Cycle2

the queue.
• Requires at least 2
X[n-2]

X[n-3] X[n-3]
pointers (head and tail)
• Extensively used in digital
filtering Tail

y[n] = a0x[n]+a1x[n-1]+…+akx[n-k]

ACOE343 - Embedded Real-Time Processor Systems - 15

Frederick University
Direct Memory Access (DMA)
• The feature that allows peripherals to access
main memory without the intervention of the
CPU
• Typically, the CPU initiates DMA transfer, does
other operations while the transfer is in
progress, and receives an interrupt from the
DMA controller once the operation is complete.
• Can create cache coherency problems (the data
in the cache may be different from the data in
the external memory after DMA)
• Requires a DMA controller

ACOE343 - Embedded Real-Time Processor Systems - 16

Frederick University
Cache memory
• Separate instruction and data L1 caches
(Harvard architecture)
• Cache coherence protocols required,
since most systems use DMA

ACOE343 - Embedded Real-Time Processor Systems - 17

Frederick University
DSP vs. Microcontroller
• DSP • Microcontroller
– Harvard Architecture – Mostly von Neumann
– VLIW/SIMD (parallel Architecture
execution units) – Single execution unit
– No bit level operations – Flexible bit-level
– Hardware MACs operations
– DSP applications – No hardware MACs
– Control applications

ACOE343 - Embedded Real-Time Processor Systems - 18

Frederick University
Examples
• Estimate how long will the following code
fragment take to execute on
– A general purpose processor with 1 GHz operating
frequency, five-stage pipelining and 5 cycles required
for multiplication, 1 cycle for addition
– A DSP running at 500 MHz, zero overhead looping
and 6 independent ALUs and 2 independent single-
cycle MAC units?

for (i=0; i<8; i++)

{
a[i] = 2*i + 3;
b[i] = 3*i + 5;
}
ACOE343 - Embedded Real-Time Processor Systems - 19
Frederick University
Review Questions
• Which of the following code fragments is
appropriate for SIMD implementation?
a[0]=b[0]+c[0]; a[0]=b[0]&c[0];
a[2]=b[2]+c[2]; a[0]=b[0]%c[0];
a[4]=b[4]+c[4]; a[0]=b[0]+c[0];
a[6]=b[6]+c[6]; a[0]=b[0]/c[0];
• Can the following instructions be merged into
one VLIW instruction? If not in how many?
– a=b+c;
– d=c/e;
– f=d&a;
– g=b%c;

ACOE343 - Embedded Real-Time Processor Systems - 20

Frederick University
Review Questions
• Which of the following is not a typical DSP
feature?
– Dedicated multiplier/MAC
– Von Neumann memory architecture
– Pipelining
– Saturation arithmetic
• Which implementation would you choose for
lowest power consumption?
– ASIC
– FPGA
– General-Purpose Processor
– DSP
ACOE343 - Embedded Real-Time Processor Systems - 21
Frederick University
Examples
• How many VLIW instructions does the following program
fragment require if there two independent data paths
(a,b), with 3 ALUs and 1 MAC available in each and 8
instructions/word? How many cycles will it take to
execute if they are the first instructions in the program
and all instructions require 1 cycle, assuming the
pipelining architecture of slide 10 with 6 phases of
execution?
ADD a1,a2,a3 ;a3 = a1+a2
SUB b1,b3,b4 ;b4 = b1-b3
MUL a2,a3,a5 ;a5 = a2-a3
MUL b3,b4,b2 ;b2 = b3*b4
AND a7,a0,a1 ;a1 = a7 AND a0
MUL a3,a4,a5 ;a5 = a3*a4
OR a6,a3,a2 ;a2 = a6 OR a3
ACOE343 - Embedded Real-Time Processor Systems - 22
Frederick University
References
• DR. Chassaing, “DSP Applications using C
and the TMS320C6x DSK”, Wiley, 2002
• Texas Instruments, TMS320C64x
datasheets
• Analog Devices, ADSP-21xx Processors

ACOE343 - Embedded Real-Time Processor Systems - 23

Frederick University

Mathematics 9 - Q3 - Mod11 - Conditions Proving For Triangles Similar - v3
100% (2)
Mathematics 9 - Q3 - Mod11 - Conditions Proving For Triangles Similar - v3
28 pages
Transcript of Pivotal Climate-Change Hearing 1988
100% (4)
Transcript of Pivotal Climate-Change Hearing 1988
216 pages
UNIT 5 (DSP Processor)
78% (9)
UNIT 5 (DSP Processor)
51 pages
Top 100 AI Tools for Productivity
No ratings yet
Top 100 AI Tools for Productivity
19 pages
R3 - To Build A Fire
100% (1)
R3 - To Build A Fire
20 pages
DSP - Processors - r23 - Unit 5
No ratings yet
DSP - Processors - r23 - Unit 5
19 pages
DSP Presentation Overview For Class
100% (1)
DSP Presentation Overview For Class
71 pages
DSP Processor
No ratings yet
DSP Processor
24 pages
DSP Lab Manual DSK Technical Programming With C, MATLAB Programs 2008 B.Tech ECE IV-I JNTU Hyd V1.9
80% (5)
DSP Lab Manual DSK Technical Programming With C, MATLAB Programs 2008 B.Tech ECE IV-I JNTU Hyd V1.9
52 pages
Funk MMQ 30 Days
100% (1)
Funk MMQ 30 Days
34 pages
Implementation of DSP Algorithms
No ratings yet
Implementation of DSP Algorithms
20 pages
ECE/CS 752 Dynamic Scheduling (I) : Nam Sung Kim Electrical and Computer Engineering University of Wisconsin
No ratings yet
ECE/CS 752 Dynamic Scheduling (I) : Nam Sung Kim Electrical and Computer Engineering University of Wisconsin
47 pages
Studies in The Psychology of Sex, Volume 3 Analysis of The Sexual Impulse Love and Pain The Sexual Impulse in Women by Ellis, Havelock, 1859-1939
100% (3)
Studies in The Psychology of Sex, Volume 3 Analysis of The Sexual Impulse Love and Pain The Sexual Impulse in Women by Ellis, Havelock, 1859-1939
242 pages
DSP - Presentation - Sumit 4
No ratings yet
DSP - Presentation - Sumit 4
55 pages
Johnson Grammar School: Kuntloor-Hyderabad
No ratings yet
Johnson Grammar School: Kuntloor-Hyderabad
2 pages
Course Information: Lecturers Web Page Assessment
No ratings yet
Course Information: Lecturers Web Page Assessment
6 pages
Unit 5
No ratings yet
Unit 5
71 pages
Advanced Flight Ops Training
No ratings yet
Advanced Flight Ops Training
3 pages
DSP Chip Architecture: Team Members: Steve Mcdermott Ken Whelan Kyle Welch
No ratings yet
DSP Chip Architecture: Team Members: Steve Mcdermott Ken Whelan Kyle Welch
23 pages
ACOE343 - Real-Time: Embedded Processor Systems
No ratings yet
ACOE343 - Real-Time: Embedded Processor Systems
79 pages
WiFi, Working, Elements of WiFi
100% (2)
WiFi, Working, Elements of WiFi
67 pages
DSP - Presentation - Sumit 5
No ratings yet
DSP - Presentation - Sumit 5
45 pages
DSP Processor Fundamentals
No ratings yet
DSP Processor Fundamentals
58 pages
DSP - Presentation - Sumit 1
No ratings yet
DSP - Presentation - Sumit 1
71 pages
DSP Notes
No ratings yet
DSP Notes
15 pages
M (1) .Tech 2 Sem Syllabi
No ratings yet
M (1) .Tech 2 Sem Syllabi
16 pages
Unit V
No ratings yet
Unit V
7 pages
DSP - Presentation - Sumit 2
No ratings yet
DSP - Presentation - Sumit 2
68 pages
DSP - Presentation - Sumit 3
No ratings yet
DSP - Presentation - Sumit 3
63 pages
CH1O3 Questions PDF
No ratings yet
CH1O3 Questions PDF
52 pages
08 Architecture
No ratings yet
08 Architecture
51 pages
DSP Processors
100% (1)
DSP Processors
24 pages
Chap 15
No ratings yet
Chap 15
61 pages
Digital Signal Processors Overview
No ratings yet
Digital Signal Processors Overview
83 pages
Introduction To Digital Signal Processors (DSPS) - Student
No ratings yet
Introduction To Digital Signal Processors (DSPS) - Student
24 pages
Gujarat Technological University: Credits
No ratings yet
Gujarat Technological University: Credits
4 pages
02 Architecture of Arm
No ratings yet
02 Architecture of Arm
43 pages
DSP Lecture 01
100% (1)
DSP Lecture 01
39 pages
R22 BEFA All Units Questions & Answers 03-8-2024
No ratings yet
R22 BEFA All Units Questions & Answers 03-8-2024
87 pages
en - GASP 2020 2022 Global Aviation Safety Plan
No ratings yet
en - GASP 2020 2022 Global Aviation Safety Plan
144 pages
01 Introduction
No ratings yet
01 Introduction
29 pages
Efficient Embedded Proceossor SYLLABUS 12 JUN
No ratings yet
Efficient Embedded Proceossor SYLLABUS 12 JUN
2 pages
Irony Reading
No ratings yet
Irony Reading
17 pages
6th Unit DSP
No ratings yet
6th Unit DSP
34 pages
Are Today's Teenagers Smarter and Better Than We Think - The New York Times
No ratings yet
Are Today's Teenagers Smarter and Better Than We Think - The New York Times
5 pages
Role of Family in Consumer Behaviour
0% (1)
Role of Family in Consumer Behaviour
10 pages
Characteristics of DSP
100% (1)
Characteristics of DSP
15 pages
B1 Booster v1
No ratings yet
B1 Booster v1
32 pages
Chap 15
No ratings yet
Chap 15
60 pages
Preboard Exam in Ee 2
No ratings yet
Preboard Exam in Ee 2
14 pages
Elec327b DSP Processors 1
100% (1)
Elec327b DSP Processors 1
21 pages
DSP Processors for Engineers
No ratings yet
DSP Processors for Engineers
43 pages
DSP Architecture
100% (1)
DSP Architecture
71 pages
Unit 1: Fundamentals of Programmable DSPS: Bhooshan Humane
No ratings yet
Unit 1: Fundamentals of Programmable DSPS: Bhooshan Humane
60 pages
01 Introduction
No ratings yet
01 Introduction
29 pages
Unit 1
No ratings yet
Unit 1
44 pages
The Tms320C6X Family of Dsps
No ratings yet
The Tms320C6X Family of Dsps
13 pages
Lecture 4
No ratings yet
Lecture 4
23 pages
DSP Processors
No ratings yet
DSP Processors
114 pages
DSP Architecture for Engineers
No ratings yet
DSP Architecture for Engineers
33 pages
Introduction To Digital Signal Processors (DSPS) : Prof. Brian L. Evans
No ratings yet
Introduction To Digital Signal Processors (DSPS) : Prof. Brian L. Evans
30 pages
DSP-8 (DSP Processors)
No ratings yet
DSP-8 (DSP Processors)
8 pages
DR Tahir Zaidi: Targets For Algorithms
No ratings yet
DR Tahir Zaidi: Targets For Algorithms
37 pages
633888485056270520
No ratings yet
633888485056270520
115 pages
Module 4 - Technology For Teaching and Learning
No ratings yet
Module 4 - Technology For Teaching and Learning
39 pages
Programming 8051 Microcontroller
No ratings yet
Programming 8051 Microcontroller
121 pages
DSP 5th Unit
No ratings yet
DSP 5th Unit
26 pages
Region Religion and Politics 100 Years of Shiromani Alcali Dal Amarjit S Narang Download
No ratings yet
Region Religion and Politics 100 Years of Shiromani Alcali Dal Amarjit S Narang Download
64 pages
Well Productivity in An Iranian Gas-Cond
No ratings yet
Well Productivity in An Iranian Gas-Cond
11 pages
DSP C16 - UNIT-6 (Ref-2)
No ratings yet
DSP C16 - UNIT-6 (Ref-2)
26 pages
The Wizard's Harem - Volume Five - His Elven Dancer - Griz T. Orc & Kimiko Petaway - 2020 - Anna's Archive
No ratings yet
The Wizard's Harem - Volume Five - His Elven Dancer - Griz T. Orc & Kimiko Petaway - 2020 - Anna's Archive
45 pages
Factors Led To The Growth of MIS
No ratings yet
Factors Led To The Growth of MIS
17 pages
Purbasari and Purbararang Script
No ratings yet
Purbasari and Purbararang Script
22 pages
Unit 3
No ratings yet
Unit 3
87 pages
A Study On Customer Satisfaction at HDFC Bank Vijayapura
No ratings yet
A Study On Customer Satisfaction at HDFC Bank Vijayapura
85 pages
Ocean Acidification Virtual Lab
No ratings yet
Ocean Acidification Virtual Lab
4 pages
Physics 107L Lab Guidelines
No ratings yet
Physics 107L Lab Guidelines
2 pages
Industrial Two Roll Mill Quotation
No ratings yet
Industrial Two Roll Mill Quotation
3 pages
Gotaq QPCR Master Mix Quick Protocol
No ratings yet
Gotaq QPCR Master Mix Quick Protocol
1 page
Career Development As A Management Accou
No ratings yet
Career Development As A Management Accou
19 pages

Lecture 4

Uploaded by

Lecture 4

Uploaded by

Lecture 4

Introduction to Digital Signal

Dr. Konstantinos Tatas

ACOE343 - Embedded Real-Time Processor Systems - 2

ACOE343 - Embedded Real-Time Processor Systems - 3

Performance Very High High Medium Medium High

Flexibility Very low High High High

Power Very low low Medium Low Medium

Development Long Medium Short Short

ACOE343 - Embedded Real-Time Processor Systems - 4

ACOE343 - Embedded Real-Time Processor Systems - 6

ACOE343 - Embedded Real-Time Processor Systems - 7

ACOE343 - Embedded Real-Time Processor Systems - 8

F=a+b; c=e/g; d=x&y; w=z*h;

ACOE343 - Embedded Real-Time Processor Systems - 9

ACOE343 - Embedded Real-Time Processor Systems - 10

ACOE343 - Embedded Real-Time Processor Systems - 11

ACOE343 - Embedded Real-Time Processor Systems - 12

ACOE343 - Embedded Real-Time Processor Systems - 13

ACOE343 - Embedded Real-Time Processor Systems - 14

implementing a fixed X[n]

length queue of fixed size

queue while items are Cycle1

removed from the tail of X[n-1] X[n-2] Cycle2

ACOE343 - Embedded Real-Time Processor Systems - 15

ACOE343 - Embedded Real-Time Processor Systems - 16

ACOE343 - Embedded Real-Time Processor Systems - 17

ACOE343 - Embedded Real-Time Processor Systems - 18

for (i=0; i<8; i++)

ACOE343 - Embedded Real-Time Processor Systems - 20

ACOE343 - Embedded Real-Time Processor Systems - 23

You might also like