0% found this document useful (0 votes)

13 views40 pages

Digital Comm Class Notes Personal

The document discusses source coding, which involves creating efficient representations of information to reduce memory and bandwidth usage. It covers techniques such as Huffman coding and Lempel-Ziv coding, explaining their algorithms and applications in data compression. Additionally, it highlights the importance of understanding symbol probabilities for effective coding and introduces the Shannon-Fano coding technique, noting its limitations compared to Huffman coding.

Uploaded by

Anonymous

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views40 pages

Digital Comm Class Notes Personal

Uploaded by

Anonymous

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Source Coding

1
Reference: En. Mohd Nazri ahmud
Digital Communication Blocks

Source coding deals with the task of forming efficient descriptions of information sources.
Efficient descriptions permit a reduction in the memory or bandwidth resources required to
store or to transport sample realizations of the source data.
Source Coding
Source encoding is the efficient representation of data generated by a
source.

Consider a discrete source whose output of k different symbols sk is converted by the

source encoder into a block of 0s and 1s denoted by bk

Examples: (1) Output of a 12-bit

digital-to-analog converter (which
outputs one of 4096 discrete
levels)
(2) 8-bit ASCII characters emitted
by a computer keyboard A discrete source is said to be memoryless if the symbols emitted by the
source are statistically independent.

For an efficient source encoding, knowledge of the statistics of the source is required.

If some source symbols are more probable than others, we can assign short code
words to frequent symbols and long code words to rare source symbols.

Assume that the kth symbol, sk occurs with probability pk , k=0,1…..K-1.

Let the binary code word assigned to symbol sk have length lk (in bits) K −1
Therefore the average code-word length of the source encoder is given by L =  pk lk
k =0
Source Coding

Let Lmin denotes the minimum possible value of code-word length

Lmin
The Coding efficiency of the source encoder is given by =
L

bits per symbol

where E{X} is the expected value of X.

bits per symbol

4
Data Compaction
A waveform source is a random process of some random variable. We classically
consider this random variable to be time, so that the waveform of interest is a time varying
waveform. Important examples of time-varying waveforms are the outputs of transducers
used in process control, such as temperature, pressure, velocity, and flow rates, speech and
music.

Data compaction is important because signals generated contain a significant

amount of redundant info and waste communication resources during
transmission.

For efficient transmission, the redundant info should be removed prior to

transmission.

Data compaction is achieved by assigning short description to the most

frequent outcomes of the source output and longer description to the less
frequent ones.

Some source-coding schemes for data compaction:-

• Prefix coding
• The Huffman Coding
5
• The Lempel-Ziv Coding
Prefix Coding

A prefix code is a code in which no code word is the prefix of any

other code word

Example: Consider the three source codes described below

Source Probability Code I Code II Code III

Symbol of
Occurrence
s0 0.5 0 0 0
s1 0.25 1 10 01
s2 0.125 00 110 011
s3 0.125 11 111 0111

6
Prefix Coding
Source Probability Code I Code II Code III
Symbol of
Occurrence
s0 0.5 0 0 0
s1 0.25 1 10 01
s2 0.125 00 110 011
s3 0.125 11 111 0111

Is Code I a prefix code?

It is NOT a prefix code since the bit 0, the code word for s0, is a
prefix of 00, the code word for s2 and the bit 1, the code word for s1,
is a prefix of 11, the code word for s3.
Is Code II a prefix code?
A prefix code has the important property
Yes that it is always uniquely decodable
Is Code III a prefix code?
No

7
Prefix Coding - Example
Source Code I Code II Code III Code IV
Symbol ✓ x x ✓
s0 0 0 0 00
s1 10 01 01 01
s2 110 001 011 10
s3 1110 0010 110 110
s4 1111 0011 111 111

Prefix code?

8
Huffman Coding
The Huffman code is a prefix-free, variable-length code that can achieve the
shortest average code length for a given input alphabet.

Basic idea : Assign to each symbol a sequence of bits roughly equal in length
to the amount of information conveyed by the symbol.
Huffman encoding algorithm:
Step 1: The source symbols are listed in order of decreasing probability.
The two source symbols of lowest probability are assigned a 0 and 1.

Step 2: These two source symbols are regarded as being combined into
a new source symbol with probability equal to the sum of the two original
probabilities. The probability of the new symbol is placed in the list in
accordance with its value.

The procedure is repeated until we are left with a final list of symbols of
only two for which a 0 and 1 are assigned.

The code for each source symbol is found by working backward and
tracing the sequence of 0s and 1s assigned to that symbol as well as its
9
successors.
Huffman Coding – Average Code Length

K −1
L =  pk lk
k =0

= 0.4(2) + 0.2(2) + 0.2(2) + 0.1(3) + 0.1(3)

= 2.2

10
Huffman Coding – Exercise
Symbol S0 S1 S2
Probability 0.7 0.15 0.15

Compute the Huffman code.

What is the average code-word length?

11
Huffman Coding – Exercise

12
Huffman Coding – variations
When the probability of the combined symbol is found to equal another probability
in the list, we may proceed by placing the probability of the new symbol as high as
possible or as low as possible.

13
Huffman Coding – Two variations

14
Huffman Coding – Two variations

15
Huffman Coding – Two variations

Which one to choose?

16
Huffman Coding – Exercise

Symbol S0 S1 S2 S3 S4 S5 S6
Probability 0.25 0.25 0.125 0.125 0.125 0.0625 0.0625

Compute the Huffman code by placing the probability of the combined symbol
as high as possible.

What is the average code-word length?

17
Huffman Coding – Exercise Answer

Symbol S0 S1 S2 S3 S4 S5 S6
Probability 0.25 0.25 0.125 0.125 0.125 0.0625 0.0625

18
Ternary Huffman Coding

19
Huffman Coding – Exercise

20
Huffman Encoding Efficiency
– Self Information or Entropy
• H(X) (The best possible average number of bits)
– Average number of bits per letter

nk = number of bits per symbol

So the efficiency =

Redundancy = 1 - Efficiency
Take-home message: Huffman Coding

22
Lempel-Ziv Coding
• A major difficulty in using the Huffman code is that the symbol probabilities must
be known or estimated, and both the encoder and decoder must know the coding
tree.

• Lempel-Ziv code is an adaptive coding technique that does not require prior
knowledge of symbol probabilities

• Lempel-Ziv coding is the basis of well-known ZIP for data compression (Lossless
coding).

• Perform coding of groups of characters of varying lengths.

• The code assumes that a dictionary exists containing already-coded segments of a

sequence of alphabet symbols. Data is encoded by looking through the existing
dictionary for a match to the next short segment in the sequence being coded.

23
24
25
26
• LZ Coding Example:
Note: Encoded Blocks = Binary code of the location of the Prefix and value of the last
bit position (i.e., code from the subsequence).

In this example, the binary encoded block 1101 in position 9. The last bit, 1, is
the innovation symbol. The remaining bits, 110, point to the root subsequence 10
in position 6. Hence, the block 1101 is decoded into 101, which is correct.

Lempel–Ziv algorithm uses fixed-length codes to represent a variable number of

source symbols; this feature makes the Lempel–Ziv code suitable for
27
synchronous transmission.
Lempel-Ziv Coding – Exercise

Encode the following sequence using Lempel-Ziv algorithm assuming that 0

and 1 are already stored

11101001100010110100….

28
Lempel-Ziv Coding – Exercise Answer

Encode the following sequence using Lempel-Ziv algorithm assuming that 0

and 1 are already stored

11101001100010110100….

29
Lempel-Ziv Coding – Exercise Answer

Encode the following sequence using Lempel-Ziv algorithm assuming that 0

and 1 are already stored

11101001100010110100….

30
Lempel-Ziv Coding – Exercise Answer

Encode the following sequence using Lempel-Ziv algorithm assuming that 0

and 1 are already stored

11101001100010110100….

31
Shannon –Fano Coding Technique
Algorithm.
Step 1: Arrange all messages in descending order of
probability.

Step 2: Divide the Seq. in two groups in such a way that sum
of probabilities in each group is nearly same.

Step 3: Assign 0 to Upper group and 1 to Lower group.

Step 4: Repeat the Step 2 and 3 for Group 1 and 2 and

So on……..
SF Coding Example-1

• Shannon–Fano does not always produce optimal prefix codes. For this
reason, Shannon–Fano is almost never used.
• Huffman coding is almost as computationally simple and produces prefix
codes that always achieve the lowest expected code word length.
• Shannon–Fano coding is used in the IMPLODE compression method, which
is part of the ZIP file format. 33
SF Example-2
Messages
Pi Coding Procedure No. Of Code
Mi Bits

M1 ½ 0 1 0

M2 1/8/ 1 0 0 3 100

M3 1/8 1 0 1 3 101

M4 1/16 1 1 0 0 4 1100

M5 1/16 1 1 0 1 4 1101

M6 1/16 1 1 1 0 4 1110

M7 1/32 1 1 1 1 0 5 11110

M8 1/32 1 1 1 1 1 5 11111
35
Proof of Source Coding Theorem
Average code-
word length of
the source
encoder
K −1
L =  pk lk
k =0

36
Courtesy:Archana c
Cont’d…

Use natural logarithm

We have

Substitute
Multiplied in both sides

37
Cont’d…

Use natural logarithm

Using Kraft's Mc Millan inequality

38
Cont’d…

39
Thanks
40

COBOL From Micro To Mainframe 3rd Edition PDF
100% (1)
COBOL From Micro To Mainframe 3rd Edition PDF
908 pages
Communication Theory II - Lecture 7
No ratings yet
Communication Theory II - Lecture 7
34 pages
CyberArk Vault Setup Guide
No ratings yet
CyberArk Vault Setup Guide
66 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
Module-3 Some Basiccompression Methods
No ratings yet
Module-3 Some Basiccompression Methods
48 pages
Anamoly Detection
0% (1)
Anamoly Detection
20 pages
Huffman Coding Assignment
50% (2)
Huffman Coding Assignment
7 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Source Coding
No ratings yet
Source Coding
35 pages
Source Coding
No ratings yet
Source Coding
18 pages
Class 10 IT 402 Important Questions Updated Syllabus
No ratings yet
Class 10 IT 402 Important Questions Updated Syllabus
3 pages
Chapter 07 - Compression Techniques
No ratings yet
Chapter 07 - Compression Techniques
19 pages
Data Compression
No ratings yet
Data Compression
35 pages
Apache - Kafka Notes
No ratings yet
Apache - Kafka Notes
9 pages
PTSP VI Part 2
No ratings yet
PTSP VI Part 2
44 pages
20 Objective Questions On AI
No ratings yet
20 Objective Questions On AI
3 pages
66 IC PPT Lecture 6
No ratings yet
66 IC PPT Lecture 6
20 pages
Compression Methods: Huffman & LZ
100% (1)
Compression Methods: Huffman & LZ
26 pages
IP Security: True/False & MCQs
No ratings yet
IP Security: True/False & MCQs
5 pages
Sap Bods: - Vijaya Polisetty
No ratings yet
Sap Bods: - Vijaya Polisetty
51 pages
Week 4-7 Nptel Haskell HRST
No ratings yet
Week 4-7 Nptel Haskell HRST
16 pages
Case Study
100% (1)
Case Study
15 pages
Lab 6 - Student
No ratings yet
Lab 6 - Student
24 pages
3 Source Coding
No ratings yet
3 Source Coding
31 pages
4 Huffman and Shannon Fano Coding
No ratings yet
4 Huffman and Shannon Fano Coding
23 pages
DC-PPT 5
No ratings yet
DC-PPT 5
44 pages
Image Compression
100% (1)
Image Compression
38 pages
Multimedia Data Compression
No ratings yet
Multimedia Data Compression
31 pages
Module IV
No ratings yet
Module IV
37 pages
ECEVSP L03 Compression2
No ratings yet
ECEVSP L03 Compression2
40 pages
CH 6
No ratings yet
CH 6
21 pages
Hach Sc100 Controller User Manual
No ratings yet
Hach Sc100 Controller User Manual
64 pages
Unit 5 - Part-Ii
No ratings yet
Unit 5 - Part-Ii
41 pages
Question Bank: Information Coding Techniques
No ratings yet
Question Bank: Information Coding Techniques
10 pages
Chapter Three
No ratings yet
Chapter Three
30 pages
Mohd Zaid CV
No ratings yet
Mohd Zaid CV
2 pages
UNV【Datasheet】 IPC2122LB-SF28 (40) -A-BY 2MP Mini Fixed Bullet Network Camera Datasheet V1.1-EN
No ratings yet
UNV【Datasheet】 IPC2122LB-SF28 (40) -A-BY 2MP Mini Fixed Bullet Network Camera Datasheet V1.1-EN
4 pages
Source Coding Techniques: 1. Huffman Code. 2. Two-Pass Huffman Code. 3. Lemple-Ziv Code
No ratings yet
Source Coding Techniques: 1. Huffman Code. 2. Two-Pass Huffman Code. 3. Lemple-Ziv Code
111 pages
Arithmetic & Lempel-Ziv Coding Guide
No ratings yet
Arithmetic & Lempel-Ziv Coding Guide
53 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
Source Coding Techniques
No ratings yet
Source Coding Techniques
44 pages
Week 3
No ratings yet
Week 3
30 pages
Information Theory and Coding - Chapter 3
No ratings yet
Information Theory and Coding - Chapter 3
33 pages
B&E 105: T B S: Echnology For Usiness Olutions
No ratings yet
B&E 105: T B S: Echnology For Usiness Olutions
11 pages
Memoire John CC
No ratings yet
Memoire John CC
71 pages
LectureNotes01 PDF
No ratings yet
LectureNotes01 PDF
29 pages
SEL-351R: Intelligent Control Made Simple
No ratings yet
SEL-351R: Intelligent Control Made Simple
4 pages
Image Processing (RCS082) Unit V Huffman Coding
No ratings yet
Image Processing (RCS082) Unit V Huffman Coding
12 pages
Intel WiFi Link 6200 622ANHMW Wireless N 300M Half MiniCard
No ratings yet
Intel WiFi Link 6200 622ANHMW Wireless N 300M Half MiniCard
5 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Friamat Basic Eco Oi Friatec
No ratings yet
Friamat Basic Eco Oi Friatec
28 pages
Source Coding & Theorems Guide
No ratings yet
Source Coding & Theorems Guide
29 pages
SQL Server Always On - Overview
No ratings yet
SQL Server Always On - Overview
4 pages
2.1 Source Code
No ratings yet
2.1 Source Code
5 pages
Huffman Coding: Vida Movahedi
No ratings yet
Huffman Coding: Vida Movahedi
24 pages
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
No ratings yet
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
15 pages
Coding Techniques Important Questions-1
No ratings yet
Coding Techniques Important Questions-1
6 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
Multimedia Data Compression Guide
No ratings yet
Multimedia Data Compression Guide
21 pages
1.5 KNR2103 - Week 9 - Day 2 - PDF PDF
No ratings yet
1.5 KNR2103 - Week 9 - Day 2 - PDF PDF
52 pages
Information Theory and Coding: What You Need To Know in Today's ICE Age!
No ratings yet
Information Theory and Coding: What You Need To Know in Today's ICE Age!
44 pages
DCT Based Coding
No ratings yet
DCT Based Coding
49 pages
Source Coding
No ratings yet
Source Coding
9 pages
Lecture35-37 SourceCoding
No ratings yet
Lecture35-37 SourceCoding
20 pages
Huffman Coding Technique
No ratings yet
Huffman Coding Technique
13 pages
Mad Unit 3-Jntuworld
No ratings yet
Mad Unit 3-Jntuworld
53 pages
Huffman Coding in C++
No ratings yet
Huffman Coding in C++
10 pages
Development of OFDM Modem System For Satellite Application. - Grant-In Aid Programme
No ratings yet
Development of OFDM Modem System For Satellite Application. - Grant-In Aid Programme
17 pages
Aim: To Implement Huffman Coding Using MATLAB Experimental Requirements: PC Loaded With MATLAB Software Theory
No ratings yet
Aim: To Implement Huffman Coding Using MATLAB Experimental Requirements: PC Loaded With MATLAB Software Theory
5 pages
Log
No ratings yet
Log
2 pages
Huff Man
No ratings yet
Huff Man
8 pages
Bidding Process Flow For Bank E Auction
No ratings yet
Bidding Process Flow For Bank E Auction
25 pages
Procedural Programming
No ratings yet
Procedural Programming
9 pages
Communication Systems Engineering
No ratings yet
Communication Systems Engineering
25 pages
Green University of Bangladesh: Department of Computer Science and Engineering (CSE)
No ratings yet
Green University of Bangladesh: Department of Computer Science and Engineering (CSE)
13 pages
Source & Channel Encoding Basics
No ratings yet
Source & Channel Encoding Basics
15 pages
Tenderdetail Tenderdetail: Indian Tenders
No ratings yet
Tenderdetail Tenderdetail: Indian Tenders
8 pages
Kernel Exploitation for Hackers
No ratings yet
Kernel Exploitation for Hackers
31 pages
3.4.5 Packet Tracer - Configure Trunks
No ratings yet
3.4.5 Packet Tracer - Configure Trunks
2 pages
Analyzing The BCG Matrix of Amazon PDF
No ratings yet
Analyzing The BCG Matrix of Amazon PDF
2 pages
Geartrax For Solidworks 2016 471instmankl PDF
No ratings yet
Geartrax For Solidworks 2016 471instmankl PDF
2 pages
ICT
No ratings yet
ICT
10 pages
Transmitters Compression
No ratings yet
Transmitters Compression
18 pages

Digital Comm Class Notes Personal

Uploaded by

Digital Comm Class Notes Personal

Uploaded by

Source Coding

Consider a discrete source whose output of k different symbols sk is converted by the

Examples: (1) Output of a 12-bit

Assume that the kth symbol, sk occurs with probability pk , k=0,1…..K-1.

Let Lmin denotes the minimum possible value of code-word length

bits per symbol

where E{X} is the expected value of X.

bits per symbol

Data compaction is important because signals generated contain a significant

For efficient transmission, the redundant info should be removed prior to

Data compaction is achieved by assigning short description to the most

Some source-coding schemes for data compaction:-

A prefix code is a code in which no code word is the prefix of any

Example: Consider the three source codes described below

Source Probability Code I Code II Code III

Is Code I a prefix code?

= 0.4(2) + 0.2(2) + 0.2(2) + 0.1(3) + 0.1(3)

Compute the Huffman code.

What is the average code-word length?

Which one to choose?

What is the average code-word length?

nk = number of bits per symbol

• Perform coding of groups of characters of varying lengths.

• The code assumes that a dictionary exists containing already-coded segments of a

Lempel–Ziv algorithm uses fixed-length codes to represent a variable number of

Encode the following sequence using Lempel-Ziv algorithm assuming that 0

Encode the following sequence using Lempel-Ziv algorithm assuming that 0

Encode the following sequence using Lempel-Ziv algorithm assuming that 0

Encode the following sequence using Lempel-Ziv algorithm assuming that 0

Step 3: Assign 0 to Upper group and 1 to Lower group.

Step 4: Repeat the Step 2 and 3 for Group 1 and 2 and

Use natural logarithm

Use natural logarithm

Using Kraft's Mc Millan inequality

You might also like