Image Compression
Why Do We Need Compression?
Requirements may outstrip the anticipated increase of storage space and bandwidth For data storage and data transmission
DVD Video conference Printer
The bit rate of uncompressed digital cinema data exceeds 1 Gbps
Why Can We Compress?
Spatial redundancy
Neighboring pixels are not independent but correlated
Temporal redundancy
Image Compression
(Bandwidth Compression vs. Bit Rate Reduction)
Reduction of the number of bits needed to represent a given image or its information Image compression exploits the fact that all images are not equally likely
Exploits energy gaps in signal
Information vs Data
REDUNDANT DATA
INFORMATION
DATA = INFORMATION + REDUNDANT DATA
An Image Model-Ref: J.B.ONeal
Picture size is one unit wide by one unit high
Width 1 Unit
M=Number of Samples
D=Spacing Between Samples = 1
Height 1 Unit
1/2
Correlation Between Adjacent Samples
e
M1/2
e M
Compression As It Relates To Image Content
Picture Correlation Distance -1 Portrait 6.3 (Fills 1/2 Frame) Typical 16.7 (Moderate Detail) 100 People 50 2000 People 150
INTERFRAME and INTRAFRAME PROCESSING
Interframe Processing Point to Point
Predictive Encoding
Line to Line
Intraframe Processing
BIT RATE = NQF
N = NUMBER OF PIXELS
Q = QUANTIZATION BITS/PIXEL F = FRAME RATE
Channel Bit Rate Compression Ratio = 10 LOG NQF
REDUCING
CREATES
N
Q
Reduced Resolution
Contouring (Artifacts)
Image Blur
We need More Sophisticated Approaches
Selected Methods for compression
LPC Delta Modulation Bit plane encoding Transform encoding Standards
JPEG MPEG
PREDICTIVE CODING
Predictive Coding
transmit the difference between
estimate of future sample & the sample itself. - Delta modulation - DPCM
- Adaptive predictive coding
- Differential frame coding
Differential Pulse Code Modulation (DPCM)
SIMPLE DELTA MODULATION
x(t)
+ -
(t)
+ y (t)
f (t)
SIMPLE DELTA MODULATION
y(t) x(t)
f(t)
t
BIT PLANE ENCODING
abcdef
a
b c d e
f
TRANSFORM CODING
Transform Coding - transform image - code the coefficients of the transform - transmit them - reconstruct by inverse transform Benefits - transform coeff. relatively uncorrelated - energy is highly compacted - reasonable robust relative to channel errors
Potential Bit Rate Reduction for 525 Line Video Imagery
TYPE OPERATIONS COMMENTS
Decrease N (from 512 to 256)
Decrease Q Decrease F
Resolution Dynamic Range of Display Frame Rate Flicker Results Image Blurring Point to Point Redundancy
4:1 6:5 6:1 (30 Fps 5 Fps)
Delta Modulation
2:1 3:1 6 Bits/Pixel to 2 Bits/Pixel
TYPE Adaptive Intraframe PCM - Delta Optimal Transform Encoding JPEG
OPERATIONS Intraframe Redundancy Intraframe Redundancy Intraframe Redundancy Frame to Frame Redundancy
COMMENTS 5:1 6 Bits/Pixel to 1.2 Bits/Pixel 12:1 6 Bits/Pixel to 1.2 Bits/Pixel 27:1 Hundreds to one
Interframe Processing
COMPRESSION/COST RATIO RANKING RANK Technique Compression/ Compression/ Vs. Cost Ratio 6-Bit PCM 1 D 1 D MOD Adaptive MOD W-H Transform DPCM 3.0 1.1 3.0 3.3
2 D
1 D
0.454
0.429
12.7
3.0
Hybrid Techniques
Delta Modulation of transform coefficients Variable scan rate techniques Contour encoding JPEG and MPEG
Lossless or Lossy Compression
Lossless compression
There is no information loss, and the image can be reconstructed exactly the same as the original Applications: Medical imagery, Archiving
Lossy compression
Information loss is tolerable Many-to-1 mapping in compression eg. quantization Applications: commercial distribution (DVD) and rate constrained environment where lossless methods can not provide enough compression ratio
Standards
JPEG MPEG
Why Do We Need International Standards?
International standardization is conducted to achieve inter-operability .
Only syntax and decoder are specified. Encoder is not standardized and its optimization is left to the manufacturer.
Standards provide state-of-the-art technology that is developed by a group of experts in the field.
Not only solve current problems, but also anticipate the future application requirements.
What Is JPEG?
"Joint Photographic Expert Group". Voted
as international standard in 1992. Works with color and grayscale images, e.g., satellite, medical, ... Lossy and lossless
JPEG
(Intraframe coding)
First generation JPEG uses DCT+Run length Huffman entropy coding. Second generation JPEG (JPEG2000) uses wavelet transform + bit plane coding + Arithmetic entropy coding.
Why DCT Not DFT?
DCT is similar to DFT, but can provide a better approximation with fewer coefficients
The coefficients of DCT are real valued instead of complex valued in DFT.
The 64 (8 X 8) DCT Basis Functions
Each 8x8 block can be looked at as a weighted sum of these basis functions. The process of 2D DCT is also the process of finding those weights.
Zig-zag Scan DCT Blocks
Why? -- To group low frequency coefficients in top of vector. Maps 8 x 8 to a 1 x 64 vector.
Original
JPEG 27:1
JPEG2000 27:1
JPEG Compression Example
Original image
512 x 512 x 8 bits
= 2,097,152 bits
JPEG
27:1 reduction
=77,673 bits
What Is MPEG ?
"Motion Picture Expert Group", established in 1990 to create standard for delivery of audio and video. MPEG-1 : target VHS quality on a CDROM (320 x 240 + CD audio @ 1.5 Mbits/sec) .
MPEG
(Interframe Coding)
Temporal DPCM is used to remove temporal redundancy first. The motion compensated error is coded with DCT+Run length Huffman entropy coding.
MPEG
Temporal redundancy
Prediction along the motion trajectories (motion
compensation prediction)
Motion Estimation
The accuracy of motion estimation has a big influence on coding efficiency. Motion estimation is a very time-consuming work. Some fast algorithms are needed.
Motion Compensated Prediction
First frame Second frame
MPEG-1
I frame Intra-coded frame P frame Inter-coded frame or Predicted frame