IMAGE COMPRESSION & FILE FORMATS
SUHAIL Q. MIR
[email protected]
MSc. IT
Directorate of Information Technology & Support Systems
University of Kashmir
Image Compression ?
Three categories
(Text Conversion)Text in document Images may be
converted from raster to ASCII
(Vectorization) Line Art may be vectorized using
mathematical formulae. Raster to Vector
Bitonal and Continuous tonal still images can be digitized
and encoded using a variety of established Algorithms
Segmentation
(Characters, words, sentences, or Paragraphs)
Paragraph
Segmentation
Word Segmentation
Obama Character
Segmentation
Obama m
Compression
2. Vectorization
Compression
2. Vectorization
3. Image Compression
Two Techniques
Loss Less Image Compression
Lossy Compression
Lossless Image Compression
If the compression and decompression processes induce no information loss,
the compression scheme is lossless; otherwise, it is lossy.
Run-Length Encoding
1 Dimensional
2 Dimensional
Loss Less Image Compression (Run Length
Encoding 1D and 2D)
Lossy Image Compression
Reduction Irrelevant Information + Loss less Compression
Two Techniques :
Reduction of Resolution (Sub-Sampling)
Coarser Description of Each Sample (Quantization)
Lossy Image Compression
Reduction of Resolution (Sub-Sampling)
Th qck brn fx jmpd ovr th lz dg
Lossy Image Compression
Reduction of Resolution (Sub-Sampling)
Th qck brn fx jmpd ovr th lz dg
Lossy Image Compression
Coarser Description of Each Sample (Quantization)
Fractal Compression
A fractal is a never-ending pattern.
In Images A fractal is a natural phenomenon or a
mathematical set that exhibits a repeating pattern at
different scales. This property is called as “Self
Similarity.” Although Fractals are very complex, they
are made by repeating a simple process.
Some Examples
Some IMAGE File Formats
Filename extension .jpg, .jpeg, .jpe .jif, .jfif, .jfi
Magic Number : ff d8 ff
The Rate at which the Pixels are changing
The Rate at which the Pixels are changing
The Rate at which the Pixels are changing
Spatial Domain Frequency Domain
Magic Number : GIF87a/GIF89a
LOSS LESS DATA COMPRESSION
TIFF
is a computer image file format for storing raster graphics images.
The format was originally created by the company Aldus for use
in desktop publishing. Aldus was later acquired by Adobe.
File name extension : .tiff, .tif
TIFF is a flexible, adaptable file format for handling images and data
within a single file. A TIFF file can be a container
holding JPEG (lossy) and PackBits (lossless) compressed images. A
TIFF file also can include a vector-based clipping path (outlines,
croppings, image frames).
PackBits is a fast, simple lossless compression scheme for run-length encoding of data.
Apple introduced the PackBits format with the release of MacPaint on the Macintosh computer.
MAGIC NUMBER
generated by ExifTool
PNG
is a raster graphics file format that supports lossless data
compression.
PNG was created as an improved, non-patented
replacement for Graphics Interchange Format (GIF), and is
the most used lossless image compression format on the
Internet
PNG supports palette-based images (with palettes of 24-
bit RGB or 32-bit RGBA colors and grayscale images
Supports Interlacing
File type Extension : .PNG
The PNG image viewed with a hex editor
Jpeg vs PNG
WINDOWS BITMAP : .BMP
The BMP file format, also known as bitmap image file or
device independent bitmap (DIB) file format or simply a
bitmap, is a raster graphics image file format used to
store bitmap digital images.
Microsoft Windows and OS/2 operating systems.
Filename extension .bmp, .dib
LOSS LESSS COMPRESSION
Magic number
PCX
PCX, standing for Personal Computer Exchange, is
an image file format developed by the now-
defunct ZSoft .
It was the native file format for PC Paintbrush and
became one of the first widely accepted DOS imaging
standards,
PCX files commonly stored palette-indexed images
ranging from 2 or 4 colors to 16 and 256 colors, although
the format has been extended to record true-color (24-
bit) images as well.
LOSS LESSS COMPRESSION