0% found this document useful (0 votes)

20 views5 pages

DMS Report

The report discusses text compression using prefix codes, particularly Huffman coding, to efficiently reduce data size for modern communication systems like SMS and IoT. It outlines the problem statement, real-time applications, theoretical background, solution methodology, and provides a Python implementation for encoding and decoding text. The conclusion highlights the effectiveness of prefix codes in bandwidth-sensitive applications and suggests future enhancements such as integration with messaging apps and performance comparisons with other coding methods.

Uploaded by

tejasteju0012

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views5 pages

DMS Report

Uploaded by

tejasteju0012

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Dr.

AMBEDKAR INSTITUTE OF TECHNOLOGY

(An Autonomous Institute, Affiliated to Visvesvaraya Technological University, Belagavi,
Accredited by NAAC, with ‘A’ Grade)

Near Jnana Bharathi Campus, Bengaluru – 560056

Department of computer science and engineering

REPORT ON

Text Compression Using Prefix Codes

Submitted in partial fulfillment of the award of the Degree of
BACHELOR OF ENGINEERING
in
COMPUTER SCIENCE AND ENGINEERING

SUBMITTED
by

NAME USN
Venkatesh Naik S 1DA24CS416
Sandesh M 1DA23CS151
Tejas L 1DA23CS180
Rohit R P 1DA23CS148

Submitted to :
Vinutha M S

1
Text Compression Using Prefix Codes

Unit Reference: Unit 4 - Introduction to Graph Theory

1. Problem Statement:
In modern communication systems, especially SMS and IoT-based platforms, reducing data
size without losing information is crucial due to limited bandwidth. The goal is to compress
textual data efficiently using prefix codes.

2. Real-Time Application:
Prefix codes (like Huffman coding) are used in:
• SMS compression to save transmission costs

• Chat applications to reduce message payloads

• Embedded systems/IoT devices with minimal memory

3. Theoretical Background:
Prefix codes are binary codes where no code is a prefix of another. Huffman coding is a
greedy algorithm that assigns variable-length codes to input characters based on their
frequencies. Frequently occurring characters are given shorter codes.

4. Solution Methodology:

• Count frequency of each character in the input text

• Build a binary Huffman Tree using a min-heap

• Assign binary codes by traversing the tree

• Encode the text using these codes

• Decode by traversing the tree based on binary input

2
5. Python Implementation:

import heapq

from collections import Counter

class Node:

def init(self, char, freq):

self.char = char

self.freq = freq

self.left = None

self.right = None

def lt(self, other):

return self.freq < other.freq

def build_huffman_tree(text):

freq = Counter(text)

heap = [Node(char, fr) for char, fr in freq.items()]

heapq.heapify(heap)

while len(heap) > 1:

n1 = heapq.heappop(heap)
n2 = heapq.heappop(heap)

merged = Node(None, n1.freq + n2.freq)

merged.left = n1

merged.right = n2

heapq.heappush(heap, merged)

return heap[0]

def generate_codes(node, prefix="", code_map={}):

if node is None:
return

3
if node.char:

code_map[node.char] = prefix

generate_codes(node.left, prefix + "0", code_map)

generate_codes(node.right, prefix + "1", code_map)

return code_map

def encode(text, code_map):

return ''.join(code_map[char] for char in text)

def decode(encoded_text, root):

decoded = ""
node = root
for bit in encoded_text:

node = node.left if bit == "0" else node.right

if node.char:

decoded += node.char

node = root

return decoded

text = "hello hello sms compression"

root = build_huffman_tree(text)

code_map = generate_codes(root)

encoded = encode(text, code_map)

decoded = decode(encoded, root)

print("Prefix Codes:", code_map)

print("Encoded Binary:", encoded)

print("Decoded Text:", decoded)

4
6. Output Sample:

Prefix Codes: {'h': '1011', 'e': '010', 'l': '00', 'o': '111', ' ': '10', 's': '011', 'm': '1101', 'c': '1100', 'p':
'1000', 'r': '1001', 'i': '1010', 'n': '1110'}

Encoded Binary: 1011010000...

Decoded Text: hello hello sms compression

7. Conclusion:
Using prefix codes like Huffman coding allows significant reduction in the size of textual
data, making it suitable for bandwidth-sensitive applications like SMS and IoT.

8. Future Scope:

• Integrate with real-time messaging apps

• Compare with arithmetic coding or LZW for performance

• Extend to multimedia compression

******************************Thank you********************************

Adp Huffman Coding
No ratings yet
Adp Huffman Coding
15 pages
2 Huff
No ratings yet
2 Huff
3 pages
Assignment3 DSA
No ratings yet
Assignment3 DSA
3 pages
Rakib Project
No ratings yet
Rakib Project
14 pages
Huffman Coding Notes
No ratings yet
Huffman Coding Notes
7 pages
5.2 Huffman Algorithm
No ratings yet
5.2 Huffman Algorithm
12 pages
Graph Theory - Important Application of Trees Huffman Coding
No ratings yet
Graph Theory - Important Application of Trees Huffman Coding
50 pages
Data Compression
No ratings yet
Data Compression
18 pages
L10 Huffman Encoding Greedy
No ratings yet
L10 Huffman Encoding Greedy
52 pages
Mini Project
No ratings yet
Mini Project
26 pages
Huffman Coding
No ratings yet
Huffman Coding
40 pages
Getting Started: Huffman Coding
No ratings yet
Getting Started: Huffman Coding
5 pages
DAA Lab Practice 2
No ratings yet
DAA Lab Practice 2
15 pages
Huffman
No ratings yet
Huffman
24 pages
Postgrad Guide to Huffman Coding
No ratings yet
Postgrad Guide to Huffman Coding
13 pages
Discrete Mathematics
No ratings yet
Discrete Mathematics
51 pages
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
No ratings yet
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
24 pages
5c. Huffman
No ratings yet
5c. Huffman
13 pages
Huffman Coding
No ratings yet
Huffman Coding
65 pages
Huffman Code
No ratings yet
Huffman Code
7 pages
Samarth Adatia MLSP Exp2
No ratings yet
Samarth Adatia MLSP Exp2
14 pages
Assignment No-05
No ratings yet
Assignment No-05
3 pages
Huffman Code
No ratings yet
Huffman Code
5 pages
Huffman
No ratings yet
Huffman
70 pages
Report
No ratings yet
Report
43 pages
Huffman Coding: Greedy Algorithm Guide
No ratings yet
Huffman Coding: Greedy Algorithm Guide
27 pages
7.4 Huffman Coding
No ratings yet
7.4 Huffman Coding
26 pages
Coding Kompresi File Dokumen
No ratings yet
Coding Kompresi File Dokumen
8 pages
04huffman 2x2
No ratings yet
04huffman 2x2
6 pages
Algorithm Analysis of Huffman Coding Using Python
No ratings yet
Algorithm Analysis of Huffman Coding Using Python
16 pages
Huffman Assign (Hifza 117)
No ratings yet
Huffman Assign (Hifza 117)
6 pages
Huffman Code
No ratings yet
Huffman Code
25 pages
Huffman Coding for Tech Students
No ratings yet
Huffman Coding for Tech Students
77 pages
Big Homework 2: General Mentions
No ratings yet
Big Homework 2: General Mentions
11 pages
11 Huffman Coding
No ratings yet
11 Huffman Coding
25 pages
Data Compression
No ratings yet
Data Compression
28 pages
Compression: Another Example of Greedy Algorithm: Huffman Codes
No ratings yet
Compression: Another Example of Greedy Algorithm: Huffman Codes
4 pages
Huffman Code
No ratings yet
Huffman Code
29 pages
Huffman Coding Compression Intro
No ratings yet
Huffman Coding Compression Intro
4 pages
Huffman Coding
No ratings yet
Huffman Coding
7 pages
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
No ratings yet
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
28 pages
2 2 5huffman
No ratings yet
2 2 5huffman
52 pages
Group Assignment Multimedia System
No ratings yet
Group Assignment Multimedia System
26 pages
Unit 2
No ratings yet
Unit 2
28 pages
Huffman Encoding Project Report
No ratings yet
Huffman Encoding Project Report
36 pages
Huffman Coding: Efficient Encoding Algorithm
No ratings yet
Huffman Coding: Efficient Encoding Algorithm
16 pages
Codes
No ratings yet
Codes
16 pages
Ex 7 Daa
No ratings yet
Ex 7 Daa
8 pages
L8 - Huffman Algorithm
No ratings yet
L8 - Huffman Algorithm
52 pages
Huffman Code Greedy Approach
No ratings yet
Huffman Code Greedy Approach
15 pages
Huffman Coding for Beginners
No ratings yet
Huffman Coding for Beginners
10 pages
Design and Analysis of Algorithms (COM336) : Huffman Coding
No ratings yet
Design and Analysis of Algorithms (COM336) : Huffman Coding
1 page
Huffman Coding Explained
No ratings yet
Huffman Coding Explained
45 pages
Huffman Coding
No ratings yet
Huffman Coding
10 pages
Huffman Encoding: WWW - Cis.Upenn - Edu/ Matuszek/Cit594-2002/SLIDES/HUFFMAN
No ratings yet
Huffman Encoding: WWW - Cis.Upenn - Edu/ Matuszek/Cit594-2002/SLIDES/HUFFMAN
13 pages
Data Compression Chapter 7
No ratings yet
Data Compression Chapter 7
40 pages
Huffman Coding
No ratings yet
Huffman Coding
3 pages
CS301 Lec26
No ratings yet
CS301 Lec26
30 pages
Ccs353 Mdcs Lab Manual
No ratings yet
Ccs353 Mdcs Lab Manual
30 pages
Vetcare
No ratings yet
Vetcare
18 pages
Module1 DSDV
No ratings yet
Module1 DSDV
95 pages
A Project Report ON Coaching Management System
100% (1)
A Project Report ON Coaching Management System
66 pages
2 Smartforms
No ratings yet
2 Smartforms
7 pages
Chapter 4 Overview of Preventive Maintenance
No ratings yet
Chapter 4 Overview of Preventive Maintenance
14 pages
Google Form CAI611 PC4.1 - 4.10
No ratings yet
Google Form CAI611 PC4.1 - 4.10
1 page
Module 2 - Flowcharts and Algorithms
100% (1)
Module 2 - Flowcharts and Algorithms
23 pages
Ais CH 3
No ratings yet
Ais CH 3
39 pages
OrionSX-Datasheet 083022
No ratings yet
OrionSX-Datasheet 083022
2 pages
Hackathon 2025
No ratings yet
Hackathon 2025
2 pages
Pure+Moderation Brochure+General+2020+
No ratings yet
Pure+Moderation Brochure+General+2020+
20 pages
Digital Literacy
No ratings yet
Digital Literacy
19 pages
Dynamic Planning With A LLM
No ratings yet
Dynamic Planning With A LLM
9 pages
Software Requirement Specification
No ratings yet
Software Requirement Specification
19 pages
Checkmate Iv Celox Checkmate Iv Quik-Cup
100% (1)
Checkmate Iv Celox Checkmate Iv Quik-Cup
4 pages
Western Australian Junior (WAJO) 2000-20 With Solutions PDF
No ratings yet
Western Australian Junior (WAJO) 2000-20 With Solutions PDF
221 pages
Types and Benefits of Application Software
No ratings yet
Types and Benefits of Application Software
6 pages
Day 7 Task: Understanding Package Manager and Systemctl: Tasks
No ratings yet
Day 7 Task: Understanding Package Manager and Systemctl: Tasks
6 pages
Character Reference
No ratings yet
Character Reference
2 pages
HW3 PDF
No ratings yet
HW3 PDF
1 page
6298 Schematics List
No ratings yet
6298 Schematics List
2 pages
Sayali 2
No ratings yet
Sayali 2
49 pages
Software Process Models Guide
No ratings yet
Software Process Models Guide
30 pages
Lesson Plan - Initial Configuration of Sophos Firewall
No ratings yet
Lesson Plan - Initial Configuration of Sophos Firewall
2 pages
Mixed Signal Integrated Circuit Design
100% (1)
Mixed Signal Integrated Circuit Design
1 page
Citroen 2 CV
100% (6)
Citroen 2 CV
49 pages
Electronics Engineer Internship Letter
No ratings yet
Electronics Engineer Internship Letter
2 pages
Ar
No ratings yet
Ar
10 pages
Oces DGFS-2025
No ratings yet
Oces DGFS-2025
3 pages
AIML Lab: Regression Models Guide
No ratings yet
AIML Lab: Regression Models Guide
7 pages

DMS Report

Uploaded by

DMS Report

Uploaded by

Dr.

AMBEDKAR INSTITUTE OF TECHNOLOGY

Near Jnana Bharathi Campus, Bengaluru – 560056

Department of computer science and engineering

Text Compression Using Prefix Codes

Unit Reference: Unit 4 - Introduction to Graph Theory

• Chat applications to reduce message payloads

• Embedded systems/IoT devices with minimal memory

• Count frequency of each character in the input text

• Build a binary Huffman Tree using a min-heap

• Assign binary codes by traversing the tree

• Encode the text using these codes

• Decode by traversing the tree based on binary input

from collections import Counter

def __init__(self, char, freq):

def __lt__(self, other):

heap = [Node(char, fr) for char, fr in freq.items()]

while len(heap) > 1:

merged = Node(None, n1.freq + n2.freq)

def generate_codes(node, prefix="", code_map={}):

generate_codes(node.left, prefix + "0", code_map)

generate_codes(node.right, prefix + "1", code_map)

def encode(text, code_map):

return ''.join(code_map[char] for char in text)

def decode(encoded_text, root):

node = node.left if bit == "0" else node.right

text = "hello hello sms compression"

encoded = encode(text, code_map)

decoded = decode(encoded, root)

print("Prefix Codes:", code_map)

print("Decoded Text:", decoded)

Encoded Binary: 1011010000...

Decoded Text: hello hello sms compression

• Integrate with real-time messaging apps

• Extend to multimedia compression

You might also like

def init(self, char, freq):

def lt(self, other):