0% found this document useful (0 votes)

44 views5 pages

H y H H y Y: Bibliography

This document discusses probability models and their applications in signal processing and information retrieval. It introduces concepts like random processes, stationary and non-stationary processes, and probability distribution functions including Gaussian, mixture Gaussian, Markov, and Poisson. Specific applications discussed include entropy coding, citation indexing, pattern recognition, and noise reduction. PageRank, a method for ranking the importance of web pages based on the page's citations, is also summarized.

Uploaded by

nradhy2725

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views5 pages

H y H H y Y: Bibliography

Uploaded by

nradhy2725

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Bibliography

53
1 ( 2 )
P/2

f Y ( y )= =

xx
1

1/ 2

1 T 1T 1 1 exp y H xx H y H 2 1 T 1 exp y yy y 2 H

(2.148)

(2 ) P / 2 xx

1/ 2

where yy =H xx H T . Note that a linear transformation of a Gaussian process yields another Gaussian process. Applications In this section we consider applications of probability models in signal coding, citation indexing, pattern recognition and noise reduction. Entropy Coding Consider a communication system with an alphabet X of M symbols x1, x2, .., xM with probabilities P1=P(x1), P2=P(x2),,..., PM=P(xM). The entropy of X is

H ( X ) = P( xi ) log P ( xi )
i =1

()

Entropy gives the minimum number of bits required to encode the source. This theoretical minimum is usually approached by encoding N samples of the process simultaneously with K bits where K/NH(X). As N becomes large then for an efficient coder K/N approaches the entropy H(X) of X. The simplest method to encode an M-valued variable is to use a fixed-length coding scheme that uniformly assigns N binary digits to each of the M values with N=Nint(log2M), where Nint is nearest integer. The efficiency of a coding scheme in terms of its entropy is defined as H(X)/N. When N=H(X)entropy coding efficiency of the code is H(X)/N=1 or 100%. When the source symbols are not equally probable, a more efficient method is entropy encoding. Entropy coding is a variable length coding procedure which assigns codewords of variable lengths to symbols xk such that more probable symbols which occur more frequently are assigned shorter codewords and less probable symbols which happen less frequently are

Probability Models

Symbol , Prob.

Code tree 0 0 0 0 1 .6 .4 1 .2 1 1 1.0

Symbol , Code

x1 , 0.4 x2 , 0.2 x3 , 0.2 x4 , 0.1 x5 , 0.1

x1 x2 x3 x4 x5

0 10 110 1110 1111

Figure 2.18 Illustration of a tree code and Huffman coding

assigned longer code words. An example of such a code is the Morse Code which dates back to the nineteen century. If the entropy coding is ideal, the bit rate at the output of a uniform M level quantiser can be reduced by an amount of log2M-H(X). A simple form of entropy coding is the Huffman Coding that creates an efficient set of prefix codes for a given text. The ease with which Huffman codes can be created and used makes this code an extremely popular tool for data compression. The procedure consists in arranging source words in decreasing order of probability in a columns. The two lowest probability symbols are combined by drawing a straight line out from each and connecting them. This combination is combined with the next symbol and the procedure is repeated to cover all symbols. Binary codewords are assigned by moving from the root of the tree at the right hand side to left in the tree and assigning a 1 to the lower branch and a 0 to the upper branch where each pair of symbols have been combined.
Example 2.23 Given 5 symbols x1, x2, ..., x5 with probabilities of P(x1)=0.4, P (x2)= P (x3)=0.2 and P (x4)= P(x5)=0.1, design a binary variable length code for this source. Figure 2.18 illustrates the design of a Huffman code for this source.

The average codeword length = 4 0.1 + 4 0.1 + 3 0.2 + 2 0.2 + 1 0.4 = 2.2 bits/symbol

Bibliography

The entropy of X is H(X)=1.86 bits/symbol. The average code word length of 2.2 bits/symbol is close to the minimum possible value of 1.86 bits/symbol. We can get closer to the minimum average codeword length by encoding pairs of letters or blocks of more than two symbols at a time (with added complexity). The Huffman code has an important prefix condition property whereby no codeword is a prefix or an initial part of another codeword. Thus codewords can be readily concatenated (in a comma free fashion) and be uniquely (unambiguously) decoded.
Web Page Citation Ranking and Indexing

Search engines on the world wide web have to sort billions of web pages and websites. A good set of search keywords focus the search on documents and websites which contain the search words. However the problem remains that often the contents of many websites are not of the required quality and there are also misleading websites containing words aimed to attract visitors to increase the hit rates or advertising revenue. For efficient information management the websites and their information content need to be ranked using an objective quality measure. An objective measure of quality of published information on any medium is citation, which for long has been used in academic research. A map of hyperlinks and pointers on the web allows rapid calculation of a web page's rank in terms of citation. Page rank is a good way to prioritise the results of web keyword searches.
Citation Ranking in Web Page Rank Calculation

Search engines usually find many web pages that contain the search keywords. The problem is how to present the web links containing the search keywords in a rank ordered form, such that the rank of a page represents a measure of the quality of information on the page. The relevance of a web page containing the search text string can be determined from the following analysis of the web page: (a) Page title containing the search words is an indicator of the relevance of the topic of the page, but not of its quality. (b) Number of times the search words are mentioned in the web page is also an indicator.

Probability Models

(c) Number of citation of the web page from other web pages is an objective indicator of quality as perceived by web users. (d) Each citation to a web page can be weighted by its importance which itself is a weighted citation of the citation. The simplest way to rank a web page is to count the total number of citation links pointing to that page and then divide this by the total number of citations links on the web. This method would rank a web page using a simple probability measure defined as the frequency of citation links. However, as with the tradition of academic research, a citation itself need to be weighted by the quality of the source of citation i.e. by the citation ranking of the source itself. A weighted citation gives some approximation of a page's importance or quality, where each source of citation is weighted by its own citation ranking. Let PR(A) define the page rank for a web page A. Assume that page A has pages T1...Tn pointing to it. Page rank of A can be defined as
PR(A) = (1-d) + d (PR(T1)/C(T1) + ... + PR(Tn)/C(Tn))

where C(T) is defined as the number of links going out of page T. The parameter d is a damping factor which can be set between 0 and 1, usually d is set to 0.85. Note that the PageRanks form a probability distribution over web pages, so the sum of all web pages' PageRanks will be one. PageRank or PR(A) can be calculated using a simple iterative algorithm, and corresponds to the principal eigenvector of the normalized link matrix of the web. PageRank can be thought of as a model of user behaviour. It is assumed there is a "random surfer" who is given a web page at random and keeps clicking on links, never hitting "back" but eventually gets bored and starts on another random page. The probability that the random surfer visits a page is its PageRank. And, the d damping factor is the probability at each page the "random surfer" will get bored and request another random page. One important variation is to only add the damping factor d to a single page, or a group of pages. This allows for personalization and can make it nearly impossible to deliberately mislead the system in order to get a higher ranking. Another intuitive justification is that a page can have a high PageRank if there are many pages that point to it, or if there are some pages that point to it and have a high PageRank. Intuitively, pages that are well cited from many places around the web are worth looking at. Also, pages that have perhaps only one citation from something like the homepage are also

Bibliography

generally worth looking at. If a page was not high quality, or was a broken link, it is quite likely that Yahoo's homepage would not link to it. PageRank handles both these cases and everything in between by recursively propagating weights through the link structure of the web. 2.7 Summary The theory of statistical processes is central to the development of signal processing algorithms. We began this chapter with basic definitions of deterministic signals, random signals and random processes. A random process generates random signals, and the collection of all signals that can be generated by a random process is the space of the process. Probabilistic models and statistical measures, originally developed for random variables, were extended to model random signals. Although random signals are completely described in terms of probabilistic models, for many applications it may be sufficient to characterise a process in terms of a set of relatively simple statistics such as the mean, the autocorrelation function, the covariance and the power spectrum. Much of the theory and application of signal processing is concerned with the identification, extraction, and utilisation of structures and patterns in a signal process. The correlation and its Fourier transform the power spectrum are particularly important because they can be used to identify the patterns in a stochastic process. We considered the concepts of stationary, ergodic stationary and nonstationary processes. The concept of a stationary process is central to the theory of linear time-invariant systems, and furthermore even non-stationary processes can be modelled with a chain of stationary sub-processes as described in Chapter 5 on hidden Markov models. For signal processing applications, a number of useful pdfs, including the Gaussian, the mixture Gaussian, the Markov and the Poisson process, were considered. These pdf models are extensively employed in the remainder of this book. Signal processing normally involves the filtering or transformation of an input signal to an output signal. We derived general expressions for the pdf of the output of a system in terms of the pdf of the input. We also considered some applications of stochastic processes for modelling random noise such as white noise, clutters, shot noise and impulsive noise.
Bibliography

ANDERSON O.D. (1976) Time Series Analysis and Forecasting. The Box Jenkins Approach. Butterworth, London.

Huffman Coding A Case Study of A Comparison
No ratings yet
Huffman Coding A Case Study of A Comparison
2 pages
Huffman Coding: A Case Study of A Comparison Between Three Different Type Documents
No ratings yet
Huffman Coding: A Case Study of A Comparison Between Three Different Type Documents
5 pages
PTSP VI Part 2
No ratings yet
PTSP VI Part 2
44 pages
B.E Semester: 6 - IT (GTU) : 2161603 - Data Compression and Data Retrieval
No ratings yet
B.E Semester: 6 - IT (GTU) : 2161603 - Data Compression and Data Retrieval
17 pages
Data Compression
No ratings yet
Data Compression
20 pages
Karantp
No ratings yet
Karantp
10 pages
Digital Communications Lab (CE-343L) : Experiment NO
No ratings yet
Digital Communications Lab (CE-343L) : Experiment NO
3 pages
Data Compression
No ratings yet
Data Compression
7 pages
Huffman Coding
No ratings yet
Huffman Coding
23 pages
Lesson 4 Information Theory
No ratings yet
Lesson 4 Information Theory
39 pages
Information Theory and Coding - Chapter 2
0% (1)
Information Theory and Coding - Chapter 2
41 pages
Huffman Coding in C++
No ratings yet
Huffman Coding in C++
10 pages
Lossless Compression
No ratings yet
Lossless Compression
11 pages
Source Coding
No ratings yet
Source Coding
29 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
TP02
No ratings yet
TP02
16 pages
Mobile Multimedia Coding Techniques
No ratings yet
Mobile Multimedia Coding Techniques
19 pages
Rohini 67178593226
No ratings yet
Rohini 67178593226
6 pages
Advanced Huffman Coding Techniques
No ratings yet
Advanced Huffman Coding Techniques
7 pages
Communication Theory II - Lecture 7
No ratings yet
Communication Theory II - Lecture 7
34 pages
DCT Based Coding
No ratings yet
DCT Based Coding
49 pages
Source Coding Ompression
No ratings yet
Source Coding Ompression
34 pages
ECM3701 Study Unit 8
No ratings yet
ECM3701 Study Unit 8
20 pages
20250320121146-Module-3 MMC Notes
No ratings yet
20250320121146-Module-3 MMC Notes
27 pages
New Error Model of Entropy Encoding For Image Compression: Volume 2, Issue 2, March - April 2013
No ratings yet
New Error Model of Entropy Encoding For Image Compression: Volume 2, Issue 2, March - April 2013
5 pages
4 Huffman and Shannon Fano Coding
No ratings yet
4 Huffman and Shannon Fano Coding
23 pages
Chapter 3 Part 1
No ratings yet
Chapter 3 Part 1
43 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Truncated Huffman
No ratings yet
Truncated Huffman
5 pages
Google PageRank Algorithm
No ratings yet
Google PageRank Algorithm
10 pages
9.1 Measure of Information - Entropy: Chapter Outline
No ratings yet
9.1 Measure of Information - Entropy: Chapter Outline
81 pages
Information Theory and Source Coding
No ratings yet
Information Theory and Source Coding
45 pages
Data Compression
No ratings yet
Data Compression
35 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Chapter Five Lossless Compression
No ratings yet
Chapter Five Lossless Compression
49 pages
Modification of Adaptive Huffman Coding For Use in
No ratings yet
Modification of Adaptive Huffman Coding For Use in
6 pages
Huffman Code
No ratings yet
Huffman Code
51 pages
A B Fibonaccian Search
No ratings yet
A B Fibonaccian Search
12 pages
The Role of Algorithms in Computing
No ratings yet
The Role of Algorithms in Computing
9 pages
Lecture 3-Huffman Coding
No ratings yet
Lecture 3-Huffman Coding
30 pages
Ternary vs Binary Coding Efficiency
No ratings yet
Ternary vs Binary Coding Efficiency
12 pages
Module-3 Some Basiccompression Methods
No ratings yet
Module-3 Some Basiccompression Methods
48 pages
Why Needed?: Without Compression, These Applications Would Not Be Feasible
No ratings yet
Why Needed?: Without Compression, These Applications Would Not Be Feasible
11 pages
Compression
No ratings yet
Compression
21 pages
Data Compression Techniques: Pushpender Rana, Student
No ratings yet
Data Compression Techniques: Pushpender Rana, Student
4 pages
Adaptive Data Compression
No ratings yet
Adaptive Data Compression
7 pages
Application of Splay Tree
No ratings yet
Application of Splay Tree
12 pages
IRS Text Compression
No ratings yet
IRS Text Compression
1 page
Unit 2 CA209
No ratings yet
Unit 2 CA209
29 pages
Information Theory & Coding Basics
No ratings yet
Information Theory & Coding Basics
40 pages
Search Algorithms
No ratings yet
Search Algorithms
14 pages
1-Information Theory-2021
No ratings yet
1-Information Theory-2021
31 pages
Chapter 4 - Introduction To Source Coding PDF
No ratings yet
Chapter 4 - Introduction To Source Coding PDF
72 pages
Multimedia-Communication Module 2
No ratings yet
Multimedia-Communication Module 2
26 pages
Run-Length Codes: Algorithms
No ratings yet
Run-Length Codes: Algorithms
2 pages
Huffman Coding Technique
No ratings yet
Huffman Coding Technique
13 pages
MIT6 004s09 Tutor01 Sol
No ratings yet
MIT6 004s09 Tutor01 Sol
13 pages
CHAPTER 1, 1.3 Integration of Trigonometric Substitutions
No ratings yet
CHAPTER 1, 1.3 Integration of Trigonometric Substitutions
19 pages
Set3 Growth of Functions
No ratings yet
Set3 Growth of Functions
61 pages
Matrix Methods for Simultaneous Equations
No ratings yet
Matrix Methods for Simultaneous Equations
11 pages
Class XII Mathematics Exam 2024
No ratings yet
Class XII Mathematics Exam 2024
7 pages
Syllabus Format PG Maths
No ratings yet
Syllabus Format PG Maths
26 pages
Shear Building - Seismic
100% (1)
Shear Building - Seismic
13 pages
Phase-Type Distributions & Mixtures of Erlangs
No ratings yet
Phase-Type Distributions & Mixtures of Erlangs
132 pages
On Teaching Finite Element Method in Plasticity With Mathematica
No ratings yet
On Teaching Finite Element Method in Plasticity With Mathematica
10 pages
Addition and Subtraction of Decimals
90% (10)
Addition and Subtraction of Decimals
4 pages
Veliz VKA-SD
No ratings yet
Veliz VKA-SD
74 pages
Bayesian Learning Video Tutorial
No ratings yet
Bayesian Learning Video Tutorial
25 pages
FEA - II Sem Notes by Prof. Kajal Pachdhare - UNIT I Part 3
No ratings yet
FEA - II Sem Notes by Prof. Kajal Pachdhare - UNIT I Part 3
11 pages
Fuzzy Logic (1) : Intelligent System Course
No ratings yet
Fuzzy Logic (1) : Intelligent System Course
23 pages
Hintikka - Synthetic A Priori
No ratings yet
Hintikka - Synthetic A Priori
13 pages
Module Review Form QR1
100% (1)
Module Review Form QR1
14 pages
BBA Calculus Lecture Notes
No ratings yet
BBA Calculus Lecture Notes
275 pages
44 Multiplicity of Eigenvalues
No ratings yet
44 Multiplicity of Eigenvalues
2 pages
L12, L13, L14, L15, L16 - Module 4 - Source Coding
No ratings yet
L12, L13, L14, L15, L16 - Module 4 - Source Coding
59 pages
(Ebook PDF) Intermediate Microeconomics With Calculus: A Modern Approach Instant Download
100% (1)
(Ebook PDF) Intermediate Microeconomics With Calculus: A Modern Approach Instant Download
50 pages
Eberly, David H GPGPU Programming For Games and Science
100% (1)
Eberly, David H GPGPU Programming For Games and Science
464 pages
EOQ EPQ Model Backorder
No ratings yet
EOQ EPQ Model Backorder
7 pages
Fault Simulation
No ratings yet
Fault Simulation
65 pages
Sequence & Progression
No ratings yet
Sequence & Progression
11 pages
Remainder Theoram
No ratings yet
Remainder Theoram
2 pages
BCM ChilliReference PDF
No ratings yet
BCM ChilliReference PDF
712 pages
Quadratic Equation
No ratings yet
Quadratic Equation
57 pages
Workt Ext in GE 105: Mathe Matics in The Moder N World
No ratings yet
Workt Ext in GE 105: Mathe Matics in The Moder N World
5 pages
NCERT Solutions For Class 12 Maths Chapter 11 Three Dimensional Geometry Exercise 11.1
No ratings yet
NCERT Solutions For Class 12 Maths Chapter 11 Three Dimensional Geometry Exercise 11.1
5 pages
Ch-4 - Linear Equations in Two Variables - NCERT Exemplar
No ratings yet
Ch-4 - Linear Equations in Two Variables - NCERT Exemplar
30 pages
International Mathematics and Science Olympiad (IMSO) For Primary School 2004
No ratings yet
International Mathematics and Science Olympiad (IMSO) For Primary School 2004
10 pages

H y H H y Y: Bibliography

Uploaded by

H y H H y Y: Bibliography

Uploaded by

Bibliography

Code tree 0 0 0 0 1 .6 .4 1 .2 1 1 1.0

x1 , 0.4 x2 , 0.2 x3 , 0.2 x4 , 0.1 x5 , 0.1

0 10 110 1110 1111

Figure 2.18 Illustration of a tree code and Huffman coding

You might also like