INDIAN INSTITUTE OF TECHNOLOGY ROORKEE
NAME OF DEPT. /CENTRE: Electronics and Computer Engineering
1. Subject Code: EC 518N Course Title: Speech and Audio Processing
2. Contact Hours: L: 3 T: 0 P: 0
3. Examination Duration (Hrs.): Theory 0 3 Practical 0 0
4. Relative Weight: CWS 15 PRS 00 MTE 35 ETE 50 E PRE 00
5. Credits: 0 3 6. Semester
Autumn Spring Both
7. Pre-requisite: EC - 311 and EC - 411 or equivalent
8. Subject Area: MSC
9. Objective: To acquaint the students with the concepts in speech and audio processing, and their
applications in communication systems.
10. Details of the Course:
Sl. Contents Contact
No. Hours
1. Digital speech processing and its applications, production and classification of 7
speech sounds, lossless tube models, digital models for speech signals;
Analysis and synthesis of pole-zero speech models, Levinson recursion, lattice
synthesis filter.
2. Time dependent processing of speech, pitch period estimation, frequency 6
domain pitch estimation; Discrete-time short-time Fourier transform and its
application, phase vocoder, channel vocoder.
3. Homomorphic speech processing, waveform coders, hybrid coders and vector 9
quantization of speech; Model based coding: Linear predictive, RELP, MELP,
CELP; Speech synthesis.
4. Principles of speech recognition, spectral distance measures, dynamic time 7
warping, word recognition using phoneme units, hidden Markov models and
word recognition, speech recognition systems, speaker recognition.
5. Ear physiology, psychoacoustics, perception model and auditory system as 7
filter bank; Filter bank design and modified discrete cosine transform
algorithm for audio compression in MP3 and AAC coders; Standards for high-
fidelity audio coding.
6. Tree-structured filter banks, multicomplementary filter banks; Properties of 6
wavelets and scaling functions, wavelet transform; Filter banks and wavelets,
applications of wavelet signal processing in audio and speech coding.
Total 42
11. Suggested Books:
Sl. Name of Books / Authors Year of
No. Publication
1. Rabiner, L.R. and Schafer, R.W., Digital Processing of Speech 2006
Signals, Pearson Education.
2. Quatieri, T.F., Discrete-Time Speech Signal Processing: Principles and 2002
Practice, Pearson Education.
3. Furui, S., Digital Speech Processing, Synthesis and Recognition, 2nd 2000
Ed., CRC Press.
4. Fliege, N.J., Multi Rate Digital Signal Processing, John Wiley & 1999
Sons.
5. Spanias, A., Painter, T. and Venkatraman, A., Audio Signal Processing 2007
and Coding, John Wiley & Sons.
6. Gold, B. and Morgan, N., Speech and Audio Signal Processing, John 2002
Wiley & Sons.