0% found this document useful (0 votes)

90 views5 pages

Hindi Speech Recognition Method

This document discusses using the Discrete Wavelet Transform (DWT) as a feature extraction tool for Hindi speech recognition. It constructs a speech database of 10 Hindi words spoken by 1 speaker and computes DWT coefficients. It then calculates Linear Predictive Coding (LPC) coefficients from the DWT coefficients. A K-Means algorithm clusters the LPC coefficients into a 10-cluster vector quantized codebook. Recognition is done by computing DWT/LPC coefficients for a test word and finding the closest codebook centroid. The method is evaluated on a test set of 100 samples from the 10 words, achieving a recognition rate.

Uploaded by

Madhusudhana Rao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

90 views5 pages

Hindi Speech Recognition Method

Uploaded by

Madhusudhana Rao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

International Journal of Computer Theory and Engineering, Vol. 2, No.

4, August, 2010 1793-8201

Exploring the Discrete Wavelet Transform as a Tool for Hindi Speech Recognition
Shivesh Ranjan

AbstractIn this paper, we propose a new scheme for recognition of isolated words in Hindi Language speech, based on the Discrete Wavelet Transform. We first compute the Discrete Wavelet Transform coefficients of the speech signal. Then, Linear Predictive Coding Coefficients of the Discrete Wavelet Transform coefficients are calculated. Our scheme then uses K Means Algorithm on the obtained Linear Predictive Coding Coefficients to form a Vector Quantized codebook. Recognition of a spoken Hindi word is carried out by first calculating its Discrete Wavelet Transform Coefficients, followed by Linear Predictive Coding Coefficient calculation of these Coefficients, and then deciding in favor of the Hindi word whose corresponding centroid (in the Vector Quantized codebook) gives a minimum squared Euclidean distance error with respect to the word under test. Index Termsdiscrete wavelet transform; linear predicitive coding; vector quantization; hindi ;speech recognition.

the same ten words) was taken and recognition was then attempted for each of the words in the test sample. Thus, a total of 100 recognition attempts were made. All the Hindi speech samples: both for forming the database for constructing the VQ codebook, and the 100 samples of the ten Hindi words to be tested, were taken from the same speaker (An adult male native speaker.) II. THE DISCRETE WAVELET TRANSFORM

The DWT can be used for Multi Resolution Analysis (MRA) [5,6],where a given signal is decomposed into what are known as the approximation and detail coefficients . A given function f(t) satisfying certain conditions [5], can be expressed through the following representation

INTRODUCTION Where (t) is the mother wavelet and (t) is the scaling function. a(L,k) is called the approximation coefficient at scale L and d(j,K) is called the detail coefficient at scale j.The approximation and detail coefficients can be expressed as

Hindi is the most widely spoken language in India, therefore, a speech recognition scheme for Hindi is expected to be of widespread use in diverse fields like railway ticket reservations, cellular phone based banking services, air-ticket reservations etc. Our approach is primarily concerned with exploring a new feature extraction method using the Discrete Wavelet Transform (DWT) and the Linear Predictive Coding (LPC) coefficients calculation. We have avoided Hidden Markov Models (HMMs) [2,4] in our scheme as our main focus is confined to showing, how the features derived by applying the DWT ,can be used to recognize Hindi Speech. Earlier works on Hindi Speech recognition using wavelets [3] have employed linear prediction on the DWT coefficients too, but our approach does not involve calculation of linear prediction coefficients separately for the approximation and detail coefficients. In our scheme, we find the LPC coefficients of the DWT coefficients in a manner very similar to that used in finding the LPC coefficients of an actual speech signal [4].The use of DWT for speech recognition has also been investigated in [8]. To demonstrate our scheme for Hindi recognition, we first constructed a data base of 10 Hindi words (the numbers 1 through 10 in Hindi) sampled at 8KHz.Ten samples of each word were taken, Thus, a 100 words database was constructed.DWT and LPC analysis was carried out on each of the words, followed by K-Means Algorithm [1,4] to form a 10 entries VQ codebook. A different 100 samples set (of
Shivesh Ranjan is with Electronics and Communication Engineering, Manipal Institute of Technology, Manipal 576104, India(email: [email protected]).

Based on the choice of the mother wavelet (t) and scaling function (t), different families of wavelets can be constructed[5,6,9,10].We used three distinct families of DWTs namely: the Daubechies wavelets (db), the Discrete Meyers wavelets (dmey) and the Coiflets (coif) in our recognition scheme.
TABLE I. NUMBER 1 2 3 4 5 6 7 8 9 10 HINDI WORD ek do teen char paanch chhae saat aath nau dus SYMBOL USED IN THE PAPER one two three four five six seven eight nine ten

642

International Journal of Computer Theory and Engineering, Vol. 2, No. 4, August, 2010 1793-8201

III. SPEECH DATABASE CONSTRUCTION AND DWT COEFICIENTS COMPUTAION A. Constrction of Database An adult male, native speaker of Hindi was asked to utter the Hindi words (1 through 10, see Table-1), and his voice was sampled at 8KHz.The speech signal of each word was then isolated from silence. The samples were then stored in ascending order: first, the ten samples corresponding to word one (ek) were stored, then the ten samples of two and so on.

B. Caculating the DWT approximation and detail coefficients Each of the 100 speech samples were then decomposed into approximation and detail coefficients using DWT. Five different sets of decomposition were carried out on each of the 100 speech samples, using 5 different DWTs (of 3 different wavelet families). Of the five different decomposition sets, three sets of decompositions were carried out using the Daubechies wavelets as they have been reported to be highly successful in speech compression schemes using wavelets [7].A single decomposition of the 100 samples was performed using each of the remaining two wavelet families: Coiflets and discrete Meyer wavelets.Fig.1 shows this decomposition process. The DWTs and their B. Using K-Means algorithm to form a VQ codebook symbols used in this paper are We chose to use K-means Algorithm [1,4] to perform Daubechies Wavelets [9] clustering of the LPC coefficients (computed in the Daubechies8, 3-Level decomposition (db8, Lev3) Daubechiesb8, 5-Level decomposition (db8, Lev5) previous stage) into ten clusters, in order to form the VQ Daubechies10, 5-Level decomposition (db10, Lev5) Codebook. The algorithm clustered the points in the 100 by 90 LPC coefficients matrix into ten clusters, and returned Coiflets (coif) [9] the index and cluster centroid location for each of the 100 Coiflets5, 5-Level decomposition (coif5, Lev5) Discrete Meyer Wavelets [10] : Discrete Meyer, 5- entries in the matrix. Since our recognition scheme relied on Level decomposition (demy Lev5) the K-Means Algorithm for recognition and, we did not use HMMs in the later stages for recognition, we proposed the following algorithm to form a VQ codebook As the order of entries in the 100 by 90 LPC coefficients matrix was known (the first 10 entries corresponded to the word one, the next ten for two and so on), we used this information, to our advantage in forming a VQ code book Starting from the first ten indices, returned by the KMeans algorithm, we choose the index appearing the largest number of times in the group (of ten) as the index of the group. The corresponding centroid was designated the centroid of the group. The same process was repeated for the next ten Figure 1. Decomposition of speech signal using DWTs indices. We continued this till all 10 groups (i.e. a total of 100 entries) were assigned to ten different groups. This simple algorithm would have failed, if a IV. FORMATION OF VQ CODEBOOK AND TESTING certain index were in majority in more than one We obtained five sets of DWT coefficients from the group, because then, the assigning of index to both previous step. Each of these sets had 100 entries. Each entry the groups would have become ambiguous. However, was actually the collection of DWT coefficients of the such a situation was not observed. So, unique indices were assigned to each of the ten groups (group1 speech signal from which it was derived. through group10). In fact, we did have a conflict A. Computing LPC coefficients from the approximation resolution scheme to form a VQ table, for the and detail coefficients simplified case, when a given index appeared in majority in two groups. However, for the more To compute LPC coefficients from the DWT coefficients, complex case of more than two, the simple conflict we employed a method similar to that used for finding LPC resolution scheme that we present (later), will not coefficients of speech signals [4]. work. However, such an ambiguous case is least The DWT coefficients of each speech signal were expected to occur as we did not encounter any. arranged in descending order, starting from the Group1 corresponded to the first ten entries of the LPC 643

corresponding highest level approximation coefficients, followed the same levels detail coefficients ,followed subsequently by lower levels detail coefficients in descending order. Then, the DWT coefficients were framed into frames of 160 samples in length. Overlap between successive frames was kept at 80 samples Each frame was multiplied by a 160 point Hamming window. No pre-emphasis [4] was done unlike the speech signals. The 10th order LPC coefficients for each frame were found, and the whole process was carried on the first 9 frames of the DWT coefficients (for each speech signal .Thus, we obtained a total of 90 LPC coefficients, ten for each frame (of length 160 samples) for the DWT coefficients of a single speech signal .In effect, we used the first 900 terms from each of the 100 DWT coefficients. At the end of this stage, we got five sets of LPC coefficients, each of which had 100 rows (corresponding to the 10 utterances of each word) and 90 columns (corresponding to the 90 LPC coefficients derived from the DWT coefficients of each speech signal). These sets were then used to construct their respective database, which was to be used in recognition (discussed later).

International Journal of Computer Theory and Engineering, Vol. 2, No. 4, August, 2010 1793-8201

coefficients matrix, which were actually the LPC coefficients derived from the first ten original speech signals DWT coefficients .So we used the index of group 1 as the index for the Hindi word one (i.e. ek) and the corresponding centroid was identified as the centroid of the cluster in which LPC coefficients derived from the word one lay. Similarly, we obtained the indices and centroids for each of the remaining nine Hindi words ( two through ten).This information was then used to form a VQ table with the corresponding Hindi word as its index and the related centroid as its content.

C. Conflict Resolution Scheme to form a VQ Codebook Consider the case when the same index appears in majority in more than one group. As mentioned previously, it becomes difficult to assign unique indices to the different groups in such a case. To overcome this, we propose the following simple approach. Case 1: The same index appears in majority in two, but its distribution is unequal. In other words, the number of times the index (which is in majority), appears in the two groups is not equal. For this particular case, we can resolve the ambiguity by simply assigning the index to that particular group, in which the index appears the more number of times. Case 2: The same index appears in majority in two groups, and the distribution in both of them is equal. In this case, the assigning of the index to a particular group is arbitrary. Once an index is assigned to a particular group, the other group is searched for the index that appears the second largest number of times. This index is then assigned as the index of the group. We were also tempted to test our approach for forming the VQ code book from the DWT coefficients themselves, rather than using the algorithm on the LPC coefficients derived from them. To this end, we attempted to run our codebook formation algorithm on the DWT coefficients derived from the speech signals. Much to our disappointment, we found that the algorithm failed completely on the DWT coefficients. In fact, all the DWT coefficients of the 100 samples failed to get grouped Figure 2. Recognition using db8 Lev3 decomposition in ten clusters with ten different indices in majority. Surprisingly, all of them had a few indices in majority in all the ten groups, making the assignment of a particular index E. Effect of number of terms of the DWT coefficients (used in LPC coefficients) on recognition. to a group, virtually impossible. This ruled out any possibility of using the DWT coefficients directly, since our We were also interested in observing if varying the approach relied on obtaining a ten entry VQ code book. number of terms (of the DWT coefficients) that were used in Table 2 shows the number of distinct indices that were in the LPC coefficient calculation, had any effect on the majority in the ten groups, according to our simple rule of overall recognition process. assigning an index to a group, as discussed previously, For this, we used the same procedure, but with the ruling out any possibility of performing recognition without difference that instead of finding the 90 LPC coefficients finding the LPC coefficients of the DWT coefficients. To from the first 800 samples of each of the DWT coefficients, sum up, we needed ten distinct indices to recognize ten we utilized the first 1600 terms for the LPC coefficients. different words, while this approach assigned just a single In this case, for each speech signal, we obtained a 190 index to all the ten groups! So, it was rejected. element row vector (19 frames) after the LPC coefficients calculation stage. Thus, for a total of 100 speech signals, we TABLE II. obtained a matrix of 100 by 190 entries. Everything else db8 db8 db10 coif5 dmey remained similar to what we have already discussed for the DWT Lev3 Lev5 Lev5 Lev5 Lev5 case when we took the first 800 samples only (as in IV.B). Number
Of distinct indices assigned to the 10 groups 1 1 1 1 1

D. Testing isolated words using the centrods and indices of the VQ Codebook To test a given Hindi word, we first found its DWT coefficients, then, the LPC coefficients of the DWT coefficients were found. The LPC coefficients were then matched with each entry in the VQ table and a decision was made in favor of the index, the content (i.e. the centroid) of which gave minimum squared Euclidean Distance with respect to the word under test. We tested 10 different samples of each Hindi word. Thus, a total of 100 samples were tested for each of the five DWT types. Fig. 2 shows the overall recognition scheme employed in our approach, taking db8 Lev3 (Daubechies 8, 3 Level) decomposition as an example. Similarly, recognition was carried out for each of the four remaining DWT decompositions based approaches, and the performance in each case was noted.

644

International Journal of Computer Theory and Engineering, Vol. 2, No. 4, August, 2010 1793-8201

RESULTS

TABLE III. Hindi Word One (ek) Two(do) Three(teen) Four(char) Five(panch Six(chhae) Seven(saat) Eight(aath) Nine (nau) Ten(dus) db8 Lev3 90 90 100 50 80 30 70 50 80 90 db8 Lev5 70 100 80 100 100 60 90 60 90 100 db10 Lev5 50 100 90 80 100 70 90 30 90 90 coif5 Lev 5 70 70 90 100 100 70 100 20 90 50 dmey Lev5 70 100 60 90 90 70 80 60 50 100

Figure 4. Success of Individual Words

Table 3 shows the success percentage of each of the five types of DWT based approaches, when we used the first 800 samples to form the 90 LPC coefficients vector (for each word). Table 4 shows the results when we took the first 1600 samples, and formed a 190 elements row vector of LPC coefficients, for the particular cases of db8 Lev3 (Daubechies8, 3 level) and db8, Lev 5 based decompositions. We would like to emphasize that other DWTs are also expected to give different results when the number of samples vary; we chose these two, just to examine the nature of the effect.
TABLE IV. db8 Lev3 (1600 Terms) 90 100 100 100 100 90 90 80 100 100 db8 Lev5 (1600 Terms) 70 100 90 100 100 90 90 90 80 100

To, appreciate the effect of increased number of samples on the recognition of words ,Fig. 5 shows the percentage increase in performance for each of the two cases in which, double the original number of samples were used (i.e. 1600 samples ).

Hindi Word One (ek) Two(do) Three(teen) Four(char) Five(panch Six(chhae) Seven(saat) Eight(aath) Nine (nau) Ten(dus)

Figure 5. Percentage increase in performance

VI. CONCLUSION As seen from the results, when working with the first 800 samples of the DWT coefficients to compute LPC coefficients, the Daubechies8, 5-level decomposition gave the highest percentage of success in the recognition of Hindi Speech. Clearly, it emerges as the candidate of choice for our DWT based speech recognition scheme. Daubechies10, 5-level decomposition and the Discrete Meyer wavelets gave comparable performance, while the Daubechies8, 3level decomposition gave the poorest performance. The recognition of word eight (aath) had the poorest success percentage of getting recognized in this approach. Doubling the number of samples had a very positive impact on the performance, in fact, the recognition by db8 Lev3 increased by an overwhelming 23 percent. However, it should be kept in mind that the price paid for this increase in performance, was an overall increase in computational complexity. Also, important is the fact that the recognition of the word eight also improves greatly, suggesting that the DWT coefficients in the first 800 samples of the word were relatively inefficient in recognition of this word. This paper aimed at exploring the DWTs as a tool for recognition of Hindi Speech. But, our main focus in this paper was to identify the type of DWT, which would most likely give superior performance over other DWT types in speech recognition. The Recognition approach in this paper, after the feature extraction stage is clearly not very robust, as we have tried to keep our approach limited to identifying 645

Fig.3 shows the average success percentage of each of the five DWTs based recognition scheme.Fig.4 shows the average success percentage of recognition of each of the ten words. Note that both of these correspond to the original case, when we had taken 800 terms of the DWT coefficients to find the 10th order LPC coefficients.

Figure 3. Suceess of different DWTs

International Journal of Computer Theory and Engineering, Vol. 2, No. 4, August, 2010 1793-8201

the best possible wavelets family which can be used for DWT based Hindi Speech recognition. It should also be observed that this approach can be used for recognition of speech in other languages as well. Any modification to our scheme to make it speaker independent will require taking a large number of utterance samples from speakers of different age groups, gender, accent etc. Then, the data (index and centroid) obtained after applying the K-Means algorithm on the LPC coefficients, can be used to train HMMs [2,4] and an HMM based speech recognition scheme can be employed[2].Such a scheme is expected to give good performance for speaker independent speech recognition. REFERENCES
[1] J. MacQueen,Some methods for classification and analysis of multivariate observationsProc. Of Fifth Berkeley Symposium on Mathematical Statistics and Probability , June 21-July 18, 1965 and December 27, 1965-January 7, 1966,pp. 281-297 B. H. Juang, L. R. Rabiner, S. E. Levinson and M. M. Sondhi, Recent Develoments in the applicatiron of Hidden Markov Models to Speaker-independent Isolated Word Recognition, Proc. IEEE

[2]

International Confrence on Accoustics,Speech and Signal Processing,March 1985,pp. 9-12 [3] Aditya Sharma,M. C. Shrotiya,Omar Farooq,Z. A. Abbas, Hybrid wavelet based LPC features for Hindi speech recognition, International Journal of Information and Communication Technology,2008,pp. 373-381 [4] Lawrence R. Rabiner, B. H. Juang,Fundamentals of Speech Recognition,2nd Indian Reprint, Pearson Education,Delhi,1993, pp. 133-167,357-422 [5] Gilbert Strang and Truong Nguen,Wavelets and Filter Banks.Wellesley-Cambridge Press,MA,1997,pp. 174-220,365-382 [6] Andrew K. Chan and Jaideva C. Goswami, Fundamentals of Wavelets,Wiley-India Edition,John Wiley & Sons Inc,New Delhi,1999, pp. 89-97 [7] Nikhil Rao, Speech Compression Using Wavelets, ELEC 4801 THESIS PROJECT.School of Information Technology and Electrical Engineering,The University of Queensland,October 2001 [8] Brian Gamulkiewicz and Michael Weeks, Wavelets based SpeechRecognition Proc. IEEE Internatioanl Symposium on MicrNanoMechatronics of Human Science,Dec.2003,pp. 678-681 Vol. 2,doi: 10.1109/MWSCAS.2003.1562377. [9] Ingrid Daubechies,Ten Lectures on Wavelets,SIAM,1992,pp. 115132,194-292,258-259 [10] Martin Vettereli and Jelena Kovacevic,Wavelets and Subband Coding,Prentice Hall,1995,pp. 233-238

646

Features of Wavelet Packet Decomposition and Discrete Wavelet Transform For Malayalam Speech Recognition
No ratings yet
Features of Wavelet Packet Decomposition and Discrete Wavelet Transform For Malayalam Speech Recognition
4 pages
2015 Elsevier Speaker Identification Using Vowels Features Through A Combined Method of Formants Wavelets and Neural Network Classifiers
No ratings yet
2015 Elsevier Speaker Identification Using Vowels Features Through A Combined Method of Formants Wavelets and Neural Network Classifiers
9 pages
Continuous Density Hidden Markov Model For Hindi Speech Recognition
No ratings yet
Continuous Density Hidden Markov Model For Hindi Speech Recognition
7 pages
DWT and Mfccs Based Feature Extraction Methods For Isolated Word Recognition
No ratings yet
DWT and Mfccs Based Feature Extraction Methods For Isolated Word Recognition
6 pages
AJSAT Vol.5 No.2 July Dece 2016 pp.23 30
No ratings yet
AJSAT Vol.5 No.2 July Dece 2016 pp.23 30
8 pages
Feature Parameter Extraction From Wavelet Sub-Band Analysis For The Recognition of Isolated Malayalam Spoken Words
No ratings yet
Feature Parameter Extraction From Wavelet Sub-Band Analysis For The Recognition of Isolated Malayalam Spoken Words
4 pages
Development of A Novel Voice Verification System Using Wavelets
No ratings yet
Development of A Novel Voice Verification System Using Wavelets
4 pages
Final Synopsis
No ratings yet
Final Synopsis
23 pages
8834 PDF
No ratings yet
8834 PDF
8 pages
Irjet V10i1284
No ratings yet
Irjet V10i1284
5 pages
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
No ratings yet
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
3 pages
Recognition of Socphatic Speaking
No ratings yet
Recognition of Socphatic Speaking
7 pages
Sita#1part2 Merged
No ratings yet
Sita#1part2 Merged
61 pages
Dundigal, Hyderabad - 500 043 Industry Oriented Mini Project
No ratings yet
Dundigal, Hyderabad - 500 043 Industry Oriented Mini Project
24 pages
Algorithm For The Identification and Verification Phase
No ratings yet
Algorithm For The Identification and Verification Phase
9 pages
Urdu Speech Number Recognition
No ratings yet
Urdu Speech Number Recognition
7 pages
General Presentation: Discrete Wavelet Transform
No ratings yet
General Presentation: Discrete Wavelet Transform
24 pages
Recognizing Voice For Numerics Using MFCC and DTW
No ratings yet
Recognizing Voice For Numerics Using MFCC and DTW
4 pages
HTTP Home - Agh.edu - PL Bziolko Dokuwiki Lib Exe Fetch - PHP Media Art HK
No ratings yet
HTTP Home - Agh.edu - PL Bziolko Dokuwiki Lib Exe Fetch - PHP Media Art HK
4 pages
A (L, K) Are Known As The Approximation
No ratings yet
A (L, K) Are Known As The Approximation
7 pages
Punjabi Speech Recognition: A Survey: by Muskan and Dr. Naveen Aggarwal
No ratings yet
Punjabi Speech Recognition: A Survey: by Muskan and Dr. Naveen Aggarwal
7 pages
Speech Recognition Using Matlab: Objective
No ratings yet
Speech Recognition Using Matlab: Objective
2 pages
Pages From Ali-SpringerPlus
No ratings yet
Pages From Ali-SpringerPlus
1 page
Discrete Wavelet Transform Signal Analyzer: Pedro Henrique Cox and Aparecido Augusto de Carvalho
No ratings yet
Discrete Wavelet Transform Signal Analyzer: Pedro Henrique Cox and Aparecido Augusto de Carvalho
8 pages
$Xwrpdwlf6Shhfk5Hfrjqlwlrqxvlqj&Ruuhodwlrq $Qdo/Vlv: $evwudfw - 7Kh Jurzwk LQ Zluhohvv FRPPXQLFDWLRQ
No ratings yet
$Xwrpdwlf6Shhfk5Hfrjqlwlrqxvlqj&Ruuhodwlrq $Qdo/Vlv: $evwudfw - 7Kh Jurzwk LQ Zluhohvv FRPPXQLFDWLRQ
5 pages
Gender Detection by Voice Using Deep Learning
No ratings yet
Gender Detection by Voice Using Deep Learning
5 pages
Ijves Y14 05338
No ratings yet
Ijves Y14 05338
5 pages
Speaker Identification Using Admissible Wavelet Packet Based Decomposition
No ratings yet
Speaker Identification Using Admissible Wavelet Packet Based Decomposition
4 pages
Speech Recognition Using Discrete Hidden Markov Model: Department of ECE, Saveetha Engineering College, Chennai, India
No ratings yet
Speech Recognition Using Discrete Hidden Markov Model: Department of ECE, Saveetha Engineering College, Chennai, India
6 pages
Combination of LPC and ANN For Speaker Recognition
No ratings yet
Combination of LPC and ANN For Speaker Recognition
5 pages
Extra Paper
No ratings yet
Extra Paper
11 pages
199568.speaker Recognition Method Combining FFT Wavelet Functions and Neural Networks
No ratings yet
199568.speaker Recognition Method Combining FFT Wavelet Functions and Neural Networks
4 pages
7i4feed Forward Back Propagation Neural Network Method For Arabic Vowel Recognition Based On Wavelet Linear Prediction Coding Copyright Ijaet
No ratings yet
7i4feed Forward Back Propagation Neural Network Method For Arabic Vowel Recognition Based On Wavelet Linear Prediction Coding Copyright Ijaet
11 pages
Gender Classification
No ratings yet
Gender Classification
5 pages
Speech Denoising With Maximal Overlap Discrete Wavelet Transform
No ratings yet
Speech Denoising With Maximal Overlap Discrete Wavelet Transform
4 pages
Hindi Digit Recognition Study
No ratings yet
Hindi Digit Recognition Study
7 pages
Los Alarnos: Wavelet of Partial
No ratings yet
Los Alarnos: Wavelet of Partial
12 pages
Development & Evaluation of Different Acoustic Models For Malayalam Continuous Speech Recognition
No ratings yet
Development & Evaluation of Different Acoustic Models For Malayalam Continuous Speech Recognition
8 pages
Speaker ID with Wavelet & GMM
No ratings yet
Speaker ID with Wavelet & GMM
34 pages
Audio Analysis Using The Discrete Wavelet Transform: 1 2 Related Work
No ratings yet
Audio Analysis Using The Discrete Wavelet Transform: 1 2 Related Work
6 pages
Ma Kale
No ratings yet
Ma Kale
3 pages
BF 02745745
No ratings yet
BF 02745745
28 pages
Fusion of Spectrograph and LPC Analysis For Word Recognition: A New Fuzzy Approach
No ratings yet
Fusion of Spectrograph and LPC Analysis For Word Recognition: A New Fuzzy Approach
6 pages
Wavelet Based Feature Extraction For Phoneme Recognition
No ratings yet
Wavelet Based Feature Extraction For Phoneme Recognition
4 pages
Reference Paper 4
No ratings yet
Reference Paper 4
11 pages
Speech Recognition Using MFCC and DTW: January 2014
No ratings yet
Speech Recognition Using MFCC and DTW: January 2014
5 pages
Ubicc Final 245
No ratings yet
Ubicc Final 245
9 pages
Discrete Wavelet Transform in Finance
No ratings yet
Discrete Wavelet Transform in Finance
35 pages
AJSAT Vol.5 No.2 July Dece 2016 pp.23 30
No ratings yet
AJSAT Vol.5 No.2 July Dece 2016 pp.23 30
9 pages
Puede-ser-Speaker Identification Based On Hybrid Feature
No ratings yet
Puede-ser-Speaker Identification Based On Hybrid Feature
6 pages
Speach
No ratings yet
Speach
8 pages
An Introduction To Hidden Markov Models
No ratings yet
An Introduction To Hidden Markov Models
12 pages
Speaker Recognition Using Vocal Tract Features
No ratings yet
Speaker Recognition Using Vocal Tract Features
5 pages
Speech To Text Conversion STT System Using Hidden Markov Model HMM
No ratings yet
Speech To Text Conversion STT System Using Hidden Markov Model HMM
4 pages
20BTC602J Lab Manual
No ratings yet
20BTC602J Lab Manual
44 pages
DSP Training & Consultancy Expert CV
No ratings yet
DSP Training & Consultancy Expert CV
15 pages
ADA420338
No ratings yet
ADA420338
116 pages
Radar Angle of Arrival Explained
No ratings yet
Radar Angle of Arrival Explained
3 pages
Meher 2019
No ratings yet
Meher 2019
4 pages
Ug Consultants Digital Image Processing Course Contents
No ratings yet
Ug Consultants Digital Image Processing Course Contents
9 pages
Lip Movement Synthesis From Speech Based On Hidden Markov Models
No ratings yet
Lip Movement Synthesis From Speech Based On Hidden Markov Models
6 pages
Emasry@iugaza - Edu.ps: Islamic University of Gaza Statistics and Probability For Engineers ENCV 6310 Instructor
No ratings yet
Emasry@iugaza - Edu.ps: Islamic University of Gaza Statistics and Probability For Engineers ENCV 6310 Instructor
2 pages
Compressed Sensing-Based Imaging of Millimeter-Wave Isar Data
No ratings yet
Compressed Sensing-Based Imaging of Millimeter-Wave Isar Data
6 pages
6 Image Compression
No ratings yet
6 Image Compression
45 pages
Icse Class 9 History Civics SP Model 2
No ratings yet
Icse Class 9 History Civics SP Model 2
2 pages
Allama Prabhu Avara Kate - G P Rajaratnam
No ratings yet
Allama Prabhu Avara Kate - G P Rajaratnam
105 pages
v30b06 PDF
No ratings yet
v30b06 PDF
2 pages
Compressive Sensing for Engineers
No ratings yet
Compressive Sensing for Engineers
50 pages
v30b06 PDF
No ratings yet
v30b06 PDF
2 pages
RCMDR
No ratings yet
RCMDR
44 pages
Video Enhancer & Filters Guide
No ratings yet
Video Enhancer & Filters Guide
2 pages
B.tech Cse 7th Sem Syllabus
No ratings yet
B.tech Cse 7th Sem Syllabus
6 pages
LAB Report 7
No ratings yet
LAB Report 7
8 pages
9.4 Slides
No ratings yet
9.4 Slides
9 pages
Syllabus Mecve
No ratings yet
Syllabus Mecve
44 pages
Speed Gun Instruction Manual: Battery Installation
No ratings yet
Speed Gun Instruction Manual: Battery Installation
6 pages
Manyk Web Sharpening Action
No ratings yet
Manyk Web Sharpening Action
7 pages
M.Tech Electronics Syllabus
No ratings yet
M.Tech Electronics Syllabus
72 pages
DSP Course Material for ECE Students
100% (2)
DSP Course Material for ECE Students
28 pages
Comm II Tutorial Sheet 1
No ratings yet
Comm II Tutorial Sheet 1
10 pages
Huffman Coding Technique For Image Compression: ISSN:2320-0790
No ratings yet
Huffman Coding Technique For Image Compression: ISSN:2320-0790
3 pages
Generation of Basic Signals: AIM: To Write A MATLAB Program To Generate Various Type of Signals. Algorithm
No ratings yet
Generation of Basic Signals: AIM: To Write A MATLAB Program To Generate Various Type of Signals. Algorithm
39 pages
Color 1 Color 2 Color 3 Color 4 Color 5
No ratings yet
Color 1 Color 2 Color 3 Color 4 Color 5
4 pages
Bab 8 Transformasi Z - Oke2
No ratings yet
Bab 8 Transformasi Z - Oke2
53 pages
TMS320 C 50
No ratings yet
TMS320 C 50
774 pages
Image Processing for M.Tech Students
No ratings yet
Image Processing for M.Tech Students
9 pages
EE341 Midterm PDF
No ratings yet
EE341 Midterm PDF
4 pages
DSP Unit-1
No ratings yet
DSP Unit-1
88 pages
Eee Sar Part Ii & Iii PDF
No ratings yet
Eee Sar Part Ii & Iii PDF
97 pages
DSP Question Bank
0% (1)
DSP Question Bank
44 pages
A New Era in Elemental DBF
No ratings yet
A New Era in Elemental DBF
10 pages
Course Outline II1303
No ratings yet
Course Outline II1303
3 pages
FFT Algorithms for Engineers
No ratings yet
FFT Algorithms for Engineers
11 pages
Embedded Systems Lab Exam 2014
No ratings yet
Embedded Systems Lab Exam 2014
4 pages
Denoising Audio Signals Using MATLAB
No ratings yet
Denoising Audio Signals Using MATLAB
7 pages
DSP - Module 1
100% (1)
DSP - Module 1
33 pages
Bharathi Education Trust G. Madegowda Institute of Technology (Gmit)
No ratings yet
Bharathi Education Trust G. Madegowda Institute of Technology (Gmit)
10 pages
Advanced Digital Signal Processing
No ratings yet
Advanced Digital Signal Processing
4 pages
BMW E39 5 Series Radio OBC MID Manual
100% (2)
BMW E39 5 Series Radio OBC MID Manual
65 pages
1 (R2MDC) A Low Power Radix-2 FFT Accelerator For FPGA
No ratings yet
1 (R2MDC) A Low Power Radix-2 FFT Accelerator For FPGA
5 pages

Hindi Speech Recognition Method

Uploaded by

Hindi Speech Recognition Method

Uploaded by

International Journal of Computer Theory and Engineering, Vol. 2, No.

4, August, 2010 1793-8201

Figure 4. Success of Individual Words

Figure 5. Percentage increase in performance

Figure 3. Suceess of different DWTs

You might also like