Speech Recognition using MFCC and Neural Networks
- Divyesh S. Mistry
- Dr.A.V.Kulkarni
Speech Recognition ; Mel Frequency Cepstrum Coefficients (MFCC) ; Neural Networks ; Learning Vector Quantization
The most common mode of communication between humans is speech. As this is the most preferred way, humans would like to use speech to interact with machines also. That is why, automatic speech recognition has gained a lot of popularity. Many approaches for speech recognition exist like Dynamic Time Warping (DTW), Hidden Markov Model (HMM). This paper shows how Neural Network (NN) can be used for speech recognition and also investigates its performance in speech recognition. Learning Vector Quantization Neural Network has been applied. For the feature extraction of speech Mel Frequency Cepstrum Coefficients (MFCC) has been used which gives a set of feature vectors of speech waveform. Earlier research has shown MFCC to be more accurate and effective than other feature extraction techniques in the speech recognition. The work has been done on MATLAB and experimental results show that system is able to recognize words at sufficiently high accuracy.
Divyesh S. Mistry, Dr.A.V.Kulkarni. "Speech Recognition using MFCC and Neural Networks".INTERNATIONAL JOURNAL OF ENGINEERING DEVELOPMENT AND RESEARCH ISSN:2321-9939, Vol.2, Issue 2, pp.2122-2129, URL :https://rjwave.org/ijedr/papers/IJEDR1402134.pdf
Volume 2 Issue 2
Pages. 2122-2129