A Comparative Study of Multilayer Feed-forward Neural Network and Radial Basis Function Neural Network Models for Speech Recognition

Priyanka Tyagi; Dr. Jayant Shekhar

doi:10.32628/IJSRSET162429

Authors

Priyanka Tyagi M. Tech Student, Department of Computer Science, Subharti University, Meerut, Uttar Pradesh, India
Dr. Jayant Shekhar Professor (Director, SITE), Department of Computer Science, Subharti University, Meerut, Uttar Pradesh, India

Keywords:

Automatic Speech Recognition, Digital Signal Processing, Sampling, Quantization, Feed-Forward Neural Network, Radial Basis Network.

Abstract

The most common way of human-to-human communication is speech. As speech provides the easiest and most natural way of interaction, it becomes the need of human-to-machine communication as well. Automatic speech recognition (ASR) is the technology to enable machines to understand, process and recognize speech. Due to its applicability in various application domains, ASR becomes one of the most fascinating areas of pattern recognition. In this paper, we are analyzing the performances of multilayer feed-forward neural network and Radial basis function neural network models for the recognition of speech signals. The work is conducted in four stages: speech signal acquisition & pre-processing, feature pattern vector creation, implementation & training of selected neural network models and comparative analysis of the performances of selected neural networks.

Proposed work is conducted with 10 speech samples of English alphabets .Digital signal processing operations are applied on signals to convert them and make them appropriate for further processing. Five feature pattern vectors are created to be used for training and testing of the network models. Performance of selected neutral network models is measured and analyzed for the created feature pattern vectors. Results indicate that feed-forward neural network model performs better than the Radial basis function neural network for all the test pattern vectors.

References

WouterGevaert, GeorgiTsenov and ValeriMladenov, “Neural Networks used for SpeechRecognition”, Journal of Automatic Control, pg. 1-7, Vol. 20, 2010.
Gerasimos Potamianos, ChalapathyNeti, Guillaume Gravier, AshutoshGarg and Andrew W. Senior, “Recent Advances in the Automatic Recogntion of Audio-Visual speech’, Proceedings of the IEEE, pp. 1-18, Vol. 91, No. 9, 2003.
Nidhi Srivastava, “Speech Recognition Using Artificial Neural Network “, International Journal of Engineering Science and Innovative Technology (IJESIT), pg. 406-412, Vol. 3, Issue 3, 2014.
A.Anusuya and S. K. Katti, “Speech Recognition by Machine: A Review”, Int. Journal of Computer Science and Information Security,pg. 181-205, Vol. 6, No. 3, 2009.
Santosh K. Gaikwad, Bharti W. Gawali and PravinYannawar, “A Review on Speech
Recognition Technique”, International Journal of Computer Applications, pg. 16-24, Vol. 10, No. 3, 2010.
Landauer, C. Kamm, and S. Singhal, “Learning a Minimally Structured Backpropagation Network to Recognize Speech,” In Proceedings of Ninth Annual Conference of Cogn. Sc.Soc., pp. 531–536, 1987.
Sir Charles Wheatstone, The Scientific Papers of Sir Charles Wheatstone, London: Taylorand Francis, 1879.
H.Davis, R.Biddulph, and S.Balashek, “AutomaticRecognition of spoken Digits”, Acoust.Soc.Am.,24(6):637-642,1952.
Bishnu S. Atal and Lawrence R. Rabiner, “A Pattern Recognition Approach to Voiced-Unvoiced-Silence Classification with Applications to Speech Recognition”, IEEE Transactions on Acoustics, Speech and Signal Processing, pp 201-212, Vol. ASSP-24, No. 3, 1976.
R.Rabiner, S.E.Levinson, A.E.Rosenberg, andJ.G.Wilpon, “Speaker Independent Recognition ofIsolated Words Using Clustering Techniques”, IEEETrans. Acoustics, Speech, Signal Proc., ASSP-27:336-349, 1979.
al.,“Energy Conditioned SpectralEstimation for Recognition of Noisy Speed” , IEEETransactions on Audio, Speech and Language processing,Vol.1,No.1, Jan 1993.
Xiaodong Cui et.al., “A Study of Variable-ParameterGaussian Mixture Hidden Markov Modeling for NoisySpeech Recognition”, IEEE Transactions On Audio,Speech, And Language Processing, Vol. 15, No. 4, 2007.
Syed Ayaz Ali Shah, Azzam ul Asar and S.F. Shukat, “ Neural Network Solution for Secure Interactive Voice Response”, World Applied Sciences Journal, pg. 1264-1269, Vol. 6, No. 9, 2009.
Shih F.Y., “Image Processing and Pattern Recognition - Fundamentals and Techniques”, Wiley Pub. 2010.
Sandrine Revaz, “Statistical Models in Automatic Speech Recognition”, Master’s Thesis, Department of Mathematics, University of Fribourg Idiap, 2015.
John G. Proakis and Dimitris G. MAnolakis, “Digital Signal Processing – Principles, Algorithms and Applications”, Prentice Hall Publication, Third Edition, 2005.
DimitrisManolakis and Vinay Ingle, “Applied Digital Signal Processing – Theory and Practice”, Cambridge University Press, First Edition, 2011.
Yagnanarayana B., “Artificial Intelligence”, Prentice Hall Pub., Ninth Edition, 2004.
Jesus O. D. and Hagan M. T., “Backpropagation Algorithms for a Broad Class of Dynamic Networks”, IEEE Transactions on Neural Networks, pp. 14-27, Vol. 18, no. 1, 2007.
Powell M.J.D., “Radial Basis Functions for Multivariate Interpolation: A Review”, In Algorithms for the Approximation of Functions and Data, J.C. Mason and M.G. Cox, eds., Clarendon Press, pp. 143-167, 1987.
Chen S., Cowan C.F.N. and Grant P. M. “Orthogonal Least Square Learning Algorithm for Radial Basis Function Networks”, IEEE Transactions on Neural Networks, pg. 302-309, Vol.2, No. 2, 1991.

A Comparative Study of Multilayer Feed-forward Neural Network and Radial Basis Function Neural Network Models for Speech Recognition

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite