Open Journal Systems

Design and Development of Automatic Speech Recognition of Marathi Numerals

Yogesh K. Gedam

crossmark logo side by side horizontal

Abstract


Speech is the way of communication of human being in this world. It is need to develop a system which can interface with computer. For speech recognition feature extraction technique and the feature matching technique are used. For the feature extraction, we used MFCC and for feature matching and the classification we used the Support Vector Machine (SVM) technique. We used PCA for dimension reduction. Database collected is in the form of ‘.wav’ file. The data collected is of Marathi numbers starting from Shunya (One) to Nau (Nine). Data is divided into male and female speakers. It is recorded in noisy environment. Feature are extracted and then classified with SVM. This paper is describes the speech recognition of Marathi digits.


Full Text:

PDF

References


P. P. Shrishrimal, R. R. Deshmukh, V. B. Waghmare, “Development of Isolated Words Speech Database of Marathi Words For Agriculture Purpose”, Asian Journal of Computer Science and Information Technology (AJCSIT), Vol. 2, No. 7, pp. 217-218, ISSN: 2249-5126

Yogesh K. Gedam, Sujata S. Magare, Amrapali C. Dabhade, Ratnadeep R. Deshmukh, “Development of Automatic Speech Recognition of Marathi Numerals - A Review”, International Journal of Engineering and Innovative Technology (IJEIT), Volume 3, Issue 9, March 2014

Priyanka Mishra, Suyash Agrawal, “Recognition Of Speaker Using Mel Frequency Cepstral Coefficient & Vector Quantization”, International Journal Of Science, Engineering And Technology Research (IJSETR),Volume 1, Issue 6, December 2012

Omprakash Prabhakar, Navneet Kumar Sahu, “A Survey On: Voice Command Recognition Technique”, International Journal of Advanced Research in Computer Science And Software Engineering, Volume 3, Issue 5, May 2013

PRAAT Software , Cited On 03/08/2013 Source: http://Www.Fon.Hum.Uva.Nl/Praat,

Yogesh K. Gedam, Paras V. Kothare , Ratnadeep R. Deshmukh, “Design and Development of Speech Database of Marathi Numerals”, International Journal of Advanced Research in Computer Science and Software Engineering 3 (4),March - 2013

Bhupinder Singh, Rupinder Kaur, Nidhi Devgun, Ramandeep Kaur, “The Process Of Feature Extraction In Automatic Speech Recognition System For Computer Machine Interaction With Humans: A Review”, International Journal Of Advanced Research In Computer Science And Software Engineering , ISSN: 2277 128X , Volume 2, Issue 2, February 2012

Stolcke A., Shriberg E., Ferrer L., Kajarekar S., Sonmez K., Tur G. (2007), “Speech Recognition As Feature Extraction For Speaker Recognition”, SAFE, Washington D.C., USA Pp 11-13. [9] Zheng Fang, Zhang Guoliang, Song Zhanjiang, “Comparison Of Different Implementations Of MFCC”, Journal Of Computer Sci. Technology., 16(6):582–589, 2001

H. Hermansky, “Perceptual Linear Predictive (PLP) Analysis Of Speech”, J. Acoustic. Soc. America, 87, 1990

Maruti Limkar, Rama Rao, Vidya Sagvekar, “Isolated Digit Recognition Using MFCC AND DTW”, International Journal On Advanced Electrical And Electronics Engineering, (IJAEEE), ISSN (Print): 2278-8948, Volume-1, Issue-1, 2012

Siddheshwar S. Gangonda, Dr. Prachi Mukherji, “Speech Processing For Marathi Numeral Recognition Using MFCC And DTW Features”, International Journal Of Engineering Research And Applications (IJERA) ISSN: 2248-9622

Shivanker Dev Dhingra , Geeta Nijhawan , Poonam Pandit, “Isolated Speech Recognition Using MFCC And DTW”, International Journal Of Advanced Research In Electrical, Electronics And Instrumentation Engineering, Vol. 2, Issue 8, August 2013

“MFCC and Its Applications In Speaker Recognition” Vibha Tiwari, Deptt. Of Electronics Engg., Gyan Ganga Institute Of Technology And Management, Bhopal, (MP) INDIA (Received 5 Nov., 2009, Accepted 10 Feb., 2010).

V. Vapnik, “The Nature of Statistical Learning Theory”, Springer Verlag, 1995

Yongming Qu, George Ostrouchov, Nagiza Samatova, Al Geist, "Principal Component Analysis for Dimension Reduction in Massive Distributed Data Sets", sited on 12 Feb 2014

Source: www.ornl.gov/~webworks/cppr/y2001/pres/116016.pdf

Sven E. Kr¨Uger, Martin Schaff ¨Oner, Marcel Katz, Edin Andelic, Andreas Wendemuth, Speech Recognition With Support Vector Machines In A Hybrid System

Harsha K M, Facila Chinchu O, Cini Kurian, Kannan Balakrishnan, “A Comparative Study Of HMM and SVM in Malayalam Digit Recognition”

A. Ganapathiraju, J. Hamaker, and J. Picone, “Hybrid SVM/HMM Architectures for Speech Recognition”, in Proc. ICSLP, Beijing, 2000.

C. Chandra Sekhar, W. F. Lee, K. Takeda And F. Itakura, “Acoustic Modeling Of Sub Word Units Using Support Vector Machines”, Workshop On Spoken Language Processing, TIFR, Mumbai, India

Mamta Maloo, K.V. Kale, “Support Vector Machine Based Gujarati Numeral Recognition”, International Journal on Computer Science and Engineering (IJCSE)




DOI: http://dx.doi.org/10.15520/ajcsit.v4i10.11

Refbacks

  • There are currently no refbacks.