Ensemble Feature Extraction Modules for Improved Hindi Speech Recognition System
Speech is the most natural way of communication between human beings. The field of speech recognition generates intrigues of man – machine conversation and due to its versatile applications; automatic speech recognition systems have been designed. In this paper we are presenting a novel approach for Hindi speech recognition by ensemble feature extraction modules of ASR systems and their outputs have been combined using voting technique ROVER. Experimental results have been shown that proposed system will produce better result than traditional ASR systems.
Keywords: ASR, MFCC, PLP, LPCC, ROVER.
Download Full-Text
ABOUT THE AUTHORS
Malay Kumar
Malay Kumar was received his B. Tech. degree from Kanpur University, Kanpur, India in 2010 and pursuing his M. Tech. degree from prestigious National Institute of Technology, Kurukshetra, India. He is working in the area of speech processing from last one and half year and also opt this area as his dissertation work, his research work involves around working with different open source recognition tools, implementation of various modeling units’ word, phoneme, triphone and syllable models and working with system integration techniques like Rover for Hindi language.
Rajesh Kumar Aggarwal
R. K. Aggarwal was received his M. Tech. degree in 2006 and pursuing PhD from National Institute of Technology, Kurukshetra, INDIA. Currently he is also working as an Associate Professor in the Department of Computer Engineering of the same Institute. He has published more than 24 research papers in various International/National journals and conferences and also worked as an active reviewer in many of them. He has delivered several invited talks, keynote addresses and also chaired the sessions in reputed conferences. His research interests include speech processing, soft computing, statistical modeling and science and spirituality. He is a life member of Computer Society of India (CSI) and Indian Society for Technical Education (ISTE). He has been involved in various academic, administrative and social affairs of many organizations having more than 20 years of experience in this field.
Gaurav Leekha
Gaurav Leekha has received his M.Tech degree in 2010 from Kurukshetra University, INDIA. Currently he is working as an Asst. Professor in Computer Science and Engineering department of M.M. University, Solan, Himachal Pardesh, INDIA. He is working in the area of speech recognition for Indian languages from last 3 years and published several papers in National/International conferences. He has also attended many workshops on speech recognition in various reputed institutes.
Yogesh Kumar
Yogesh Kumar is M.Tech. student in National Institute of Technology, Kurukshetra, India. He have great interest in the area of speech processing for Indian languages.
Malay Kumar
Malay Kumar was received his B. Tech. degree from Kanpur University, Kanpur, India in 2010 and pursuing his M. Tech. degree from prestigious National Institute of Technology, Kurukshetra, India. He is working in the area of speech processing from last one and half year and also opt this area as his dissertation work, his research work involves around working with different open source recognition tools, implementation of various modeling units’ word, phoneme, triphone and syllable models and working with system integration techniques like Rover for Hindi language.
Rajesh Kumar Aggarwal
R. K. Aggarwal was received his M. Tech. degree in 2006 and pursuing PhD from National Institute of Technology, Kurukshetra, INDIA. Currently he is also working as an Associate Professor in the Department of Computer Engineering of the same Institute. He has published more than 24 research papers in various International/National journals and conferences and also worked as an active reviewer in many of them. He has delivered several invited talks, keynote addresses and also chaired the sessions in reputed conferences. His research interests include speech processing, soft computing, statistical modeling and science and spirituality. He is a life member of Computer Society of India (CSI) and Indian Society for Technical Education (ISTE). He has been involved in various academic, administrative and social affairs of many organizations having more than 20 years of experience in this field.
Gaurav Leekha
Gaurav Leekha has received his M.Tech degree in 2010 from Kurukshetra University, INDIA. Currently he is working as an Asst. Professor in Computer Science and Engineering department of M.M. University, Solan, Himachal Pardesh, INDIA. He is working in the area of speech recognition for Indian languages from last 3 years and published several papers in National/International conferences. He has also attended many workshops on speech recognition in various reputed institutes.
Yogesh Kumar
Yogesh Kumar is M.Tech. student in National Institute of Technology, Kurukshetra, India. He have great interest in the area of speech processing for Indian languages.