Tuesday 23rd of April 2024
 

Punjabi Automatic Speech Recognition Using HTK


Mohit Dua, R.K. Aggarwal, Virender Kadyan and Shelza Dua

This paper aims to discuss the implementation of an isolated word Automatic Speech Recognition system (ASR) for an Indian regional language Punjabi. The HTK toolkit based on Hidden Markov Model (HMM), a statistical approach, is used to develop the system. Initially the system is trained for 115 distinct Punjabi words by collecting data from eight speakers and then is tested by using samples from six speakers in real time environments. To make the system more interactive and fast a GUI has been developed using JAVA platform for implementing the testing module. The paper also describes the role of each HTK tool, used in various phases of system development, by presenting a detailed architecture of an ASR system developed using HTK library modules and tools. The experimental results show that the overall system performance is 95.63% and 94.08%.

Keywords: Automatic Speech Recognition system, Mel Frequency Cepstral Coefficient (MFCC), HMM, HTK, P-ASR

Download Full-Text


ABOUT THE AUTHORS

Mohit Dua
Mohit Dua did his B.Tech. degree in Computer Science and Engineering from Kurukshetra University, Kurukshetra, INDIA in 2004 and M.Tech degree in Computer Engineering from National Institute of Technology, Kurukshetra, INDIA in 2012. He is presently working as Assistant Professor in Department of Computer Engineering at NIT Kurukshetra, INDIA with more than 7 years of academic experience. He is a life member of Computer Society of India (CSI) and Indian Society for Technical Education (ISTE). His research interests include Speech processing, Theory of Formal languages and Statistical modeling.

R.K. Aggarwal
R.K. Aggarwal received his M.Tech. degree in 2006 and is pursuing Ph.D. from National Institute of Technology, Kurukshetra, INDIA. Currently, He is working as an Associate Professor in the Department of Computer Engineering of the same Institute. He has published more than 30 research papers in various International/National journals and conferences and also worked as an active reviewer in many of them. He has delivered several invited talks, keynote addresses and also chaired the sessions in reputed conferences. His research interests include speech processing, soft computing, statistical modeling and science and spirituality. He is a life member of Computer Society of India (CSI) and Indian Society for Technical Education (ISTE). He has been involved in various academic, administrative and social affairs of many organizations having more than 20 years of experience in this field.

Virender Kadyan
DIET Karnal

Shelza Dua
Department of Electronics & Communication Engg.


IJCSI Published Papers Indexed By:

 

 

 

 
+++
About IJCSI

IJCSI is a refereed open access international journal for scientific papers dealing in all areas of computer science research...

Learn more »
Join Us
FAQs

Read the most frequently asked questions about IJCSI.

Frequently Asked Questions (FAQs) »
Get in touch

Phone: +230 911 5482
Email: info@ijcsi.org

More contact details »