作者
Md Ekhlasur Rahaman, SM Shamsul Alam, Himadri Shekhar Mondal, Ahmed Saif Muntaseer, Rajib Mandal, M Raihan
发表日期
2019/7/6
研讨会论文
2019 10th international conference on computing, communication and networking technologies (ICCCNT)
页码范围
1-4
出版商
IEEE
简介
Speech signal processing has become an important mode of interaction with computer. In this paper, Mel Frequency Cepstral Coefficient (MFCC) technique has been used to process speech samples to attain the recognition. MFCC is a term which narrate the short-term power spectrum of a speech signal, depend on a linear cosine transform (FFT and DCT we have used in our work) of a log power spectrum on a nonlinear Mel scale of frequency. We have used Dynamic Time Warping algorithm and Cross Correlation algorithm to match feature vectors. We have taken five recorded reference word through “One” to “Five”. Then the feature vectors generated from this reference signals are stored in database. A test sample of any numeric in “One” to “Five” is again recorded and then the algorithm is applied to recognize the same with recorded reference voices. In our paper, the recognition techniques show different …
引用总数
20202021202220233242
学术搜索中的文章
ME Rahaman, SMS Alam, HS Mondal, AS Muntaseer… - 2019 10th international conference on computing …, 2019