查看文章

arxiv.org 中的 [PDF]

Gender identification using mfcc for telephone applications-a comparative study

作者

Jamil Ahmad, Mustansar Fiaz, Soon-il Kwon, Maleerat Sodanil, Bay Vo, Sung Wook Baik

发表日期

2016/1/7

期刊

arXiv preprint arXiv:1601.01577

简介

Gender recognition is an essential component of automatic speech recognition and interactive voice response systems. Determining gender of the speaker reduces the computational burden of such systems for any further processing. Typical methods for gender recognition from speech largely depend on features extraction and classification processes. The purpose of this study is to evaluate the performance of various state-of-the-art classification methods along with tuning their parameters for helping selection of the optimal classification methods for gender recognition tasks. Five classification schemes including k-nearest neighbor, na\"ive Bayes, multilayer perceptron, random forest, and support vector machine are comprehensively evaluated for determination of gender from telephonic speech using the Mel-frequency cepstral coefficients. Different experiments were performed to determine the effects of training data sizes, length of the speech streams, and parameter tuning on classification performance. Results suggest that SVM is the best classifier among all the five schemes for gender recognition.

引用总数

被引用次数：41

201620172018201920202021202220232 4 2 8 5 6 8 5

学术搜索中的文章

Gender identification using mfcc for telephone applications-a comparative study

J Ahmad, M Fiaz, S Kwon, M Sodanil, B Vo, SW Baik - arXiv preprint arXiv:1601.01577, 2016

被引用次数：41 相关文章所有 4 个版本