查看文章

Autoencoder-Based Speech Features for Manipuri Dialect Identification

作者

Thangjam Clarinda Devi, Kabita Thaoroijam

发表日期

2022/6/26

图书

Advanced Machine Intelligence and Signal Processing

页码范围

697-705

出版商

Springer Nature Singapore

简介

A dialect identification system for Manipuri using deep-stacked autoencoder-based speech features has been presented in this paper. For this work, Manipuri speech corpora have been used available at Linguistic Data Consortium for Indian Languages (LDC-IL). Mel-frequency cepstrum coefficients (MFCC) were extracted from input speech signals, and then autoencoder (AE) is used for dimensionality reduction. Support vector machines (SVM) and deep neural network (DNN) are used for classification of three major Manipuri dialects, i.e. Sekmai, Kakching and Imphal. Results have shown an average dialect identification performance in terms of accuracy of approximately 70% and 72.77% for SVM and DNN, respectively. The performance of both the classifiers has been compared with and without using autoencoder. The role of stacked autoencoder-based features is found to be significant in Manipuri dialect …

学术搜索中的文章

Autoencoder-Based Speech Features for Manipuri Dialect Identification

TC Devi, K Thaoroijam - Advanced Machine Intelligence and Signal Processing, 2022