[HTML][HTML] Enhanced Indonesian ethnic speaker recognition using data augmentation deep neural network

K Nugroho, E Noersasongko - Journal of King Saud University-Computer …, 2022 - Elsevier
Speaker Recognition is a challenging topic in Speech Processing research area. The
various models proposed have succeeded in achieving a fairly high level of accuracy in this …

Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients

MA Nasr, M Abd-Elnaby, AS El-Fishawy… - International Journal of …, 2018 - Springer
This paper presents an efficient approach for automatic speaker identification based on
cepstral features and the Normalized Pitch Frequency (NPF). Most relevant speaker …

Convolutional Neural Network Based Real Time Arabic Speech Recognition to Arabic Braille for Hearing and Visually Impaired

S Bhatia, A Devi, RI Alsuwailem, A Mashat - Frontiers in Public Health, 2022 - frontiersin.org
Natural Language Processing (NLP) is a group of theoretically inspired computer structures
for analyzing and modeling clearly going on texts at one or extra degrees of linguistic …

Kurdish speaker identification based on one dimensional convolutional neural network

ZK Abdul - Computational Methods for Differential Equations, 2019 - cmde.tabrizu.ac.ir
Voice is one of the vital biometrics in human identification and/or verification area. In this
paper, two different models are proposed for speaker identification which are a 1D …

Data augmentation and deep neural networks for the classification of Pakistani racial speakers recognition

A Amjad, L Khan, HT Chang - PeerJ Computer Science, 2022 - peerj.com
Speech emotion recognition (SER) systems have evolved into an important method for
recognizing a person in several applications, including e-commerce, everyday interactions …

Advancing Voice biometrics for Dysarthria Speakers using Multitaper LFCC and Voice Conversion Data Augmentation

S Salim, W Ahmad - IEEE Transactions on Information …, 2024 - ieeexplore.ieee.org
Patients with dysarthria and physical impairments face challenges with traditional user
interfaces. An Automatic Speaker Verification (ASV) system can enhance accessibility by …

Multitaper-mel spectrograms for keyword spotting

DB de Souza, KJ Bakri… - IEEE Signal …, 2022 - ieeexplore.ieee.org
Keyword spotting (KWS) is one of the speech recognition tasks most sensitive to the quality
of the feature representation. However, the research on KWS has traditionally focused on …

Speaker recognition based on dynamic time warping and Gaussian mixture model

N Zhang, Y Yao - 2020 39th Chinese Control Conference (CCC …, 2020 - ieeexplore.ieee.org
At present, most of the speaker recognition models are based on the MFCC cepstrum
feature of the mixture Gaussian model, because MFCC represents the speaker's voice …

Frequency domain analysis of MFCC feature extraction in children's speech recognition system

R Hidayat - JURNAL INFOTEL, 2022 - ejournal.ittelkom-pwt.ac.id
The research on speech recognition systems currently focuses on the analysis of robust
speech recognition systems. When the speech signals are combined with noise, the …

Arabic digits speech recognition and speaker identification in noisy environment using a hybrid model of VQ and GMM

A Ouisaadane, S Safi, M Frikel - … Computing Electronics and …, 2020 - telkomnika.uad.ac.id
This paper presents an automatic speaker identification and speech recognition for Arabic
digits in noisy environment. In this work, the proposed system is able to identify the speaker …