Speaker identification and verification of noisy speech using multitaper MFCC and Gaussian...

[HTML][HTML] Enhanced Indonesian ethnic speaker recognition using data augmentation deep neural network

K Nugroho, E Noersasongko - Journal of King Saud University-Computer …, 2022 - Elsevier

Speaker Recognition is a challenging topic in Speech Processing research area. The
various models proposed have succeeded in achieving a fairly high level of accuracy in this …

被引用次数：25 相关文章所有 2 个版本

[PDF] researchgate.net

Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients

MA Nasr, M Abd-Elnaby, AS El-Fishawy… - International Journal of …, 2018 - Springer

This paper presents an efficient approach for automatic speaker identification based on
cepstral features and the Normalized Pitch Frequency (NPF). Most relevant speaker …

被引用次数：37 相关文章所有 3 个版本

[PDF] frontiersin.org

Convolutional Neural Network Based Real Time Arabic Speech Recognition to Arabic Braille for Hearing and Visually Impaired

S Bhatia, A Devi, RI Alsuwailem, A Mashat - Frontiers in Public Health, 2022 - frontiersin.org

Natural Language Processing (NLP) is a group of theoretically inspired computer structures
for analyzing and modeling clearly going on texts at one or extra degrees of linguistic …

被引用次数：14 相关文章所有 6 个版本

[PDF] tabrizu.ac.ir

Kurdish speaker identification based on one dimensional convolutional neural network

ZK Abdul - Computational Methods for Differential Equations, 2019 - cmde.tabrizu.ac.ir

Voice is one of the vital biometrics in human identification and/or verification area. In this
paper, two different models are proposed for speaker identification which are a 1D …

被引用次数：20 相关文章所有 7 个版本

[PDF] peerj.com

Data augmentation and deep neural networks for the classification of Pakistani racial speakers recognition

A Amjad, L Khan, HT Chang - PeerJ Computer Science, 2022 - peerj.com

Speech emotion recognition (SER) systems have evolved into an important method for
recognizing a person in several applications, including e-commerce, everyday interactions …

被引用次数：5 相关文章所有 11 个版本

Advancing Voice biometrics for Dysarthria Speakers using Multitaper LFCC and Voice Conversion Data Augmentation

S Salim, W Ahmad - IEEE Transactions on Information …, 2024 - ieeexplore.ieee.org

Patients with dysarthria and physical impairments face challenges with traditional user
interfaces. An Automatic Speaker Verification (ASV) system can enhance accessibility by …

Multitaper-mel spectrograms for keyword spotting

DB de Souza, KJ Bakri… - IEEE Signal …, 2022 - ieeexplore.ieee.org

Keyword spotting (KWS) is one of the speech recognition tasks most sensitive to the quality
of the feature representation. However, the research on KWS has traditionally focused on …

被引用次数：2 相关文章所有 2 个版本

Speaker recognition based on dynamic time warping and Gaussian mixture model

N Zhang, Y Yao - 2020 39th Chinese Control Conference (CCC …, 2020 - ieeexplore.ieee.org

At present, most of the speaker recognition models are based on the MFCC cepstrum
feature of the mixture Gaussian model, because MFCC represents the speaker's voice …

被引用次数：6 相关文章所有 2 个版本

[PDF] ittelkom-pwt.ac.id

Frequency domain analysis of MFCC feature extraction in children's speech recognition system

R Hidayat - JURNAL INFOTEL, 2022 - ejournal.ittelkom-pwt.ac.id

The research on speech recognition systems currently focuses on the analysis of robust
speech recognition systems. When the speech signals are combined with noise, the …

被引用次数：5 相关文章所有 3 个版本

[PDF] uad.ac.id

Arabic digits speech recognition and speaker identification in noisy environment using a hybrid model of VQ and GMM

A Ouisaadane, S Safi, M Frikel - … Computing Electronics and …, 2020 - telkomnika.uad.ac.id

This paper presents an automatic speaker identification and speech recognition for Arabic
digits in noisy environment. In this work, the proposed system is able to identify the speaker …

被引用次数：4 相关文章所有 7 个版本

高级搜索

QQ 群