An overview of text-independent speaker recognition: From features to supervectors

T Kinnunen, H Li - Speech communication, 2010 - Elsevier
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

Modulation spectral features for speech emotion recognition using deep neural networks

P Singh, M Sahidullah, G Saha - Speech Communication, 2023 - Elsevier
This work explores the use of constant-Q transform based modulation spectral features (CQT-
MSF) for speech emotion recognition (SER). The human perception and analysis of sound …

An investigation of multidimensional voice program parameters in three different databases for voice pathology detection and classification

A Al-Nasheri, G Muhammad, M Alsulaiman, Z Ali… - Journal of Voice, 2017 - Elsevier
Summary Background and Objective Automatic voice-pathology detection and classification
systems may help clinicians to detect the existence of any voice pathologies and the type of …

An automatic health monitoring system for patients suffering from voice complications in smart cities

Z Ali, G Muhammad, MF Alhamid - Ieee Access, 2017 - ieeexplore.ieee.org
Current evolutions in the Internet of Things and cloud computing make it believable to build
smart cities and homes. Smart cities provide smart technologies to residents for the …

Optimization of data-driven filterbank for automatic speaker verification

S Sarangi, M Sahidullah, G Saha - Digital Signal Processing, 2020 - Elsevier
Most of the speech processing applications use triangular filters spaced in mel-scale for
feature extraction. In this paper, we propose a new data-driven filter design method which …

Investigation of voice pathology detection and classification on different frequency regions using correlation functions

A Al-Nasheri, G Muhammad, M Alsulaiman, Z Ali - Journal of Voice, 2017 - Elsevier
Summary Objectives and Background Automatic voice pathology detection and
classification systems effectively contribute to the assessment of voice disorders, which …

Empirical mode decomposition for adaptive AM-FM analysis of speech: A review

R Sharma, L Vignolo, G Schlotthauer… - Speech …, 2017 - Elsevier
This work reviews the advancements in the non-conventional analysis of speech signals,
particularly from an AM-FM analysis point of view. The benefits of such an analysis, as …

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

KS Ahmad, AS Thosar, JH Nirmal… - … on Advances in Pattern …, 2015 - ieeexplore.ieee.org
This paper motivates the use of combination of mel frequency cepstral coefficients (MFCC)
and its delta derivatives (DMFCC and DDMFCC) calculated using mel spaced Gaussian …

[PDF][PDF] Распознавание личности по голосу: аналитический обзор

ВН Сорокин, ВВ Вьюгин, АА Тананыкин - Информационные процессы, 2012 - jip.ru
Задача распознавания диктора по его голосу была поставлена более 40 лет тому
назад, и исследования в этой области все еще продолжаются. Решение этой задачи …