An investigation of dependencies between frequency components and speaker characteristics...

T Kinnunen, H Li - Speech communication, 2010 - Elsevier

This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …

被引用次数：2071 相关文章所有 26 个版本

[PDF] arxiv.org

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org

Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

Modulation spectral features for speech emotion recognition using deep neural networks

P Singh, M Sahidullah, G Saha - Speech Communication, 2023 - Elsevier

This work explores the use of constant-Q transform based modulation spectral features (CQT-
MSF) for speech emotion recognition (SER). The human perception and analysis of sound …

被引用次数：40 相关文章所有 7 个版本

[PDF] essex.ac.uk

An investigation of multidimensional voice program parameters in three different databases for voice pathology detection and classification

A Al-Nasheri, G Muhammad, M Alsulaiman, Z Ali… - Journal of Voice, 2017 - Elsevier

Summary Background and Objective Automatic voice-pathology detection and classification
systems may help clinicians to detect the existence of any voice pathologies and the type of …

被引用次数：136 相关文章所有 8 个版本

[PDF] ieee.org

An automatic health monitoring system for patients suffering from voice complications in smart cities

Z Ali, G Muhammad, MF Alhamid - Ieee Access, 2017 - ieeexplore.ieee.org

Current evolutions in the Internet of Things and cloud computing make it believable to build
smart cities and homes. Smart cities provide smart technologies to residents for the …

被引用次数：112 相关文章所有 8 个版本

[PDF] arxiv.org

Optimization of data-driven filterbank for automatic speaker verification

S Sarangi, M Sahidullah, G Saha - Digital Signal Processing, 2020 - Elsevier

Most of the speech processing applications use triangular filters spaced in mel-scale for
feature extraction. In this paper, we propose a new data-driven filter design method which …

被引用次数：69 相关文章所有 8 个版本

[PDF] essex.ac.uk

Investigation of voice pathology detection and classification on different frequency regions using correlation functions

A Al-Nasheri, G Muhammad, M Alsulaiman, Z Ali - Journal of Voice, 2017 - Elsevier

Summary Objectives and Background Automatic voice pathology detection and
classification systems effectively contribute to the assessment of voice disorders, which …

被引用次数：111 相关文章所有 14 个版本

[PDF] unl.edu.ar

Empirical mode decomposition for adaptive AM-FM analysis of speech: A review

R Sharma, L Vignolo, G Schlotthauer… - Speech …, 2017 - Elsevier

This work reviews the advancements in the non-conventional analysis of speech signals,
particularly from an AM-FM analysis point of view. The benefits of such an analysis, as …

被引用次数：89 相关文章所有 10 个版本

A unique approach in text independent speaker recognition using MFCC feature sets and probabilistic neural network

KS Ahmad, AS Thosar, JH Nirmal… - … on Advances in Pattern …, 2015 - ieeexplore.ieee.org

This paper motivates the use of combination of mel frequency cepstral coefficients (MFCC)
and its delta derivatives (DMFCC and DDMFCC) calculated using mel spaced Gaussian …

被引用次数：107 相关文章所有 2 个版本

[PDF] jip.ru

[PDF][PDF] Распознавание личности по голосу: аналитический обзор

ВН Сорокин, ВВ Вьюгин, АА Тананыкин - Информационные процессы, 2012 - jip.ru

Задача распознавания диктора по его голосу была поставлена более 40 лет тому
назад, и исследования в этой области все еще продолжаются. Решение этой задачи …

被引用次数：133 相关文章所有 4 个版本

高级搜索

QQ 群