Deep speaker embeddings for Speaker Verification: Review and experimental comparison

M Jakubec, R Jarina, E Lieskovska, P Kasak - Engineering Applications of …, 2024 - Elsevier
The construction of speaker-specific acoustic models for automatic speaker recognition is
almost exclusively based on deep neural network-based speaker embeddings. This work …

Multi-view self-attention based transformer for speaker recognition

R Wang, J Ao, L Zhou, S Liu, Z Wei, T Ko… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Initially developed for natural language processing (NLP), Transformer model is now widely
used for speech processing tasks such as speaker recognition, due to its powerful sequence …

Residual networks for text-independent speaker identification: Unleashing the power of residual learning

P Gambhir, A Dev, P Bansal, DK Sharma… - Journal of Information …, 2024 - Elsevier
The human voice, a dynamic signal, conveys valuable information for speaker identification,
encompassing gender, age, emotions, and language. In the biometrics industry, identifying …

Global–local self-attention based transformer for speaker verification

F Xie, D Zhang, C Liu - Applied Sciences, 2022 - mdpi.com
Transformer models are now widely used for speech processing tasks due to their powerful
sequence modeling capabilities. Previous work determined an efficient way to model …

PulmoListener: Continuous Acoustic Monitoring of Chronic Obstructive Pulmonary Disease in the Wild

S Bhalla, S Liaqat, R Wu, AS Gershon… - Proceedings of the …, 2023 - dl.acm.org
Prior work has shown the utility of acoustic analysis in controlled settings for assessing
chronic obstructive pulmonary disease (COPD)---one of the most common respiratory …

A run-through: Text independent speaker identification using deep learning

P Gambhir, A Dev - Artificial Intelligence and Speech Technology, 2021 - taylorfrancis.com
Speaker identification determines an unknown speaker's identity which is a 1: N match
where the speech is compared against multiple templates. Speaker identification is …

[PDF][PDF] Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory

R Li, Z Xie, H Xu, Y Peng, H Liu, H Huang, ES Chng - isca-archive.org
Accent recognition (AR) is challenging due to the lack of training data as well as the accents
are entangled with speakers and regional characteristics. This paper aims to improve AR …

Robustness of language recognition system to transmission channel

R Duroselle - 2021 - hal.science
Language recognition is the task of predicting the language used in a test speech utterance.
Since 2017, the best performing systems have been based on a deep neural network which …

[PDF][PDF] Robustesse au canal des systemes de reconnaissance de la langue

R Duroselle - hal.science
La tâche de reconnaissance de la langue consiste à prédire la langue utilisée dans un
enregistrement audio contenant de la parole. L'étude d'un tel traitement trouve sa …