T-vectors: Weakly supervised speaker identification using hierarchical transformer model

M Jakubec, R Jarina, E Lieskovska, P Kasak - Engineering Applications of …, 2024 - Elsevier

The construction of speaker-specific acoustic models for automatic speaker recognition is
almost exclusively based on deep neural network-based speaker embeddings. This work …

被引用次数：16 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-view self-attention based transformer for speaker recognition

R Wang, J Ao, L Zhou, S Liu, Z Wei, T Ko… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Initially developed for natural language processing (NLP), Transformer model is now widely
used for speech processing tasks such as speaker recognition, due to its powerful sequence …

被引用次数：49 相关文章所有 5 个版本

Residual networks for text-independent speaker identification: Unleashing the power of residual learning

P Gambhir, A Dev, P Bansal, DK Sharma… - Journal of Information …, 2024 - Elsevier

The human voice, a dynamic signal, conveys valuable information for speaker identification,
encompassing gender, age, emotions, and language. In the biometrics industry, identifying …

被引用次数：3 相关文章所有 2 个版本

[PDF] mdpi.com

Global–local self-attention based transformer for speaker verification

F Xie, D Zhang, C Liu - Applied Sciences, 2022 - mdpi.com

Transformer models are now widely used for speech processing tasks due to their powerful
sequence modeling capabilities. Previous work determined an efficient way to model …

被引用次数：8 相关文章所有 4 个版本

PulmoListener: Continuous Acoustic Monitoring of Chronic Obstructive Pulmonary Disease in the Wild

S Bhalla, S Liaqat, R Wu, AS Gershon… - Proceedings of the …, 2023 - dl.acm.org

Prior work has shown the utility of acoustic analysis in controlled settings for assessing
chronic obstructive pulmonary disease (COPD)---one of the most common respiratory …

被引用次数：2 相关文章所有 2 个版本

[PDF] researchgate.net

A run-through: Text independent speaker identification using deep learning

P Gambhir, A Dev - Artificial Intelligence and Speech Technology, 2021 - taylorfrancis.com

Speaker identification determines an unknown speaker's identity which is a 1: N match
where the speech is compared against multiple templates. Speaker identification is …

被引用次数：3 相关文章所有 5 个版本

[PDF] isca-archive.org

[PDF][PDF] Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory

R Li, Z Xie, H Xu, Y Peng, H Liu, H Huang, ES Chng - isca-archive.org

Accent recognition (AR) is challenging due to the lack of training data as well as the accents
are entangled with speakers and regional characteristics. This paper aims to improve AR …

被引用次数：1 相关文章所有 2 个版本

[PDF] hal.science

Robustness of language recognition system to transmission channel

R Duroselle - 2021 - hal.science

Language recognition is the task of predicting the language used in a test speech utterance.
Since 2017, the best performing systems have been based on a deep neural network which …

[PDF][PDF] Robustesse au canal des systemes de reconnaissance de la langue

R Duroselle - hal.science

La tâche de reconnaissance de la langue consiste à prédire la langue utilisée dans un
enregistrement audio contenant de la parole. L'étude d'un tel traitement trouve sa …

高级搜索

QQ 群