In defence of metric learning for speaker recognition

JS Chung, J Huh, S Mun, M Lee, HS Heo… - arXiv preprint arXiv …, 2020 - arxiv.org
The objective of this paper is' open-set'speaker recognition of unseen speakers, where ideal
embeddings should be able to condense information into a compact utterance-level …

Leveraging speaker attribute information using multi task learning for speaker verification and diarization

C Luu, P Bell, S Renals - arXiv preprint arXiv:2010.14269, 2020 - arxiv.org
Deep speaker embeddings have become the leading method for encoding speaker identity
in speaker recognition tasks. The embedding space should ideally capture the variations …

Masked multi-center angular margin loss for language recognition

M Ju, Y Xu, D Ke, K Su - EURASIP Journal on Audio, Speech, and Music …, 2022 - Springer
Abstract Language recognition based on embedding aims to maximize inter-class variance
and minimize intra-class variance. Previous researches are limited to the training constraint …

Cosine-distance virtual adversarial training for semi-supervised speaker-discriminative acoustic embeddings

FL Kreyssig, PC Woodland - arXiv preprint arXiv:2008.03756, 2020 - arxiv.org
In this paper, we propose a semi-supervised learning (SSL) technique for training deep
neural networks (DNNs) to generate speaker-discriminative acoustic embeddings (speaker …

On metric-based deep embedding learning for text-independent speaker verification

HB Kashani, S Reza, IS Rezaei - 2020 6th Iranian Conference …, 2020 - ieeexplore.ieee.org
As a state-of-the-art solution for speaker verification problems, deep neural networks have
been usefully employed for extracting speaker embeddings which represent speaker …

Active and Semi-Supervised Learning for Speech Recognition

F Kreyssig - 2024 - repository.cam.ac.uk
Recent years have seen significant advances in speech recognition technology, which can
largely be attributed to the combination of the rise in deep learning in speech recognition …