A review of speaker diarization: Recent advances with deep learning

TJ Park, N Kanda, D Dimitriadis, KJ Han… - Computer Speech & …, 2022 - Elsevier
Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

ASVspoof: the automatic speaker verification spoofing and countermeasures challenge

Z Wu, J Yamagishi, T Kinnunen… - IEEE Journal of …, 2017 - ieeexplore.ieee.org
Concerns regarding the vulnerability of automatic speaker verification (ASV) technology
against spoofing can undermine confidence in its reliability and form a barrier to exploitation …

An overview of text-independent speaker recognition: From features to supervectors

T Kinnunen, H Li - Speech communication, 2010 - Elsevier
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …

A study of interspeaker variability in speaker verification

P Kenny, P Ouellet, N Dehak, V Gupta… - … on Audio, Speech …, 2008 - ieeexplore.ieee.org
We propose a new approach to the problem of estimating the hyperparameters which define
the interspeaker variability model in joint factor analysis. We tested the proposed estimation …

Microsoft speaker diarization system for the voxceleb speaker recognition challenge 2020

X Xiao, N Kanda, Z Chen, T Zhou… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
This paper describes the Microsoft speaker diarization system for monaural multi-talker
recordings in the wild, evaluated at the diarization track of the VoxCeleb Speaker …

On joint optimization of automatic speaker verification and anti-spoofing in the embedding space

A Gomez-Alanis, JA Gonzalez-Lopez… - IEEE Transactions …, 2020 - ieeexplore.ieee.org
Biometric systems are exposed to spoofing attacks which may compromise their security,
and voice biometrics based on automatic speaker verification (ASV), is no exception. To …

Deconstructing cross-entropy for probabilistic binary classifiers

D Ramos, J Franco-Pedroso, A Lozano-Diez… - Entropy, 2018 - mdpi.com
In this work, we analyze the cross-entropy function, widely used in classifiers both as a
performance measure and as an optimization objective. We contextualize cross-entropy in …

Tutorial on logistic-regression calibration and fusion: converting a score to a likelihood ratio

GS Morrison - Australian Journal of Forensic Sciences, 2013 - Taylor & Francis
Logistic-regression calibration and fusion are potential steps in the calculation of forensic
likelihood ratios. The present paper provides a tutorial on logistic-regression calibration and …

Language identification: A tutorial

E Ambikairajah, H Li, L Wang, B Yin… - IEEE Circuits and …, 2011 - ieeexplore.ieee.org
This tutorial presents an overview of the progression of spoken language identification (LID)
systems and current developments. The introduction provides a background on automatic …