End-to-end speaker segmentation for overlap-aware resegmentation

H Bredin, A Laurent - arXiv preprint arXiv:2104.04045, 2021 - arxiv.org
Speaker segmentation consists in partitioning a conversation between one or more
speakers into speaker turns. Usually addressed as the late combination of three sub-tasks …

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

Encoder-decoder based attractors for end-to-end neural diarization

S Horiguchi, Y Fujita, S Watanabe… - … /ACM Transactions on …, 2022 - ieeexplore.ieee.org
This paper investigates an end-to-end neural diarization (EEND) method for an unknown
number of speakers. In contrast to the conventional cascaded approach to speaker …

Towards neural diarization for unlimited numbers of speakers using global and local attractors

S Horiguchi, S Watanabe, P García… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
Attractor-based end-to-end diarization is achieving comparable accuracy to the carefully
tuned conventional clustering-based methods on challenging datasets. However, the main …

From simulated mixtures to simulated conversations as training data for end-to-end neural diarization

F Landini, A Lozano-Diez, M Diez, L Burget - arXiv preprint arXiv …, 2022 - arxiv.org
End-to-end neural diarization (EEND) is nowadays one of the most prominent research
topics in speaker diarization. EEND presents an attractive alternative to standard cascaded …

EEND-SS: Joint end-to-end neural speaker diarization and speech separation for flexible number of speakers

S Maiti, Y Ueda, S Watanabe, C Zhang… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
In this paper, we present a novel framework that jointly performs three tasks: speaker
diarization, speech separation, and speaker counting. Our proposed framework integrates …

Online neural diarization of unlimited numbers of speakers using global and local attractors

S Horiguchi, S Watanabe, P García… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
A method to perform offline and online speaker diarization for an unlimited number of
speakers is described in this paper. End-to-end neural diarization (EEND) has achieved …

Attention-based encoder-decoder network for end-to-end neural speaker diarization with target speaker attractor

Z Chen, B Han, S Wang, Y Qian - arXiv preprint arXiv:2305.10704, 2023 - arxiv.org
This paper proposes a novel Attention-based Encoder-Decoder network for End-to-End
Neural speaker Diarization (AED-EEND). In AED-EEND system, we incorporate the target …

Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer

Z Chen, B Han, S Wang, Y Qian - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org
Deep neural network-based systems have significantly improved the performance of
speaker diarization tasks. However, end-to-end neural diarization (EEND) systems often …

Online streaming end-to-end neural diarization handling overlapping speech and flexible numbers of speakers

Y Xue, S Horiguchi, Y Fujita, Y Takashima… - arXiv preprint arXiv …, 2021 - arxiv.org
We propose a streaming diarization method based on an end-to-end neural diarization
(EEND) model, which handles flexible numbers of speakers and overlapping speech. In our …