End-to-end speaker diarization conditioned on speech activity and overlap detection

H Bredin, A Laurent - arXiv preprint arXiv:2104.04045, 2021 - arxiv.org

Speaker segmentation consists in partitioning a conversation between one or more
speakers into speaker turns. Usually addressed as the late combination of three sub-tasks …

被引用次数：208 相关文章所有 17 个版本

[PDF] arxiv.org

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org

Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

被引用次数：4 相关文章所有 4 个版本

[PDF] ieee.org

Encoder-decoder based attractors for end-to-end neural diarization

S Horiguchi, Y Fujita, S Watanabe… - … /ACM Transactions on …, 2022 - ieeexplore.ieee.org

This paper investigates an end-to-end neural diarization (EEND) method for an unknown
number of speakers. In contrast to the conventional cascaded approach to speaker …

被引用次数：70 相关文章所有 6 个版本

[PDF] arxiv.org

Towards neural diarization for unlimited numbers of speakers using global and local attractors

S Horiguchi, S Watanabe, P García… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org

Attractor-based end-to-end diarization is achieving comparable accuracy to the carefully
tuned conventional clustering-based methods on challenging datasets. However, the main …

被引用次数：43 相关文章所有 6 个版本

[PDF] arxiv.org

From simulated mixtures to simulated conversations as training data for end-to-end neural diarization

F Landini, A Lozano-Diez, M Diez, L Burget - arXiv preprint arXiv …, 2022 - arxiv.org

End-to-end neural diarization (EEND) is nowadays one of the most prominent research
topics in speaker diarization. EEND presents an attractive alternative to standard cascaded …

被引用次数：44 相关文章所有 8 个版本

[PDF] arxiv.org

EEND-SS: Joint end-to-end neural speaker diarization and speech separation for flexible number of speakers

S Maiti, Y Ueda, S Watanabe, C Zhang… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

In this paper, we present a novel framework that jointly performs three tasks: speaker
diarization, speech separation, and speaker counting. Our proposed framework integrates …

被引用次数：32 相关文章所有 5 个版本

[PDF] ieee.org

Online neural diarization of unlimited numbers of speakers using global and local attractors

S Horiguchi, S Watanabe, P García… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org

A method to perform offline and online speaker diarization for an unlimited number of
speakers is described in this paper. End-to-end neural diarization (EEND) has achieved …

被引用次数：27 相关文章所有 7 个版本

[PDF] arxiv.org

Attention-based encoder-decoder network for end-to-end neural speaker diarization with target speaker attractor

Z Chen, B Han, S Wang, Y Qian - arXiv preprint arXiv:2305.10704, 2023 - arxiv.org

This paper proposes a novel Attention-based Encoder-Decoder network for End-to-End
Neural speaker Diarization (AED-EEND). In AED-EEND system, we incorporate the target …

被引用次数：20 相关文章所有 5 个版本

[PDF] arxiv.org

Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer

Z Chen, B Han, S Wang, Y Qian - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org

Deep neural network-based systems have significantly improved the performance of
speaker diarization tasks. However, end-to-end neural diarization (EEND) systems often …

被引用次数：20 相关文章所有 5 个版本

[PDF] arxiv.org

Online streaming end-to-end neural diarization handling overlapping speech and flexible numbers of speakers

Y Xue, S Horiguchi, Y Fujita, Y Takashima… - arXiv preprint arXiv …, 2021 - arxiv.org

We propose a streaming diarization method based on an end-to-end neural diarization
(EEND) model, which handles flexible numbers of speakers and overlapping speech. In our …

被引用次数：30 相关文章所有 8 个版本

高级搜索

QQ 群