Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization

P Singh, S Ganapathy - arXiv preprint arXiv:2401.12850, 2024 - arxiv.org
Speaker diarization, the task of segmenting an audio recording based on speaker identity,
constitutes an important speech pre-processing step for several downstream applications …

Implicit Self-supervised Language Representation for Spoken Language Diarization

J Mishra, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
The use of spoken language diarization (LD) as a preprocessing system might be essential
in a code-switched (CS) scenario. Furthermore, implicit frameworks are preferable to …

TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024

J Kalda, T Alumäe, M Lebourdais, H Bredin… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper describes the submissions of team TalTech-IRIT-LIS to the DISPLACE 2024
challenge. Our team participated in the speaker diarization and language diarization tracks …

The Second DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments

SB Kalluri, P Singh, PR Chowdhuri, A Kulkarni… - arXiv preprint arXiv …, 2024 - arxiv.org
The DIarization of SPeaker and LAnguage in Conversational Environments (DISPLACE)
2024 challenge is the second in the series of DISPLACE challenges, which involves tasks of …

[PDF][PDF] Graph Clustering Approaches for Speaker Diarization of Conversational Speech

P Singh - 2023 - leap.ee.iisc.ac.in
In this era of advanced machine intelligence, real-world speech applications need to be
equipped to deal with conversations involving multiple speakers. An essential first step in …