Multimodal speaker diarization

A Noulas, G Englebienne… - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
We present a novel probabilistic framework that fuses information coming from the audio
and video modality to perform speaker diarization. The proposed framework is a Dynamic …

Speaker diarization using speaker embedding (s) and trained generative model

IL Moreno, LCC Rus - US Patent 10,978,059, 2021 - Google Patents
Speaker diarization techniques that enable processing of audio data to generate one or
more refined versions of the audio data, where each of the refined versions of the audio data …

Speaker diarization using speaker embedding (s) and trained generative model

IL Moreno, LCC Rus - US Patent 11,735,176, 2023 - Google Patents
Speaker diarization techniques that enable processing of audio data to generate one or
more refined versions of the audio data, where each of the refined versions of the audio data …