Speaker segmentation and clustering

M Kotti, V Moschou, C Kotropoulos - Signal processing, 2008 - Elsevier
This survey focuses on two challenging speech processing topics, namely: speaker
segmentation and speaker clustering. Speaker segmentation aims at finding speaker …

[PDF][PDF] Распознавание личности по голосу: аналитический обзор

ВН Сорокин, ВВ Вьюгин, АА Тананыкин - Информационные процессы, 2012 - jip.ru
Задача распознавания диктора по его голосу была поставлена более 40 лет тому
назад, и исследования в этой области все еще продолжаются. Решение этой задачи …

Tristounet: triplet loss for speaker turn embedding

H Bredin - 2017 IEEE international conference on acoustics …, 2017 - ieeexplore.ieee.org
TristouNet is a neural network architecture based on Long Short-Term Memory recurrent
networks, meant to project speech sequences into a fixed-dimensional euclidean space …

Joint speech recognition and speaker diarization via sequence transduction

LE Shafey, H Soltau, I Shafran - arXiv preprint arXiv:1907.05337, 2019 - arxiv.org
Speech applications dealing with conversations require not only recognizing the spoken
words, but also determining who spoke when. The task of assigning words to speakers is …

An open-source state-of-the-art toolbox for broadcast news diarization

M Rouvier, G Dupuy, P Gay, E Khoury, T Merlin… - Interspeech, 2013 - hal.science
This paper presents the LIUM open-source speaker diarization toolbox, mostly dedicated to
broadcast news. This tool includes both Hierarchical Agglomerative Clustering using well …

LIUM SpkDiarization: an open source toolkit for diarization

S Meignier, T Merlin - CMU SPUD Workshop, 2010 - hal.science
This paper presents an open-source diarization toolkit which is mostly dedicated to speaker
and developed by the LIUM. This toolkit includes hierarchical agglomerative clustering …

Speaker change detection in broadcast tv using bidirectional long short-term memory networks

R Yin, H Bredin, C Barras - Interspeech 2017, 2017 - hal.science
Speaker change detection is an important step in a speaker di-arization system. It aims at
finding speaker change points in the audio stream. In this paper, it is treated as a sequence …

Diarization of telephone conversations using factor analysis

P Kenny, D Reynolds, F Castaldo - IEEE Journal of Selected …, 2010 - ieeexplore.ieee.org
We report on work on speaker diarization of telephone conversations which was begun at
the Robust Speaker Recognition Workshop held at Johns Hopkins University in 2008. Three …

A review on speaker diarization systems and approaches

MH Moattar, MM Homayounpour - Speech Communication, 2012 - Elsevier
Speaker indexing or diarization is an important task in audio processing and retrieval.
Speaker diarization is the process of labeling a speech signal with labels corresponding to …

Sociophone: Everyday face-to-face interaction monitoring platform using multi-phone sensor fusion

Y Lee, C Min, C Hwang, J Lee, I Hwang, Y Ju… - Proceeding of the 11th …, 2013 - dl.acm.org
In this paper, we propose SocioPhone, a novel initiative to build a mobile platform for face-to-
face interaction monitoring. Face-to-face interaction, especially conversation, is a …