Задача распознавания диктора по его голосу была поставлена более 40 лет тому назад, и исследования в этой области все еще продолжаются. Решение этой задачи …
H Bredin - 2017 IEEE international conference on acoustics …, 2017 - ieeexplore.ieee.org
TristouNet is a neural network architecture based on Long Short-Term Memory recurrent networks, meant to project speech sequences into a fixed-dimensional euclidean space …
Speech applications dealing with conversations require not only recognizing the spoken words, but also determining who spoke when. The task of assigning words to speakers is …
This paper presents the LIUM open-source speaker diarization toolbox, mostly dedicated to broadcast news. This tool includes both Hierarchical Agglomerative Clustering using well …
S Meignier, T Merlin - CMU SPUD Workshop, 2010 - hal.science
This paper presents an open-source diarization toolkit which is mostly dedicated to speaker and developed by the LIUM. This toolkit includes hierarchical agglomerative clustering …
Speaker change detection is an important step in a speaker di-arization system. It aims at finding speaker change points in the audio stream. In this paper, it is treated as a sequence …
P Kenny, D Reynolds, F Castaldo - IEEE Journal of Selected …, 2010 - ieeexplore.ieee.org
We report on work on speaker diarization of telephone conversations which was begun at the Robust Speaker Recognition Workshop held at Johns Hopkins University in 2008. Three …
Speaker indexing or diarization is an important task in audio processing and retrieval. Speaker diarization is the process of labeling a speech signal with labels corresponding to …
In this paper, we propose SocioPhone, a novel initiative to build a mobile platform for face-to- face interaction monitoring. Face-to-face interaction, especially conversation, is a …