Multistage speaker diarization of broadcast news

M Kotti, V Moschou, C Kotropoulos - Signal processing, 2008 - Elsevier

This survey focuses on two challenging speech processing topics, namely: speaker
segmentation and speaker clustering. Speaker segmentation aims at finding speaker …

被引用次数：183 相关文章所有 9 个版本

[PDF] jip.ru

[PDF][PDF] Распознавание личности по голосу: аналитический обзор

ВН Сорокин, ВВ Вьюгин, АА Тананыкин - Информационные процессы, 2012 - jip.ru

Задача распознавания диктора по его голосу была поставлена более 40 лет тому
назад, и исследования в этой области все еще продолжаются. Решение этой задачи …

被引用次数：127 相关文章所有 4 个版本

[PDF] arxiv.org

Tristounet: triplet loss for speaker turn embedding

H Bredin - 2017 IEEE international conference on acoustics …, 2017 - ieeexplore.ieee.org

TristouNet is a neural network architecture based on Long Short-Term Memory recurrent
networks, meant to project speech sequences into a fixed-dimensional euclidean space …

被引用次数：222 相关文章所有 5 个版本

[PDF] arxiv.org

Joint speech recognition and speaker diarization via sequence transduction

LE Shafey, H Soltau, I Shafran - arXiv preprint arXiv:1907.05337, 2019 - arxiv.org

Speech applications dealing with conversations require not only recognizing the spoken
words, but also determining who spoke when. The task of assigning words to speakers is …

被引用次数：105 相关文章所有 7 个版本

[PDF] hal.science

An open-source state-of-the-art toolbox for broadcast news diarization

M Rouvier, G Dupuy, P Gay, E Khoury, T Merlin… - Interspeech, 2013 - hal.science

This paper presents the LIUM open-source speaker diarization toolbox, mostly dedicated to
broadcast news. This tool includes both Hierarchical Agglomerative Clustering using well …

被引用次数：206 相关文章所有 18 个版本

[PDF] hal.science

LIUM SpkDiarization: an open source toolkit for diarization

S Meignier, T Merlin - CMU SPUD Workshop, 2010 - hal.science

This paper presents an open-source diarization toolkit which is mostly dedicated to speaker
and developed by the LIUM. This toolkit includes hierarchical agglomerative clustering …

被引用次数：248 相关文章所有 9 个版本

[PDF] hal.science

Speaker change detection in broadcast tv using bidirectional long short-term memory networks

R Yin, H Bredin, C Barras - Interspeech 2017, 2017 - hal.science

Speaker change detection is an important step in a speaker di-arization system. It aims at
finding speaker change points in the audio stream. In this paper, it is treated as a sequence …

被引用次数：98 相关文章所有 6 个版本

[PDF] academia.edu

Diarization of telephone conversations using factor analysis

P Kenny, D Reynolds, F Castaldo - IEEE Journal of Selected …, 2010 - ieeexplore.ieee.org

We report on work on speaker diarization of telephone conversations which was begun at
the Robust Speaker Recognition Workshop held at Johns Hopkins University in 2008. Three …

被引用次数：174 相关文章所有 8 个版本

[PDF] academia.edu

A review on speaker diarization systems and approaches

MH Moattar, MM Homayounpour - Speech Communication, 2012 - Elsevier

Speaker indexing or diarization is an important task in audio processing and retrieval.
Speaker diarization is the process of labeling a speech signal with labels corresponding to …

被引用次数：116 相关文章所有 6 个版本

[PDF] smu.edu.sg

Sociophone: Everyday face-to-face interaction monitoring platform using multi-phone sensor fusion

Y Lee, C Min, C Hwang, J Lee, I Hwang, Y Ju… - Proceeding of the 11th …, 2013 - dl.acm.org

In this paper, we propose SocioPhone, a novel initiative to build a mobile platform for face-to-
face interaction monitoring. Face-to-face interaction, especially conversation, is a …

被引用次数：117 相关文章所有 21 个版本

高级搜索

QQ 群