查看文章

usc.edu 中的 [PDF]

Unsupervised speaker indexing using generic models

作者

Soonil Kwon, Shri Narayanan

发表日期

2005/8/15

期刊

IEEE Transactions on Speech and Audio processing

卷号

期号

页码范围

1004-1013

出版商

IEEE

简介

Unsupervised speaker indexing sequentially detects points where a speaker identity changes in a multispeaker audio stream, and categorizes each speaker segment, without any prior knowledge about the speakers. This paper addresses two challenges: The first relates to sequential speaker change detection. The second relates to speaker modeling in light of the fact that the number/identity of the speakers is unknown. To address this issue, a predetermined generic speaker-independent model set, called the sample speaker models (SSM), is proposed. This set can be useful for more accurate speaker modeling and clustering without requiring training models on target speaker data. Once a speaker-independent model is selected from the generic sample models, it is progressively adapted into a specific speaker-dependent model. Experiments were performed with data from the Speaker Recognition Benchmark …

引用总数

被引用次数：86

200520062007200820092010201120122013201420152016201720182019202020212022202320245 4 14 6 2 6 7 8 3 6 3 3 2 4 2 3 2 4 2

学术搜索中的文章

Unsupervised speaker indexing using generic models

S Kwon, S Narayanan - IEEE Transactions on Speech and Audio processing, 2005

被引用次数：86 相关文章所有 6 个版本