Speaker change detection using a new weighted distance measure.

M Kotti, V Moschou, C Kotropoulos - Signal processing, 2008 - Elsevier

This survey focuses on two challenging speech processing topics, namely: speaker
segmentation and speaker clustering. Speaker segmentation aims at finding speaker …

被引用次数：183 相关文章所有 9 个版本

[PDF] academia.edu

A review on speaker diarization systems and approaches

MH Moattar, MM Homayounpour - Speech Communication, 2012 - Elsevier

Speaker indexing or diarization is an important task in audio processing and retrieval.
Speaker diarization is the process of labeling a speech signal with labels corresponding to …

被引用次数：116 相关文章所有 6 个版本

[PDF] joensuu.fi

Real-time speaker identification and verification

T Kinnunen, E Karpov, P Franti - IEEE Transactions on Audio …, 2005 - ieeexplore.ieee.org

In speaker identification, most of the computation originates from the distance or likelihood
computations between the feature vectors of the unknown speaker and the models in the …

被引用次数：307 相关文章所有 24 个版本

[PDF] uef.fi

[PDF][PDF] Spectral features for automatic text-independent speaker recognition

T Kinnunen - Licentiate's thesis, 2003 - cs.uef.fi

Front-end or feature extractor is the first component in an automatic speaker recognition
system. Feature extraction transforms the raw speech signal into a compact but effective …

被引用次数：222 相关文章所有 11 个版本

[PDF] usc.edu

Unsupervised speaker indexing using generic models

S Kwon, S Narayanan - IEEE Transactions on Speech and …, 2005 - ieeexplore.ieee.org

Unsupervised speaker indexing sequentially detects points where a speaker identity
changes in a multispeaker audio stream, and categorizes each speaker segment, without …

被引用次数：86 相关文章所有 6 个版本

[PDF] tamu.edu

Development of a remote therapy tool for childhood apraxia of speech

A Parnandi, V Karappa, T Lan, M Shahin… - ACM Transactions on …, 2015 - dl.acm.org

We present a multitier system for the remote administration of speech therapy to children
with apraxia of speech. The system uses a client-server architecture model and facilitates …

被引用次数：37 相关文章所有 3 个版本

[PDF] arxiv.org

Hierarchical RNN with static sentence-level attention for text-based speaker change detection

Z Meng, L Mou, Z Jin - Proceedings of the 2017 ACM on Conference on …, 2017 - dl.acm.org

Speaker change detection (SCD) is an important task in dialog modeling. Our paper
addresses the problem of text-based SCD, which differs from existing audio-based studies …

被引用次数：33 相关文章所有 5 个版本

[PDF] city.ac.uk

Computationally efficient and robust BIC-based speaker segmentation

M Kotti, E Benetos… - IEEE transactions on audio …, 2008 - ieeexplore.ieee.org

An algorithm for automatic speaker segmentation based on the Bayesian information
criterion (BIC) is presented. BIC tests are not performed for every window shift, as previously …

被引用次数：58 相关文章所有 26 个版本

[PDF] researchgate.net

Multistage newton's approach for training radial basis function neural networks

K Tyagi, C Rane, B Irie, M Manry - SN Computer Science, 2021 - Springer

A systematic four-step batch approach is presented for the second-order training of radial
basis function (RBF) neural networks for estimation. First, it is shown that second-order …

被引用次数：8 相关文章所有 3 个版本

[PDF] cuhk.edu.hk

A multitask learning framework for speaker change detection with content information from unsupervised speech decomposition

H Su, D Zhao, L Dang, M Li, X Wu… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Speaker Change Detection (SCD) is a task of determining the time boundaries between
speech segments of different speakers. SCD system can be applied to many tasks, such as …

被引用次数：5 相关文章所有 2 个版本

高级搜索

QQ 群