Speaker segmentation and clustering

M Kotti, V Moschou, C Kotropoulos - Signal processing, 2008 - Elsevier
This survey focuses on two challenging speech processing topics, namely: speaker
segmentation and speaker clustering. Speaker segmentation aims at finding speaker …

A review on speaker diarization systems and approaches

MH Moattar, MM Homayounpour - Speech Communication, 2012 - Elsevier
Speaker indexing or diarization is an important task in audio processing and retrieval.
Speaker diarization is the process of labeling a speech signal with labels corresponding to …

Real-time speaker identification and verification

T Kinnunen, E Karpov, P Franti - IEEE Transactions on Audio …, 2005 - ieeexplore.ieee.org
In speaker identification, most of the computation originates from the distance or likelihood
computations between the feature vectors of the unknown speaker and the models in the …

[PDF][PDF] Spectral features for automatic text-independent speaker recognition

T Kinnunen - Licentiate's thesis, 2003 - cs.uef.fi
Front-end or feature extractor is the first component in an automatic speaker recognition
system. Feature extraction transforms the raw speech signal into a compact but effective …

Unsupervised speaker indexing using generic models

S Kwon, S Narayanan - IEEE Transactions on Speech and …, 2005 - ieeexplore.ieee.org
Unsupervised speaker indexing sequentially detects points where a speaker identity
changes in a multispeaker audio stream, and categorizes each speaker segment, without …

Development of a remote therapy tool for childhood apraxia of speech

A Parnandi, V Karappa, T Lan, M Shahin… - ACM Transactions on …, 2015 - dl.acm.org
We present a multitier system for the remote administration of speech therapy to children
with apraxia of speech. The system uses a client-server architecture model and facilitates …

Hierarchical RNN with static sentence-level attention for text-based speaker change detection

Z Meng, L Mou, Z Jin - Proceedings of the 2017 ACM on Conference on …, 2017 - dl.acm.org
Speaker change detection (SCD) is an important task in dialog modeling. Our paper
addresses the problem of text-based SCD, which differs from existing audio-based studies …

Computationally efficient and robust BIC-based speaker segmentation

M Kotti, E Benetos… - IEEE transactions on audio …, 2008 - ieeexplore.ieee.org
An algorithm for automatic speaker segmentation based on the Bayesian information
criterion (BIC) is presented. BIC tests are not performed for every window shift, as previously …

Multistage newton's approach for training radial basis function neural networks

K Tyagi, C Rane, B Irie, M Manry - SN Computer Science, 2021 - Springer
A systematic four-step batch approach is presented for the second-order training of radial
basis function (RBF) neural networks for estimation. First, it is shown that second-order …

A multitask learning framework for speaker change detection with content information from unsupervised speech decomposition

H Su, D Zhao, L Dang, M Li, X Wu… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Speaker Change Detection (SCD) is a task of determining the time boundaries between
speech segments of different speakers. SCD system can be applied to many tasks, such as …