Voxceleb2: Deep speaker recognition

JS Chung, A Nagrani, A Zisserman - arXiv preprint arXiv:1806.05622, 2018 - arxiv.org
… It also addresses a lack of ethnic diversity in the VoxCeleb1 dataset (section 3); (ii) we propose
deep ResNet-based architectures for speaker embedding suitable for spectrogram inputs

Speaker recognition using LPC, MFCC, ZCR features with ANN and SVM classifier for large input database

N Chauhan, T Isshiki, D Li - 2019 IEEE 4th international …, 2019 - ieeexplore.ieee.org
speakers voice samples as a input for recognition purpose. Main goal of this work is to obtain
better accuracy for speaker recognition system for large number of voice database. In this …

Speaker recognition using neural networks and conventional classifiers

KR Farrell, RJ Mammone… - IEEE Transactions on …, 1994 - ieeexplore.ieee.org
… The leaves represent exclusive partitions of the input data. A test vector will be evaluated at
each node and directed to one of two subsequent nodes based on a decision. This decision …

Speaker recognition—general classifier approaches and data fusion methods

RP Ramachandran, KR Farrell, R Ramachandran… - Pattern recognition, 2002 - Elsevier
… In the text-independent case, there is no restriction on the sentence or phrase to be spoken,
whereas in the text-dependent case, the input sentence or phrase is fixed for each speaker. …

Deep learning methods in speaker recognition: a review

D Sztahó, G Szaszák, A Beke - arXiv preprint arXiv:1911.06615, 2019 - arxiv.org
… representations of unlabelled input data. In (Banerjee et al., 2018), spectrograms (25 ms
window size, 10 ms timestep) have been fed as input speech data after applying PCA …

An automatic speaker recognition system

P Chakraborty, F Ahmed, MM Kabir… - … 2007, Kitakyushu, Japan …, 2008 - Springer
… After the enrolment session, the acoustic vectors extracted from input speech of a speaker
… In this work, the utterances of several speakers are taken and the data are divided in music (…

Report: A vector quantization approach to speaker recognition

FK Soong, AE Rosenberg, BH Juang… - AT&T technical …, 1987 - ieeexplore.ieee.org
… where R, is the autocorrelation matrix of speech input data associated with the vector a. Using
this distortion measure, and the VQ codebook training algorithm proposed by Linde, Buzo, …

An overview of speaker recognition technology

S Furui - Automatic Speech and Speaker Recognition …, 1996 - Springer
… Then the likelihood of inputdatabases designed for speaker recognition and related areas
include KING corpus and SWITCHBOARD corpus [24]. It is crucial to extend these databases

Multivariability speaker recognition database in Indian scenario

BC Haris, G Pradhan, A Misra, SRM Prasanna… - International Journal of …, 2012 - Springer
… To remove the non-speech portions from input data, an energy based voice activity detector
with fixed threshold was used. The cepstral mean subtraction was applied on all features so …

Speaker recognition: A tutorial

JP Campbell - Proceedings of the IEEE, 1997 - ieeexplore.ieee.org
… of the computed input feature vectors to models of the claimed speaker or feature vector …
to a representation problem, we seek other means to reduce the dimensionality of the data. …