Forensic speaker recognition: A new method based on extracting accent and language information from short utterances

S Saleem, F Subhan, N Naseer, A Bais… - Forensic Science …, 2020 - Elsevier
… as follows:(1) ∧ c ( Y ) = l o g p ( Y | λ c ) − l o g p ( Y | λ u b m ) (2) c ˆ = arg max c = 1 N ∧
c where Y is a sequence of MFCC features of a test speech sample. Accent/language/…

A comprehensive review on speaker recognition

B Saritha, MA Laskar, RH Laskar - … in Speech and Music Technology …, 2022 - Springer
speaker identification and speaker verification. The process of identifying an unknown speaker
from a set of known speakers is speaker … The acoustic feature sequence is obtained from …

A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

MM Kabir, MF Mridha, J Shin, I Jahan, AQ Ohi - IEEE Access, 2021 - ieeexplore.ieee.org
… The study showed that the speaker trait is primarily a deterministic short-time feature rather
… that aggregated the variable-length input sequence into an utterance level representation. …

Lithuanian speech recognition using purely phonetic deep learning

L Pipiras, R Maskeliūnas, R Damaševičius - Computers, 2019 - mdpi.com
… for Polish speech recognition. Pakoci et al. [38] used the n-gram model for tuning … are based
on the phoneme sequences processing, which is simpler than classical speech recognition

[HTML][HTML] Using a small amount of text-independent speech data for a BiLSTM large-scale speaker identification approach

MK Nammous, K Saeed, P Kobojek - Journal of King Saud University …, 2022 - Elsevier
… of the sequence. For any given timestep t c in a … Speaker Identification task. The dataset
was manually prepared in three languages: Arabic, English, and Polish for 24 speakers with a bit …

[PDF][PDF] Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition.

M Rybicka, J Villalba, P Zelasko, N Dehak… - Interspeech, 2021 - isca-archive.org
… In both models, the sequence of the main building blocks is similar. The input feature map
is … The residual part of the ResNet-34 exhibits the same block sequence, with the difference …

Dual supervised learning for non-native speech recognition

K Radzikowski, R Nowak, L Wang, O Yoshie - … Journal on Audio, Speech …, 2019 - Springer
speech recognition model (M STT ), which can recognize phonemes for a given sound
sequence… Since there are not many popular benchmarks for ASR of either Japanese or Polish

Automatic speech recognition system for tonal languages: State-of-the-art survey

J Kaur, A Singh, V Kadyan - Archives of Computational Methods in …, 2021 - Springer
… In this paper a systematic survey on Automatic Speech Recognition (ASR) for tonal …
signal and classify it as the last learned pattern in a sequence. The performance as well as …

Natural language processing: Speaker, language, and gender identification with LSTM

MK Nammous, K Saeed - Advanced Computing and Systems for Security …, 2019 - Springer
… of speaker identification in text-independent continues speech … , and Polish; for each
language, there are eight speakers. … LSTM can capture the long dependencies in a sequence by …

[PDF][PDF] Machine learning-based analysis of English lateral allophones

M Piotrowska, G Korvel, B Kostek… - … Journal of Applied …, 2019 - intapi.sciendo.com
… phenomena are particularly difficult for Polish speakers. … of the audio-visual speech recognition
system. Recently, more … on the basis of Pn sequences. The features are denoted as the …