Consensus on validation of forensic voice comparison

GS Morrison, E Enzinger, V Hughes, M Jessen… - Science & Justice, 2021 - Elsevier
Since the 1960s, there have been calls for forensic voice comparison to be empirically
validated under casework conditions. Since around 2000, there have been an increasing …

An overview of Indian spoken language recognition from machine learning perspective

S Dey, M Sahidullah, G Saha - ACM Transactions on Asian and Low …, 2022 - dl.acm.org
Automatic spoken language identification (LID) is a very important research field in the era of
multilingual voice-command-based human-computer interaction. A front-end LID module …

A comprehensive study on bilingual and multilingual speech emotion recognition using a two-pass classification scheme

P Heracleous, A Yoneyama - PloS one, 2019 - journals.plos.org
Emotion recognition plays an important role in human-computer interaction. Previously and
currently, many studies focused on speech emotion recognition using several classifiers and …

VoxWatch: an open-set speaker recognition benchmark on VoxCeleb

R Peri, SO Sadjadi, D Garcia-Romero - arXiv preprint arXiv:2307.00169, 2023 - arxiv.org
Despite its broad practical applications such as in fraud prevention, open-set speaker
identification (OSI) has received less attention in the speaker recognition community …

Statistical models in forensic voice comparison

SM Geoffrey, E Ewald, D Ramos… - Handbook of forensic …, 2020 - taylorfrancis.com
The purpose of forensic voice comparison is to assist a court of law in deciding whether the
voices on two (or more) recordings were produced by the same speaker or by different …

Addressing the semi-open set dialect recognition problem under resource-efficient considerations

S Dey, G Saha - Speech Communication, 2023 - Elsevier
This work presents a resource-efficient solution for the spoken dialect recognition task under
semi-open set evaluation scenarios, where a closed set model is exposed to unknown class …

Performance evaluation and implementations of MFCC, SVM and MLP algorithms in the FPGA board

S Khamlich, F Khamlich, I Atouf… - International journal of …, 2021 - hrcak.srce.hr
Sažetak One of the most difficult speech recognition tasks is accurate recognition of human-
to-human communication. Advances in deep learning over the last few years have produced …

Capturing the quantity and location of adult wh-words in the preschool classroom using a sensing tool system

Y Seven, DW Irvin, PV Kothalkar, S Dutta… - Early Childhood …, 2024 - Elsevier
Observational approaches may limit researchers' ability to comprehensively capture
preschool classroom conversations, including the use of wh-words. In the current proof-of …

Speech-to-text recognition in University English as a Foreign Language Learning

KTC Chen - Education and Information Technologies, 2022 - Springer
This study explored the potential of adopting speech-to-text recognition (STR) technology for
English as a foreign language (EFL) oral training in class at the university level. An action …

Emotional speaker recognition based on machine and deep learning

TJ Sefara, TB Mokgonyane - 2020 2nd International …, 2020 - ieeexplore.ieee.org
Speaker recognition is a method which recognise a speaker from characteristics of a voice.
Speaker recognition technologies have been widely used in many domains. Most speaker …