On the issues of intra-speaker variability and realism in speech, speaker, and language recogniti...

GS Morrison, E Enzinger, V Hughes, M Jessen… - Science & Justice, 2021 - Elsevier

Since the 1960s, there have been calls for forensic voice comparison to be empirically
validated under casework conditions. Since around 2000, there have been an increasing …

被引用次数：72 相关文章所有 14 个版本

[PDF] arxiv.org

An overview of Indian spoken language recognition from machine learning perspective

S Dey, M Sahidullah, G Saha - ACM Transactions on Asian and Low …, 2022 - dl.acm.org

Automatic spoken language identification (LID) is a very important research field in the era of
multilingual voice-command-based human-computer interaction. A front-end LID module …

被引用次数：11 相关文章所有 9 个版本

[PDF] plos.org

A comprehensive study on bilingual and multilingual speech emotion recognition using a two-pass classification scheme

P Heracleous, A Yoneyama - PloS one, 2019 - journals.plos.org

Emotion recognition plays an important role in human-computer interaction. Previously and
currently, many studies focused on speech emotion recognition using several classifiers and …

被引用次数：35 相关文章所有 9 个版本

[PDF] arxiv.org

VoxWatch: an open-set speaker recognition benchmark on VoxCeleb

R Peri, SO Sadjadi, D Garcia-Romero - arXiv preprint arXiv:2307.00169, 2023 - arxiv.org

Despite its broad practical applications such as in fraud prevention, open-set speaker
identification (OSI) has received less attention in the speaker recognition community …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Statistical models in forensic voice comparison

SM Geoffrey, E Ewald, D Ramos… - Handbook of forensic …, 2020 - taylorfrancis.com

The purpose of forensic voice comparison is to assist a court of law in deciding whether the
voices on two (or more) recordings were produced by the same speaker or by different …

被引用次数：34 相关文章所有 8 个版本

Addressing the semi-open set dialect recognition problem under resource-efficient considerations

S Dey, G Saha - Speech Communication, 2023 - Elsevier

This work presents a resource-efficient solution for the spoken dialect recognition task under
semi-open set evaluation scenarios, where a closed set model is exposed to unknown class …

被引用次数：1 相关文章所有 2 个版本

[PDF] srce.hr

Performance evaluation and implementations of MFCC, SVM and MLP algorithms in the FPGA board

S Khamlich, F Khamlich, I Atouf… - International journal of …, 2021 - hrcak.srce.hr

Sažetak One of the most difficult speech recognition tasks is accurate recognition of human-
to-human communication. Advances in deep learning over the last few years have produced …

被引用次数：14 相关文章所有 6 个版本

[PDF] nsf.gov

Capturing the quantity and location of adult wh-words in the preschool classroom using a sensing tool system

Y Seven, DW Irvin, PV Kothalkar, S Dutta… - Early Childhood …, 2024 - Elsevier

Observational approaches may limit researchers' ability to comprehensively capture
preschool classroom conversations, including the use of wh-words. In the current proof-of …

被引用次数：3 相关文章所有 2 个版本

Speech-to-text recognition in University English as a Foreign Language Learning

KTC Chen - Education and Information Technologies, 2022 - Springer

This study explored the potential of adopting speech-to-text recognition (STR) technology for
English as a foreign language (EFL) oral training in class at the university level. An action …

被引用次数：8 相关文章所有 6 个版本

[PDF] csir.co.za

Emotional speaker recognition based on machine and deep learning

TJ Sefara, TB Mokgonyane - 2020 2nd International …, 2020 - ieeexplore.ieee.org

Speaker recognition is a method which recognise a speaker from characteristics of a voice.
Speaker recognition technologies have been widely used in many domains. Most speaker …

被引用次数：16 相关文章所有 6 个版本

高级搜索

QQ 群