A survey on deep reinforcement learning for audio-based applications

S Latif, H Cuayáhuitl, F Pervez, F Shamshad… - Artificial Intelligence …, 2023 - Springer
Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence
(AI) by endowing autonomous systems with high levels of understanding of the real world …

Machine learning for stuttering identification: Review, challenges and future directions

SA Sheikh, M Sahidullah, F Hirsch, S Ouni - Neurocomputing, 2022 - Elsevier
Stuttering is a speech disorder during which the flow of speech is interrupted by involuntary
pauses and repetition of sounds. Stuttering identification is an interesting interdisciplinary …

Viewmaker networks: Learning views for unsupervised representation learning

A Tamkin, M Wu, N Goodman - arXiv preprint arXiv:2010.07432, 2020 - arxiv.org
Many recent methods for unsupervised representation learning train models to be invariant
to different" views," or distorted versions of an input. However, designing these views …

Privacy-preserving voice analysis via disentangled representations

R Aloufi, H Haddadi, D Boyle - Proceedings of the 2020 ACM SIGSAC …, 2020 - dl.acm.org
Voice User Interfaces (VUIs) are increasingly popular and built into smartphones, home
assistants, and Internet of Things (IoT) devices. Despite offering an always-on convenient …

Reinforcement learning and bandits for speech and language processing: Tutorial, review and outlook

B Lin - Expert Systems with Applications, 2023 - Elsevier
In recent years, reinforcement learning and bandits have transformed a wide range of real-
world applications including healthcare, finance, recommendation systems, robotics, and …

Stutternet: Stuttering detection using time delay neural network

SA Sheikh, M Sahidullah, F Hirsch… - 2021 29th European …, 2021 - ieeexplore.ieee.org
This paper introduces StutterNet, a novel deep learning based stuttering detection capable
of detecting and identifying various types of disfluencies. Most of the existing work in this …

[HTML][HTML] Designing virtual reality–based conversational agents to train clinicians in verbal de-escalation skills: Exploratory usability study

N Moore, N Ahmadpour, M Brown, P Poronnik… - JMIR Serious …, 2022 - games.jmir.org
Background Violence and aggression are significant workplace challenges faced by
clinicians worldwide. Traditional methods of training consist of “on-the-job learning” and role …

Multitask learning from augmented auxiliary data for improving speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Despite the recent progress in speech emotion recognition (SER), state-of-the-art systems
lack generalisation across different conditions. A key underlying reason for poor …

Deep speaker recognition: Process, progress, and challenges

AQ Ohi, MF Mridha, MA Hamid, MM Monowar - IEEE Access, 2021 - ieeexplore.ieee.org
Speaker recognition is related to human biometrics dealing with the identification of
speakers from their speech. Speaker recognition is an active research area and being …

Latent representation learning for structural characterization of catalysts

PK Routh, Y Liu, N Marcella, B Kozinsky… - The Journal of …, 2021 - ACS Publications
Supervised machine learning-enabled mapping of the X-ray absorption near edge structure
(XANES) spectra to local structural descriptors offers new methods for understanding the …