Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, H Cuayáhuitl, F Pervez, F Shamshad… - Artificial Intelligence …, 2023 - Springer

Deep reinforcement learning (DRL) is poised to revolutionise the field of artificial intelligence
(AI) by endowing autonomous systems with high levels of understanding of the real world …

被引用次数：70 相关文章所有 10 个版本

[PDF] sciencedirect.com

Machine learning for stuttering identification: Review, challenges and future directions

SA Sheikh, M Sahidullah, F Hirsch, S Ouni - Neurocomputing, 2022 - Elsevier

Stuttering is a speech disorder during which the flow of speech is interrupted by involuntary
pauses and repetition of sounds. Stuttering identification is an interesting interdisciplinary …

被引用次数：50 相关文章所有 11 个版本

[PDF] arxiv.org

Viewmaker networks: Learning views for unsupervised representation learning

A Tamkin, M Wu, N Goodman - arXiv preprint arXiv:2010.07432, 2020 - arxiv.org

Many recent methods for unsupervised representation learning train models to be invariant
to different" views," or distorted versions of an input. However, designing these views …

被引用次数：68 相关文章所有 5 个版本

[PDF] arxiv.org

Privacy-preserving voice analysis via disentangled representations

R Aloufi, H Haddadi, D Boyle - Proceedings of the 2020 ACM SIGSAC …, 2020 - dl.acm.org

Voice User Interfaces (VUIs) are increasingly popular and built into smartphones, home
assistants, and Internet of Things (IoT) devices. Despite offering an always-on convenient …

被引用次数：63 相关文章所有 4 个版本

[PDF] arxiv.org

Reinforcement learning and bandits for speech and language processing: Tutorial, review and outlook

B Lin - Expert Systems with Applications, 2023 - Elsevier

In recent years, reinforcement learning and bandits have transformed a wide range of real-
world applications including healthcare, finance, recommendation systems, robotics, and …

被引用次数：19 相关文章所有 7 个版本

[PDF] arxiv.org

Stutternet: Stuttering detection using time delay neural network

SA Sheikh, M Sahidullah, F Hirsch… - 2021 29th European …, 2021 - ieeexplore.ieee.org

This paper introduces StutterNet, a novel deep learning based stuttering detection capable
of detecting and identifying various types of disfluencies. Most of the existing work in this …

被引用次数：49 相关文章所有 13 个版本

[HTML] jmir.org

[HTML][HTML] Designing virtual reality–based conversational agents to train clinicians in verbal de-escalation skills: Exploratory usability study

N Moore, N Ahmadpour, M Brown, P Poronnik… - JMIR Serious …, 2022 - games.jmir.org

Background Violence and aggression are significant workplace challenges faced by
clinicians worldwide. Traditional methods of training consist of “on-the-job learning” and role …

被引用次数：22 相关文章所有 10 个版本

[PDF] arxiv.org

Multitask learning from augmented auxiliary data for improving speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Despite the recent progress in speech emotion recognition (SER), state-of-the-art systems
lack generalisation across different conditions. A key underlying reason for poor …

被引用次数：19 相关文章所有 8 个版本

[PDF] ieee.org

Deep speaker recognition: Process, progress, and challenges

AQ Ohi, MF Mridha, MA Hamid, MM Monowar - IEEE Access, 2021 - ieeexplore.ieee.org

Speaker recognition is related to human biometrics dealing with the identification of
speakers from their speech. Speaker recognition is an active research area and being …

被引用次数：35 相关文章所有 5 个版本

[PDF] osti.gov

Latent representation learning for structural characterization of catalysts

PK Routh, Y Liu, N Marcella, B Kozinsky… - The Journal of …, 2021 - ACS Publications

Supervised machine learning-enabled mapping of the X-ray absorption near edge structure
(XANES) spectra to local structural descriptors offers new methods for understanding the …

被引用次数：40 相关文章所有 6 个版本

高级搜索

QQ 群