[HTML][HTML] Speechformer-ctc: Sequential modeling of depression detection with speech temporal classification

J Wang, V Ravi, J Flint, A Alwan - Speech communication, 2024 - Elsevier
Speech-based automatic depression detection systems have been extensively explored
over the past few years. Typically, each speaker is assigned a single label (Depressive or …

Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis

L Liu, L Liu, HA Wafa, F Tydeman… - Journal of the …, 2024 - academic.oup.com
Objective This study aims to conduct a systematic review and meta-analysis of the
diagnostic accuracy of deep learning (DL) using speech samples in depression. Materials …

Enhancing multimodal depression detection with intra-and inter-sample contrastive learning

M Li, Y Wei, Y Zhu, S Wei, B Wu - Information Sciences, 2024 - Elsevier
Multimodal depression detection (MDD) has garnered significant interest in recent years.
Current methods typically integrate multimodal information within samples to distinguish …

Self-supervised embeddings for detecting individual symptoms of depression

SH Dumpala, K Dikaios, A Nunes, F Rudzicz… - arXiv preprint arXiv …, 2024 - arxiv.org
Depression, a prevalent mental health disorder impacting millions globally, demands
reliable assessment systems. Unlike previous studies that focus solely on either detecting …

On the effects of obfuscating speaker attributes in privacy-aware depression detection

N Aloshban, A Esposito, A Vinciarelli, T Guha - Pattern Recognition Letters, 2024 - Elsevier
Detection of depressive symptoms from spoken content has emerged as an efficient Artificial
Intelligence (AI) tool for diagnosing this serious mental health condition. Since speech is a …

Scenario of Use Scheme: Threat Model Specification for Speaker Privacy Protection in the Medical Domain

MU Rahman, M Larson, L Bosch… - arXiv preprint arXiv …, 2024 - arxiv.org
Speech recordings are being more frequently used to detect and monitor disease, leading to
privacy concerns. Beyond cryptography, protection of speech can be addressed by …

Privacy-oriented manipulation of speaker representations

F Teixeira, A Abad, B Raj, I Trancoso - IEEE Access, 2024 - ieeexplore.ieee.org
Speaker embeddings are ubiquitous, with applications ranging from speaker recognition
and diarization to speech synthesis and voice anonymization. The amount of information …

[HTML][HTML] A Privacy-Preserving Unsupervised Speaker Disentanglement Method for Depression Detection from Speech

V Ravi, J Wang, J Flint, A Alwan - CEUR workshop proceedings, 2024 - ncbi.nlm.nih.gov
The proposed method focuses on speaker disentanglement in the context of depression
detection from speech signals. Previous approaches require patient/speaker labels …

Bidirectional Multimodal Block-Recurrent Transformers for Depression Detection

X Jia, X Zhao, B Tang, R Jiang - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Depression, as a prevalent and severe psychological disorder, has become a burden to
individuals, families and societies all over the world. Recently, some deep learning methods …

Accuracy and Privacy in Speech-Based Modeling of Major Depression: Innovative Approaches Through Data Augmentation, and Speaker Identity Disentanglement

V Ravi - 2024 - search.proquest.com
Abstract Major Depressive Disorder (MDD) is a prevalent mental illness that affects a
significant portion of the global population. Despite its severity, traditional diagnostic …