A weakly supervised learning framework for detecting social anxiety and depression

A Salekin, JW Eberle, JJ Glenn, BA Teachman… - Proceedings of the …, 2018 - dl.acm.org
Although social anxiety and depression are common, they are often underdiagnosed and
undertreated, in part due to difficulties identifying and accessing individuals in need of …

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

N Semwal, A Kumar… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Emotions exhibited by a speaker can be detected by analyzing his/her speech, facial
expressions and gestures or by combining these properties. This paper concentrates on …

Natural language processing: Speaker, language, and gender identification with LSTM

MK Nammous, K Saeed - Advanced Computing and Systems for Security …, 2019 - Springer
Long short-term memory (LSTM) is a state-of-the-art network used for different tasks related
to natural language processing (NLP), pattern recognition, and classification. It has been …

Hierarchical sparse coding framework for speech emotion recognition

D Torres-Boza, MC Oveneke, F Wang, D Jiang… - Speech …, 2018 - Elsevier
Finding an appropriate feature representation for audio data is central to speech emotion
recognition. Most existing audio features rely on hand-crafted feature encoding techniques …

Joint discrete and continuous emotion prediction using ensemble and end-to-end approaches

EA AlBadawy, Y Kim - Proceedings of the 20th ACM International …, 2018 - dl.acm.org
This paper presents a novel approach in continuous emotion prediction that characterizes
dimensional emotion labels jointly with continuous and discretized representations …

Unsupervised learning for expressive speech synthesis

I Jauk - 2017 - upcommons.upc.edu
Nowadays, especially with the upswing of neural networks, speech synthesis is almost
totally data driven. The goal of this thesis is to provide methods for automatic and …

[PDF][PDF] Factor Analysis Based Speaker Normalisation for Continuous Emotion Prediction.

T Dang, V Sethu, E Ambikairajah - INTERSPEECH, 2016 - researchgate.net
Speaker variability has been shown to be a significant confounding factor in speech based
emotion classification systems and a number of speaker normalisation techniques have …

[PDF][PDF] Creating expressive synthetic voices by unsupervised clustering of audiobooks

I Jauk, A Bonafonte, P Lopez-Otero… - … Annual Conference of …, 2015 - academia.edu
In this work we design an approach for automatic feature selection and voice creation for
expressive synthesis. Our approach is guided by two main goals:(1) increasing the flexibility …

An i-vector gplda system for speech based emotion recognition

KW Gamage, V Sethu, PN Le… - 2015 Asia-Pacific …, 2015 - ieeexplore.ieee.org
In this paper, we propose the use of a Gaussian Probabilistic Linear Discriminant Analysis
(GPLDA) back-end for utterance level emotion classification based on i-vectors representing …

Acoustic feature prediction from semantic features for expressive speech using deep neural networks

I Jauk, A Bonafonte, S Pascual - 2016 24th European Signal …, 2016 - ieeexplore.ieee.org
The goal of the study is to predict acoustic features of expressive speech from semantic
vector space representations. Though a lot of successful work was invested in …