iVectors for continuous emotion recognition

A Salekin, JW Eberle, JJ Glenn, BA Teachman… - Proceedings of the …, 2018 - dl.acm.org

Although social anxiety and depression are common, they are often underdiagnosed and
undertreated, in part due to difficulties identifying and accessing individuals in need of …

被引用次数：96 相关文章所有 8 个版本

Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

N Semwal, A Kumar… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org

Emotions exhibited by a speaker can be detected by analyzing his/her speech, facial
expressions and gestures or by combining these properties. This paper concentrates on …

被引用次数：48 相关文章

Natural language processing: Speaker, language, and gender identification with LSTM

MK Nammous, K Saeed - Advanced Computing and Systems for Security …, 2019 - Springer

Long short-term memory (LSTM) is a state-of-the-art network used for different tasks related
to natural language processing (NLP), pattern recognition, and classification. It has been …

被引用次数：37 相关文章所有 4 个版本

Hierarchical sparse coding framework for speech emotion recognition

D Torres-Boza, MC Oveneke, F Wang, D Jiang… - Speech …, 2018 - Elsevier

Finding an appropriate feature representation for audio data is central to speech emotion
recognition. Most existing audio features rely on hand-crafted feature encoding techniques …

被引用次数：33 相关文章所有 5 个版本

[PDF] researchgate.net

Joint discrete and continuous emotion prediction using ensemble and end-to-end approaches

EA AlBadawy, Y Kim - Proceedings of the 20th ACM International …, 2018 - dl.acm.org

This paper presents a novel approach in continuous emotion prediction that characterizes
dimensional emotion labels jointly with continuous and discretized representations …

被引用次数：15 相关文章所有 2 个版本

[PDF] upc.edu

Unsupervised learning for expressive speech synthesis

I Jauk - 2017 - upcommons.upc.edu

Nowadays, especially with the upswing of neural networks, speech synthesis is almost
totally data driven. The goal of this thesis is to provide methods for automatic and …

被引用次数：11 相关文章所有 9 个版本

[PDF] researchgate.net

[PDF][PDF] Factor Analysis Based Speaker Normalisation for Continuous Emotion Prediction.

T Dang, V Sethu, E Ambikairajah - INTERSPEECH, 2016 - researchgate.net

Speaker variability has been shown to be a significant confounding factor in speech based
emotion classification systems and a number of speaker normalisation techniques have …

被引用次数：11 相关文章所有 4 个版本

[PDF] academia.edu

[PDF][PDF] Creating expressive synthetic voices by unsupervised clustering of audiobooks

I Jauk, A Bonafonte, P Lopez-Otero… - … Annual Conference of …, 2015 - academia.edu

In this work we design an approach for automatic feature selection and voice creation for
expressive synthesis. Our approach is guided by two main goals:(1) increasing the flexibility …

被引用次数：14 相关文章所有 7 个版本

[PDF] researchgate.net

An i-vector gplda system for speech based emotion recognition

KW Gamage, V Sethu, PN Le… - 2015 Asia-Pacific …, 2015 - ieeexplore.ieee.org

In this paper, we propose the use of a Gaussian Probabilistic Linear Discriminant Analysis
(GPLDA) back-end for utterance level emotion classification based on i-vectors representing …

被引用次数：10 相关文章所有 3 个版本

[PDF] eurasip.org

Acoustic feature prediction from semantic features for expressive speech using deep neural networks

I Jauk, A Bonafonte, S Pascual - 2016 24th European Signal …, 2016 - ieeexplore.ieee.org

The goal of the study is to predict acoustic features of expressive speech from semantic
vector space representations. Though a lot of successful work was invested in …

被引用次数：8 相关文章所有 5 个版本

高级搜索

QQ 群