查看文章

researchgate.net 中的 [PDF]

Fusing active orientation models and mid-term audio features for automatic depression estimation

作者

Christos Smailis, Nikolaos Sarafianos, Theodoros Giannakopoulos, Stavros Perantonis

发表日期

2016

期刊

Proceedings of the 9th ACM International Conference on PErvasive Technologies Related to Assistive Environments

出版商

ACM

简介

In this paper, we predict a human's depression level in the BDI-II scale, using facial and voice features. Active orientation models (AOM) and several voice features were extracted from the video and audio modalities. Long-term and mid-term features were computed and a fusion is performed in the feature space. Videos from the Depression Recognition Sub-Challenge of the 2014 Audio-Visual Emotion Challenge and Workshop (AVEC 2014) were used and support vector regression models were trained to predict the depression level. We demonstrated that the fusion of AOMs with audio features leads to better performance compared to individual modalities. The obtained regression results indicate the robustness of the proposed technique, under different settings, as well as an RMSE improvement compared to the AVEC 2014 video baseline.

引用总数

被引用次数：7

201720182019202020212022202320242 1 1 1 1 1

学术搜索中的文章

Fusing active orientation models and mid-term audio features for automatic depression estimation

C Smailis, N Sarafianos, T Giannakopoulos… - Proceedings of the 9th ACM International Conference …, 2016

被引用次数：7 相关文章所有 2 个版本