作者
Christos Smailis, Nikolaos Sarafianos, Theodoros Giannakopoulos, Stavros Perantonis
发表日期
2016
期刊
Proceedings of the 9th ACM International Conference on PErvasive Technologies Related to Assistive Environments
出版商
ACM
简介
In this paper, we predict a human's depression level in the BDI-II scale, using facial and voice features. Active orientation models (AOM) and several voice features were extracted from the video and audio modalities. Long-term and mid-term features were computed and a fusion is performed in the feature space. Videos from the Depression Recognition Sub-Challenge of the 2014 Audio-Visual Emotion Challenge and Workshop (AVEC 2014) were used and support vector regression models were trained to predict the depression level. We demonstrated that the fusion of AOMs with audio features leads to better performance compared to individual modalities. The obtained regression results indicate the robustness of the proposed technique, under different settings, as well as an RMSE improvement compared to the AVEC 2014 video baseline.
引用总数
20172018201920202021202220232024211111
学术搜索中的文章
C Smailis, N Sarafianos, T Giannakopoulos… - Proceedings of the 9th ACM International Conference …, 2016