作者
Patrik Jonell, Taras Kucherenko, Gustav Eje Henter, Jonas Beskow
发表日期
2020/10/20
期刊
Proceedings of the 20th International Conference on Intelligent Virtual Agents
简介
To enable more natural face-to-face interactions, conversational agents need to adapt their behavior to their interlocutors. One key aspect of this is generation of appropriate non-verbal behavior for the agent, for example facial gestures, here defined as facial expressions and head movements. Most existing gesture-generating systems do not utilize multi-modal cues from the interlocutor when synthesizing non-verbal behavior. Those that do, typically use deterministic methods that risk producing repetitive and non-vivid motions. In this paper, we introduce a probabilistic method to synthesize interlocutor-aware facial gestures - represented by highly expressive FLAME parameters - in dyadic conversations. Our contributions are: a) a method for feature extraction from multi-party video and speech recordings, resulting in a representation that allows for independent control and manipulation of expression and speech …
引用总数
学术搜索中的文章
P Jonell, T Kucherenko, GE Henter, J Beskow - Proceedings of the 20th ACM International Conference …, 2020
P Jonell, T Kucherenko, GE Henter, J Beskow - Proceedings of the 20th ACM International Conference …, 2020