A review of evaluation practices of gesture generation in embodied conversational agents

P Wolfert, N Robinson… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Embodied conversational agents (ECAs) are often designed to produce nonverbal behavior
to complement or enhance their verbal communication. One such form of the nonverbal …

Generating diverse and natural 3d human motions from text

C Guo, S Zou, X Zuo, S Wang, W Ji… - Proceedings of the …, 2022 - openaccess.thecvf.com
Automated generation of 3D human motions from text is a challenging problem. The
generated motions are expected to be sufficiently diverse to explore the text-grounded …

Listen, denoise, action! audio-driven motion synthesis with diffusion models

S Alexanderson, R Nagy, J Beskow… - ACM Transactions on …, 2023 - dl.acm.org
Diffusion models have experienced a surge of interest as highly expressive yet efficiently
trainable probabilistic models. We show that these models are an excellent fit for …

Beat: A large-scale semantic and emotional multi-modal dataset for conversational gestures synthesis

H Liu, Z Zhu, N Iwamoto, Y Peng, Z Li, Y Zhou… - European conference on …, 2022 - Springer
Achieving realistic, vivid, and human-like synthesized conversational gestures conditioned
on multi-modal data is still an unsolved problem due to the lack of available datasets …

Synthesis of compositional animations from textual descriptions

A Ghosh, N Cheema, C Oguz… - Proceedings of the …, 2021 - openaccess.thecvf.com
How can we animate 3D-characters from a movie script or move robots by simply telling
them what we would like them to do?" How unstructured and complex can we make a …

Action2motion: Conditioned generation of 3d human motions

C Guo, X Zuo, S Wang, S Zou, Q Sun, A Deng… - Proceedings of the 28th …, 2020 - dl.acm.org
Action recognition is a relatively established task, where given an input sequence of human
motion, the goal is to predict its action category. This paper, on the other hand, considers a …

Language2pose: Natural language grounded pose forecasting

C Ahuja, LP Morency - 2019 International Conference on 3D …, 2019 - ieeexplore.ieee.org
Generating animations from natural language sentences finds its applications in aa number
of domains such as movie script visualization, virtual human animation and, robot motion …

Speech-based gesture generation for robots and embodied agents: A scoping review

Y Liu, G Mohammadi, Y Song, W Johal - Proceedings of the 9th …, 2021 - dl.acm.org
Humans use gestures as a means of non-verbal communication. Often accompanying
speech, these gestures have several purposes but in general, aim to convey an intended …

Analyzing input and output representations for speech-driven gesture generation

T Kucherenko, D Hasegawa, GE Henter… - Proceedings of the 19th …, 2019 - dl.acm.org
This paper presents a novel framework for automatic speech-driven gesture generation,
applicable to human-agent interaction including both virtual agents and robots. Specifically …

Learning speech-driven 3d conversational gestures from video

I Habibie, W Xu, D Mehta, L Liu, HP Seidel… - Proceedings of the 21st …, 2021 - dl.acm.org
We propose the first approach to synthesize the synchronous 3D conversational body and
hand gestures, as well as 3D face and head animations, of a virtual character from speech …