A Comprehensive Review of Data‐Driven Co‐Speech Gesture Generation

S Nyatsanga, T Kucherenko, C Ahuja… - Computer Graphics …, 2023 - Wiley Online Library
Gestures that accompany speech are an essential part of natural and efficient embodied
human communication. The automatic generation of such co‐speech gestures is a long …

Gesturediffuclip: Gesture diffusion model with clip latents

T Ao, Z Zhang, L Liu - ACM Transactions on Graphics (TOG), 2023 - dl.acm.org
The automatic generation of stylized co-speech gestures has recently received increasing
attention. Previous systems typically allow style control via predefined text labels or example …

Listen, denoise, action! audio-driven motion synthesis with diffusion models

S Alexanderson, R Nagy, J Beskow… - ACM Transactions on …, 2023 - dl.acm.org
Diffusion models have experienced a surge of interest as highly expressive yet efficiently
trainable probabilistic models. We show that these models are an excellent fit for …

Diffusestylegesture: Stylized audio-driven co-speech gesture generation with diffusion models

S Yang, Z Wu, M Li, Z Zhang, L Hao, W Bao… - arXiv preprint arXiv …, 2023 - arxiv.org
The art of communication beyond speech there are gestures. The automatic co-speech
gesture generation draws much attention in computer animation. It is a challenging task due …

Qpgesture: Quantization-based and phase-guided motion matching for natural speech-driven gesture generation

S Yang, Z Wu, M Li, Z Zhang, L Hao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Speech-driven gesture generation is highly challenging due to the random jitters of human
motion. In addition, there is an inherent asynchronous relationship between human speech …

The GENEA Challenge 2023: A large-scale evaluation of gesture generation models in monadic and dyadic settings

T Kucherenko, R Nagy, Y Yoon, J Woo… - Proceedings of the 25th …, 2023 - dl.acm.org
This paper reports on the GENEA Challenge 2023, in which participating teams built speech-
driven gesture-generation systems using the same speech and motion dataset, followed by …

Emotional speech-driven 3d body animation via disentangled latent diffusion

K Chhatre, N Athanasiou, G Becherini… - Proceedings of the …, 2024 - openaccess.thecvf.com
Existing methods for synthesizing 3D human gestures from speech have shown promising
results but they do not explicitly model the impact of emotions on the generated gestures …

Gesturemaster: Graph-based speech-driven gesture generation

C Zhou, T Bian, K Chen - … of the 2022 International Conference on …, 2022 - dl.acm.org
This paper describes the GestureMaster entry to the GENEA (Generation and Evaluation of
Non-verbal Behaviour for Embodied Agents) Challenge 2022. Given speech audio and text …

The diffusestylegesture+ entry to the genea challenge 2023

S Yang, H Xue, Z Zhang, M Li, Z Wu, X Wu… - Proceedings of the 25th …, 2023 - dl.acm.org
In this paper, we introduce the DiffuseStyleGesture+, our solution for the Generation and
Evaluation of Non-verbal Behavior for Embodied Agents (GENEA) Challenge 2023, which …

The IVI Lab entry to the GENEA Challenge 2022–A Tacotron2 based method for co-speech gesture generation with locality-constraint attention mechanism

CJ Chang, S Zhang, M Kapadia - Proceedings of the 2022 International …, 2022 - dl.acm.org
This paper describes the IVI Lab entry to the GENEA Challenge 2022. We formulate the
gesture generation problem as a sequence-to-sequence conversion task with text, audio …