T Sha, W Zhang, T Shen, Z Li, T Mei - ACM Computing Surveys, 2023 - dl.acm.org
Deep person generation has attracted extensive research attention due to its wide applications in virtual agents, video conferencing, online shopping, and art/movie …
Achieving realistic, vivid, and human-like synthesized conversational gestures conditioned on multi-modal data is still an unsolved problem due to the lack of available datasets …
S Yang, Z Wu, M Li, Z Zhang, L Hao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Speech-driven gesture generation is highly challenging due to the random jitters of human motion. In addition, there is an inherent asynchronous relationship between human speech …
B Wu, C Liu, CT Ishi, J Shi, H Ishiguro - International Journal of Social …, 2023 - Springer
Gestures, a form of body language, significantly influence how users perceive humanoid robots. Recent data-driven methods for co-speech gestures have successfully enhanced the …
Although substantial progress has been made in audio-driven talking video synthesis, there still remain two major difficulties: existing works 1) need a long sequence of training dataset …
A Vidal, C Busso - Speech Communication, 2023 - Elsevier
The synthesis of lip movements is an important problem for a socially interactive agent (SIA). It is important to generate lip movements that are synchronized with speech and have …
Virtual agent research has evolved into a substantial body of work, albeit one with a fragmented structure and overlapping, and at times inconsistent, definitions and results. The …
M Zhang, D Jin, C Gu, F Hong, Z Cai, J Huang… - arXiv preprint arXiv …, 2024 - arxiv.org
Human motion generation, a cornerstone technique in animation and video production, has widespread applications in various tasks like text-to-motion and music-to-dance. Previous …
Communication using manual (hand) gestures is considered a defining property of social robots, and their physical embodiment and presence, therefore, we see a need for a …