Speech-to-gesture generation: A challenge in deep learning approach with bi-directional LSTM

P Wolfert, N Robinson… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Embodied conversational agents (ECAs) are often designed to produce nonverbal behavior
to complement or enhance their verbal communication. One such form of the nonverbal …

被引用次数：65 相关文章所有 7 个版本

[PDF] thecvf.com

Generating diverse and natural 3d human motions from text

C Guo, S Zou, X Zuo, S Wang, W Ji… - Proceedings of the …, 2022 - openaccess.thecvf.com

Automated generation of 3D human motions from text is a challenging problem. The
generated motions are expected to be sufficiently diverse to explore the text-grounded …

被引用次数：315 相关文章所有 6 个版本

[PDF] acm.org

Listen, denoise, action! audio-driven motion synthesis with diffusion models

S Alexanderson, R Nagy, J Beskow… - ACM Transactions on …, 2023 - dl.acm.org

Diffusion models have experienced a surge of interest as highly expressive yet efficiently
trainable probabilistic models. We show that these models are an excellent fit for …

被引用次数：97 相关文章所有 4 个版本

[PDF] arxiv.org

Beat: A large-scale semantic and emotional multi-modal dataset for conversational gestures synthesis

H Liu, Z Zhu, N Iwamoto, Y Peng, Z Li, Y Zhou… - European conference on …, 2022 - Springer

Achieving realistic, vivid, and human-like synthesized conversational gestures conditioned
on multi-modal data is still an unsolved problem due to the lack of available datasets …

被引用次数：86 相关文章所有 7 个版本

[PDF] thecvf.com

Synthesis of compositional animations from textual descriptions

A Ghosh, N Cheema, C Oguz… - Proceedings of the …, 2021 - openaccess.thecvf.com

How can we animate 3D-characters from a movie script or move robots by simply telling
them what we would like them to do?" How unstructured and complex can we make a …

被引用次数：138 相关文章所有 11 个版本

[PDF] arxiv.org

Action2motion: Conditioned generation of 3d human motions

C Guo, X Zuo, S Wang, S Zou, Q Sun, A Deng… - Proceedings of the 28th …, 2020 - dl.acm.org

Action recognition is a relatively established task, where given an input sequence of human
motion, the goal is to predict its action category. This paper, on the other hand, considers a …

被引用次数：291 相关文章所有 4 个版本

[PDF] arxiv.org

Language2pose: Natural language grounded pose forecasting

C Ahuja, LP Morency - 2019 International Conference on 3D …, 2019 - ieeexplore.ieee.org

Generating animations from natural language sentences finds its applications in aa number
of domains such as movie script visualization, virtual human animation and, robot motion …

被引用次数：239 相关文章所有 6 个版本

Speech-based gesture generation for robots and embodied agents: A scoping review

Y Liu, G Mohammadi, Y Song, W Johal - Proceedings of the 9th …, 2021 - dl.acm.org

Humans use gestures as a means of non-verbal communication. Often accompanying
speech, these gestures have several purposes but in general, aim to convey an intended …

被引用次数：22 相关文章

[PDF] arxiv.org

Analyzing input and output representations for speech-driven gesture generation

T Kucherenko, D Hasegawa, GE Henter… - Proceedings of the 19th …, 2019 - dl.acm.org

This paper presents a novel framework for automatic speech-driven gesture generation,
applicable to human-agent interaction including both virtual agents and robots. Specifically …

被引用次数：157 相关文章所有 8 个版本

[PDF] acm.org

Learning speech-driven 3d conversational gestures from video

I Habibie, W Xu, D Mehta, L Liu, HP Seidel… - Proceedings of the 21st …, 2021 - dl.acm.org

We propose the first approach to synthesize the synchronous 3D conversational body and
hand gestures, as well as 3D face and head animations, of a virtual character from speech …

被引用次数：88 相关文章所有 10 个版本

高级搜索

QQ 群