Foundations & trends in multimodal machine learning: Principles, challenges, and open questions

PP Liang, A Zadeh, LP Morency - ACM Computing Surveys, 2024 - dl.acm.org
Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …

Digitizing intangible cultural heritage embodied: State of the art

Y Hou, S Kenderdine, D Picca, M Egloff… - Journal on Computing …, 2022 - dl.acm.org
Intangible cultural heritage (ICH) as a field of research and site for digital efforts has grown
significantly since the UNESCO 2003 Convention for the Safeguarding of Intangible …

Motionbert: A unified perspective on learning human motion representations

W Zhu, X Ma, Z Liu, L Liu, W Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a unified perspective on tackling various human-centric video tasks by learning
human motion representations from large-scale and heterogeneous data resources …

Back to mlp: A simple baseline for human motion prediction

W Guo, Y Du, X Shen, V Lepetit… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper tackles the problem of human motion prediction, consisting in forecasting future
body poses from historically observed sequences. State-of-the-art approaches provide good …

Belfusion: Latent diffusion for behavior-driven human motion prediction

G Barquero, S Escalera… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Stochastic human motion prediction (HMP) has generally been tackled with generative
adversarial networks and variational autoencoders. Most prior works aim at predicting highly …

Flex: Full-body grasping without full-body grasps

P Tendulkar, D Surís… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Synthesizing 3D human avatars interacting realistically with a scene is an important problem
with applications in AR/VR, video games, and robotics. Towards this goal, we address the …

Unrealego: A new dataset for robust egocentric 3d human motion capture

H Akada, J Wang, S Shimada, M Takahashi… - … on Computer Vision, 2022 - Springer
We present UnrealEgo, ie, a new large-scale naturalistic dataset for egocentric 3D human
pose estimation. UnrealEgo is based on an advanced concept of eyeglasses equipped with …

Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera

J Lee, H Joo - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We present a lightweight and affordable motion capture method based on two smartwatches
and a head-mounted camera. In contrast to the existing approaches that use six or more …

[HTML][HTML] Conditional motion in-betweening

J Kim, T Byun, S Shin, J Won, S Choi - Pattern Recognition, 2022 - Elsevier
Motion in-betweening (MIB) is a process of generating intermediate skeletal movement
between the given start and target poses while preserving the naturalness of the motion …

Human movement datasets: An interdisciplinary scoping review

T Olugbade, M Bieńkiewicz, G Barbareschi… - ACM Computing …, 2022 - dl.acm.org
Movement dataset reviews exist but are limited in coverage, both in terms of size and
research discipline. While topic-specific reviews clearly have their merit, it is critical to have a …