Making images real again: A comprehensive survey on deep image composition

L Niu, W Cong, L Liu, Y Hong, B Zhang, J Liang… - arXiv preprint arXiv …, 2021 - arxiv.org
As a common image editing operation, image composition aims to combine the foreground
from one image and another background image, resulting in a composite image. However …

Visual affordance and function understanding: A survey

M Hassanin, S Khan, M Tahtali - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Nowadays, robots are dominating the manufacturing, entertainment, and healthcare
industries. Robot vision aims to equip robots with the capabilities to discover information …

Long-term human motion prediction with scene context

Z Cao, H Gao, K Mangalam, QZ Cai, M Vo… - Computer Vision–ECCV …, 2020 - Springer
Human movement is goal-directed and influenced by the spatial layout of the objects in the
scene. To plan future human motion, it is crucial to perceive the environment–imagine how …

Stochastic scene-aware motion prediction

M Hassan, D Ceylan, R Villegas… - Proceedings of the …, 2021 - openaccess.thecvf.com
A long-standing goal in computer vision is to capture, model, and realistically synthesize
human behavior. Specifically, by learning from data, our goal is to enable virtual humans to …

Diffusion-guided reconstruction of everyday hand-object interaction clips

Y Ye, P Hebbar, A Gupta… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We tackle the task of reconstructing hand-object interactions from short video clips. Given an
input video, our approach casts 3D inference as a per-video optimization and recovers a …

Affordpose: A large-scale dataset of hand-object interactions with affordance-driven hand pose

J Jian, X Liu, M Li, R Hu, J Liu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
How human interact with objects depends on the functional roles of the target objects, which
introduces the problem of affordance-aware hand-object interaction. It requires a large …

DECO: Dense estimation of 3D human-scene contact in the wild

S Tripathi, A Chatterjee, JC Passy… - Proceedings of the …, 2023 - openaccess.thecvf.com
Understanding how humans use physical contact to interact with the world is key to enabling
human-centric artificial intelligence. While inferring 3D contact is crucial for modeling …

Populating 3D scenes by learning human-scene interaction

M Hassan, P Ghosh, J Tesch… - Proceedings of the …, 2021 - openaccess.thecvf.com
Humans live within a 3D space and constantly interact with it to perform tasks. Such
interactions involve physical contact between surfaces that is semantically meaningful. Our …

Human poseitioning system (hps): 3d human pose estimation and self-localization in large scenes from body-mounted sensors

V Guzov, A Mir, T Sattler… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract We introduce (HPS) Human POSEitioning System, a method to recover the full 3D
pose of a human registered with a 3D scan of the surrounding environment using wearable …

Putting people in their place: Affordance-aware human insertion into scenes

S Kulal, T Brooks, A Aiken, J Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We study the problem of inferring scene affordances by presenting a method for realistically
inserting people into scenes. Given a scene image with a marked region and an image of a …