Agentformer: Agent-aware transformers for socio-temporal multi-agent forecasting

Y Yuan, X Weng, Y Ou… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Predicting accurate future trajectories of multiple agents is essential for autonomous systems
but is challenging due to the complex interaction between agents and the uncertainty in …

Precog: Prediction conditioned on goals in visual multi-agent settings

N Rhinehart, R McAllister, K Kitani… - Proceedings of the …, 2019 - openaccess.thecvf.com
For autonomous vehicles (AVs) to behave appropriately on roads populated by human-
driven vehicles, they must be able to reason about the uncertain intentions and decisions of …

Dlow: Diversifying latent flows for diverse human motion prediction

Y Yuan, K Kitani - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Deep generative models are often used for human motion prediction as they are able to
model multi-modal data distributions and characterize diverse human behavior. While much …

Ego-topo: Environment affordances from egocentric video

T Nagarajan, Y Li, C Feichtenhofer… - Proceedings of the …, 2020 - openaccess.thecvf.com
First-person video naturally brings the use of a physical environment to the forefront, since it
shows the camera wearer interacting fluidly in a space based on his intentions. However …

Multi-label affordance mapping from egocentric vision

L Mur-Labadia, JJ Guerrero… - Proceedings of the …, 2023 - openaccess.thecvf.com
Accurate affordance detection and segmentation with pixel precision is an important piece in
many complex systems based on interactions, such as robots and assitive devices. We …

Epic fields: Marrying 3d geometry and video understanding

V Tschernezki, A Darkhalil, Z Zhu… - Advances in …, 2024 - proceedings.neurips.cc
Neural rendering is fuelling a unification of learning, 3D geometry and video understanding
that has been waiting for more than two decades. Progress, however, is still hampered by a …

Predicting the future from first person (egocentric) vision: A survey

I Rodin, A Furnari, D Mavroeidis… - Computer Vision and …, 2021 - Elsevier
Egocentric videos can bring a lot of information about how humans perceive the world and
interact with the environment, which can be beneficial for the analysis of human behaviour …

HYPER: Learned hybrid trajectory prediction via factored inference and adaptive sampling

X Huang, G Rosman, I Gilitschenski… - … on Robotics and …, 2022 - ieeexplore.ieee.org
Modeling multi-modal high-level intent is important for ensuring diversity in trajectory
prediction. Existing approaches explore the discrete nature of human intent before …

Analyzing the variety loss in the context of probabilistic trajectory prediction

LA Thiede, PP Brahma - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Trajectory or behavior prediction of traffic agents is an important component of autonomous
driving and robot planning in general. It can be framed as a probabilistic future sequence …

EgoEnv: Human-centric environment representations from egocentric video

T Nagarajan, SK Ramakrishnan… - Advances in …, 2023 - proceedings.neurips.cc
First-person video highlights a camera-wearer's activities in the context of their persistent
environment. However, current video understanding approaches reason over visual features …