作者
Yu-Ke Li, Pin Wang, Mang Ye, Ching-Yao Chan
发表日期
2021/10/17
图书
Proceedings of the 29th ACM International Conference on Multimedia
页码范围
451-459
简介
Multi-person action forecasting is an emerging task and a pivotal step towards video understanding. The major challenge lies in estimating a distribution characterizing the upcoming actions of all individuals in the scene. The state-of-the-art solutions attempt to solve this problem via a step-by-step prediction procedure. However, they are not adequate to address some particular limitations, such as the compounding errors, the innate uncertainty of the future and the spatio-temporal contexts. To handle the multi-person action forecasting challenges, we put forth a novel imitative learning framework upon the basis of inverse reinforcement learning. Specifically, we aim to learn a policy to model the aforementioned distribution up to a coming horizon through an objective that naturally solves the compounding errors. Such a policy is able to explore multiple plausible futures via extrapolating a series of latent variables and …
引用总数
学术搜索中的文章
YK Li, P Wang, M Ye, CY Chan - Proceedings of the 29th ACM International Conference …, 2021