Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models

H Mittal, N Agarwal, SY Lo… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We introduce PlausiVL a large video-language model for anticipating action sequences that
are plausible in the real-world. While significant efforts have been made towards anticipating …