Recent progress in deep learning is essentially based on a “big data for small tasks” paradigm, under which massive amounts of data are used to train a classifier for a single …
B Xiaohan Nie, C Xiong… - Proceedings of the IEEE …, 2015 - openaccess.thecvf.com
Action recognition and pose estimation from video are closely related tasks for understanding human motion, most methods, however, learn separate models and combine …
A Stergiou, R Poppe - Computer Vision and Image Understanding, 2019 - Elsevier
Many videos depict people, and it is their interactions that inform us of their activities, relation to one another and the cultural and social setting. With advances in human action …
P Wei, Y Zhao, N Zheng, SC Zhu - IEEE transactions on pattern …, 2016 - ieeexplore.ieee.org
In this paper, we present a 4D human-object interaction (4DHOI) model for solving three vision tasks jointly: i) event segmentation from a video sequence, ii) event recognition and …
C Wang, C Xu, D Tao - IEEE Transactions on Artificial …, 2020 - ieeexplore.ieee.org
Image animation is to animate a still image of the object of interest using poses extracted from another video sequence. Through training on a large-scale video dataset, most existing …
SS Kumar, M John - 2016 IEEE international Carnahan …, 2016 - ieeexplore.ieee.org
An optical flow based approach for recognizing human actions and human-human interactions in video sequences has been addressed in this paper. We propose a local …
EP Ijjina, CK Mohan - 2014 13th International Conference on …, 2014 - ieeexplore.ieee.org
In this paper, we proposed a deep convolutional network architecture for recognizing human actions in videos using action bank features. Action bank features computed against of a …
W Li, J Joo, H Qi, SC Zhu - IEEE Transactions on Multimedia, 2016 - ieeexplore.ieee.org
This paper presents a novel method for automatically detecting and tracking news topics from multimodal TV news data. We propose a multimodal topic and-or graph (MT-AOG) to …
Developing an expert text detection system for video indexing and retrieving is a challenging task due to low resolution, complex background, non-illumination and movement of text …