作者
Yun Zhai, Mubarak Shah
发表日期
2006/10/23
图书
Proceedings of the 14th ACM international conference on Multimedia
页码范围
815-824
简介
Human vision system actively seeks interesting regions in images to reduce the search effort in tasks, such as object detection and recognition. Similarly, prominent actions in video sequences are more likely to attract our first sight than their surrounding neighbors. In this paper, we propose a spatiotemporal video attention detection technique for detecting the attended regions that correspond to both interesting objects and actions in video sequences. Both spatial and temporal saliency maps are constructed and further fused in a dynamic fashion to produce the overall spatiotemporal attention model. In the temporal attention model, motion contrast is computed based on the planar motions (homography) between images, which is estimated by applying RANSAC on point correspondences in the scene. To compensate the non-uniformity of spatial distribution of interest-points, spanning areas of motion segments are …
引用总数
20072008200920102011201220132014201520162017201820192020202120222023202491615193156119129141132113104697268606627
学术搜索中的文章
Y Zhai, M Shah - Proceedings of the 14th ACM international conference …, 2006