作者
Lixin Duan, Dong Xu, Ivor Wai-Hung Tsang, Jiebo Luo
发表日期
2011/12/27
期刊
IEEE Transactions on pattern analysis and machine intelligence
卷号
34
期号
9
页码范围
1667-1680
出版商
IEEE
简介
We propose a visual event recognition framework for consumer videos by leveraging a large amount of loosely labeled web videos (e.g., from YouTube). Observing that consumer videos generally contain large intraclass variations within the same type of events, we first propose a new method, called Aligned Space-Time Pyramid Matching (ASTPM), to measure the distance between any two video clips. Second, we propose a new transfer learning method, referred to as Adaptive Multiple Kernel Learning (A-MKL), in order to 1) fuse the information from multiple pyramid levels and features (i.e., space-time features and static SIFT features) and 2) cope with the considerable variation in feature distributions between videos from two domains (i.e., web video domain and consumer video domain). For each pyramid level and each type of local features, we first train a set of SVM classifiers based on the combined training …
引用总数
201020112012201320142015201620172018201920202021202220232024826444354606736473428161381
学术搜索中的文章
L Duan, D Xu, IWH Tsang, J Luo - IEEE Transactions on pattern analysis and machine …, 2011