作者
Yen-Ting Liu, Yu-Jhe Li, Fu-En Yang, Shang-Fu Chen, Yu-Chiang Frank Wang
发表日期
2019/9/22
研讨会论文
2019 IEEE international conference on image processing (ICIP)
页码范围
3377-3381
出版商
IEEE
简介
Video summarization still remains a challenging task. Due to sufficient video data on the Internet, such task draws significant attention in the vision community and benefits a wide range of applications, e.g., video retrieval, search, etc. To effectively perform video summarization by deriving the keyframes which represent the given input video, we propose a novel framework named Hierarchical Multi-Attention Network (H-MAN) which comprises the shot-level reconstruction model and multi-head attention model. While our designed attention model is two-stage hierarchical structure for producing various attention maps, we are among the first to utilize the multi-attention mechanism in the video summarization task, which brings improved performance. The quantitative and qualitative results demonstrate the effectiveness of our model, which performs favorably against state-of-the-art approaches.
引用总数
202020212022202320245616229
学术搜索中的文章
YT Liu, YJ Li, FE Yang, SF Chen, YCF Wang - 2019 IEEE international conference on image …, 2019