A comprehensive survey and mathematical insights towards video summarization

P Narwal, N Duhan, KK Bhatia - Journal of Visual Communication and …, 2022 - Elsevier
Video Summarization is a technique to reduce the original raw video into a short video
summary. Video summarization automates the task of acquiring key frames/segments from …

MSMO: Multimodal summarization with multimodal output

J Zhu, H Li, T Liu, Y Zhou, J Zhang… - Proceedings of the 2018 …, 2018 - aclanthology.org
Multimodal summarization has drawn much attention due to the rapid growth of multimedia
data. The output of the current multimodal summarization systems is usually represented in …

Multimodal summarization with guidance of multimodal reference

J Zhu, Y Zhou, J Zhang, H Li, C Zong, C Li - Proceedings of the AAAI …, 2020 - aaai.org
Multimodal summarization with multimodal output (MSMO) is to generate a multimodal
summary for a multimodal news report, which has been proven to effectively improve users' …

Multi-modal summarization for asynchronous collection of text, image, audio and video

H Li, J Zhu, C Ma, J Zhang, C Zong - Proceedings of the 2017 …, 2017 - aclanthology.org
The rapid increase of the multimedia data over the Internet necessitates multi-modal
summarization from collections of text, image, audio and video. In this work, we propose an …

A survey of recent work on video summarization: approaches and techniques

V Tiwari, C Bhatnagar - Multimedia Tools and Applications, 2021 - Springer
The volume of video data generated has seen an exponential growth over the years and
video summarization has emerged as a process that can facilitate efficient storage, quick …

Read, watch, listen, and summarize: Multi-modal summarization for asynchronous text, image, audio and video

H Li, J Zhu, C Ma, J Zhang… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Automatic text summarization is a fundamental natural language processing (NLP)
application that aims to condense a source text into a shorter version. The rapid increase in …

A comprehensive review of image retargeting

X Fan, Z Zhang, L Sun, B Xiao, TS Durrani - Neurocomputing, 2024 - Elsevier
With the development of display technologies, image retargeting plays a significant role in
computer vision and pattern recognition communities currently. Image retargeting aims to …

Unsupervised sound localization via iterative contrastive learning

YB Lin, HY Tseng, HY Lee, YY Lin, MH Yang - Computer Vision and Image …, 2023 - Elsevier
Sound localization aims to find the source of the audio signal in the visual scene. However, it
is labor-intensive to annotate the correlations between the signals sampled from the audio …

A novel multi-modal neural network approach for dynamic and generic sports video summarization

P Narwal, N Duhan, KK Bhatia - Engineering Applications of Artificial …, 2023 - Elsevier
Video Summarization is a video compression/compaction technique to create a shorter yet
informative version of original video. Video summarization has offered solutions to plenty of …

A salient dictionary learning framework for activity video summarization via key-frame extraction

I Mademlis, A Tefas, I Pitas - Information Sciences, 2018 - Elsevier
Recently, dictionary learning methods for unsupervised video summarization have
surpassed traditional video frame clustering approaches. This paper addresses static …