Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer
In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

Video summarization using deep neural networks: A survey

E Apostolidis, E Adamantidou, AI Metsai… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Video summarization technologies aim to create a concise and complete synopsis by
selecting the most informative parts of the video content. Several approaches have been …

A review on video summarization techniques

P Meena, H Kumar, SK Yadav - Engineering Applications of Artificial …, 2023 - Elsevier
The exponential growth of technology has resulted in a profusion of advanced imaging
devices and eases internet accessibility, leading to an increase in the creation and use of …

Learning a deep multi-scale feature ensemble and an edge-attention guidance for image fusion

J Liu, X Fan, J Jiang, R Liu, Z Luo - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Image fusion integrates a series of images acquired from different sensors, eg, infrared and
visible, outputting an image with richer information than either one. Traditional and recent …

Real-time video emotion recognition based on reinforcement learning and domain knowledge

K Zhang, Y Li, J Wang, E Cambria… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Multimodal emotion recognition in conversational videos (ERC) develops rapidly in recent
years. To fully extract the relative context from video clips, most studies build their models on …

Concealed attack for robust watermarking based on generative model and perceptual loss

Q Li, X Wang, B Ma, X Wang, C Wang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
While existing watermarking attack methods can disturb the correct extraction of watermark
information, the visual quality of watermarked images will be greatly damaged. Therefore, a …

Conversational AI: An overview of methodologies, applications & future scope

P Kulkarni, A Mahabaleshwarkar… - 2019 5th …, 2019 - ieeexplore.ieee.org
Conversational AI is a sub-domain of Artificial Intelligence that deals with speech-based or
text-based AI agents that have the capability to simulate and automate conversations and …

Dsnet: A flexible detect-to-summarize network for video summarization

W Zhu, J Lu, J Li, J Zhou - IEEE Transactions on Image …, 2020 - ieeexplore.ieee.org
In this paper, we propose a Detect-to-Summarize network (DSNet) framework for supervised
video summarization. Our DSNet contains anchor-based and anchor-free counterparts. The …

Transparency by design: Closing the gap between performance and interpretability in visual reasoning

D Mascharka, P Tran, R Soklaski… - Proceedings of the …, 2018 - openaccess.thecvf.com
Visual question answering requires high-order reasoning about an image, which is a
fundamental capability needed by machine systems to follow complex directives. Recently …

Align and attend: Multimodal summarization with dual contrastive losses

B He, J Wang, J Qiu, T Bui… - Proceedings of the …, 2023 - openaccess.thecvf.com
The goal of multimodal summarization is to extract the most important information from
different modalities to form summaries. Unlike unimodal summarization, the multimodal …