Semantic guidance network for video captioning

L Guo, H Zhao, ZW Chen, ZY Han - Scientific Reports, 2023 - nature.com
Abstract video captioning is a more challenging task that aims to generate abundant natural
language descriptions, and it has become a promising direction for artificial intelligence …

Cascaded frameworks in underwater optical image restoration

B Li, Z Chen, L Lu, P Qi, L Zhang, Q Ma, H Hu, J Zhai… - Information …, 2025 - Elsevier
Optical imaging and vision technology have become crucial research topics in the field of
underwater and ocean scenes. These technologies play a vital role in advancing …

[HTML][HTML] A deep learning model for generating fundus autofluorescence images from color fundus photography

F Song, W Zhang, Y Zheng, D Shi, M He - Advances in ophthalmology …, 2023 - Elsevier
Abstract Background Fundus Autofluorescence (FAF) is a valuable imaging technique used
to assess metabolic alterations in the retinal pigment epithelium (RPE) associated with …

Saliency-Driven Hand Gesture Recognition Incorporating Histogram of Oriented Gradients (HOG) and Deep Learning

F Jafari, A Basu - Sensors, 2023 - mdpi.com
Hand gesture recognition is a vital means of communication to convey information between
humans and machines. We propose a novel model for hand gesture recognition based on …

Multiple forgery detection in digital video based on inconsistency in video quality assessment attributes

HD Panchal, HB Shah - Multimedia Systems, 2023 - Springer
With the enormous development of video capture, sharing, and editing tools, the authenticity
and correctness of videos are under threat. Videos captured by a CCTV or any surveillance …

Deep learning-based forgery identification and localization in videos

R Gowda, D Pawar - Signal, Image and Video Processing, 2023 - Springer
The video forensics capabilities are constantly improving in terms of evidence accumulating,
analysis, processing, and storage. Video forensic analysis involves scientific investigation …

[HTML][HTML] Entropy feature and peak-means clustering based slowly moving object detection in head and shoulder video sequences

PK Sahoo, P Kanungo, S Mishra… - Journal of King Saud …, 2022 - Elsevier
With the increase in demand for video conferencing and IOT applications, efficient video
coding standards are necessary. The performance of MPEG-4 coding scheme depends on …

Joint target geometry and polarization properties for polarization image fusion

J Duan, J Liu, Y Hao, G Chen, Y Zheng, L Jia - Optics and Lasers in …, 2024 - Elsevier
Traditional polarization image fusion focuses on mining information from the source image
and realizes polarization image fusion by finding the feature balance point between two …

A video codec based on background extraction and moving object detection

S Hadi, A Shahbahrami, H Azgomi - Multimedia Tools and Applications, 2024 - Springer
Cameras are the primary data sources in video surveillance systems and produce massive
data every second. Video surveillance is an extremely beneficial functionality brought to us …

[HTML][HTML] Event triggered intelligent video recording system using MS-SSIM for smart home security

HA Khalaf, AS Tolba, MZ Rashid - AIN Shams engineering journal, 2018 - Elsevier
This paper presents an intelligent system for event-triggered video recording for smart home
applications. Video recording is triggered through a collaborative sensing strategy. PIR …