A comprehensive survey of transformers for computer vision

S Jamil, M Jalil Piran, OJ Kwon - Drones, 2023 - mdpi.com
As a special type of transformer, vision transformers (ViTs) can be used for various computer
vision (CV) applications. Convolutional neural networks (CNNs) have several potential …

An overview of recent work in media forensics: Methods and threats

K Bhagtani, AKS Yadav, ER Bartusiak, Z Xiang… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we review recent work in media forensics for digital images, video, audio
(specifically speech), and documents. For each data modality, we discuss synthesis and …

Emergency events detection based on integration of federated learning and active learning

K Alfalqi, M Bellaiche - International Journal of Information Technology, 2023 - Springer
Social media networks now make it easy to access, in real-time, massive amounts of
information from all over the world. They are often the primary source of information for …

Deep learning for precipitation nowcasting: A survey from the perspective of time series forecasting

S An, TJ Oh, E Sohn, D Kim - Expert Systems with Applications, 2025 - Elsevier
Deep learning-based time series forecasting has dominated the short-term precipitation
forecasting field with the help of its ability to estimate motion flow in high-resolution datasets …

[PDF][PDF] An overview on the generation and detection of synthetic and manipulated satellite images

L Abady, ED Cannas, P Bestagini… - … on Signal and …, 2022 - nowpublishers.com
Due to the reduction of technological costs and the increase of satellite launches, satellite
images are becoming more popular and easier to obtain. Besides serving benevolent …

An overview of recent work in multimedia forensics

K Bhagtani, AKS Yadav, ER Bartusiak… - 2022 IEEE 5th …, 2022 - computer.org
An Overview of Recent Work in Multimedia Forensics Toggle navigation IEEE Computer
Society Digital Library Jobs Tech News Resource Center Press Room Advertising About Us …

CDS-Net: Cooperative dual-stream network for image manipulation detection

H Wang, J Deng, X Lin, W Tang, S Wang - Pattern Recognition Letters, 2023 - Elsevier
To accurately locate manipulated regions, many existing approaches employ a dual-stream
framework to extract a wide range of manipulation clues, including local noise, edge …

Comparative analysis of vision transformer models for facial emotion recognition using augmented balanced datasets

S Bobojanov, BM Kim, M Arabboev, S Begmatov - Applied Sciences, 2023 - mdpi.com
Facial emotion recognition (FER) has a huge importance in the field of human–machine
interface. Given the intricacies of human facial expressions and the inherent variations in …

LLM-Enhanced multimodal detection of fake news

J Wang, Z Zhu, C Liu, R Li, X Wu - PloS one, 2024 - journals.plos.org
Fake news detection is growing in importance as a key topic in the information age.
However, most current methods rely on pre-trained small language models (SLMs), which …

Learning from synthetic InSAR with vision transformers: The case of volcanic unrest detection

NI Bountos, D Michail… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The detection of early signs of volcanic unrest preceding an eruption in the form of ground
deformation in interferometric synthetic aperture radar (InSAR) data is critical for assessing …