ScanDMM: A deep markov model of scanpath prediction for 360deg images

X Sui, Y Fang, H Zhu, S Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Scanpath prediction for 360deg images aims to produce dynamic gaze behaviors based on
the human visual perception mechanism. Most existing scanpath prediction methods for …

EyeFormer: predicting personalized scanpaths with transformer-guided reinforcement learning

Y Jiang, Z Guo, H Rezazadegan Tavakoli… - Proceedings of the 37th …, 2024 - dl.acm.org
From a visual-perception perspective, modern graphical user interfaces (GUIs) comprise a
complex graphics-rich two-dimensional visuospatial arrangement of text, images, and …

Impact of Design Decisions in Scanpath Modeling

P Emami, Y Jiang, Z Guo, LA Leiva - Proceedings of the ACM on Human …, 2024 - dl.acm.org
Modeling visual saliency in graphical user interfaces (GUIs) allows to understand how
people perceive GUI designs and what elements attract their attention. One aspect that is …

Beyond Average: Individualized Visual Scanpath Prediction

X Chen, M Jiang, Q Zhao - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Understanding how attention varies across individuals has significant scientific and societal
impacts. However existing visual scanpath models treat attention uniformly neglecting …

Exploring the benefits of images with frequency visual content in predicting human ocular scanpaths using Artificial Neural Networks

CJ Do Nascimento, ME Orchard, C Devia - Expert Systems with …, 2024 - Elsevier
We present a study of an artificial neural architecture that predict human ocular scanpaths
while they are free-viewing different images types. This analysis is made by comparing …

Visual Question Answering in Robotic Surgery: A Comprehensive Review

D Ding, T Yao, R Luo, X Sun - IEEE Access, 2025 - ieeexplore.ieee.org
Visual Question Answering (VQA) in robotic surgery is rapidly becoming a pivotal
technology in medical AI, addressing the complex challenge of interpreting multimodal …

Gaze Scanpath Transformer: Predicting Visual Search Target by Spatiotemporal Semantic Modeling of Gaze Scanpath

T Nishiyasu, Y Sato - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We introduce a new method called the Gaze Scanpath Transformer for predicting a search
target category during a visual search task. Previous methods for estimating visual search …

GazeSearch: Radiology Findings Search Benchmark

TT Pham, TP Nguyen, Y Ikebe, A Awasthi… - arXiv preprint arXiv …, 2024 - arxiv.org
Medical eye-tracking data is an important information source for understanding how
radiologists visually interpret medical images. This information not only improves the …

Scantd: 360° scanpath prediction based on time-series diffusion

Y Wang, FL Zhang, NA Dodgson - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
Scanpath generation in 360° images aims to model the realistic trajectories of gaze points
that viewers follow when exploring panoramic environments. Existing methods for scanpath …

Scanpath prediction in panoramic videos via expected code length minimization

M Li, K Fan, K Ma - arXiv preprint arXiv:2305.02536, 2023 - arxiv.org
Predicting human scanpaths when exploring panoramic videos is a challenging task due to
the spherical geometry and the multimodality of the input, and the inherent uncertainty and …