From a visual-perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and …
P Emami, Y Jiang, Z Guo, LA Leiva - Proceedings of the ACM on Human …, 2024 - dl.acm.org
Modeling visual saliency in graphical user interfaces (GUIs) allows to understand how people perceive GUI designs and what elements attract their attention. One aspect that is …
CJ Do Nascimento, ME Orchard, C Devia - Expert Systems with …, 2024 - Elsevier
We present a study of an artificial neural architecture that predict human ocular scanpaths while they are free-viewing different images types. This analysis is made by comparing …
D Ding, T Yao, R Luo, X Sun - IEEE Access, 2025 - ieeexplore.ieee.org
Visual Question Answering (VQA) in robotic surgery is rapidly becoming a pivotal technology in medical AI, addressing the complex challenge of interpreting multimodal …
T Nishiyasu, Y Sato - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We introduce a new method called the Gaze Scanpath Transformer for predicting a search target category during a visual search task. Previous methods for estimating visual search …
TT Pham, TP Nguyen, Y Ikebe, A Awasthi… - arXiv preprint arXiv …, 2024 - arxiv.org
Medical eye-tracking data is an important information source for understanding how radiologists visually interpret medical images. This information not only improves the …
Y Wang, FL Zhang, NA Dodgson - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
Scanpath generation in 360° images aims to model the realistic trajectories of gaze points that viewers follow when exploring panoramic environments. Existing methods for scanpath …
M Li, K Fan, K Ma - arXiv preprint arXiv:2305.02536, 2023 - arxiv.org
Predicting human scanpaths when exploring panoramic videos is a challenging task due to the spherical geometry and the multimodality of the input, and the inherent uncertainty and …