Multi-Modal Deep Learning for Computer Vision and Its Application

B Li - 2023 - ora.ox.ac.uk
… image visual attributes with corresponding semantic text descriptions, allowing the network
to learn to understand semantic meaning of both text and image information, and then …

Communicating visualizations without visuals: Investigation of visualization alternative text for people with visual impairments

C Jung, S Mehta, A Kulkarni, Y Zhao… - … and computer graphics, 2021 - ieeexplore.ieee.org
… in their head while listening to alternative texts and wish to carry … recommendations to compose
an informative alternative text. … In Proceedings of the IEEE conference on computer vision

Making memes accessible

C Gleason, A Pavel, X Liu, P Carrington… - Proceedings of the 21st …, 2019 - dl.acm.org
text template or rendered in the audio template using text-to-speech. In our evaluation of …
Twitter users with vision impairments, we found that most users preferred alternative text memes …

Fusecap: Leveraging large language models for enriched fused image captions

N Rotstein, D Bensaïd, S Brody… - … of Computer Vision, 2024 - openaccess.thecvf.com
… in computer vision [7, 32, 34, 36, 49, 55]. In recent years, image captioning tasks [3, 44] have
gained significant research attention and interest due to the success of Vision Language (VL…

Automated scoring of figural tests of creativity with computer vision

S Acar, P Organisciak, D Dumas - The Journal of Creative …, 2023 - Wiley Online Library
… verbal tests such as Alternate Uses Test (eg, Organisciak et al., 2023). Here, we evaluated
similar applications of computer vision models to scoring divergent thinking in figural tests. …

Reading text in the wild with convolutional neural networks

M Jaderberg, K Simonyan, A Vedaldi… - … journal of computer vision, 2016 - Springer
… The increase of powerful computer vision techniques and the overwhelming increase in
the volume of images produced over the last decade has seen a rapid development of text

The Paleographer's Eye ex machina: Using Computer Vision To Assist Humanists in Scribal Hand Identification

S Grieggs, CEM Henderson… - … of Computer Vision, 2024 - openaccess.thecvf.com
Computer vision offers solutions with spectacular performance on writer identification and
retrieval benchmarks, but these have not been widely adopted by the paleography community …

Attesting similarity: Supporting the organization and study of art image collections with computer vision

S Lang, B Ommer - Digital Scholarship in the Humanities, 2018 - academic.oup.com
… The collaboration between computer vision and art history has provided tools to access and
… of a collaboration and presents work by the Computer Vision group of Heidelberg University. …

Clippo: Image-and-language understanding from pixels only

M Tschannen, B Mustafa… - … on Computer Vision and …, 2023 - openaccess.thecvf.com
… ing language understanding exclusively from alt-texts is fundamentally limited. Therefore,
we augment image/alttext contrastive pretraining with language-based contrastive training. …

Teaching computer vision: Bringing research benchmarks to the classroom

T Hassner, I Bayaz - ACM Transactions on Computing Education (TOCE …, 2015 - dl.acm.org
This article concerns the design of effective computer vision programming exercises and
presents a novel means of designing these assignments. We describe three recent case …