关注
Ron Litman
Ron Litman
Amazon AI
在 amazon.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Scatter: selective context attentional scene text recognizer
R Litman, O Anschel, S Tsiper, R Litman, S Mazor, R Manmatha
proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1642020
Sequence-to-sequence contrastive learning for text recognition
A Aberdam, R Litman, S Tsiper, O Anschel, R Slossberg, S Mazor, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
1212021
Latr: Layout-aware transformer for scene-text vqa
AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
922022
Multimodal semi-supervised learning for text recognition
A Aberdam, R Ganz, S Mazor, R Litman
arXiv preprint arXiv:2205.03873, 2022
222022
Out-of-vocabulary challenge report
S Garcia-Bordils, A Mafla, AF Biten, O Nuriel, A Aberdam, S Mazor, ...
European Conference on Computer Vision, 359-375, 2022
172022
Textadain: Paying attention to shortcut learning in text recognizers
O Nuriel, S Fogel, R Litman
European Conference on Computer Vision, 427-445, 2022
15*2022
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
A Aberdam, D Bensaïd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
112023
Towards Models that Can See and Read
R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
102023
On calibration of scene-text recognition models
R Slossberg, O Anschel, A Markovitz, R Litman, A Aberdam, S Tsiper, ...
European Conference on Computer Vision, 263-279, 2022
102022
Question aware vision transformer for multimodal reasoning
R Ganz, Y Kittenplon, A Aberdam, E Ben Avraham, O Nuriel, S Mazor, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
32024
GRAM: Global reasoning for multi-page VQA
T Blau, S Fogel, R Ronen, A Golts, R Ganz, E Ben Avraham, A Aberdam, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
12024
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
O Abramovich, N Nayman, S Fogel, I Lavi, R Litman, S Tsiper, R Tichauer, ...
arXiv preprint arXiv:2407.12594, 2024
2024
M3T: A new benchmark dataset for multi-modal document-level machine translation
B Hsu, X Liu, H Li, Y Fujinuma, M Nadejde, X Niu, Y Kittenplon, R Litman, ...
arXiv preprint arXiv:2406.08255, 2024
2024
Residual context refinement network architecture for optical character recognition
R Litman, O Anschel, S Tsiper, R Litman, S Mazor, J Wu, R Manmatha
US Patent 11,308,354, 2022
2022
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition Supplementary Material
A Aberdam, D Bensaıd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ...
Towards Models that Can See and Read Supplementary Material
R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman
LaTr: Layout-Aware Transformer for Scene-Text VQA Supplementary Material
AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha
Supplementary Material: Sequence-to-Sequence Contrastive Learning for Text Recognition
A Aberdam, R Litman, S Tsiper, O Anschel, R Slossberg, S Mazor, ...
SCATTER: Selective Context Attentional Scene Text Recognizer Supplementary Materials
R Litman, O Anschel, S Tsiper, R Litman, S Mazor, R Manmatha
系统目前无法执行此操作,请稍后再试。
文章 1–19