Vision transformer for fast and efficient scene text recognition

R Atienza - International conference on document analysis and …, 2021 - Springer
Scene text recognition (STR) enables computers to read text in natural scenes such as
object labels, road signs and instructions. STR helps machines perform informed decisions …

Vision Transformer for Fast and Efficient Scene Text Recognition

R Atienza - arXiv e-prints, 2021 - ui.adsabs.harvard.edu
Scene text recognition (STR) enables computers to read text in natural scenes such as
object labels, road signs and instructions. STR helps machines perform informed decisions …

Vision Transformer for Fast and Efficient Scene Text Recognition

R Atienza - arXiv preprint arXiv:2105.08582, 2021 - arxiv.org
Scene text recognition (STR) enables computers to read text in natural scenes such as
object labels, road signs and instructions. STR helps machines perform informed decisions …

Vision Transformer for Fast and Efficient Scene Text Recognition

R Atienza - 2021 - pythonawesome.com
ViTSTR is a simple single-stage model that uses a pre-trained Vision Transformer (ViT) to
perform Scene Text Recognition (ViTSTR). It has a comparable accuracy with state-of-the-art …

Vision Transformer for Fast and Efficient Scene Text Recognition

R Atienza - International Conference on Document Analysis and …, 2021 - dl.acm.org
Scene text recognition (STR) enables computers to read text in natural scenes such as
object labels, road signs and instructions. STR helps machines perform informed decisions …