Transformers in remote sensing: A survey

AA Aleissaee, A Kumar, RM Anwer, S Khan… - Remote Sensing, 2023 - mdpi.com
Deep learning-based algorithms have seen a massive popularity in different areas of remote
sensing image analysis over the past decade. Recently, transformer-based architectures …

Text recognition in the wild: A survey

X Chen, L Jin, Y Zhu, C Luo, T Wang - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
The history of text can be traced back over thousands of years. Rich and precise semantic
information carried by text is important in a wide range of vision-based application …

Few could be better than all: Feature sampling and grouping for scene text detection

J Tang, W Zhang, H Liu, MK Yang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recently, transformer-based methods have achieved promising progresses in object
detection, as they can eliminate the post-processes like NMS and enrich the deep …

Revisiting scene text recognition: A data perspective

Q Jiang, J Wang, D Peng, C Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
This paper aims to re-assess scene text recognition (STR) from a data-oriented perspective.
We begin by revisiting the six commonly used benchmarks in STR and observe a trend of …

Expanding performance boundaries of open-source multimodal models with model, data, and test-time scaling

Z Chen, W Wang, Y Cao, Y Liu, Z Gao, E Cui… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce InternVL 2.5, an advanced multimodal large language model (MLLM) series
that builds upon InternVL 2.0, maintaining its core model architecture while introducing …

MOST: A multi-oriented scene text detector with localization refinement

M He, M Liao, Z Yang, H Zhong… - Proceedings of the …, 2021 - openaccess.thecvf.com
Over the past few years, the field of scene text detection has progressed rapidly that modern
text detectors are able to hunt text in various challenging scenarios. However, they might still …

Seglink++: Detecting dense and arbitrary-shaped scene text by instance-aware component grouping

J Tang, Z Yang, Y Wang, Q Zheng, Y Xu, X Bai - Pattern recognition, 2019 - Elsevier
State-of-the-art methods have achieved impressive performances on multi-oriented text
detection. Yet, they usually have difficulty in handling curved and dense texts, which are …

Total-text: toward orientation robustness in scene text detection

CK Ch'ng, CS Chan, CL Liu - International Journal on Document Analysis …, 2020 - Springer
At present, text orientation is not diverse enough in the existing scene text datasets.
Specifically, curve-orientated text is largely out-numbered by horizontal and multi-oriented …

Benchmarking chinese text recognition: Datasets, baselines, and an empirical study

H Yu, J Chen, B Li, J Ma, M Guan, X Xu, X Wang… - arXiv preprint arXiv …, 2021 - arxiv.org
The flourishing blossom of deep learning has witnessed the rapid development of text
recognition in recent years. However, the existing text recognition methods are mainly …

Dtrocr: Decoder-only transformer for optical character recognition

M Fujitake - Proceedings of the IEEE/CVF Winter …, 2024 - openaccess.thecvf.com
Typical text recognition methods rely on an encoder-decoder structure, in which the encoder
extracts features from an image, and the decoder produces recognized text from these …