Textdiffuser: Diffusion models as text painters

J Chen, Y Huang, T Lv, L Cui… - Advances in Neural …, 2024 - proceedings.neurips.cc
Diffusion models have gained increasing attention for their impressive generation abilities
but currently struggle with rendering accurate and coherent text. To address this issue, we …

A text attention network for spatial deformation robust scene text image super-resolution

J Ma, Z Liang, L Zhang - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Scene text image super-resolution aims to increase the resolution and readability of the text
in low-resolution images. Though significant improvement has been achieved by deep …

Learning generative structure prior for blind text image super-resolution

X Li, W Zuo, CC Loy - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Blind text image super-resolution (SR) is challenging as one needs to cope with diverse font
styles and unknown degradation. To address the problem, existing methods perform …

Reading and writing: Discriminative and generative modeling for self-supervised text recognition

M Yang, M Liao, P Lu, J Wang, S Zhu, H Luo… - Proceedings of the 30th …, 2022 - dl.acm.org
Existing text recognition methods usually need large-scale training data. Most of them rely
on synthetic training data due to the lack of annotated real images. However, there is a …

Chinese text recognition with a pre-trained clip-like model through image-ids aligning

H Yu, X Wang, B Li, X Xue - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Scene text recognition has been studied for decades due to its broad applications. However,
despite Chinese characters possessing different characteristics from Latin characters, such …

Text prior guided scene text image super-resolution

J Ma, S Guo, L Zhang - IEEE Transactions on Image …, 2023 - ieeexplore.ieee.org
Scene text image super-resolution (STISR) aims to improve the resolution and visual quality
of low-resolution (LR) scene text images, while simultaneously boost the performance of text …

Benchmarking chinese text recognition: Datasets, baselines, and an empirical study

H Yu, J Chen, B Li, J Ma, M Guan, X Xu, X Wang… - arXiv preprint arXiv …, 2021 - arxiv.org
The flourishing blossom of deep learning has witnessed the rapid development of text
recognition in recent years. However, the existing text recognition methods are mainly …

Text gestalt: Stroke-aware scene text image super-resolution

J Chen, H Yu, J Ma, B Li, X Xue - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
In the last decade, the blossom of deep learning has witnessed the rapid development of
scene text recognition. However, the recognition of low-resolution scene text images …

Maskocr: Text recognition with masked encoder-decoder pretraining

P Lyu, C Zhang, S Liu, M Qiao, Y Xu, L Wu… - arXiv preprint arXiv …, 2022 - arxiv.org
Text images contain both visual and linguistic information. However, existing pre-training
techniques for text recognition mainly focus on either visual representation learning or …

A benchmark for chinese-english scene text image super-resolution

J Ma, Z Liang, W Xiang, X Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Scene Text Image Super-resolution (STISR) aims to recover high-resolution (HR)
scene text images with visually pleasant and readable text content from the given low …