Vision transformers in image restoration: A survey

AM Ali, B Benjdira, A Koubaa, W El-Shafai, Z Khan… - Sensors, 2023 - mdpi.com
The Vision Transformer (ViT) architecture has been remarkably successful in image
restoration. For a while, Convolutional Neural Networks (CNN) predominated in most …

A review of document image enhancement based on document degradation problem

Y Zhou, S Zuo, Z Yang, J He, J Shi, R Zhang - Applied Sciences, 2023 - mdpi.com
Document image enhancement methods are often used to improve the accuracy and
efficiency of automated document analysis and recognition tasks such as character …

Docdiff: Document enhancement via residual diffusion models

Z Yang, B Liu, Y Xxiong, L Yi, G Wu, X Tang… - Proceedings of the 31st …, 2023 - dl.acm.org
Removing degradation from document images not only improves their visual quality and
readability, but also enhances the performance of numerous automated document analysis …

Deblurgan-cnn: effective image denoising and recognition for noisy handwritten characters

S Gonwirat, O Surinta - IEEE Access, 2022 - ieeexplore.ieee.org
Many problems can reduce handwritten character recognition performance, such as image
degradation, light conditions, low-resolution images, and even the quality of the capture …

From leaves to labels: Building modular machine learning networks for rapid herbarium specimen analysis with LeafMachine2

WN Weaver, SA Smith - Applications in Plant Sciences, 2023 - Wiley Online Library
Premise Quantitative plant traits play a crucial role in biological research. However,
traditional methods for measuring plant morphology are time consuming and have limited …

Ocr-idl: Ocr annotations for industry document library dataset

AF Biten, R Tito, L Gomez, E Valveny… - European Conference on …, 2022 - Springer
Pretraining has proven successful in Document Intelligence tasks where deluge of
documents are used to pretrain the models only later to be finetuned on downstream tasks …

High-fidelity document stain removal via a large-scale real-world dataset and a memory-augmented transformer

M Li, H Sun, Y Lei, X Zhang, Y Dong, Y Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
Document images are often degraded by various stains, significantly impacting their
readability and hindering downstream applications such as document digitization and …

Text-DIAE: a self-supervised degradation invariant autoencoder for text recognition and document enhancement

MA Souibgui, S Biswas, A Mafla, AF Biten… - proceedings of the …, 2023 - ojs.aaai.org
In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-
supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) …

GDB: gated convolutions-based document binarization

Z Yang, B Liu, Y Xiong, G Wu - Pattern Recognition, 2024 - Elsevier
Document binarization is a crucial pre-processing step for various document analysis tasks.
However, existing methods fail to accurately capture stroke edges, primarily due to the …

UIE-Convformer: Underwater Image Enhancement Based on Convolution and Feature Fusion Transformer

B Wang, H Xu, G Jiang, M Yu, T Ren… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Due to the light scattering and absorption of impurities, the quality of underwater imaging is
poor, which seriously affects underwater exploration and research. To address the problem …