Document layout analysis: a comprehensive survey

GM Binmakhashen, SA Mahmoud - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Document layout analysis (DLA) is a preprocessing step of document understanding
systems. It is responsible for detecting and annotating the physical structure of documents …

dhSegment: A generic deep-learning approach for document segmentation

SA Oliveira, B Seguin, F Kaplan - 2018 16th International …, 2018 - ieeexplore.ieee.org
In recent years there have been multiple successful attempts tackling document processing
problems separately by designing task specific hand-tuned strategies. We argue that the …

A two-stage method for text line detection in historical documents

T Grüning, G Leifert, T Strauß, J Michael… - International Journal on …, 2019 - Springer
This work presents a two-stage text line detection method for historical documents. Each
detected text line is represented by its baseline. In a first stage, a deep neural network called …

[HTML][HTML] OCR4all—An open-source tool providing a (semi-) automatic OCR workflow for historical printings

C Reul, D Christ, A Hartelt, N Balbach, M Wehner… - Applied Sciences, 2019 - mdpi.com
Optical Character Recognition (OCR) on historical printings is a challenging task mainly due
to the complexity of the layout and the highly variant typography. Nevertheless, in the last …

[HTML][HTML] A survey of historical document image datasets

K Nikolaidou, M Seuret, H Mokayed… - International Journal on …, 2022 - Springer
This paper presents a systematic literature review of image datasets for document image
analysis, focusing on historical documents, such as handwritten manuscripts and early …

[HTML][HTML] Deep learning for historical document analysis and recognition—a survey

F Lombardi, S Marinai - Journal of Imaging, 2020 - mdpi.com
Nowadays, deep learning methods are employed in a broad range of research fields. The
analysis and recognition of historical documents, as we survey in this work, is not an …

A comprehensive study of imagenet pre-training for historical document image analysis

L Studer, M Alberti, V Pondenkandath… - 2019 International …, 2019 - ieeexplore.ieee.org
Automatic analysis of scanned historical documents comprises a wide range of image
analysis tasks, which are often challenging for machine learning due to a lack of human …

cBAD: ICDAR2017 competition on baseline detection

M Diem, F Kleber, S Fiel, T Grüning… - 2017 14th IAPR …, 2017 - ieeexplore.ieee.org
The cBAD competition aims at benchmarking state-of-the-art baseline detection algorithms.
It is in line with previous competitions such as the ICDAR 2013 Handwriting Segmentation …

A layered approach to stereo reconstruction

S Baker, R Szeliski, P Anandan - Proceedings. 1998 IEEE …, 1998 - ieeexplore.ieee.org
We propose a framework for extracting structure from stereo which represents the scene as
a collection of approximately planar layers. Each layer consists of an explicit 3D plane …

docExtractor: An off-the-shelf historical document element extraction

T Monnier, M Aubry - 2020 17th International Conference on …, 2020 - ieeexplore.ieee.org
We present docExtractor, a generic approach for extracting visual elements such as text
lines or illustrations from historical documents without requiring any real data annotation. We …