Document layout analysis: a comprehensive survey

GM Binmakhashen, SA Mahmoud - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Document layout analysis (DLA) is a preprocessing step of document understanding
systems. It is responsible for detecting and annotating the physical structure of documents …

DocBank: A benchmark dataset for document layout analysis

M Li, Y Xu, L Cui, S Huang, F Wei, Z Li… - arXiv preprint arXiv …, 2020 - arxiv.org
Document layout analysis usually relies on computer vision models to understand
documents while ignoring textual information that is vital to capture. Meanwhile, high quality …

Feature selection in multimedia: the state-of-the-art review

PY Lee, WP Loh, JF Chin - Image and vision computing, 2017 - Elsevier
Multimedia data mining, particularly feature selection (FS), has been successfully applied in
recent classification and recognition works. However, only a few studies in the contemporary …

A ranking-based feature selection approach for handwritten character recognition

ND Cilia, C De Stefano, F Fontanella… - Pattern Recognition …, 2019 - Elsevier
Feature selection is generally considered a very important step in any pattern recognition
process. Its aim is that of reducing the computational cost of the classification task, in an …

Comparing filter and wrapper approaches for feature selection in handwritten character recognition

ND Cilia, T D'Alessandro, C De Stefano… - Pattern Recognition …, 2023 - Elsevier
It is generally agreed that the selection of an appropriate set of features is a fundamental
process in the development of any pattern recognition system. Its purpose is to identify the …

Binarization free layout analysis for arabic historical documents using fully convolutional networks

BK Barakat, J El-Sana - … workshop on arabic and derived script …, 2018 - ieeexplore.ieee.org
We present a Fully Convolutional Network based method for layout analysis of non-
binarized historical Arabic manuscripts. The document image is segmented into main text …

Layout analysis on challenging historical arabic manuscripts using siamese network

R Alaasam, B Kurar, J El-Sana - 2019 International Conference …, 2019 - ieeexplore.ieee.org
This paper presents layout analysis for historical Arabic documents using siamese network.
Given pages from different documents, we divide them into patches of similar sizes. We train …

Unsupervised deep learning for handwritten page segmentation

A Droby, BK Barakat, B Madi… - … on Frontiers in …, 2020 - ieeexplore.ieee.org
Segmenting handwritten document images into regions with homogeneous patterns is an
important pre-processing step for many document images analysis tasks. Hand-labeling …

[图书][B] Handwriting analysis with focus on writer identification and writer retrieval

V Christlein - 2019 - search.proquest.com
In the course of the mass digitization of historical as well as contemporary sources, an
individual examination by means of historical or forensic experts is no longer feasible. A …

Historical document layout analysis using anisotropic diffusion and geometric features

GM BinMakhashen, SA Mahmoud - International Journal on Digital …, 2020 - Springer
There are several digital libraries worldwide which maintain valuable historical manuscripts.
Usually, digital copies of these manuscripts are offered to researchers and readers in raster …