A comprehensive survey of mostly textual document segmentation algorithms since 2008

S Eskenazi, P Gomez-Krämer, JM Ogier - Pattern recognition, 2017 - Elsevier
In document image analysis, segmentation is the task that identifies the regions of a
document. The increasing number of applications of document analysis requires a good …

Segmentation and recognition for historical Tibetan document images

L Ma, C Long, L Duan, X Zhang, Y Li, Q Zhao - IEEE Access, 2020 - ieeexplore.ieee.org
As a shining pearl in traditional Tibetan culture, historical Tibetan documents have received
extensive attention from historians, linguists and Buddhist scholars. These documents are …

Обзор алгоритмов детектирования текстовых областей на изображениях и видеозаписях

ЮА Болотова, ВГ Спицын, ПМ Осина - Компьютерная оптика, 2017 - cyberleninka.ru
Статья посвящена обзору методов детектирования и сегментации текстовых областей
на изображениях и видеозаписях. Определяется обобщенный алгоритм работы систем …

Optical character recognition of printed persian/arabic documents

M Shafii - 2014 - scholar.uwindsor.ca
Texts are an important representation of language. Due to the volume of texts generated and
the historical value of some documents, it is imperative to use computers to read generated …

Layout analysis and content enrichment of digitized books

C Grana, G Serra, M Manfredi, D Coppi… - Multimedia Tools and …, 2016 - Springer
In this paper we describe a system for automatically analyzing old documents and creating
hyper linking between different epochs, thus opening ancient documents to young people …

A document straight line based segmentation for complex layout extraction

H Alhéritière, F Cloppet, C Kurtz… - 2017 14th IAPR …, 2017 - ieeexplore.ieee.org
Document layout extraction is a difficult step in the image interpretation process due to the
high complexity of documents. The main challenge relies on the huge gap between both the …

Text extraction for historical Tibetan document images based on connected component analysis and corner point detection

X Zhang, L Duan, L Ma, J Wu - … , CCCV 2017, Tianjin, China, October 11 …, 2017 - Springer
In this paper, we present a text extraction method for historical Tibetan document images.
The task of text extraction is considered as text area detection and location problem. Firstly …

A super resolution framework for low resolution document image OCR

D Ma, G Agam - Document Recognition and Retrieval XX, 2013 - spiedigitallibrary.org
Optical character recognition is widely used for converting document images into digital
media. Existing OCR algorithms and tools produce good results from high resolution, good …

[PDF][PDF] Improved ant colony optimization for document image segmentation

HS Abdullah, AH Jasim - International Journal of Computer …, 2016 - researchgate.net
In this paper, features are initialized for each region in a document image; features of each
pixel are extracted for clustering process and enhancing the heuristic function for …

Illustrations segmentation in digitized documents using local correlation features

D Coppi, C Grana, R Cucchiara - Procedia Computer Science, 2014 - Elsevier
In this paper we propose an approach for Document Layout Analysis based on local
correlation features. We identify and extract illustrations in digitized documents by learning …