Document layout analysis: a comprehensive survey

GM Binmakhashen, SA Mahmoud - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Document layout analysis (DLA) is a preprocessing step of document understanding
systems. It is responsible for detecting and annotating the physical structure of documents …

DocBank: A benchmark dataset for document layout analysis

M Li, Y Xu, L Cui, S Huang, F Wei, Z Li… - arXiv preprint arXiv …, 2020 - arxiv.org
Document layout analysis usually relies on computer vision models to understand
documents while ignoring textual information that is vital to capture. Meanwhile, high quality …

A two-stage method for text line detection in historical documents

T Grüning, G Leifert, T Strauß, J Michael… - International Journal on …, 2019 - Springer
This work presents a two-stage text line detection method for historical documents. Each
detected text line is represented by its baseline. In a first stage, a deep neural network called …

Understanding the performance of TCP pacing

A Aggarwal, S Savage… - … IEEE INFOCOM 2000 …, 2000 - ieeexplore.ieee.org
Many researchers have observed that TCP's congestion control mechanisms can lead to
bursty traffic flows on modern high-speed networks, with a negative impact on overall …

[PDF][PDF] Collaboro: a collaborative (meta) modeling tool

JLC Izquierdo, J Cabot - PeerJ Computer Science, 2016 - peerj.com
Motivation Scientists increasingly rely on intelligent information systems to help them in their
daily tasks, in particular for managing research objects, like publications or datasets. The …

Text line segmentation in indian ancient handwritten documents using faster R-CNN

A Jindal, R Ghosh - Multimedia Tools and Applications, 2023 - Springer
Textline segmentation in ancient handwritten documents is still considered as a challenging
task in document analysis and recognition field even though various rule-based methods …

Seam carving for text line extraction on color and grayscale historical manuscripts

N Arvanitopoulos, S Süsstrunk - 2014 14th International …, 2014 - ieeexplore.ieee.org
We propose a novel algorithm for automatic text line extraction on color and gray scale
manuscript pages without prior binarization. Our algorithm is based on seam carving to …

Labeling, cutting, grouping: an efficient text line segmentation method for medieval manuscripts

M Alberti, L Vögtlin, V Pondenkandath… - 2019 International …, 2019 - ieeexplore.ieee.org
This paper introduces a new way for text-line extraction by integrating deep-learning based
pre-classification and state-of-the-art segmentation methods. Text-line extraction in complex …

[PDF][PDF] A probabilistic formulation of keyword spotting

J Puigcerver - PhD thesis, 2018 - pdfs.semanticscholar.org
This thesis, first defines the goal of Keyword Spotting from a Decision Theory perspective.
Then, the problem is tackled following a probabilistic formulation. More precisely, Keyword …

A document analysis deep learning regression model for initial coin offerings success prediction

J Wang, R Chen, W Xu, Y Tang, Y Qin - Expert Systems with Applications, 2022 - Elsevier
Initial coin offerings (ICOs) provide an early-stage financing method for blockchain-based
ventures. During the ICO process, whitepapers are important not only as promotional …