Optical character recognition with neural networks and post-correction with finite state methods

S Drobac, K Lindén - International Journal on Document Analysis and …, 2020 - Springer
The optical character recognition (OCR) quality of the historical part of the Finnish
newspaper and journal corpus is rather low for reliable search and scientific research on the …

A set of benchmarks for handwritten text recognition on historical documents

JA Sánchez, V Romero, AH Toselli, M Villegas… - Pattern Recognition, 2019 - Elsevier
Abstract Handwritten Text Recognition is a important requirement in order to make visible
the contents of the myriads of historical documents residing in public and private archives …

The Carabela project and manuscript collection: large-scale probabilistic indexing and content-based classification

E Vidal, V Romero, AH Toselli… - … on Frontiers in …, 2020 - ieeexplore.ieee.org
The main aim of the Carabela project was to develop and apply techniques that allow textual
searching on massive Spanish collections of 15th-19th century manuscripts. The project …

A probabilistic framework for lexicon-based keyword spotting in handwritten text images

E Vidal, AH Toselli, J Puigcerver - arXiv preprint arXiv:2104.04556, 2021 - arxiv.org
Query by String Keyword Spotting (KWS) is here considered as a key technology for
indexing large collections of handwritten text images to allow fast textual access to the …

Lexicon-based probabilistic indexing of handwritten text images

E Vidal, AH Toselli, J Puigcerver - Neural Computing and Applications, 2023 - Springer
Keyword Spotting (KWS) is here considered as a basic technology for Probabilistic Indexing
(PrIx) of large collections of handwritten text images to allow fast textual access to the …

Probabilistic multi-word spotting in handwritten text images

AH Toselli, E Vidal, J Puigcerver… - Pattern Analysis and …, 2019 - Springer
Keyword spotting techniques are becoming cost-effective solutions for information retrieval
in handwritten documents. We explore the extension of the single-word, line-level …

Influence of text line segmentation in handwritten text recognition

V Romero, JA Sanchez, V Bosch… - 2015 13th …, 2015 - ieeexplore.ieee.org
Text line segmentation is the process by which text lines in a document image are localized
and extracted. It is an important step in off-line Handwritten Text Recognition (HTR) given …

Handwritten text recognition for historical documents in the transcriptorium project

JA Sánchez, V Bosch, V Romero, K Depuydt… - Proceedings of the first …, 2014 - dl.acm.org
Transcription of historical handwritten documents is a crucial problem for making easier the
access to these documents to the general public. Currently, huge amount of historical …

Handwritten text recognition results on the Bentham collection with improved classical n-gram-HMM methods

AH Toselli, E Vidal - Proceedings of the 3rd international workshop on …, 2015 - dl.acm.org
Handwritten Text Recognition experiments and results are presented on the historical
Bentham text image dataset used in the ICFHR-2014 HTRtS competition. The successful …

Interactive graph-matching using active query strategies

F Serratosa, X Cortés - Pattern Recognition, 2015 - Elsevier
Given two graphs, the aim of graph matching is to find the׳׳ best׳׳ matching between the
nodes of one graph and the nodes of the other graph. Due to distortions of the data and the …