OCR of historical printings with an application to building diachronic corpora: A case study using the RIDGES herbal corpus

U Springmann, A Lüdeling - arXiv preprint arXiv:1608.02153, 2016 - arxiv.org
This article describes the results of a case study that applies Neural Network-based Optical
Character Recognition (OCR) to scanned images of books printed between 1487 and 1870 …

OCR of historical printings of Latin texts: problems, prospects, progress

U Springmann, D Najock, H Morgenroth… - Proceedings of the First …, 2014 - dl.acm.org
This paper deals with the application of OCR methods to historical printings of Latin texts.
Whereas the problem of recognizing historical printings of modern languages has been the …

Improving OCR accuracy for classical critical editions

F Boschetti, M Romanello, A Babeu, D Bamman… - Research and Advanced …, 2009 - Springer
This paper describes a work-flow designed to populate a digital library of ancient Greek
critical editions with highly accurate OCR scanned text. While the most recently available …

Automatic quality evaluation and (semi-) automatic improvement of OCR models for historical printings

U Springmann, F Fink, KU Schulz - arXiv preprint arXiv:1606.05157, 2016 - arxiv.org
Good OCR results for historical printings rely on the availability of recognition models trained
on diplomatic transcriptions as ground truth, which is both a scarce resource and time …

Optical recognition of psaltic byzantine chant notation

C Dalitz, GK Michalakis, C Pranzas - International Journal of Document …, 2008 - Springer
This paper describes a document recognition system for the modern neume based notation
of Byzantine music. We propose algorithms for page segmentation, lyrics removal …

A corpus-based approach to philological issues

F Boschetti - 2010 - eprints-phd.biblio.unitn.it
The aim of this work is the application of techniques developed in the domain of corpus
linguistics to a collection of ancient Greek texts, taking into account not only the canonical …

A multiple-choice test recognition system based on the gamera framework

A Spadaccini, V Rizzo - arXiv preprint arXiv:1105.3834, 2011 - arxiv.org
This article describes JECT-OMR, a system that analyzes digital images representing scans
of multiple-choice tests compiled by students. The system performs a structural analysis of …

[PDF][PDF] Optical Character Recognition with the Gamera Framework

C Dalitz, R Baston - Doc. Image Analysis with the Gamera …, 2009 - researchgate.net
Due to its flexibility, the Gamera framework for document analysis and recognition has been
used in the past primarily for very specific document types like ancient scripts and music …

[图书][B] Finding what you need, and knowing what you can find: digital tools for palaeographers in musicology and beyond

J Craig-McFeely - 2010 - ora.ox.ac.uk
This chapter examines three projects that provide musicologists with a range of resources
for managing and exploring their materials: DIAMM (Digital Image Archive of Medieval …

[PDF][PDF] Digital Aeschylus Breadth and Depth Issues in Digital Libraries

F Boschetti - Workshop on Advanced Technologies for Digital …, 2009 - pro.unibz.it
Digital Libraries can grow along two different dimensions: breadth and depth. In the first
case, works of many authors extend the existing collections. In the second case, different …