This paper introduces LatinCy, a set of trained general purpose Latin-language" core" pipelines for use with the spaCy natural language processing framework. The models are …
E de Graaf, S Stopponi, J Bos… - The 13th Conference …, 2022 - research.rug.nl
To facilitate corpus searches by classicists as well as to reduce data sparsity when training models, we focus on the automatic lemmatization of ancient Greek inscriptions, which have …
C Swaelens, I De Vos, E Lefever - Language Resources and Evaluation, 2023 - Springer
In this paper, we explore the feasibility of developing a part-of-speech tagger for not- normalised, Byzantine Greek epigrams. Hence, we compared three different transformer …
O Hellwig, S Nehrdich, S Sellmer - Language Resources and Evaluation, 2023 - Springer
This paper describes the first data-driven parser for Vedic Sanskrit, an ancient Indo-Aryan language in which a corpus of important religious and philosophical texts has been …
C Swaelens, I De Vos, E Lefever - Proceedings of the 17th …, 2023 - aclanthology.org
In this paper, we present the interim results of a transformer-based annotation pipeline for Ancient and Medieval Greek. As the texts in the Database of Byzantine Book Epigrams have …
This paper stems from the project A World of Possibilities. Modal pathways over an extra- long period of time: the diachrony of modality in the Latin language (WoPoss) which involves …
DJ Waters - International Journal on Digital Libraries, 2023 - Springer
This article advances the thesis that three decades of investments by national and international funders, combined with those of scholars, technologists, librarians, archivists …
These proceedings include the papers accepted for presentation at the Second Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2022). 1 The …
M Vierros, E Henriksson - Journal of open humanities data, 2021 - researchportal.helsinki.fi
Abstract The PapyGreek Treebanks dataset contains documentary texts written in Postclassical Greek (ca. 300 BCE–700 CE), morphosyntactically annotated according to …