Interlinking through lemmas. the lexical collection of the lila knowledge base of linguistic resources for latin

M Passarotti, F Mambrini, G Franzini… - Studi e Saggi …, 2020 - studiesaggilinguistici.it
This paper presents the structure of the LiLa Knowledge Base, ie a collection of multifarious
linguistic resources for Latin described with the same vocabulary of knowledge description …

Latincy: Synthetic trained pipelines for latin nlp

PJ Burns - arXiv preprint arXiv:2305.04365, 2023 - arxiv.org
This paper introduces LatinCy, a set of trained general purpose Latin-language" core"
pipelines for use with the spaCy natural language processing framework. The models are …

AGILe: The first lemmatizer for Ancient Greek inscriptions

E de Graaf, S Stopponi, J Bos… - The 13th Conference …, 2022 - research.rug.nl
To facilitate corpus searches by classicists as well as to reduce data sparsity when training
models, we focus on the automatic lemmatization of ancient Greek inscriptions, which have …

Linguistic annotation of Byzantine book epigrams

C Swaelens, I De Vos, E Lefever - Language Resources and Evaluation, 2023 - Springer
In this paper, we explore the feasibility of developing a part-of-speech tagger for not-
normalised, Byzantine Greek epigrams. Hence, we compared three different transformer …

Data-driven dependency parsing of Vedic Sanskrit

O Hellwig, S Nehrdich, S Sellmer - Language Resources and Evaluation, 2023 - Springer
This paper describes the first data-driven parser for Vedic Sanskrit, an ancient Indo-Aryan
language in which a corpus of important religious and philosophical texts has been …

Medieval social media: Manual and automatic annotation of byzantine Greek marginal writing

C Swaelens, I De Vos, E Lefever - Proceedings of the 17th …, 2023 - aclanthology.org
In this paper, we present the interim results of a transformer-based annotation pipeline for
Ancient and Medieval Greek. As the texts in the Database of Byzantine Book Epigrams have …

Multi-layered semantic annotation and the formalisation of annotation schemas for the investigation of modality in a Latin corpus

H Bermúdez-Sabel, F Dell'Oro, P Marongiu - Language Resources and …, 2024 - Springer
This paper stems from the project A World of Possibilities. Modal pathways over an extra-
long period of time: the diachrony of modality in the Latin language (WoPoss) which involves …

The emerging digital infrastructure for research in the humanities

DJ Waters - International Journal on Digital Libraries, 2023 - Springer
This article advances the thesis that three decades of investments by national and
international funders, combined with those of scholars, technologists, librarians, archivists …

[PDF][PDF] In Search of the Flocks: How to Perform Onomasiological Queries in an Ancient Greek Corpus?

A Keersmaekers, T Van Hal - Proceedings of the LREC 2022 …, 2022 - lirias.kuleuven.be
These proceedings include the papers accepted for presentation at the Second Workshop
on Language Technologies for Historical and Ancient Languages (LT4HALA 2022). 1 The …

PapyGreek Treebanks: A Dataset of Linguistically Annotated Greek Documentary Papyri

M Vierros, E Henriksson - Journal of open humanities data, 2021 - researchportal.helsinki.fi
Abstract The PapyGreek Treebanks dataset contains documentary texts written in
Postclassical Greek (ca. 300 BCE–700 CE), morphosyntactically annotated according to …