Robust generalization strategies for morpheme glossing in an endangered language documentation context

M Ginn, A Palmer - arXiv preprint arXiv:2311.02777, 2023 - arxiv.org
Generalization is of particular importance in resource-constrained settings, where the
available training data may represent only a small fraction of the distribution of possible …

Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing

C Yang, G Nicolai, M Silfverberg - arXiv preprint arXiv:2406.11085, 2024 - arxiv.org
In this paper, we address the data scarcity problem in automatic data-driven glossing for low-
resource languages by coordinating multiple sources of linguistic expertise. We supplement …

Morph classifier

V John - 2024 - dspace.cuni.cz
Morphological classification is the task of classifying morphs-the forms of morphemes-in
laready segmented words. Since there are more and greateer resources for morpholog-ical …