[PDF][PDF] Machine learning for ancient languages: A survey

T Sommerschield, Y Assael, J Pavlopoulos… - Computational …, 2023 - direct.mit.edu
Ancient languages preserve the cultures and histories of the past. However, their study is
fraught with difficulties, and experts must tackle a range of challenging text-based tasks, from …

Latin bert: A contextual language model for classical philology

D Bamman, PJ Burns - arXiv preprint arXiv:2009.10053, 2020 - arxiv.org
We present Latin BERT, a contextual language model for the Latin language, trained on
642.7 million words from a variety of sources spanning the Classical era to the 21st century …

Disentangling the cultural evolution of ancient China: a digital humanities perspective

S Duan, J Wang, H Yang, Q Su - Humanities and Social Sciences …, 2023 - nature.com
Being recognized among the cradles of human civilization, ancient China nurtured the
longest continuous academic traditions and humanistic spirits, which continue to impact …

Profiling of intertextuality in Latin literature using word embeddings

PJ Burns, JA Brofos, K Li, P Chaudhuri… - Proceedings of the …, 2021 - aclanthology.org
Identifying intertextual relationships between authors is of central importance to the study of
literature. We report an empirical analysis of intertextuality in classical Latin literature using …

On the feasibility of automated detection of allusive text reuse

E Manjavacas, B Long, M Kestemont - arXiv preprint arXiv:1905.02973, 2019 - arxiv.org
The detection of allusive text reuse is particularly challenging due to the sparse evidence on
which allusive references rely---commonly based on none or very few shared words …

Penerapan Simhash dan Hamming distance dalam Deteksi kemiripan Teks Berita

D Sebastian, LD Krisnawati… - Jurnal Terapan Teknologi …, 2022 - katalog.ukdw.ac.id
Daur Ulang Text didefinisikan sebagai pemanfaatan sumber tulisan yang ada untuk
penulisan sebuah teks baru. Persentase penggunaan ulang teks dari sumber sebelumnya …

The Making of Coptic Wordnet

L Slaughter, LM Da Costa, S Miyagawa… - Proceedings of the …, 2019 - aclanthology.org
With the increasing availability of wordnets for ancient languages, such as Ancient Greek
and Latin, gaps remain in the coverage of less studied languages of antiquity. This paper …

Deriving consensus for multi-parallel corpora: an English Bible study

P Xia, D Yarowsky - Proceedings of the Eighth International Joint …, 2017 - aclanthology.org
What can you do with multiple noisy versions of the same text? We present a method which
generates a single consensus between multi-parallel corpora. By maximizing a function of …

[HTML][HTML] A Database of Intertexts in Valerius Flaccus' Argonautica 1: A Benchmarking Resource for the Evaluation of Computational Intertextual Search of Latin …

JP Dexter, P Chaudhuri… - Journal of …, 2024 - openhumanitiesdata.metajnl.com
Abstract Characterization of intertextual references among authors is fundamental for the
study of Latin literature. In this paper, we describe a large-scale intertextuality dataset …

Modeling Semantic Change through Large Language Models

F Periti - 2024 - tesidottorato.depositolegale.it
This PhD thesis focuses on computational modeling of semantic change through large
language models. It investigates the modeling of lexical semantic change, where words …