[PDF][PDF] Machine learning for ancient languages: A survey

T Sommerschield, Y Assael, J Pavlopoulos… - Computational …, 2023 - direct.mit.edu
Ancient languages preserve the cultures and histories of the past. However, their study is
fraught with difficulties, and experts must tackle a range of challenging text-based tasks, from …

Histred: A historical document-level relation extraction dataset

S Yang, M Choi, Y Cho, J Choo - arXiv preprint arXiv:2307.04285, 2023 - arxiv.org
Despite the extensive applications of relation extraction (RE) tasks in various domains, little
has been explored in the historical context, which contains promising data across hundreds …

Learnable feature alignment with attention-based data augmentation for handling data issue in ancient documents

A Jalali, S Lee, M Lee - Applied Soft Computing, 2024 - Elsevier
Recognizing ancient cursive handwritten characters presents unique challenges due to the
diversity of writing styles and significant class imbalances, where some characters have …

Detecting sequential genre change in eighteenth-century texts

J Zhang, YC Ryan, I Rastas, F Ginter… - CEUR Workshop …, 2022 - researchportal.bath.ac.uk
Abstract Machine classification of historical books into genres is a common task for NLP-
based classifiers and has a number of applications, from literary analysis to information …

Exploiting Hanja-based Resources in Processing Korean Historic Documents written by Common Literati

H Moon, M Kang, J Seo, S Eo, C Park, Y Yang… - IEEE …, 2024 - ieeexplore.ieee.org
This research aims to explore the comprehension of historical Korean archives authored by
common literati. Numerous endeavors have been made to study Korean historical …

Classical Philology in the Time of AI: Exploring the Potential of Parallel Corpora in Ancient Language

T Yousef, C Palladino, F Shamsian - Proceedings of the Ancient …, 2023 - aclanthology.org
This paper provides an overview of diverse applications of parallel corpora in ancient
languages, particularly Ancient Greek. In the first part, we provide the fundamental principles …

Development of robust NER Models and Named Entity Tagsets for Ancient Greek

C Palladino, T Yousef - Proceedings of the Third …, 2024 - portal.findresearcher.sdu.dk
This contribution presents a novel approach to the development and evaluation of
transformer-based models for Named Entity Recognition and Classification in Ancient Greek …

T5 meets Tybalt: Author Attribution in Early Modern English Drama Using Large Language Models

RMM Hicke, D Mimno - arXiv preprint arXiv:2310.18454, 2023 - arxiv.org
Large language models have shown breakthrough potential in many NLP domains. Here we
consider their use for stylometry, specifically authorship identification in Early Modern …

Enhancement of text recognition for hanja handwritten documents of Ancient Korea

J Ahna, T Jang, Q Fengnyu, H Lee, J Lee… - arXiv preprint arXiv …, 2024 - arxiv.org
We implemented a high-performance optical character recognition model for classical
handwritten documents using data augmentation with highly variable cropping within the …

[PDF][PDF] НОВЫЙ ВЗГЛЯД НА ПРОШЛОЕ: КАК ЯЗЫКОВЫЕ МОДЕЛИ МЕНЯЮТ ИЗУЧЕНИЕ ДРЕВНИХ ТЕКСТОВ A NEW WAY OF LOOKING AT THE PAST: HOW …

АВ Кузнецов - РЕДАКЦИОННАЯ КОЛЛЕГИЯ, 2023 - sibay-uunit.ru
В статье рассмотрено применение больших языковых моделей в исторических
исследованиях для атрибуции, датировки текстов, реконструкции утраченных …