HUE: Pretrained model and dataset for understanding Hanja documents of ancient Korea

T Sommerschield, Y Assael, J Pavlopoulos… - Computational …, 2023 - direct.mit.edu

Ancient languages preserve the cultures and histories of the past. However, their study is
fraught with difficulties, and experts must tackle a range of challenging text-based tasks, from …

被引用次数：52 相关文章所有 7 个版本

[PDF] arxiv.org

Histred: A historical document-level relation extraction dataset

S Yang, M Choi, Y Cho, J Choo - arXiv preprint arXiv:2307.04285, 2023 - arxiv.org

Despite the extensive applications of relation extraction (RE) tasks in various domains, little
has been explored in the historical context, which contains promising data across hundreds …

被引用次数：6 相关文章所有 5 个版本

Learnable feature alignment with attention-based data augmentation for handling data issue in ancient documents

A Jalali, S Lee, M Lee - Applied Soft Computing, 2024 - Elsevier

Recognizing ancient cursive handwritten characters presents unique challenges due to the
diversity of writing styles and significant class imbalances, where some characters have …

[PDF] bath.ac.uk

Detecting sequential genre change in eighteenth-century texts

J Zhang, YC Ryan, I Rastas, F Ginter… - CEUR Workshop …, 2022 - researchportal.bath.ac.uk

Abstract Machine classification of historical books into genres is a common task for NLP-
based classifiers and has a number of applications, from literary analysis to information …

被引用次数：6 相关文章所有 9 个版本

[PDF] ieee.org

Exploiting Hanja-based Resources in Processing Korean Historic Documents written by Common Literati

H Moon, M Kang, J Seo, S Eo, C Park, Y Yang… - IEEE …, 2024 - ieeexplore.ieee.org

This research aims to explore the comprehension of historical Korean archives authored by
common literati. Numerous endeavors have been made to study Korean historical …

Classical Philology in the Time of AI: Exploring the Potential of Parallel Corpora in Ancient Language

T Yousef, C Palladino, F Shamsian - Proceedings of the Ancient …, 2023 - aclanthology.org

This paper provides an overview of diverse applications of parallel corpora in ancient
languages, particularly Ancient Greek. In the first part, we provide the fundamental principles …

被引用次数：2 相关文章所有 7 个版本

[PDF] sdu.dk

Development of robust NER Models and Named Entity Tagsets for Ancient Greek

C Palladino, T Yousef - Proceedings of the Third …, 2024 - portal.findresearcher.sdu.dk

This contribution presents a novel approach to the development and evaluation of
transformer-based models for Named Entity Recognition and Classification in Ancient Greek …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

T5 meets Tybalt: Author Attribution in Early Modern English Drama Using Large Language Models

RMM Hicke, D Mimno - arXiv preprint arXiv:2310.18454, 2023 - arxiv.org

Large language models have shown breakthrough potential in many NLP domains. Here we
consider their use for stylometry, specifically authorship identification in Early Modern …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Enhancement of text recognition for hanja handwritten documents of Ancient Korea

J Ahna, T Jang, Q Fengnyu, H Lee, J Lee… - arXiv preprint arXiv …, 2024 - arxiv.org

We implemented a high-performance optical character recognition model for classical
handwritten documents using data augmentation with highly variable cropping within the …

[PDF][PDF] НОВЫЙ ВЗГЛЯД НА ПРОШЛОЕ: КАК ЯЗЫКОВЫЕ МОДЕЛИ МЕНЯЮТ ИЗУЧЕНИЕ ДРЕВНИХ ТЕКСТОВ A NEW WAY OF LOOKING AT THE PAST: HOW …

АВ Кузнецов - РЕДАКЦИОННАЯ КОЛЛЕГИЯ, 2023 - sibay-uunit.ru

В статье рассмотрено применение больших языковых моделей в исторических
исследованиях для атрибуции, датировки текстов, реконструкции утраченных …

高级搜索

QQ 群