The STEM-ECR dataset: grounding scientific entity references in STEM scholarly content to authoritative encyclopedic and lexicographic sources

J D'Souza, A Hoppe, A Brack, MY Jaradeh… - arXiv preprint arXiv …, 2020 - arxiv.org
We introduce the STEM (Science, Technology, Engineering, and Medicine) Dataset for
Scientific Entity Extraction, Classification, and Resolution, version 1.0 (STEM-ECR v1. 0) …

Automatic Subject-Based Contextualisation of Programming Assignment Lists.

SC Fonseca, FD Pereira, EHT Oliveira… - … Educational Data Mining …, 2020 - ERIC
As programming must be learned by doing, introductory programming course learners need
to solve many problems, eg, on systems such as' Online Judges'. However, as such courses …

Zero-shot learning to extract assessment criteria and medical services from the preventive healthcare guidelines using large language models

X Luo, FM Tahabi, T Marc, LA Haunert… - Journal of the …, 2024 - academic.oup.com
Objectives The integration of these preventive guidelines with Electronic Health Records
(EHRs) systems, coupled with the generation of personalized preventive care …

FITAnnotator: A flexible and intelligent text annotation system

Y Li, B Yu, L Quangang, T Liu - … of the 2021 Conference of the …, 2021 - aclanthology.org
In this paper, we introduce FITAnnotator, a generic web-based tool for efficient text
annotation. Benefiting from the fully modular architecture design, FITAnnotator provides a …

From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)

MI Nešić, R Stanković, C Schöch… - Proceedings of the 8th …, 2022 - aclanthology.org
In this paper we present the wikification of the ELTeC (European Literary Text Collection),
developed within the COST Action “Distant Reading for European Literary …

[PDF][PDF] Data Collection and Annotation Pipeline for Social Good Projects.

C Scheunemann, J Naumann… - AI4SG@ AAAI …, 2020 - public.ukp.informatik.tu-darmstadt.de
Vast amounts of data are generated during crisis events through both formal and informal
sources, and this data can be used to make a positive impact in all phases of crisis events …

Mapping the Unmapped: Transmedial Representations of Premodern Geographies

C Palladino - Berichte. Geographie und Landeskunde, 2021 - biblioscout.net
This paper discusses the problem of modeling descriptive geographies of the premodern
world, with a focus on Greco-Roman sources. As premodern way-finding mechanisms are …

[HTML][HTML] Lost at Sea: A Dataset of 25+ SEA Words Morpho-Semantically Annotated in Ancient Greek and Latin

A Farina - 2023 - openhumanitiesdata.metajnl.com
This paper describes a dataset containing more than 25 Ancient Greek and Latin words
(nouns, verbs, adjectives) connected to the semantic field SEA ('sea','water','wave','shore',' …

Interactive distributed corpus exploration and annotation infrastructure for large corpora and knowledge-bases (INCEpTION)

R Eckart de Castilho, I Gurevych - tuprints.ulb.tu-darmstadt.de
Final Report: Interactive distributed corpus exploration and annotation infrastructure for large
corpora and knowledge-bases (IN Page 1 Final Report: Interactive distributed corpus …

[PDF][PDF] La prise en compte de la dimension langagière dans l'évaluation d'écrits explicatifs d'élèves en classe de sixième

F Badin, P Schneeberger, M Rebière… - Actes des onzièmes …, 2020 - dipot.ulb.ac.be
En France, de nombreuses recherches centrées sur les relations entre langage et
apprentissages en science se sont développées depuis les années 1980. Au sein du …