[HTML][HTML] GERNERMED: An open German medical NER model

J Frei, F Kramer - Software Impacts, 2022 - Elsevier
Recent advancements in natural language processing (NLP) have been achieved by the
use of increasingly complex neural networks. In clinical context, NLP is a key technique to …

GGPONC: a corpus of German medical text with rich metadata based on clinical practice guidelines

F Borchert, C Lohr, L Modersohn, T Langer… - arXiv preprint arXiv …, 2020 - arxiv.org
The lack of publicly accessible text corpora is a major obstacle for progress in natural
language processing. For medical applications, unfortunately, all language communities …

[HTML][HTML] German medical named entity recognition model and data set creation using machine translation and word alignment: Algorithm development and validation

J Frei, F Kramer - JMIR Formative Research, 2023 - formative.jmir.org
Background Data mining in the field of medical data analysis often needs to rely solely on
the processing of unstructured data to retrieve relevant data. For German natural language …

Recovering patient journeys: a corpus of biomedical entities and relations on Twitter (BEAR)

A Wührl, R Klinger - arXiv preprint arXiv:2204.09952, 2022 - arxiv.org
Text mining and information extraction for the medical domain has focused on scientific text
generated by researchers. However, their direct access to individual patient experiences or …

A medical information extraction workbench to process german clinical text

R Roller, L Seiffe, A Ayach, S Möller, O Marten… - arXiv preprint arXiv …, 2022 - arxiv.org
Background: In the information extraction and natural language processing domain,
accessible datasets are crucial to reproduce and compare results. Publicly available …

DOPA METER–A Tool Suite for Metrical Document Profiling and Aggregation

C Lohr, U Hahn - Proceedings of the 2023 Conference on …, 2023 - aclanthology.org
We present DOPA METER, a tool suite for the metrical investigation of written language, that
provides diagnostic means for its division into discourse categories, such as registers …

Clinical Document Corpora and Assorted Domain Proxies: A Survey of Diversity in Corpus Design, with Focus on German Text Data

U Hahn - arXiv preprint arXiv:2412.00230, 2024 - arxiv.org
We survey clinical document corpora, with focus on German textual data. Due to rigid data
privacy legislation in Germany these resources, with only few exceptions, are stored in safe …

Replace, Paraphrase or Fine-tune? Evaluating Automatic Simplification for Medical Texts in Spanish

L Campillos-Llanos, AR Terroba Reinares… - 2024 - digital.csic.es
Patients can not always completely understand medical documents given the myriad of
technical terms they contain. Automatic text simplification techniques can help, but they must …

Replace, Paraphrase or Fine-tune? Evaluating Automatic Simplification for Medical Texts in Spanish

LC Llanos, AR Terroba, R Bartolomé… - Proceedings of the …, 2024 - aclanthology.org
Patients can not always completely understand medical documents given the myriad of
technical terms they contain. Automatic text simplification techniques can help, but they must …

Mitigating gender bias in neural machine translation using counterfactual data

A Wong - 2020 - academicworks.cuny.edu
Recent advances in deep learning have greatly improved the ability of researchers to
develop effective machine translation systems. In particular, the application of modern …