Pre-trained language models in biomedical domain: A systematic survey

B Wang, Q Xie, J Pei, Z Chen, P Tiwari, Z Li… - ACM Computing …, 2023 - dl.acm.org
Pre-trained language models (PLMs) have been the de facto paradigm for most natural
language processing tasks. This also benefits the biomedical domain: researchers from …

Negation and speculation in NLP: a Survey, Corpora, methods, and applications

A Mahany, H Khaled, NS Elmitwally, N Aljohani… - Applied Sciences, 2022 - mdpi.com
Negation and speculation are universal linguistic phenomena that affect the performance of
Natural Language Processing (NLP) applications, such as those for opinion mining and …

DrBERT: A robust pre-trained model in French for biomedical and clinical domains

Y Labrak, A Bazoge, R Dufour, M Rouvier… - arXiv preprint arXiv …, 2023 - arxiv.org
In recent years, pre-trained language models (PLMs) achieve the best performance on a
wide range of natural language processing (NLP) tasks. While the first models were trained …

FrenchMedMCQA: A French multiple-choice question answering dataset for medical domain

Y Labrak, A Bazoge, R Dufour, M Rouvier… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper introduces FrenchMedMCQA, the first publicly available Multiple-Choice
Question Answering (MCQA) dataset in French for medical domain. It is composed of 3,105 …

Can reproducibility be improved in clinical natural language processing? A study of 7 clinical NLP suites

W Digan, A Névéol, A Neuraz, M Wack… - Journal of the …, 2021 - academic.oup.com
Background The increasing complexity of data streams and computational processes in
modern clinical health information systems makes reproducibility challenging. Clinical …

FlauBERT vs. CamemBERT: Understanding patient's answers by a French medical chatbot

C Blanc, A Bailly, É Francis, T Guillotin, F Jamal… - Artificial Intelligence in …, 2022 - Elsevier
In a number of circumstances, obtaining health-related information from a patient is time-
consuming, whereas a chatbot interacting efficiently with that patient might help saving …

Can synthetic text help clinical named entity recognition? a study of electronic health records in French

N Hiebel, O Ferret, K Fort, A Névéol - The 17th Conference of the …, 2023 - inria.hal.science
In sensitive domains, the sharing of corpora is restricted due to confidentiality, copyrights, or
trade secrets. Automatic text generation can help alleviate these issues by producing …

Supervised learning for the detection of negation and of its scope in French and Brazilian Portuguese biomedical corpora

C Dalloux, V Claveau, N Grabar… - Natural Language …, 2021 - cambridge.org
Automatic detection of negated content is often a prerequisite in information extraction
systems in various domains. In the biomedical domain especially, this task is important …

Contextualized French language models for biomedical named entity recognition

J Copara, J Knafou, N Naderi, C Moro… - Actes de la 6e …, 2020 - aclanthology.org
Named entity recognition (NER) is key for biomedical applications as it allows knowledge
discovery in free text data. As entities are semantic phrases, their meaning is conditioned to …

UMLS-KGI-BERT: Data-centric knowledge integration in transformers for biomedical entity recognition

A Mannion, T Chevalier, D Schwab… - arXiv preprint arXiv …, 2023 - arxiv.org
Pre-trained transformer language models (LMs) have in recent years become the dominant
paradigm in applied NLP. These models have achieved state-of-the-art performance on …