Named entity recognition and relation detection for biomedical information extraction

N Perera, M Dehmer, F Emmert-Streib - Frontiers in cell and …, 2020 - frontiersin.org
The number of scientific publications in the literature is steadily growing, containing our
knowledge in the biomedical, health, and clinical sciences. Since there is currently no …

[HTML][HTML] AMMU: a survey of transformer-based biomedical pretrained language models

KS Kalyan, A Rajasekharan, S Sangeetha - Journal of biomedical …, 2022 - Elsevier
Transformer-based pretrained language models (PLMs) have started a new era in modern
natural language processing (NLP). These models combine the power of transformers …

BioWordVec, improving biomedical word embeddings with subword information and MeSH

Y Zhang, Q Chen, Z Yang, H Lin, Z Lu - Scientific data, 2019 - nature.com
Distributed word representations have become an essential foundation for biomedical
natural language processing (BioNLP), text mining and information retrieval. Word …

BioRED: a rich biomedical relation extraction dataset

L Luo, PT Lai, CH Wei, CN Arighi… - Briefings in …, 2022 - academic.oup.com
Automated relation extraction (RE) from biomedical literature is critical for many downstream
text mining applications in both research and real-world settings. However, most existing …

Learning from imbalanced data

H He, EA Garcia - IEEE Transactions on knowledge and data …, 2009 - ieeexplore.ieee.org
With the continuous expansion of data availability in many large-scale, complex, and
networked systems, such as surveillance, security, Internet, and finance, it becomes critical …

The relationship between Precision-Recall and ROC curves

J Davis, M Goadrich - Proceedings of the 23rd international conference …, 2006 - dl.acm.org
Receiver Operator Characteristic (ROC) curves are commonly used to present results for
binary decision problems in machine learning. However, when dealing with highly skewed …

Information extraction

S Sarawagi - Foundations and Trends® in Databases, 2008 - nowpublishers.com
The automatic extraction of information from unstructured sources has opened up new
avenues for querying, organizing, and analyzing data by drawing upon the clean semantics …

Domain adaptation: challenges, methods, datasets, and applications

P Singhal, R Walambe, S Ramanna, K Kotecha - IEEE access, 2023 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) trained on one dataset (source domain) do not perform well
on another set of data (target domain), which is different but has similar properties as the …

A study on software fault prediction techniques

SS Rathore, S Kumar - Artificial Intelligence Review, 2019 - Springer
Software fault prediction aims to identify fault-prone software modules by using some
underlying properties of the software project before the actual testing process begins. It …

Subsequence kernels for relation extraction

R Mooney, R Bunescu - Advances in neural information …, 2005 - proceedings.neurips.cc
We present a new kernel method for extracting semantic relations between entities in natural
language text, based on a generalization of subsequence kernels. This kernel uses three …