Information extraction from scientific articles: a survey

Z Nasar, SW Jaffry, MK Malik - Scientometrics, 2018 - Springer
In last few decades, with the advent of World Wide Web (WWW), world is being overloaded
with huge data. This huge data carries potential information that once extracted, can be used …

Machine learning vs. rules and out-of-the-box vs. retrained: An evaluation of open-source bibliographic reference and citation parsers

D Tkaczyk, A Collins, P Sheridan, J Beel - … of the 18th ACM/IEEE on joint …, 2018 - dl.acm.org
Bibliographic reference parsing refers to extracting machine-readable metadata, such as the
names of the authors, the title, or journal name, from bibliographic reference strings. Many …

A benchmark of pdf information extraction tools using a multi-task and multi-domain evaluation framework for academic documents

N Meuschke, A Jagdale, T Spinde, J Mitrović… - International Conference …, 2023 - Springer
Extracting information from academic PDF documents is crucial for numerous indexing,
retrieval, and analysis use cases. Choosing the best tool to extract specific content elements …

A structural SVM approach for reference parsing

X Zhang, J Zou, DX Le… - 2010 Ninth international …, 2010 - ieeexplore.ieee.org
MEDLINE®, the flagship database of the US National Library of Medicine, is a critical source
of information for biomedical research and clinical medicine. The automated extraction of …

Automatic document metadata extraction based on deep networks

R Liu, L Gao, D An, Z Jiang, Z Tang - … 2017, Dalian, China, November 8–12 …, 2018 - Springer
Metadata information extraction from academic papers is of great value to many applications
such as scholar search, digital library, and so on. This task has attracted much attention from …

Locating and parsing bibliographic references in HTML medical articles

J Zou, D Le, GR Thoma - International Journal on Document Analysis and …, 2010 - Springer
The set of references that typically appear toward the end of journal articles is sometimes,
though not always, a field in bibliographic (citation) databases. But even if references do not …

Ondux: on-demand unsupervised learning for information extraction

E Cortez, AS da Silva, MA Gonçalves… - Proceedings of the 2010 …, 2010 - dl.acm.org
Information extraction by text segmentation (IETS) applies to cases in which data values of
interest are organized in implicit semi-structured records available in textual sources (eg …

Comparing free reference extraction pipelines

T Backes, A Iurshina, MA Shahid, P Mayr - International Journal on Digital …, 2024 - Springer
In this paper, we compare the performance of several popular pre-trained reference
extraction and segmentation toolkits combined in different pipeline configurations on three …

Disease named entity recognition using semisupervised learning and conditional random fields

N Suakkaphong, Z Zhang… - Journal of the American …, 2011 - Wiley Online Library
Abstract Information extraction is an important text‐mining task that aims at extracting
prespecified types of information from large text collections and making them available in …

Machine Learning Approaches for Entity Extraction from Citation Strings

V Jain, N Baliyan, S Kumar - International Conference on Information …, 2023 - Springer
As the technological evolution is on the rise, so is the amount of research literature. It
becomes important for the listing websites to easily parse the citation to store the metadata …