A flexible approach for extracting metadata from bibliographic citations

Z Nasar, SW Jaffry, MK Malik - Scientometrics, 2018 - Springer

In last few decades, with the advent of World Wide Web (WWW), world is being overloaded
with huge data. This huge data carries potential information that once extracted, can be used …

被引用次数：126 相关文章所有 6 个版本

[PDF] arxiv.org

Machine learning vs. rules and out-of-the-box vs. retrained: An evaluation of open-source bibliographic reference and citation parsers

D Tkaczyk, A Collins, P Sheridan, J Beel - … of the 18th ACM/IEEE on joint …, 2018 - dl.acm.org

Bibliographic reference parsing refers to extracting machine-readable metadata, such as the
names of the authors, the title, or journal name, from bibliographic reference strings. Many …

被引用次数：72 相关文章所有 3 个版本

[PDF] arxiv.org

A benchmark of pdf information extraction tools using a multi-task and multi-domain evaluation framework for academic documents

N Meuschke, A Jagdale, T Spinde, J Mitrović… - International Conference …, 2023 - Springer

Extracting information from academic PDF documents is crucial for numerous indexing,
retrieval, and analysis use cases. Choosing the best tool to extract specific content elements …

被引用次数：10 相关文章所有 10 个版本

[PDF] springer.com

A structural SVM approach for reference parsing

X Zhang, J Zou, DX Le… - 2010 Ninth international …, 2010 - ieeexplore.ieee.org

MEDLINE®, the flagship database of the US National Library of Medicine, is a critical source
of information for biomedical research and clinical medicine. The automated extraction of …

被引用次数：41 相关文章所有 16 个版本

[PDF] ccf.org.cn

Automatic document metadata extraction based on deep networks

R Liu, L Gao, D An, Z Jiang, Z Tang - … 2017, Dalian, China, November 8–12 …, 2018 - Springer

Metadata information extraction from academic papers is of great value to many applications
such as scholar search, digital library, and so on. This task has attracted much attention from …

被引用次数：21 相关文章所有 2 个版本

[HTML] nih.gov

Locating and parsing bibliographic references in HTML medical articles

J Zou, D Le, GR Thoma - International Journal on Document Analysis and …, 2010 - Springer

The set of references that typically appear toward the end of journal articles is sometimes,
though not always, a field in bibliographic (citation) databases. But even if references do not …

被引用次数：39 相关文章所有 10 个版本

[PDF] cortez.me

Ondux: on-demand unsupervised learning for information extraction

E Cortez, AS da Silva, MA Gonçalves… - Proceedings of the 2010 …, 2010 - dl.acm.org

Information extraction by text segmentation (IETS) applies to cases in which data values of
interest are organized in implicit semi-structured records available in textual sources (eg …

被引用次数：48 相关文章所有 9 个版本

Comparing free reference extraction pipelines

T Backes, A Iurshina, MA Shahid, P Mayr - International Journal on Digital …, 2024 - Springer

In this paper, we compare the performance of several popular pre-trained reference
extraction and segmentation toolkits combined in different pipeline configurations on three …

Disease named entity recognition using semisupervised learning and conditional random fields

N Suakkaphong, Z Zhang… - Journal of the American …, 2011 - Wiley Online Library

Abstract Information extraction is an important text‐mining task that aims at extracting
prespecified types of information from large text collections and making them available in …

被引用次数：30 相关文章所有 11 个版本

Machine Learning Approaches for Entity Extraction from Citation Strings

V Jain, N Baliyan, S Kumar - International Conference on Information …, 2023 - Springer

As the technological evolution is on the rise, so is the amount of research literature. It
becomes important for the listing websites to easily parse the citation to store the metadata …

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群