OCR++: a robust framework for information extraction from scholarly articles

M Singh, B Barua, P Palod, M Garg… - arXiv preprint arXiv …, 2016 - arxiv.org
This paper proposes OCR++, an open-source framework designed for a variety of
information extraction tasks from scholarly articles including metadata (title, author names …

Citationie: Leveraging the citation graph for scientific information extraction

V Viswanathan, G Neubig, P Liu - arXiv preprint arXiv:2106.01560, 2021 - arxiv.org
Automatically extracting key information from scientific documents has the potential to help
scientists work more efficiently and accelerate the pace of scientific progress. Prior work has …

New methods for metadata extraction from scientific literature

D Tkaczyk - arXiv preprint arXiv:1710.10201, 2017 - arxiv.org
Within the past few decades we have witnessed digital revolution, which moved scholarly
communication to electronic media and also resulted in a substantial increase in its volume …

FLAG-PDFe: Features oriented metadata extraction framework for scientific publications

MW Ahmed, MT Afzal - IEEE Access, 2020 - ieeexplore.ieee.org
The unprecedented growth of the research publications in diversified domains has
overwhelmed the research community. It requires a cumbersome process to extract this …

GROBID: Combining automatic bibliographic data recognition and term extraction for scholarship publications

P Lopez - Research and Advanced Technology for Digital …, 2009 - Springer
Based on state of the art machine learning techniques, GROBID (GeneRation Of
BIbliographic Data) performs reliable bibliographic data extractions from scholar articles …

Machine learning techniques for automatically extracting contextual information from scientific publications

S Klampfl, R Kern - … Challenges: Second SemWebEval Challenge at ESWC …, 2015 - Springer
Scholarly publishing increasingly requires automated systems that semantically enrich
documents in order to support management and quality assessment of scientific output …

Section-wise indexing and retrieval of research articles

A Shahid, MT Afzal - Cluster Computing, 2018 - Springer
Relevant information extraction is a dire need of the scholarly community. There are a
number of systems available to find relevant information from scientific literature such as …

[图书][B] Automatic structure and keyphrase analysis of scientific publications

A Constantin - 2014 - search.proquest.com
Purpose. This work addresses an escalating problem within the realm of scientific
publishing, that stems from accelerated publication rates of article formats difficult to process …

A heuristic baseline method for metadata extraction from scanned electronic theses and dissertations

MH Choudhury, J Wu, WA Ingram, EA Fox - Proceedings of the ACM …, 2020 - dl.acm.org
Extracting metadata from scholarly papers is an important text mining problem. Widely used
open-source tools such as GROBID are designed for born-digital scholarly papers but often …

Information extraction from scientific articles: a survey

Z Nasar, SW Jaffry, MK Malik - Scientometrics, 2018 - Springer
In last few decades, with the advent of World Wide Web (WWW), world is being overloaded
with huge data. This huge data carries potential information that once extracted, can be used …