S2ORC: The semantic scholar open research corpus

K Lo, LL Wang, M Neumann, R Kinney… - arXiv preprint arXiv …, 2019 - arxiv.org
We introduce S2ORC, a large corpus of 81.1 M English-language academic papers
spanning many academic disciplines. The corpus consists of rich metadata, paper abstracts …

scite: A smart citation index that displays the context of citations and classifies their intent using deep learning

JM Nicholson, M Mordaunt, P Lopez… - Quantitative Science …, 2021 - direct.mit.edu
Citation indices are tools used by the academic community for research and research
evaluation that aggregate scientific literature output and measure impact by collating citation …

Citation recommendation: approaches and datasets

M Färber, A Jatowt - International Journal on Digital Libraries, 2020 - Springer
Citation recommendation describes the task of recommending citations for a given text. Due
to the overload of published scientific works in recent years on the one hand, and the need …

A global health crisis with divided research traditions? A comparative review of Brazilian and international research in communication on the COVID-19 pandemic

FB Souza Martins, J Yu… - Annals of the International …, 2023 - academic.oup.com
The COVID-19 pandemic resulted in substantial international scientific research from high-
income countries, with fewer contributions from low-and middle-income countries (LMIC) …

Information extraction from scientific articles: a survey

Z Nasar, SW Jaffry, MK Malik - Scientometrics, 2018 - Springer
In last few decades, with the advent of World Wide Web (WWW), world is being overloaded
with huge data. This huge data carries potential information that once extracted, can be used …

unarXive: a large scholarly data set with publications' full-text, annotated in-text citations, and links to metadata

T Saier, M Färber - Scientometrics, 2020 - Springer
In recent years, scholarly data sets have been used for various purposes, such as paper
recommendation, citation recommendation, citation context analysis, and citation context …

GeoDeepShovel: A platform for building scientific database from geoscience literature with AI assistance

S Zhang, H Xu, Y Jia, Y Wen, D Wang… - Geoscience Data …, 2023 - Wiley Online Library
With the rapid development of big data science, the research paradigm in the field of
geosciences has also begun to shift to big data‐driven scientific discovery. Researchers …

On an optimal analogy-based software effort estimation

P Phannachitta - Information and Software Technology, 2020 - Elsevier
Context: An analogy-based software effort estimation technique estimates the required effort
for a new software project based on the total effort used in completing past similar projects …

A benchmark of pdf information extraction tools using a multi-task and multi-domain evaluation framework for academic documents

N Meuschke, A Jagdale, T Spinde, J Mitrović… - International Conference …, 2023 - Springer
Extracting information from academic PDF documents is crucial for numerous indexing,
retrieval, and analysis use cases. Choosing the best tool to extract specific content elements …

[HTML][HTML] 20 NIMEs: Twenty Years of New Interfaces for Musical Expression

S Fasciani, J Goode - NIME 2021, 2021 - nime.pubpub.org
This paper provides figures and metrics over twenty years of New Interfaces for Musical
Expression conferences, which are derived by analyzing the publicly available paper …