Automatic metadata extraction incorporating visual features from scanned electronic theses and dissertations

MH Choudhury, HR Jayanetti, J Wu… - 2021 ACM/IEEE …, 2021 - ieeexplore.ieee.org
Electronic Theses and Dissertations (ETDs) contain domain knowledge that can be used for
many digital library tasks, such as analyzing citation networks and predicting research …

Visual descriptor extraction from patent figure captions: A case study of data efficiency between BiLSTM and transformer

X Wei, J Wu, K Ajayi, D Oyen - Proceedings of the 22nd ACM/IEEE Joint …, 2022 - dl.acm.org
Technical drawings used for illustrating designs are ubiquitous in patent documents,
especially design patents. Different from natural images, these drawings are usually made …

Freshness and Informativity Weighted Cognitive Extent and Its Correlation with Cumulative Citation Count

Z Wang, J Wu - arXiv preprint arXiv:2412.03557, 2024 - arxiv.org
In this paper, we revisit cognitive extent, originally defined as the number of unique phrases
in a quota. We introduce Freshness and Informative Weighted Cognitive Extent (FICE) …

Theory entity extraction for social and behavioral sciences papers using distant supervision

X Wei, L Salsabil, J Wu - Proceedings of the 22nd ACM Symposium on …, 2022 - dl.acm.org
Theories and models, which are common in scientific papers in almost all domains, usually
provide the foundations of theoretical analysis and experiments. Understanding the use of …

New approach to the chunk recoginition in Polish

M Oleksy, W Walentynowicz, J Wieczorek - Procedia Computer Science, 2021 - Elsevier
This paper discusses the problem of shallow parsing of Polish, most specifically—chunking.
We describe the linguistic work on annotation guidelines development, manual corpus …