A neural approach for text extraction from scholarly figures

D Morris, P Tang, R Ewerth - 2019 International Conference on …, 2019 - ieeexplore.ieee.org
In recent years, the problem of scene text extraction from images has received extensive
attention and significant progress. However, text extraction from scholarly figures such as …

LAMBERT: layout-aware language modeling for information extraction

Ł Garncarek, R Powalski, T Stanisławek… - … conference on document …, 2021 - Springer
We introduce a simple new approach to the problem of understanding documents where
non-trivial layout influences the local semantics. To this end, we modify the Transformer …

More of that, please: Domain Adaptation of Information Extraction through Examples & Feedback

B Hättasch, C Binnig - Proceedings of the 2024 Workshop on Human-In …, 2024 - dl.acm.org
Automatic information extraction, eg, into a tabular format, is crucial for leveraging
knowledge in large text collections. Yet, creating such extraction pipelines for custom target …

Exploiting language models for annotation-efficient knowledge discovery

J Huang - 2023 - ideals.illinois.edu
With tremendous amounts of texts across the Internet nowadays, it is incredibly difficult for
people to manually seek for valuable knowledge from massive corpora, thus automatic …

NeCo@ ALQAC 2023: Legal Domain Knowledge Acquisition for Low-Resource Languages through Data Enrichment

HL Nguyen, DQ Nguyen, HT Nguyen… - … on Knowledge and …, 2023 - ieeexplore.ieee.org
In recent years, natural language processing has gained significant popularity in various
sectors, including the legal domain. This paper presents NeCo Team's solutions to the …

An Unsupervised Learning Method to improve Legal Document Retrieval task at ALQAC 2022

DT Nguyen, H Nguyen, T Le… - 2022 14th International …, 2022 - ieeexplore.ieee.org
Document retrieval for domain-specific has been an important and challenging research in
NLP, particularly legal documents. The main challenge in the legal domain is the close …

Sources of success for information extraction methods

D Kauchak, J Smarr, C Elkan - 2002 - escholarship.org
In this paper, we examine an important recent rule-based information extraction (IE)
technique named Boosted Wrapper Induction (BWI), by conducting experiments on a wider …

KGI: an integrated framework for knowledge intensive language tasks

MFM Chowdhury, M Glass, G Rossiello… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we present a system to showcase the capabilities of the latest state-of-the-art
retrieval augmented generation models trained on knowledge-intensive language tasks …

A medical information extraction workbench to process german clinical text

R Roller, L Seiffe, A Ayach, S Möller, O Marten… - arXiv preprint arXiv …, 2022 - arxiv.org
Background: In the information extraction and natural language processing domain,
accessible datasets are crucial to reproduce and compare results. Publicly available …

Eigen: Expert-Informed Joint Learning Aggregation for High-Fidelity Information Extraction from Document Images

A Singh, V Subramanian… - … Learning for Health …, 2023 - proceedings.mlr.press
Abstract Information Extraction (IE) from document images is challenging due to the high
variability of layout formats. Deep models such as etc. In this work, we propose a novel …