Key information extraction from documents: Evaluation and generator

O Bensch, M Popa, C Spille - arXiv preprint arXiv:2106.14624, 2021 - arxiv.org
Extracting information from documents usually relies on natural language processing
methods working on one-dimensional sequences of text. In some cases, for example, for the …

Attend, copy, parse end-to-end information extraction from documents

RB Palm, F Laws, O Winther - 2019 International Conference …, 2019 - ieeexplore.ieee.org
Document information extraction tasks performed by humans create data consisting of a
PDF or document image input, and extracted string outputs. This end-to-end data is naturally …

Improving information extraction on business documents with specific pre-training tasks

T Douzon, S Duffner, C Garcia, J Espinas - International Workshop on …, 2022 - Springer
Abstract Transformer-based Language Models are widely used in Natural Language
Processing related tasks. Thanks to their pre-training, they have been successfully adapted …

Integrating coordinates with context for information extraction in document images

Z Jiang, Z Huang, Y Lian, J Guo… - … Conference on Document …, 2019 - ieeexplore.ieee.org
Information extraction from document collections is a fundamental and important step to
understand, structure and analyze data. Many approaches with rules and deep learning …

Information extraction from text intensive and visually rich banking documents

B Oral, E Emekligil, S Arslan, G Eryiǧit - Information Processing & …, 2020 - Elsevier
Document types, where visual and textual information plays an important role in their
analysis and understanding, pose a new and attractive area for information extraction …

Cutie: Learning to understand documents with convolutional universal text information extractor

X Zhao, E Niu, Z Wu, X Wang - arXiv preprint arXiv:1903.12363, 2019 - arxiv.org
Extracting key information from documents, such as receipts or invoices, and preserving the
interested texts to structured data is crucial in the document-intensive streamline processes …

Bros: A pre-trained language model focusing on text and layout for better key information extraction from documents

T Hong, D Kim, M Ji, W Hwang, D Nam… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Key information extraction (KIE) from document images requires understanding the
contextual and spatial semantics of texts in two-dimensional (2D) space. Many recent …

Information extraction from invoices

A Hamdi, E Carel, A Joseph, M Coustaty… - … Conference on Document …, 2021 - Springer
The present paper is focused on information extraction from key fields of invoices using two
different methods based on sequence labeling. Invoices are semi-structured documents in …

Information extraction of domain-specific business documents with limited data

MT Nguyen, DT Le, NH Son, BC Minh… - … Joint Conference on …, 2021 - ieeexplore.ieee.org
Information extraction is a key corner-stone in the digitization of office data which requires
the conversion of unstructured to structured data. However, in the actual application to …

An iterative graph learning convolution network for key information extraction based on the document inductive bias

J Deng, Y Zhang, X Zhang, Z Tang, L Gao - International Conference on …, 2023 - Springer
Recently, there has been growing interest in automating the extraction of key information
from document images. Previous methods mainly focus on modelling the complex …