An Empirical Study of Information Extraction from Vietnamese Documents

L Nguyen, TD Nguyen, MT Dang, DN Vu… - … on Computing and …, 2023 - ieeexplore.ieee.org
Information Extraction (IE) is the procedure of transforming unstructured data into structured
formats. IE is vital to many document intelligence applications and has high potential in …

[PDF][PDF] NetBERT: a pre-trained language representation model for computer networking

A Louis - 2020 - researchgate.net
Obtaining accurate information about products in a fast and efficient way is becoming
increasingly important at Cisco as the related documentation rapidly grows. Thanks to recent …

Data-Efficient Information Extraction from Form-Like Documents

B Gunel, N Potti, S Tata, JB Wendt, M Najork… - arXiv preprint arXiv …, 2022 - arxiv.org
Automating information extraction from form-like documents at scale is a pressing need due
to its potential impact on automating business workflows across many industries like …

Attend, copy, parse end-to-end information extraction from documents

RB Palm, F Laws, O Winther - 2019 International Conference …, 2019 - ieeexplore.ieee.org
Document information extraction tasks performed by humans create data consisting of a
PDF or document image input, and extracted string outputs. This end-to-end data is naturally …

Business document information extraction: Towards practical benchmarks

M Skalický, Š Šimsa, M Uřičář, M Šulc - International Conference of the …, 2022 - Springer
Abstract Information extraction from semi-structured documents is crucial for frictionless
business-to-business (B2B) communication. While machine learning problems related to …

Cutie: Learning to understand documents with convolutional universal text information extractor

X Zhao, E Niu, Z Wu, X Wang - arXiv preprint arXiv:1903.12363, 2019 - arxiv.org
Extracting key information from documents, such as receipts or invoices, and preserving the
interested texts to structured data is crucial in the document-intensive streamline processes …

A comparative study of information extraction strategies using an attention-based neural network

S Tarride, A Lemaitre, B Coüasnon… - International Workshop on …, 2022 - Springer
This article focuses on information extraction in historical handwritten marriage records.
Traditional approaches rely on a sequential pipeline of two consecutive tasks: handwriting …

Icl-d3ie: In-context learning with diverse demonstrations updating for document information extraction

J He, L Wang, Y Hu, N Liu, H Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Large language models (LLMs), such as GPT-3 and ChatGPT, have demonstrated
remarkable results in various natural language processing (NLP) tasks with in-context …

Language models for document understanding

T Douzon - 2023 - theses.hal.science
Every day, an uncountable amount of documents are received and processed by companies
worldwide. In an effort to reduce the cost of processing each document, the largest …

Cnn-iets: A cnn-based probabilistic approach for information extraction by text segmentation

M Hu, Z Li, Y Shen, A Liu, G Liu, K Zheng… - Proceedings of the 2017 …, 2017 - dl.acm.org
Information Extraction by Text Segmentation (IETS) aims at segmenting text inputs to extract
implicit data values contained in them. The state-of-art IETS approaches mainly rely on …