Tablebank: Table benchmark for image-based table detection and recognition

H Dong, Z Cheng, X He, M Zhou, A Zhou… - arXiv preprint arXiv …, 2022 - arxiv.org

Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs,
and various other document types, a flurry of table pre-training frameworks have been …

被引用次数：53 相关文章所有 4 个版本

[PDF] ieee.org

Current status and performance analysis of table recognition in document images with deep neural networks

KA Hashmi, M Liwicki, D Stricker, MA Afzal… - IEEE …, 2021 - ieeexplore.ieee.org

The first phase of table recognition is to detect the tabular area in a document.
Subsequently, the tabular structures are recognized in the second phase in order to extract …

被引用次数：59 相关文章所有 5 个版本

[PDF] arxiv.org

Dit: Self-supervised pre-training for document image transformer

J Li, Y Xu, T Lv, L Cui, C Zhang, F Wei - Proceedings of the 30th ACM …, 2022 - dl.acm.org

Image Transformer has recently achieved significant progress for natural image
understanding, either using supervised (ViT, DeiT, etc.) or self-supervised (BEiT, MAE, etc.) …

被引用次数：122 相关文章所有 4 个版本

[PDF] thecvf.com

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

D Prasad, A Gadpal, K Kapadni… - Proceedings of the …, 2020 - openaccess.thecvf.com

An automatic table recognition method for interpretation of tabular data in document images
majorly involves solving two problems of table detection and table structure recognition. The …

被引用次数：205 相关文章所有 9 个版本

[PDF] arxiv.org

LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis

Z Shen, R Zhang, M Dell, BCG Lee, J Carlson… - Document Analysis and …, 2021 - Springer

Recent advances in document image analysis (DIA) have been primarily driven by the
application of neural networks. Ideally, research outcomes could be easily deployed in …

被引用次数：127 相关文章所有 12 个版本

[PDF] arxiv.org

DocBank: A benchmark dataset for document layout analysis

M Li, Y Xu, L Cui, S Huang, F Wei, Z Li… - arXiv preprint arXiv …, 2020 - arxiv.org

Document layout analysis usually relies on computer vision models to understand
documents while ignoring textual information that is vital to capture. Meanwhile, high quality …

被引用次数：171 相关文章所有 5 个版本

[PDF] arxiv.org

Layoutxlm: Multimodal pre-training for multilingual visually-rich document understanding

Y Xu, T Lv, L Cui, G Wang, Y Lu, D Florencio… - arXiv preprint arXiv …, 2021 - arxiv.org

Multimodal pre-training with text, layout, and image has achieved SOTA performance for
visually-rich document understanding tasks recently, which demonstrates the great potential …

被引用次数：107 相关文章所有 2 个版本

[PDF] thecvf.com

PubTables-1M: Towards comprehensive table extraction from unstructured documents

B Smock, R Pesala, R Abraham - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Recently, significant progress has been made applying machine learning to the problem of
table structure inference and extraction from unstructured documents. However, one of the …

被引用次数：72 相关文章所有 5 个版本

[PDF] arxiv.org

Table structure recognition using top-down and bottom-up cues

S Raja, A Mondal, CV Jawahar - … Conference, Glasgow, UK, August 23–28 …, 2020 - Springer

Tables are information-rich structured objects in document images. While significant work
has been done in localizing tables as graphic objects in document images, only limited …

被引用次数：100 相关文章所有 7 个版本

[PDF] thecvf.com

Global table extractor (gte): A framework for joint table identification and cell structure recognition using visual context

X Zheng, D Burdick, L Popa… - Proceedings of the …, 2021 - openaccess.thecvf.com

Documents are often the format of choice for knowledge sharing and preservation in
business and science, within which are tables that capture most of the critical data …

被引用次数：129 相关文章所有 7 个版本

高级搜索

QQ 群