Table pre-training: A survey on model architectures, pre-training objectives, and downstream tasks

H Dong, Z Cheng, X He, M Zhou, A Zhou… - arXiv preprint arXiv …, 2022 - arxiv.org
Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs,
and various other document types, a flurry of table pre-training frameworks have been …

Deep learning for table detection and structure recognition: A survey

M Salaheldin Kasem, A Abdallah, A Berendeyev… - ACM Computing …, 2024 - dl.acm.org
Tables are everywhere, from scientific journals, articles, websites, and newspapers all the
way to items we buy at the supermarket. Detecting them is thus of utmost importance to …

Tuta: Tree-based transformers for generally structured table pre-training

Z Wang, H Dong, R Jia, J Li, Z Fu, S Han… - Proceedings of the 27th …, 2021 - dl.acm.org
We propose TUTA, a unified pre-training architecture for understanding generally structured
tables. Noticing that understanding a table requires spatial, hierarchical, and semantic …

Rethinking table recognition using graph neural networks

SR Qasim, H Mahmood, F Shafait - … Conference on Document …, 2019 - ieeexplore.ieee.org
Document structure analysis, such as zone segmentation and table recognition, is a
complex problem in document processing and is an active area of research. The recent …

Lgpma: Complicated table structure recognition with local and global pyramid mask alignment

L Qiao, Z Li, Z Cheng, P Zhang, S Pu, Y Niu… - … conference on document …, 2021 - Springer
Table structure recognition is a challenging task due to the various structures and
complicated cell spanning relations. Previous methods handled the problem starting from …

Table detection in invoice documents by graph neural networks

P Riba, A Dutta, L Goldmann, A Fornés… - 2019 International …, 2019 - ieeexplore.ieee.org
Tabular structures in documents offer a complementary dimension to the raw textual data,
representing logical or quantitative relationships among pieces of information. In digital mail …

A YOLO-based table detection method

Y Huang, Q Yan, Y Li, Y Chen, X Wang… - 2019 International …, 2019 - ieeexplore.ieee.org
Due to various table layouts and styles, table detection is always a difficult task in the field of
document analysis. Inspired by the great progress of deep learning based methods on …

Entrant: A large financial dataset for table understanding

E Zavitsanos, D Mavroeidis, E Spyropoulou… - Scientific Data, 2024 - nature.com
Tabular data is a way to structure, organize, and present information conveniently and
effectively. Real-world tables present data in two dimensions by arranging cells in matrices …

Pytheas pattern-based table discovery in CSV files

C Christodoulakis, EB Munson, M Gabel… - Proceedings of the …, 2020 - dl.acm.org
CSV is a popular Open Data format widely used in a variety of domains for its simplicity and
effectiveness in storing and disseminating data. Unfortunately, data published in this format …

Table understanding: Problem overview

A Shigarov - Wiley Interdisciplinary Reviews: Data Mining and …, 2023 - Wiley Online Library
Tables are probably the most natural way to represent relational data in various media and
formats. They store a large number of valuable facts that could be utilized for question …