Document understanding dataset and evaluation (dude)

J Van Landeghem, R Tito… - Proceedings of the …, 2023 - openaccess.thecvf.com
We call on the Document AI (DocAI) community to re-evaluate current methodologies and
embrace the challenge of creating more practically-oriented benchmarks. Document …

Privacy-aware document visual question answering

R Tito, K Nguyen, M Tobaben, R Kerkouche… - arXiv preprint arXiv …, 2023 - arxiv.org
Document Visual Question Answering (DocVQA) is a fast growing branch of document
understanding. Despite the fact that documents contain sensitive or copyrighted information …

Beyond Document Page Classification: Design, Datasets, and Challenges

J Van Landeghem, S Biswas… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper highlights the need to bring document classification benchmarking closer to real-
world applications, both in the nature of data tested (X: multi-channel, multi-paged, multi …

Neural models for semantic analysis of handwritten document images

O Tüselmann, GA Fink - International Journal on Document Analysis and …, 2024 - Springer
Semantic analysis of handwritten document images offers a wide range of practical
application scenarios. A sequential combination of handwritten text recognition (HTR) and a …

DistilDoc: Knowledge distillation for visually-rich document applications

J Van Landeghem, S Maity, A Banerjee… - arXiv preprint arXiv …, 2024 - arxiv.org
This work explores knowledge distillation (KD) for visually-rich document (VRD) applications
such as document layout analysis (DLA) and document image classification (DIC). While …