Document Visual Question Answering (DocVQA) is a fast growing branch of document understanding. Despite the fact that documents contain sensitive or copyrighted information …
This paper highlights the need to bring document classification benchmarking closer to real- world applications, both in the nature of data tested (X: multi-channel, multi-paged, multi …
O Tüselmann, GA Fink - International Journal on Document Analysis and …, 2024 - Springer
Semantic analysis of handwritten document images offers a wide range of practical application scenarios. A sequential combination of handwritten text recognition (HTR) and a …
This work explores knowledge distillation (KD) for visually-rich document (VRD) applications such as document layout analysis (DLA) and document image classification (DIC). While …