Large Language Models for Page Stream Segmentation

H Heidenreich, R Dalvi, R Mukku, N Verma… - arXiv preprint arXiv …, 2024 - arxiv.org
Page Stream Segmentation (PSS) is an essential prerequisite for automated document
processing at scale. However, research progress has been limited by the absence of …

OpenPSS: An Open Page Stream Segmentation Benchmark

R Heusden, J Kamps, M Marx - … Conference on Theory and Practice of …, 2024 - Springer
In recent years, an increasing number of companies and institutions have begun the process
of digitizing their physical records to promote digital access and searchability of their …

[图书][B] Linking Theory and Practice of Digital Libraries: 28th International Conference on Theory and Practice of Digital Libraries, TPDL 2024, Ljubljana, Slovenia …

A Antonacopoulos, A Hinze, B Piwowarski, M Coustaty… - 2024 - books.google.com
This book constitutes the refereed proceedings of the 28th International Conference on
Linking Theory and Practice of Digital Libraries, TPDL 2024, held in Ljubljana, Slovenia …

Enticing Local Governments to Produce FAIR Freedom of Information Act Dossiers

M Marx, M Larooij, F Perasedillo, J Kamps - European Conference on …, 2023 - Springer
Government transparency is central in a democratic society, and increasingly governments
at all levels are required to publish records and data either proactively, or upon so-called …

Visual and textual feature fusion for document analysis

PMLL Drumond - 2024 - icts.unb.br
Diariamente é produzido um grande volume de documentos nas organizações industriais,
comerciais, governamentais, entre outras. Além disso, com o mercado competitivo na …