Icdar 2013 competition on book structure extraction

A Doucet, G Kazai, S Colutto… - 2013 12th International …, 2013 - ieeexplore.ieee.org
This paper summarizes the 3rd Book Structure Extraction competition that was run at the
ICDAR 2013. Its goal is to evaluate and compare automatic techniques for deriving structure …

Enhancing table of contents extraction by system aggregation

A Doucet, M Coustaty - 2017 14th IAPR international …, 2017 - ieeexplore.ieee.org
The OCR-ed books usually lack logical structure information, such as chapters, sections. To
enrich the navigation experience of users, several approaches have been proposed to …

Daniel@ fintoc-2019 shared task: toc extraction and title detection

E Giguet, G Lejeune - The Second Financial Narrative Processing …, 2019 - hal.science
We present different methods for the two tasks of the 2019 FinTOC challenge: Title Detection
and Table of Contents Extraction. For the Title Detection task we present different …

Daniel at the FinSBD-2 task: Extracting Lists and Sentences from PDF Documents: a model-driven end-to-end approach to PDF document analysis

E Giguet, G Lejeune - Second Workshop on Financial Technology and …, 2021 - hal.science
In this paper, we present the method we have designed and implemented for identifying lists
and sentences in PDF documents while participating to FinSBD-2 Financial Document …

Toc structure extraction from ocr-ed books

C Liu, J Chen, X Zhang, J Liu, Y Huang - … Workshop of the Initiative for the …, 2012 - Springer
This paper addresses the task of extracting the table of contents (TOC) from OCR-ed books.
Since the OCR process misses a lot of layout and structural information, it is incapable of …

Towards a semantic book search engine

S Khusro, I Ullah - … Conference on Open Source Systems & …, 2016 - ieeexplore.ieee.org
Traditional Information Retrieval (IR) methods were initially used for searching and ranking
web pages on the Web. These methods were progressively modified to exploit the …

The Book Structure Extraction Competition with the Resurgence software for part and chapter detection at Caen University

E Giguet, N Lucas - International Workshop of the Initiative for the …, 2010 - Springer
The GREYC Island team participated in the Structure Extraction Competition part of the INEX
Book track for the second time, with the Resurgence software. We used a minimal strategy …

In search of a semantic book search engine on the web: are we there yet?

I Ullah, S Khusro - Artificial Intelligence Perspectives in Intelligent Systems …, 2016 - Springer
Books being a valuable source of knowledge and learning, have always been searched for
on the Web. Traditional Web Information Retrieval (IR) techniques of searching and ranking …

Daniel@ FinTOC'2 Shared Task: Title Detection and Structure Extraction

E Giguet, G Lejeune, JB Tanguy - … of the 1st Joint Workshop on …, 2020 - aclanthology.org
We present our contributions for the 2020 FinTOC Shared Tasks: Title Detection and Table
of Contents Extraction. For the Structure Extraction task, we propose an approach that …

[PDF][PDF] Tag thunder: Web page skimming in non visual environment using concurrent speech

JM Lecarpentier, E Manishina, F Maurel… - Proceedings of the …, 2016 - researchgate.net
Skimming and scanning are two strategies generally used for speed reading. Skimming
allows a reader to get a first glance of a document; scanning is the process of searching for a …