Survey and empirical comparison of different approaches for text extraction from scholarly figures

F Böschen, T Beck, A Scherp - Multimedia Tools and Applications, 2018 - Springer
Different approaches have been proposed in the past to address the challenge of extracting
text from scholarly figures. However, until recently, no comparative evaluation of the different …

A comparison of approaches for automated text extraction from scholarly figures

F Böschen, A Scherp - … , MMM 2017, Reykjavik, Iceland, January 4-6, 2017 …, 2017 - Springer
So far, there has not been a comparative evaluation of different approaches for text
extraction from scholarly figures. In order to fill this gap, we have defined a generic pipeline …

A neural approach for text extraction from scholarly figures

D Morris, P Tang, R Ewerth - 2019 International Conference on …, 2019 - ieeexplore.ieee.org
In recent years, the problem of scene text extraction from images has received extensive
attention and significant progress. However, text extraction from scholarly figures such as …

OCR++: a robust framework for information extraction from scholarly articles

M Singh, B Barua, P Palod, M Garg… - arXiv preprint arXiv …, 2016 - arxiv.org
This paper proposes OCR++, an open-source framework designed for a variety of
information extraction tasks from scholarly articles including metadata (title, author names …

[PDF][PDF] Text extraction from comic books

A Ghorbel, JM Ogier, N Vincent - GREC 2015 Eleventh IAPR …, 2015 - researchgate.net
Comic books are one of different forms of storytelling and entertainments around the word.
Through years, comic books have been widely spread. This fact has encouraged document …

DeTEXT: A database for evaluating text extraction from biomedical literature figures

XC Yin, C Yang, WY Pei, H Man, J Zhang… - Plos one, 2015 - journals.plos.org
Hundreds of millions of figures are available in biomedical literature, representing important
biomedical experimental evidence. Since text is a rich source of information in figures …

Extracting bibliographical data for PDF documents with HMM and external resources

WF Hsiao, TM Chang, E Thomas - Program, 2014 - emerald.com
Purpose–The purpose of this paper is to propose an automatic metadata extraction and
retrieval system to extract bibliographical information from digital academic documents in …

Scanbank: A benchmark dataset for figure extraction from scanned electronic theses and dissertations

SY Kahu, WA Ingram, EA Fox, J Wu - arXiv preprint arXiv:2106.15320, 2021 - arxiv.org
We focus on electronic theses and dissertations (ETDs), aiming to improve access and
expand their utility, since more than 6 million are publicly available, and they constitute an …

[PDF][PDF] Methods for evaluating text extraction toolkits: An exploratory investigation

TB Allison, PM Herceg - 2015 - mitre.org
Text extraction tools are vital for obtaining the textual content of computer files and for using
the electronic text in a wide variety of applications, including search and natural language …

Mining text from natural scene and video images: A survey

P Shivakumara, A Alaei, U Pal - Wiley Interdisciplinary Reviews …, 2021 - Wiley Online Library
In computer terminology, mining is considered as extracting meaningful information or
knowledge from a large amount of data/information using computers. The meaningful …