[HTML][HTML] PDF text classification to leverage information extraction from publication reports

DDA Bui, G Del Fiol, S Jonnalagadda - Journal of biomedical informatics, 2016 - Elsevier
Objectives Data extraction from original study reports is a time-consuming, error-prone
process in systematic review development. Information extraction (IE) systems have the …

ClusTi: Clustering method for table structure recognition in scanned images

A Zucker, Y Belkada, H Vu, VN Nguyen - Mobile Networks and …, 2021 - Springer
Abstract OCR (Optical Character Recognition) for scanned paper invoices is very
challenging due to the variability of 19 invoice layouts, different information fields, large data …

[PDF][PDF] Parts that add up to a whole: a framework for the analysis of tables

A Silva - Edinburgh University, UK, 2010 - Citeseer
In a time when unstructured data grows exponentially throughout the world, tools are
required to extract benefit from it. Tables are an important under-exploited part of that data …

Candidate selection for the interview using github profile and user analysis for the position of software engineer

R Gajanayake, MHM Hiras… - … on advancements in …, 2020 - ieeexplore.ieee.org
Selecting the most suitable candidates for interviews is an important process for
organizations that can affect their overall work performance. Typically, recruiters check …

Metrics for evaluating performance in document analysis: application to tables

AC Silva - International Journal on Document Analysis and …, 2011 - Springer
Is an algorithm with high precision and recall at identifying table-parts also good at locating
tables? Several document analysis tasks require merging or splitting certain document …

[PDF][PDF] An agent-based approach to table recognition and interpretation

V Long - 2010 - science.mq.edu.au
The goal of this research is to improve methods for extracting and interpreting tabular data
embedded in printed documents. In pursuing this goal, this dissertation makes two main …

A conglomerate of multiple OCR table detection and extraction

S Pallavi, RR Pranesh, S Kumar - arXiv preprint arXiv:2010.08591, 2020 - arxiv.org
Information representation as tables are compact and concise method that eases searching,
indexing, and storage requirements. Extracting and cloning tables from parsable documents …

Automated recognition and extraction of tabular fields for the indexing of census records

R Clawson, K Bauer, G Chidester… - … and Retrieval XX, 2013 - spiedigitallibrary.org
We describe a system for indexing of census records in tabular documents with the goal of
recognizing the content of each cell, including both headers and handwritten entries. Each …

Automatic tabular data extraction and understanding

R Rastan - 2017 - unsworks.unsw.edu.au
Tables in documents are a widely-available and rich source of information, but not yet well-
utilised computationally because of the difficulty in automatically extracting their structure …

Chart detection and recognition in graphics intensive business documents

JP Svendsen - 2015 - dspace.library.uvic.ca
Document image analysis involves the recognition and understanding of document images
using computer vision techniques. The research described in this thesis relates to the …