作者
Sheikh Faisal Rashid, Abdullah Akmal, Muhammad Adnan, Ali Adnan Aslam, Andreas Dengel
发表日期
2017/11/9
研讨会论文
2017 14th IAPR International conference on document analysis and recognition (ICDAR)
卷号
1
页码范围
777-782
出版商
IEEE
简介
Tables are an easy way to represent information in a structural form. Table recognition is important for the extraction of such information from document images. Usually, modern OCR systems provide textual information coming from tables without recognizing actual table structure. However, recognition of table structure is important to get the contextual meaning of the contents. Table structure recognition in heterogeneous documents is challenging due to a variety of table layouts. It becomes harder where no physical rulings are present in a table. This work proposes a novel learning based methodology for the recognition of table contents in heterogeneous document images. Textual contents of documents are classified as table or non-table elements using a pre-trained neural network model. The output of the neural network is further enhanced by applying a contextual post processing on each element to correct …
引用总数
2018201920202021202220232024381388102
学术搜索中的文章
SF Rashid, A Akmal, M Adnan, AA Aslam, A Dengel - 2017 14th IAPR International conference on document …, 2017