查看文章

psu.edu 中的 [PDF]

Document image segmentation using discriminative learning over connected components

作者

Syed Saqib Bukhari, Mayce Ibrahim Ali Al Azawi, Faisal Shafait, Thomas M Breuel

发表日期

2010/6/9

图书

Proceedings of the 9th IAPR International Workshop on Document Analysis Systems

页码范围

183-190

简介

Segmentation of a document image into text and non-text regions is an important preprocessing step for a variety of document image analysis tasks, like improving OCR, document compression etc. Most of the state-of-the-art document image segmentation approaches perform segmentation using pixel-based or zone(block)-based classification. Pixel-based classification approaches are time consuming, whereas block-based methods heavily depend on the accuracy of block segmentation step. In contrast to the state-of-the-art document image segmentation approaches, our segmentation approach introduces connected component based classification, thereby not requiring a block segmentation beforehand. Here we train a self-tunable multi-layer perceptron (MLP) classifier for distinguishing between text and non-text connected components using shape and context information as a feature vector. Experimental …

引用总数

被引用次数：103

2010201120122013201420152016201720182019202020212022202320241 3 7 11 7 4 12 14 8 7 8 4 6 5 3

学术搜索中的文章

Document image segmentation using discriminative learning over connected components

SS Bukhari, MIA Al Azawi, F Shafait, TM Breuel - Proceedings of the 9th IAPR International Workshop on …, 2010

被引用次数：103 相关文章所有 11 个版本