查看文章

nust.edu.pk 中的 [PDF]

Improved document image segmentation algorithm using multiresolution morphology

作者

Syed Saqib Bukhari, Faisal Shafait, Thomas M Breuel

发表日期

2011/1/24

研讨会论文

Document recognition and retrieval XVIII

卷号

7874

页码范围

109-116

出版商

SPIE

简介

Page segmentation into text and non-text elements is an essential preprocessing step before optical character recognition (OCR) operation. In case of poor segmentation, an OCR classification engine produces garbage characters due to the presence of non-text elements. This paper describes modifications to the text/non-text segmentation algorithm presented by Bloomberg,¹ which is also available in his open-source Leptonica library.²The modifications result in significant improvements and achieved better segmentation accuracy than the original algorithm for UW-III, UNLV, ICDAR 2009 page segmentation competition test images and circuit diagram datasets.

引用总数

被引用次数：84

201120122013201420152016201720182019202020212022202320242 6 4 4 11 6 10 7 6 5 7 2 8 2

学术搜索中的文章

Improved document image segmentation algorithm using multiresolution morphology

SS Bukhari, F Shafait, TM Breuel - Document recognition and retrieval XVIII, 2011

被引用次数：84 相关文章所有 15 个版本