作者
Syed Saqib Bukhari, Faisal Shafait, Thomas M Breuel
发表日期
2009/7
期刊
Proceedings of Third International Workshop on Camera-Based Document Analysis and Recognition, Barcelona, Spain
页码范围
34-41
简介
Traditional OCR systems are designed for planar (dewarped) images and the accuracy is reduced when applied on warped images. Therefore, developing new OCR techniques for warped images or developing dewarping techniques are the possible solutions for improving OCR accuracy camera-captured documents. Among different types of dewarping techniques, curled textlines information based dewarping techniques are the most popular ones, but are sensitive to high degrees of curl and variable line spacing. In this paper we build a novel dewarping approach based on curled textlines information, which has been extracted using ridges based modified active contour model (coupledsnakes). Our dewarping approach is less sensitive different direction of curl and variable line spacing. Experimental results show that OCR error rate, from warped to dewarped documents, has been reduced from 5.15% to 1.92% on the dataset of CBDAR 2007 document image dewarping contest. We also report the performance of our method in comparison with other state-of-the-art methods.
引用总数
20082009201020112012201320142015201620172018201920202021202220231164664215244531
学术搜索中的文章
SS Bukhari, F Shafait, TM Breuel - Proceedings of Third International Workshop on …, 2009