作者
Sheikh Faisal Rashid, Syed Saqib Bukhari, Faisal Shafait, Thomas M Breuel
发表日期
2009/12/14
研讨会论文
2009 IEEE 13th International Multitopic Conference
页码范围
1-5
出版商
IEEE
简介
Orientation detection is an important preprocessing step for accurate recognition of text from document images. Many existing orientation detection techniques are based on the fact that in Roman script text ascenders occur more likely than descenders, but this approach is not applicable to document of other scripts like Urdu, Arabic, etc. In this paper, we propose a discriminative learning approach for orientation detection of Urdu documents with varying layouts and fonts. The main advantage of our approach is that it can be applied to documents of other scripts easily and accurately. Our approach is based on classification of individual connected component orientation in the document image, and then the orientation of the page image is determined via majority count. A convolutional neural network is trained as discriminative learning model for the labeled Urdu books dataset with four target orientations: 0, 90, 180 …
引用总数
20102011201220132014201520162017201820192020202120222023113322
学术搜索中的文章
SF Rashid, SS Bukhari, F Shafait, TM Breuel - 2009 IEEE 13th International Multitopic Conference, 2009