作者
Guillaume Lazzara, Thierry Géraud
发表日期
2014/6
期刊
International Journal on Document Analysis and Recognition (IJDAR)
卷号
17
期号
2
页码范围
105-123
出版商
Springer Berlin Heidelberg
简介
This work focuses on the most commonly used binarization method: Sauvola’s. It performs relatively well on classical documents, however, three main defects remain: the window parameter of Sauvola’s formula does not fit automatically to the contents, it is not robust to low contrasts, and it is not invariant with respect to contrast inversion. Thus, on documents such as magazines, the contents may not be retrieved correctly, which is crucial for indexing purpose. In this paper, we describe how to implement an efficient multiscale implementation of Sauvola’s algorithm in order to guarantee good binarization for both small and large objects inside a single document without adjusting manually the window size to the contents. We also describe how to implement it in an efficient way, step by step. This algorithm remains notably fast compared to the original one. For fixed parameters, text recognition rates and …
引用总数
2013201420152016201720182019202020212022202320241117127811968104
学术搜索中的文章
G Lazzara, T Géraud - International Journal on Document Analysis and …, 2014