作者
Ritu Garg, Ehtesham Hassan, Santanu Chaudhury, Madan Gopal
发表日期
2011/9/18
研讨会论文
2011 International Conference on Document Analysis and Recognition
页码范围
1215-1219
出版商
IEEE
简介
In this paper, we propose a novel framework for segmentation of documents with complex layouts. The document segmentation is performed by combination of clustering and conditional random fields (CRF) based modeling. The bottom-up approach for segmentation assigns each pixel to a cluster plane based on color intensity. A CRF based discriminative model is learned to extract the local neighborhood information in different cluster/color planes. The final category assignment is done by a top-level CRF based on the semantic correlation learned across clusters. The proposed framework has been extensively tested on multi-colored document images with text overlapping graphics/image.
引用总数
20122013201420152016201720181632123
学术搜索中的文章
R Garg, E Hassan, S Chaudhury, M Gopal - 2011 International Conference on Document Analysis …, 2011