作者
Junjie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu
发表日期
2018/3/9
期刊
IEEE Transactions on Multimedia
出版商
IEEE
简介
Deep convolution neural networks (CNNs) have demonstrated advanced performance on single-label image classification, and various progress also has been made to apply CNN methods on multilabel image classification, which requires annotating objects, attributes, scene categories, etc., in a single shot. Recent state-of-the-art approaches to the multilabel image classification exploit the label dependencies in an image, at the global level, largely improving the labeling capacity. However, predicting small objects and visual concepts is still challenging due to the limited discrimination of the global visual features. In this paper, we propose a regional latent semantic dependencies model (RLSD) to address this problem. The utilized model includes a fully convolutional localization architecture to localize the regions that may contain multiple highly dependent labels. The localized regions are further sent to the …
引用总数
2017201820192020202120222023202427333342463218
学术搜索中的文章