作者
Shang-Fu Chen, Yi-Chen Chen, Chih-Kuan Yeh, Yu-Chiang Frank Wang
发表日期
2018/4/27
研讨会论文
Thirty-Second AAAI Conference on Artificial Intelligence
简介
We propose a recurrent neural network (RNN) based model for image multi-label classification. Our model uniquely integrates and learning of visual attention and Long Short Term Memory (LSTM) layers, which jointly learns the labels of interest and their co-occurrences, while the associated image regions are visually attended. Different from existing approaches utilize either model in their network architectures, training of our model does not require pre-defined label orders. Moreover, a robust inference process is introduced so that prediction errors would not propagate and thus affect the performance. Our experiments on NUS-WISE and MS-COCO datasets confirm the design of our network and its effectiveness in solving multi-label classification problems.
引用总数
2017201820192020202120222023202415252435363216
学术搜索中的文章
SF Chen, YC Chen, CK Yeh, YC Wang - Proceedings of the AAAI conference on artificial …, 2018