H Fang, S Gupta, F Iandola, RK Srivastava, L Deng… - 2015 IEEE Conference on … - infona.pl
This paper presents a novel approach for automatically generating image descriptions:
visual detectors, language models, and multimodal similarity models learnt directly from a …