S Liu, Z Zhu, N Ye, S Guadarrama, K Murphy - arXiv e-prints, 2016 - ui.adsabs.harvard.edu
Current image captioning methods are usually trained via (penalized) maximum likelihood
estimation. However, the log-likelihood score of a caption does not correlate well with …