J Ji, C Xu, X Zhang, B Wang, X Song - IEEE Transactions on Image …, 2020 - dl.acm.org
Visual attention has been successfully applied in image captioning to selectively incorporate
the most relevant areas to the language generation procedure. However, the attention in …