Y Luo, J Ji, X Sun, L Cao, Y Wu, F Huang, CW Lin, R Ji - 2021 - scholar.archive.org
Descriptive region features extracted by object detection networks have played an important
role in the recent advancements of image captioning. However, they are still criticized for the …