M Shao, J Feng, J Wu, H Zhang… - Computers, Materials & …, 2023 - cdn.techscience.cn
… used to extract image features in image captioning, and the … and do not pay attention to
fine-grained details because of the … properly generates captions by fusing fine-grained features …