S Wu, Z Tang, Z Guo, W Zhang,
B Cui,
H Tang… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent developments of multi-modal large language models have demonstrated its strong
ability in solving vision-language tasks. In this paper, we focus on the product understanding …