S Zhang, Y Chen, Y Sun, F Wang, J Yang,
L Bai, S Gao - Neurocomputing, 2025 - Elsevier
The key to integrating visual language tasks is to establish a good alignment strategy.
Recently, visual semantic representation has achieved fine-grained visual understanding by …