M Wang,
Y Ma, B Cai, D Li, X He… - Available at SSRN …, 2024 - papers.ssrn.com
Video captioning, bridging computer vision and natural language, is crucial for various
knowledge-based systems in the age of video streaming. Recent advancements in video …