S Wu, Y Gao, W Yang, H Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
183 天前 - … • We propose MVVC, an end-to-end transformer-based video captioning model,
which … and autonomous driving. Experimental results show that our method achieves SOTA. …