Q Wang, J Du, S Zheng, Y Li, Y Wang… - … on Chinese Spoken …, 2022 - ieeexplore.ieee.org
In this paper, we propose two techniques, namely joint modeling and data augmentation, to
improve system performances for audio-visual scene classification (AVSC). We employ …