X Huang, S Wang, J Yan, K Tang… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Wake word spotting mainly focus on audio modality or audio-visual multimodal exploration.
The visual modality delivers stable outcomes under poor acoustic conditions, making visual …