The DKU Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge M Cheng, H Wang, Y Wang, M Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 13 | 2022 |
The WHU-Alibaba Audio-Visual Speaker Diarization System for the MISP 2022 Challenge M Cheng, H Wang, Z Wang, Q Fu, M Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
The DKU Post-Challenge Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge: Deep Analysis H Wang, M Cheng, Q Fu, M Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
SlideSpeech: A Large Scale Slide-Enriched Audio-Visual Corpus H Wang, F Yu, X Shi, Y Wang, S Zhang, M Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 6 | 2024 |
GENERATING TTS BASED ADVERSARIAL SAMPLES FOR TRAINING WAKE-UP WORD DETECTION SYSTEMS AGAINST CONFUSING WORDS H Wang, Y Jia, Z Zhao, X Wang, J Wang, M Li Proc. The Speaker and Language Recognition Workshop (Odyssey 2022), 402-406, 2022 | 3 | 2022 |
Hourglass-AVSR: Down-Up Sampling-Based Computational Efficiency Model for Audio-Visual Speech Recognition F Yu, H Wang, Z Ma, S Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer H Wang, M Cheng, Q Fu, M Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
LCB-Net: Long-Context Biasing for Audio-Visual Speech Recognition F Yu, H Wang, X Shi, S Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
The Whu Wake Word Lipreading System for the 2024 Chat-Scenario Chinese Lipreading Challenge H Wang, C Li, F Su, J Liu, H Suo, M Li 2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 1-6, 2024 | | 2024 |