MLCA-AVSR: Multi-Layer Cross Attention Fusion Based Audio-Visual Speech Recognition H Wang, P Guo, P Zhou, L Xie ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 14 | 2024 |
The NPU-ASLP system for audio-visual speech recognition in MISP 2022 challenge P Guo, H Wang, B Mu, A Zhang, P Chen ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
VE-KWS: Visual modality enhanced end-to-end keyword spotting A Zhang, H Wang, P Guo, Y Fu, L Xie, Y Gao, S Zhang, J Feng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
Unveiling the potential of llm-based ASR on chinese open-source datasets X Geng, T Xu, K Wei, B Mu, H Xue, H Wang, Y Li, P Guo, Y Dai, L Li, ... 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024 | 6 | 2024 |
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge H Wang, P Guo, Y Li, A Zhang, J Sun, L Xie, W Chen, P Zhou, H Bu, X Xu, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |
The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023 H Wang, P Guo, W Chen, P Zhou, L Xie arXiv preprint arXiv:2401.06788, 2024 | 2 | 2024 |
Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder H Wang, P Guo, X Wan, H Zhou, L Xie ICMEW 2024-2024 International Conference on Multimedia and Expo Workshop, 2024 | 1 | 2024 |
CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition H Wang, X Wan, N Zheng, K Liu, H Zhou, G Li, L Xie arXiv preprint arXiv:2412.12760, 2024 | | 2024 |
The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024 H Wang, L Xie arXiv preprint arXiv:2408.02369, 2024 | | 2024 |
An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge R Han, X Yan, W Xu, P Guo, J Sun, H Wang, Q Lu, N Jiang, L Xie ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
The NPU System for DASR Task of CHiME-7 Challenge B Mu, P Guo, H Wang, Y Li, Y Li, P Zhou, W Chen, L Xie Proc. CHiME 2023, 63-66, 2023 | | 2023 |