The ustc-nercslip systems for the chime-7 dasr challenge R Wang, M He, J Du, H Zhou, S Niu, H Chen, Y Yue, G Yang, S Wu, L Sun, ... arXiv preprint arXiv:2308.14638, 2023 | 10 | 2023 |
External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge G Zhong, H Song, R Wang, L Sun, D Liu, J Pan, X Fang, J Du, J Zhang, ... Proc. Interspeech 2022, 4860-4864, 2022 | 5 | 2022 |
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction S Wu, C Wang, H Chen, Y Dai, C Zhang, R Wang, H Lan, J Du, CH Lee, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Quantum transfer learning using the large-scale unsupervised pre-trained model wavlm-large for synthetic speech detection R Wang, J Du, T Gao ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture G Yang, M He, S Niu, R Wang, Y Yue, S Qian, S Wu, J Du, CH Lee ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Multi-branch Network with Circle Loss Using Voice Conversion and Channel Robust Data Augmentation for Synthetic Speech Detection R Wang, J Du, C Wang Chinese Conference on Biometric Recognition, 613-620, 2022 | 1 | 2022 |
Quality-aware Masked Diffusion Transformer for Enhanced Music Generation C Li, R Wang, L Liu, J Du, Y Sun, Z Guo, Z Zhang, Y Jiang arXiv preprint arXiv:2405.15863, 2024 | | 2024 |
A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition F Ma, Y Tu, M He, R Wang, S Niu, L Sun, Z Ye, J Du, J Pan, CH Lee ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Implicit Enhancement of Target Speaker in Speaker-Adaptive ASR through Efficient Joint Optimization M Wu, H Tang, J Fan, R Wang, H Chen, Y Zhang, J Du, H Zhou, L Sun, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Multitask frame-level learning for few-shot sound event detection L Zou, G Yan, R Wang, J Du, M Lei, T Gao, X Fang arXiv preprint arXiv:2403.11091, 2024 | | 2024 |
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition Y Dai, H Chen, J Du, R Wang, S Chen, H Wang, CH Lee Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |
Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition C Wang, J Du, H Chen, R Wang, CHH Yang, J Zhao, Y Ren, Q Li, CH Lee 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | | 2023 |
The zxy System for OpenASR21 Challenge H Song, G Zhong, R Wang, C Wang, J Du, L Dai | | |