关注
Ruoyu Wang
标题
引用次数
引用次数
年份
The ustc-nercslip systems for the chime-7 dasr challenge
R Wang, M He, J Du, H Zhou, S Niu, H Chen, Y Yue, G Yang, S Wu, L Sun, ...
arXiv preprint arXiv:2308.14638, 2023
102023
External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge
G Zhong, H Song, R Wang, L Sun, D Liu, J Pan, X Fang, J Du, J Zhang, ...
Proc. Interspeech 2022, 4860-4864, 2022
52022
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction
S Wu, C Wang, H Chen, Y Dai, C Zhang, R Wang, H Lan, J Du, CH Lee, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
Quantum transfer learning using the large-scale unsupervised pre-trained model wavlm-large for synthetic speech detection
R Wang, J Du, T Gao
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture
G Yang, M He, S Niu, R Wang, Y Yue, S Qian, S Wu, J Du, CH Lee
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Multi-branch Network with Circle Loss Using Voice Conversion and Channel Robust Data Augmentation for Synthetic Speech Detection
R Wang, J Du, C Wang
Chinese Conference on Biometric Recognition, 613-620, 2022
12022
Quality-aware Masked Diffusion Transformer for Enhanced Music Generation
C Li, R Wang, L Liu, J Du, Y Sun, Z Guo, Z Zhang, Y Jiang
arXiv preprint arXiv:2405.15863, 2024
2024
A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition
F Ma, Y Tu, M He, R Wang, S Niu, L Sun, Z Ye, J Du, J Pan, CH Lee
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Implicit Enhancement of Target Speaker in Speaker-Adaptive ASR through Efficient Joint Optimization
M Wu, H Tang, J Fan, R Wang, H Chen, Y Zhang, J Du, H Zhou, L Sun, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Multitask frame-level learning for few-shot sound event detection
L Zou, G Yan, R Wang, J Du, M Lei, T Gao, X Fang
arXiv preprint arXiv:2403.11091, 2024
2024
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Y Dai, H Chen, J Du, R Wang, S Chen, H Wang, CH Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
2024
Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition
C Wang, J Du, H Chen, R Wang, CHH Yang, J Zhao, Y Ren, Q Li, CH Lee
2023 Asia Pacific Signal and Information Processing Association Annual …, 2023
2023
The zxy System for OpenASR21 Challenge
H Song, G Zhong, R Wang, C Wang, J Du, L Dai
系统目前无法执行此操作,请稍后再试。
文章 1–13