RAMP: Retrieval-Augmented MOS Prediction via Confidence-based Dynamic Weighting H Wang, S Zhao, X Zheng, Y Qin Interspeech 2023, 2023 | 7 | 2023 |
Intermediate-Task Learning with Pretrained Model for Synthesized Speech MOS Prediction H Wang, X Zheng, Y Qin 2023 IEEE International Conference on Multimedia and Expo (ICME), 378-383, 2023 | 2 | 2023 |
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores J Zhou, S Zhao, H Wang, TH Zhang, H Sun, X Wang, Y Qin arXiv preprint arXiv:2406.03814, 2024 | 1 | 2024 |
ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5 J Zhou, S Wang, S Zhao, J He, H Sun, H Wang, C Liu, A Kong, Y Guo, ... arXiv preprint arXiv:2409.18584, 2024 | | 2024 |
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper J Zhou, S Zhao, J He, H Wang, W Zeng, Y Chen, H Sun, A Kong, Y Qin arXiv preprint arXiv:2409.11889, 2024 | | 2024 |
Uncertainty-Aware Mean Opinion Score Prediction H Wang, S Zhao, J Zhou, X Zheng, H Sun, X Wang, Y Qin arXiv preprint arXiv:2408.12829, 2024 | | 2024 |
Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition H Sun, S Zhao, X Kong, X Wang, H Wang, J Zhou, Y Qin arXiv preprint arXiv:2408.00325, 2024 | | 2024 |
LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation W Guan, K Wang, W Zhou, Y Wang, F Deng, H Wang, L Li, Q Hong, Y Qin arXiv preprint arXiv:2406.08203, 2024 | | 2024 |