Wavlm: Large-scale self-supervised pre-training for full stack speech processing S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ... IEEE Journal of Selected Topics in Signal Processing 16 (6), 1505-1518, 2022 | 1154 | 2022 |
Neural codec language models are zero-shot text to speech synthesizers C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2301.02111, 2023 | 348 | 2023 |
Recall and learn: Fine-tuning deep pretrained language models with less forgetting S Chen, Y Hou, Y Cui, W Che, T Liu, X Yu Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 166 | 2020 |
Continuous speech separation with conformer S Chen, Y Wu, Z Chen, J Wu, J Li, T Yoshioka, C Wang, S Liu, M Zhou ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 130 | 2021 |
Beats: Audio pre-training with acoustic tokenizers S Chen, Y Wu, C Wang, S Liu, D Tompkins, Z Chen, F Wei arXiv preprint arXiv:2212.09058, 2022 | 126 | 2022 |
Large-scale self-supervised speech representation learning for automatic speaker verification Z Chen, S Chen, Y Wu, Y Qian, C Wang, S Liu, Y Qian, M Zeng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 101 | 2022 |
Speak foreign languages with your own voice: Cross-lingual neural codec language modeling Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2303.03926, 2023 | 83 | 2023 |
Microsoft speaker diarization system for the voxceleb speaker recognition challenge 2020 X Xiao, N Kanda, Z Chen, T Zhou, T Yoshioka, S Chen, Y Zhao, G Liu, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 70 | 2021 |
Unispeech-sat: Universal speech representation learning with speaker aware pre-training S Chen, Y Wu, C Wang, Z Chen, Z Chen, S Liu, J Wu, Y Qian, F Wei, J Li, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 69 | 2022 |
Speechx: Neural codec language model as a versatile speech transformer X Wang, M Thakker, Z Chen, N Kanda, SE Eskimez, S Chen, M Tang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 37 | 2024 |
Mothernets: Rapid deep ensemble learning A Wasay, B Hentschel, Y Liao, S Chen, S Idreos Proceedings of Machine Learning and Systems 2, 199-215, 2020 | 36 | 2020 |
Speechlm: Enhanced speech pre-training with unpaired textual data Z Zhang, S Chen, L Zhou, Y Wu, S Ren, S Liu, Z Yao, X Gong, L Dai, J Li, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 35 | 2024 |
Why does self-supervised learning for speech recognition benefit speaker recognition? S Chen, Y Wu, C Wang, S Liu, Z Chen, P Wang, G Liu, J Li, J Wu, X Yu, ... arXiv preprint arXiv:2204.12765, 2022 | 33 | 2022 |
Improving self-supervised learning for speech recognition with intermediate layer supervision C Wang, Y Wu, S Chen, S Liu, J Li, Y Qian, Z Yang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 24* | 2022 |
Don’t shoot butterfly with rifles: Multi-channel continuous speech separation with early exit transformer S Chen, Y Wu, Z Chen, T Yoshioka, S Liu, J Li, X Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 22 | 2021 |
Supervision-guided codebooks for masked prediction in speech pre-training C Wang, Y Wang, Y Wu, S Chen, J Li, S Liu, F Wei arXiv preprint arXiv:2206.10125, 2022 | 18 | 2022 |
C2c-genda: Cluster-to-cluster generation for data augmentation of slot filling Y Hou, S Chen, W Che, C Chen, T Liu Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 13027 …, 2021 | 17 | 2021 |
Exploring wavlm on speech enhancement H Song, S Chen, Z Chen, Y Wu, T Yoshioka, M Tang, JW Shin, S Liu 2022 IEEE Spoken Language Technology Workshop (SLT), 451-457, 2023 | 14 | 2023 |
Investigation of practical aspects of single channel speech separation for ASR J Wu, Z Chen, S Chen, Y Wu, T Yoshioka, N Kanda, S Liu, J Li arXiv preprint arXiv:2107.01922, 2021 | 14 | 2021 |
Ultra fast speech separation model with teacher student learning S Chen, Y Wu, Z Chen, J Wu, T Yoshioka, S Liu, J Li, X Yu Interspeech 2021, 3026--3030, 2021 | 11 | 2021 |