SpEx: Multi-Scale Time Domain Speaker Extraction Network HL Chenglin Xu, Wei Rao, Eng Siong Chng IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2020 | 188* | 2020 |
Unsupervised domain adaptation via domain adversarial training for speaker recognition Q Wang, W Rao, S Sun, L Xie, ES Chng, H Li 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 162 | 2018 |
Single channel speech separation with constrained utterance level permutation invariant training using grid lstm C Xu, W Rao, X Xiao, ES Chng, H Li 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 83 | 2018 |
Boosting the performance of i-vector based speaker verification via utterance partitioning W Rao, MW Mak IEEE Trans. on Audio, Speech and Language Processing 21 (5), 1012-1022, 2013 | 62 | 2013 |
Optimization of speaker extraction neural network with magnitude and temporal spectrum approximation loss C Xu, W Rao, ES Chng, H Li ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 60 | 2019 |
The INTERSPEECH 2020 far-field speaker verification challenge X Qin, M Li, H Bu, W Rao, RK Das, S Narayanan, H Li arXiv preprint arXiv:2005.08046, 2020 | 56 | 2020 |
Utterance partitioning with acoustic vector resampling for GMM–SVM speaker verification MW Mak, W Rao Speech Communication 53 (1), 119-130, 2011 | 51 | 2011 |
Target speaker extraction for overlapped multi-talker speaker verification W Rao, C Xu, ES Chng, H Li arXiv preprint arXiv:1902.02546, 2019 | 41 | 2019 |
Tea-pse: Tencent-ethereal-audio-lab personalized speech enhancement system for icassp 2022 dns challenge Y Ju, W Rao, X Yan, Y Fu, S Lv, L Cheng, Y Wang, L Xie, S Shang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 34 | 2022 |
The ffsvc 2020 evaluation plan X Qin, M Li, H Bu, RK Das, W Rao, S Narayanan, H Li arXiv preprint arXiv:2002.00387, 2020 | 30 | 2020 |
The 2015 nist language recognition evaluation: the shared view of i2r, fantastic4 and singams KA Lee, H Li, L Deng, V Hautamäki, W Rao, X Xiao, A Larcher, H Sun, ... Interspeech 2016 2016, 3211-3215, 2016 | 30 | 2016 |
The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016 KA Lee, V Hautamäki, T Kinnunen, A Larcher, C Zhang, A Nautsch, ... Interspeech, 1328-1332, 2017 | 26 | 2017 |
Target speaker verification with selective auditory attention for single and multi-talker speech C Xu, W Rao, J Wu, H Li IEEE/ACM Transactions on audio, speech, and language processing 29, 2696-2709, 2021 | 24 | 2021 |
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source. C Xu, X Xiao, S Sun, W Rao, ES Chng, H Li Interspeech, 1894-1898, 2017 | 24 | 2017 |
I4U submission to NIST SRE 2018: Leveraging from a decade of shared experiences KA Lee, V Hautamaki, T Kinnunen, H Yamamoto, K Okabe, V Vestman, ... arXiv preprint arXiv:1904.07386, 2019 | 23 | 2019 |
Interspeech 2021 conferencingspeech challenge: Towards far-field multi-channel speech enhancement for video conferencing W Rao, Y Fu, Y Hu, X Xu, Y Jv, J Han, Z Jiang, L Xie, Y Wang, ... arXiv preprint arXiv:2104.00960, 2021 | 19 | 2021 |
Tea-pse 2.0: Sub-band network for real-time personalized speech enhancement Y Ju, S Zhang, W Rao, Y Wang, T Yu, L Xie, S Shang 2022 IEEE Spoken Language Technology Workshop (SLT), 472-479, 2023 | 18 | 2023 |
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning. C Xu, W Rao, ES Chng, H Li INTERSPEECH, 3479-3483, 2018 | 17 | 2018 |
Normalization of total variability matrix for i-vector/plda speaker verification W Rao, MW Mak, KA Lee 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 17 | 2015 |
Conferencingspeech challenge: Towards far-field multi-channel speech enhancement for video conferencing W Rao, Y Fu, Y Hu, X Xu, Y Jv, J Han, Z Jiang, L Xie, Y Wang, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 16 | 2021 |