Spex: Multi-scale time domain speaker extraction network C Xu, W Rao, ES Chng, H Li IEEE/ACM transactions on audio, speech, and language processing 28, 1370-1384, 2020 | 157 | 2020 |
Spex+: A complete time domain speaker extraction network M Ge, C Xu, L Wang, ES Chng, J Dang, H Li arXiv preprint arXiv:2005.04686, 2020 | 131 | 2020 |
Progressive tandem learning for pattern recognition with deep spiking neural networks J Wu, C Xu, X Han, D Zhou, M Zhang, H Li, KC Tan IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (11), 7824 …, 2021 | 114 | 2021 |
SINGLE CHANNEL SPEECH SEPARATION WITH CONSTRAINED UTTERANCE LEVEL PERMUTATION INVARIANT TRAINING USING GRID LSTM C XU, WEI RAO, X XIAO, ENGS CHNG, H LI | 83 | 2018 |
Optimization of speaker extraction neural network with magnitude and temporal spectrum approximation loss C Xu, W Rao, ES Chng, H Li ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 60 | 2019 |
Time-Domain Speaker Extraction Network X Chenglin, R Wei, C Eng Siong, L Haizhou 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 58 | 2019 |
A deep neural network approach for sentence boundary detection in broadcast news. C Xu, L Xie, G Huang, X Xiao, E Chng, H Li INTERSPEECH, 2887-2891, 2014 | 50 | 2014 |
A study of learning based beamforming methods for speech recognition X Xiao, C Xu, Z Zhang, S Zhao, S Sun, S Watanabe, L Wang, L Xie, ... CHiME 2016 workshop, 26-31, 2016 | 48 | 2016 |
Muse: Multi-modal target speaker extraction with visual cues Z Pan, R Tao, C Xu, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 41 | 2021 |
Target speaker extraction for overlapped multi-talker speaker verification W Rao, C Xu, ES Chng, H Li arXiv preprint arXiv:1902.02546, 2019 | 41 | 2019 |
Multi-stage speaker extraction with utterance and frame-level reference signals M Ge, C Xu, L Wang, ES Chng, J Dang, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 40 | 2021 |
Selective listening by synchronizing speech with lips Z Pan, R Tao, C Xu, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1650-1664, 2022 | 39 | 2022 |
Representation learning with spectro-temporal-channel attention for speech emotion recognition L Guo, L Wang, C Xu, J Dang, ES Chng, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 34 | 2021 |
The 2015 nist language recognition evaluation: the shared view of i2r, fantastic4 and singams KA Lee, H Li, L Deng, V Hautamäki, W Rao, X Xiao, A Larcher, H Sun, ... Interspeech 2016 2016, 3211-3215, 2016 | 30 | 2016 |
Universal Speaker Extraction in the Presence and Absence of Target Speakers for Speech of One and Two Talkers. M Borsdorf, C Xu, H Li, T Schultz Interspeech, 1469-1473, 2021 | 26 | 2021 |
A bidirectional lstm approach with word embeddings for sentence boundary detection C Xu, L Xie, X Xiao Journal of Signal Processing Systems 90, 1063-1075, 2018 | 26 | 2018 |
The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016 KA Lee, V Hautamäki, T Kinnunen, A Larcher, C Zhang, A Nautsch, ... Interspeech, 1328-1332, 2017 | 26 | 2017 |
Neural Speaker Extraction with Speaker-Speech Cross-Attention Network. W Wang, C Xu, M Ge, H Li Interspeech, 3535-3539, 2021 | 25 | 2021 |
Target speaker verification with selective auditory attention for single and multi-talker speech C Xu, W Rao, J Wu, H Li IEEE/ACM Transactions on audio, speech, and language processing 29, 2696-2709, 2021 | 24 | 2021 |
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source. C Xu, X Xiao, S Sun, W Rao, ES Chng, H Li Interspeech, 1894-1898, 2017 | 24 | 2017 |