Maximum F1-Score Discriminative Training Criterion for Automatic Mispronunciation Detection H Huang, H Xu, X Wang, W Silamu IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23 (4), 787-797, 2015 | 149 | 2015 |
E2e-based multi-task learning approach to joint speech and accent recognition J Zhang, Y Peng, P Van Tung, H Xu, H Huang, ES Chng Interspeech 2021, 2021 | 34 | 2021 |
A Lightweight Model Based on Separable Convolution for Speech Emotion Recognition. Y Zhong, Y Hu, H Huang, W Silamu Interspeech 11, 3331-3335, 2020 | 31 | 2020 |
A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection H Huang, H Xu, Y Hu, G Zhou The Journal of the Acoustical Society of America 142 (5), 3165-3177, 2017 | 28 | 2017 |
Maximum F1-Score Discriminative Training for Automatic Mispronunciation Detection in Computer-Assisted Language Learning. H Hao, J Wang, H Abudureyimu INTERSPEECH, 815-818, 2012 | 22 | 2012 |
A gating context-aware text classification model with BERT and graph convolutional networks W Gao, H Huang Journal of Intelligent & Fuzzy Systems 40 (3), 4331-4343, 2021 | 20 | 2021 |
Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions. H Xu, H Su, C Ni, X Xiao, H Huang, ES Chng, H Li INTERSPEECH, 1315-1319, 2016 | 20 | 2016 |
Minimum tag error for discriminative training of conditional random fields Y Xiong, J Zhu, H Huang, H Xu Information Sciences 179 (1-2), 169-179, 2009 | 20 | 2009 |
SRTNet: Time Domain Speech Enhancement Via Stochastic Refinement Z Qiu, M Fu, Y Yu, LL Yin, F Sun, H Huang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 13 | 2023 |
Discriminative incorporation of explicitly trained tone models into lattice based rescoring for Mandarin speech recognition H Huang, J Zhu Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE …, 2008 | 12 | 2008 |
Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition G Ma, P Hu, J Kang, S Huang, H Huang Interspeech 2021, 306-310, 2021 | 11 | 2021 |
Improving accent conversion with reference encoder and end-to-end text-to-speech W Li, B Tang, X Yin, Y Zhao, W Li, K Wang, H Huang, Y Wang, Z Ma arXiv preprint arXiv:2005.09271, 2020 | 11 | 2020 |
Minimum word error training for non-autoregressive transformer-based code-switching asr Y Peng, J Zhang, H Xu, H Huang, ES Chng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |
Adversarial attack and defense on deep neural network-based voice processing systems: An overview X Chen, S Li, H Huang Applied Sciences 11 (18), 8450, 2021 | 9 | 2021 |
Mandarin tone modeling using recurrent neural networks H Huang, Y Hu, H Xu arXiv preprint arXiv:1711.01946, 2017 | 9 | 2017 |
Minimum phoneme error based filter bank analysis for speech recognition H Huang, J Zhu 2006 IEEE International Conference on Multimedia and Expo, 1081-1084, 2006 | 9 | 2006 |
Internal language model estimation based language model fusion for cross-domain code-switching speech recognition Y Peng, Y Liu, J Zhang, H Xu, Y He, H Huang, ES Chng arXiv preprint arXiv:2207.04176, 2022 | 8 | 2022 |
Kernel based non-linear feature extraction methods for speech recognition H Huang, J Zhu Sixth International Conference on Intelligent Systems Design and …, 2006 | 8 | 2006 |
Connectionist temporal classification loss for vector quantized variational autoencoder in zero-shot voice conversion X Kang, H Huang, Y Hu, Z Huang Digital Signal Processing 116, 103110, 2021 | 7 | 2021 |
Using deep time delay neural network for slot filling in spoken language understanding Z Zhang, H Huang, K Wang Symmetry 12 (6), 993, 2020 | 7 | 2020 |