A comparative study of robustness of deep learning approaches for VAD S Tong, H Gu, K Yu 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 60 | 2016 |
An investigation of deep neural networks for multilingual speech recognition training and adaptation S Tong, PN Garner, H Bourlard Interspeech 2017, 714-718, 2017 | 52 | 2017 |
Multilingual training and cross-lingual adaptation on CTC-based acoustic model S Tong, PN Garner, H Bourlard arXiv preprint arXiv:1711.10025, 2017 | 46 | 2017 |
Cross-lingual adaptation of a CTC-based multilingual acoustic model S Tong, PN Garner, H Bourlard Speech Communication 104, 39-46, 2018 | 32 | 2018 |
Analyzing uncertainties in speech recognition using dropout A Vyas, P Dighe, S Tong, H Bourlard ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 28 | 2019 |
An investigation of multilingual ASR using end-to-end LF-MMI S Tong, PN Garner, H Bourlard ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 23 | 2019 |
Pkwrap: a pytorch package for lf-mmi training of acoustic models S Madikeri, S Tong, J Zuluaga-Gomez, A Vyas, P Motlicek, H Bourlard arXiv preprint arXiv:2010.03466, 2020 | 21 | 2020 |
Phone-aware LSTM-RNN for voice conversion J Lai, B Chen, T Tan, S Tong, K Yu 2016 IEEE 13th International Conference on Signal Processing (ICSP), 177-182, 2016 | 20 | 2016 |
Evaluating VAD for automatic speech recognition S Tong, N Chen, Y Qian, K Yu 2014 12th International Conference on Signal Processing (ICSP), 2308-2314, 2014 | 19 | 2014 |
Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems. SR Madikeri, BK Khonglah, S Tong, P Motlicek, H Bourlard, D Povey INTERSPEECH, 4746-4750, 2020 | 15 | 2020 |
Slot-triggered contextual biasing for personalized speech recognition using neural transducers S Tong, P Harding, S Wiesler ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 10 | 2023 |
Nasal speech sounds detection using connectionist temporal classification M Cernak, S Tong 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 8 | 2018 |
Fast Language Adaptation Using Phonological Information. S Tong, PN Garner, H Bourlard Interspeech, 2459-2463, 2018 | 8 | 2018 |
Multi-task joint-learning for robust voice activity detection Y Zhuang, S Tong, M Yin, Y Qian, K Yu 2016 10th International Symposium on Chinese Spoken Language Processing …, 2016 | 8 | 2016 |
Unbiased Semi-Supervised LF-MMI Training Using Dropout. S Tong, A Vyas, PN Garner, H Bourlard Interspeech, 1576-1580, 2019 | 7 | 2019 |
A Bayesian approach to recurrence in neural networks PN Garner, S Tong IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (8), 2527-2537, 2020 | 5 | 2020 |
The SUMMA platform prototype R Liepins, U Germann, G Barzdins, A Birch, S Renals, S Weber, ... 15th EACL 2017 Software Demonstrations, 116-119, 2017 | 5 | 2017 |
Selective biasing with trie-based contextual adapters for personalised speech recognition using neural transducers P Harding, S Tong, S Wiesler | 3 | 2023 |
Hierarchical attention-based contextual biasing for personalized speech recognition using neural transducers S Tong, P Harding, S Wiesler 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 1 | 2023 |
Model-internal slot-triggered biasing for domain expansion in neural transducer ASR models E Lu, P Harding, KM Sathyendra, S Tong, X Fu, J Liu, FJC Chang, ... | 1 | 2023 |