Joint CTC-attention based end-to-end speech recognition using multi-task learning S Kim, T Hori, S Watanabe 2017 IEEE international conference on acoustics, speech and signal …, 2017 | 1044 | 2017 |
Hybrid CTC/attention architecture for end-to-end speech recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017 | 875 | 2017 |
Multi-channel speech recognition: LSTMs all the way through H Erdogan, T Hayashi, JR Hershey, T Hori, C Hori, WN Hsu, S Kim, ... CHiME-4 workshop, 1-4, 2016 | 86 | 2016 |
Towards language-universal end-to-end speech recognition S Kim, ML Seltzer 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 83 | 2018 |
Multimodal transfer deep learning with applications in audio-visual recognition S Moon, S Kim, H Wang NIPS workshop 2015, 2015 | 74* | 2015 |
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ... INTERSPEECH 2021, 2021 | 73 | 2021 |
Improved training for online end-to-end speech recognition systems S Kim, ML Seltzer, J Li, R Zhao INTERSPEECH, 2018 | 50 | 2018 |
Dialog-context aware end-to-end speech recognition S Kim, F Metze 2018 IEEE Spoken Language Technology Workshop (SLT), 434-440, 2018 | 48 | 2018 |
Environmental noise embeddings for robust speech recognition S Kim, B Raj, I Lane arXiv preprint arXiv:1601.02553, 2016 | 47 | 2016 |
Improving RNN transducer based ASR with auxiliary tasks C Liu, F Zhang, D Le, S Kim, Y Saraf, G Zweig 2021 IEEE Spoken Language Technology Workshop (SLT), 172-179, 2021 | 46 | 2021 |
Recurrent models for auditory attention in multi-microphone distance speech recognition S Kim, I Lane ICLR workshop 2016, 2015 | 33 | 2015 |
Gated embeddings in end-to-end speech recognition for conversational-context fusion S Kim, S Dalmia, F Metze ACL 2019, 2019 | 28 | 2019 |
Impact of nano-scale through-silicon vias on the quality of today and future 3D IC designs DH Kim, S Kim, SK Lim International Workshop on System Level Interconnect Prediction, 1-8, 2011 | 28 | 2011 |
Improved neural language model fusion for streaming recurrent neural network transducer S Kim, Y Shangguan, J Mahadeokar, A Bruguier, C Fuegen, ML Seltzer, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 24 | 2021 |
Semantic distance: A new metric for asr performance analysis towards spoken language understanding S Kim, A Arora, D Le, CF Yeh, C Fuegen, O Kalinli, ML Seltzer INTERSPEECH 2021, 2021 | 24 | 2021 |
End-to-End Speech Recognition with Auditory Attention for Multi-Microphone Distance Speech Recognition. S Kim, IR Lane, S Kim, I Lane Interspeech, 3867-3871, 2017 | 20 | 2017 |
Cross-attention end-to-end asr for two-party conversations S Kim, S Dalmia, F Metze INTERSPEECH 2019, 2019 | 18 | 2019 |
Evaluating user perception of speech recognition system quality with semantic distance metric S Kim, D Le, W Zheng, T Singh, A Arora, X Zhai, C Fuegen, O Kalinli, ... INTERSPEECH 2022, 2022 | 15 | 2022 |
Deliberation Model for On-Device Spoken Language Understanding D Le, A Shrivastava, P Tomasello, S Kim, A Livshits, O Kalinli, ML Seltzer INTERSPEECH 2022, 2022 | 12 | 2022 |
Situation informed end-to-end asr for chime-5 challenge S Kim, S Dalmia, F Metze CHiME5 workshop, 2018 | 9* | 2018 |