An online attention-based model for speech recognition R Fan, P Zhou, W Chen, J Jia, G Liu Proc. Interspeech 2019, 4390--4394, 2019 | 59 | 2019 |
CASS-NAT: CTC alignment-based single step non-autoregressive transformer for speech recognition R Fan, W Chu, P Chang, J Xiao ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 40 | 2021 |
Improving generalization of transformer for speech recognition with parallel schedule sampling and relative positional embedding P Zhou, R Fan, W Chen, J Jia arXiv preprint arXiv:1911.00203, 2019 | 30 | 2019 |
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR R Fan, A Alwan Proc. Interspeech 2022, 4900--4904, 2022 | 29 | 2022 |
Towards better domain adaptation for self-supervised models: A case study of child ASR R Fan, Y Zhu, J Wang, A Alwan IEEE Journal of Selected Topics in Signal Processing 16 (6), 1242-1252, 2022 | 25 | 2022 |
Fundamental frequency feature normalization and data augmentation for child speech recognition G Yeung, R Fan, A Alwan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 25 | 2021 |
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition R Fan, W Chu, P Chang, J Xiao, A Alwan Proc. Interspeech 2021, 3715--3719, 2021 | 19 | 2021 |
Bi-apc: Bidirectional autoregressive predictive coding for unsupervised pre-training and its application to children’s asr R Fan, A Afshan, A Alwan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 16 | 2021 |
Exploring the use of an unsupervised autoregressive model as a shared encoder for text-dependent speaker verification V Ravi, R Fan, A Afshan, H Lu, A Alwan Proc. Interspeech 2020, 766--770, 2020 | 14 | 2020 |
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System J Wang, Y Zhu, R Fan, W Chu, A Alwan Proc. Interspeech 2021, 1279--1283, 2021 | 11 | 2021 |
A ctc alignment-based non-autoregressive transformer for end-to-end automatic speech recognition R Fan, W Chu, P Chang, A Alwan IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1436-1448, 2023 | 10 | 2023 |
LPC augment: an LPC-based ASR data augmentation algorithm for low and zero-resource children’s dialects A Johnson, R Fan, R Morris, A Alwan ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |
Fundamental frequency feature warping for frequency normalization and data augmentation in child automatic speech recognition G Yeung, R Fan, A Alwan Speech Communication 135, 1-10, 2021 | 10 | 2021 |
CNN-based audio front end processing on speech recognition R Fan, G Liu 2018 International Conference on Audio, Language and Image Processing …, 2018 | 8 | 2018 |
CTCBERT: Advancing hidden-unit BERT with CTC objectives R Fan, Y Wang, Y Gaur, J Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
Towards better meta-initialization with task augmentation for kindergarten-aged speech recognition Y Zhu, R Fan, A Alwan ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 4 | 2022 |
Acoustic-aware non-autoregressive spell correction with mask sample decoding R Fan, G Ye, Y Gaur, J Li arXiv preprint arXiv:2210.08665, 2022 | 3 | 2022 |
Research on end-to-end speech recognition [D] R Fan Beijing University of Posts and Telecommunications, 2-5, 2019 | 2* | 2019 |
UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models R Fan, NB Shankar, A Alwan IEEE Signal Processing Letters, 2024 | 1 | 2024 |
SOA: Reducing Domain Mismatch in SSL Pipeline by Speech Only Adaptation for Low Resource ASR NB Shankar, R Fan, A Alwan arXiv preprint arXiv:2406.10512, 2024 | | 2024 |