关注
Ruchao Fan
Ruchao Fan
Applied Scientist, Microsoft
在 microsoft.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
An online attention-based model for speech recognition
R Fan, P Zhou, W Chen, J Jia, G Liu
Proc. Interspeech 2019, 4390--4394, 2019
592019
CASS-NAT: CTC alignment-based single step non-autoregressive transformer for speech recognition
R Fan, W Chu, P Chang, J Xiao
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
402021
Improving generalization of transformer for speech recognition with parallel schedule sampling and relative positional embedding
P Zhou, R Fan, W Chen, J Jia
arXiv preprint arXiv:1911.00203, 2019
302019
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR
R Fan, A Alwan
Proc. Interspeech 2022, 4900--4904, 2022
292022
Towards better domain adaptation for self-supervised models: A case study of child ASR
R Fan, Y Zhu, J Wang, A Alwan
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1242-1252, 2022
252022
Fundamental frequency feature normalization and data augmentation for child speech recognition
G Yeung, R Fan, A Alwan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
252021
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
R Fan, W Chu, P Chang, J Xiao, A Alwan
Proc. Interspeech 2021, 3715--3719, 2021
192021
Bi-apc: Bidirectional autoregressive predictive coding for unsupervised pre-training and its application to children’s asr
R Fan, A Afshan, A Alwan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
162021
Exploring the use of an unsupervised autoregressive model as a shared encoder for text-dependent speaker verification
V Ravi, R Fan, A Afshan, H Lu, A Alwan
Proc. Interspeech 2020, 766--770, 2020
142020
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System
J Wang, Y Zhu, R Fan, W Chu, A Alwan
Proc. Interspeech 2021, 1279--1283, 2021
112021
A ctc alignment-based non-autoregressive transformer for end-to-end automatic speech recognition
R Fan, W Chu, P Chang, A Alwan
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1436-1448, 2023
102023
LPC augment: an LPC-based ASR data augmentation algorithm for low and zero-resource children’s dialects
A Johnson, R Fan, R Morris, A Alwan
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
102022
Fundamental frequency feature warping for frequency normalization and data augmentation in child automatic speech recognition
G Yeung, R Fan, A Alwan
Speech Communication 135, 1-10, 2021
102021
CNN-based audio front end processing on speech recognition
R Fan, G Liu
2018 International Conference on Audio, Language and Image Processing …, 2018
82018
CTCBERT: Advancing hidden-unit BERT with CTC objectives
R Fan, Y Wang, Y Gaur, J Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
Towards better meta-initialization with task augmentation for kindergarten-aged speech recognition
Y Zhu, R Fan, A Alwan
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
42022
Acoustic-aware non-autoregressive spell correction with mask sample decoding
R Fan, G Ye, Y Gaur, J Li
arXiv preprint arXiv:2210.08665, 2022
32022
Research on end-to-end speech recognition [D]
R Fan
Beijing University of Posts and Telecommunications, 2-5, 2019
2*2019
UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models
R Fan, NB Shankar, A Alwan
IEEE Signal Processing Letters, 2024
12024
SOA: Reducing Domain Mismatch in SSL Pipeline by Speech Only Adaptation for Low Resource ASR
NB Shankar, R Fan, A Alwan
arXiv preprint arXiv:2406.10512, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20