Speech-transformer: a no-recurrence sequence-to-sequence model for speech recognition L Dong, S Xu, B Xu 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 1184 | 2018 |
Syllable-based sequence-to-sequence speech recognition with the transformer in mandarin chinese S Zhou, L Dong, S Xu, B Xu arXiv preprint arXiv:1804.10752, 2018 | 134 | 2018 |
Cif: Continuous integrate-and-fire for end-to-end speech recognition L Dong, B Xu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 119 | 2020 |
Self-attention aligner: A latency-control end-to-end model for asr using self-attention network and chunk-hopping L Dong, F Wang, B Xu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 95 | 2019 |
A comparison of modeling units in sequence-to-sequence speech recognition with the transformer on mandarin chinese S Zhou, L Dong, S Xu, B Xu International Conference on Neural Information Processing, 210-220, 2018 | 68 | 2018 |
Extending recurrent neural aligner for streaming end-to-end speech recognition in mandarin L Dong, S Zhou, W Chen, B Xu arXiv preprint arXiv:1806.06342, 2018 | 38 | 2018 |
Improving end-to-end contextual speech recognition with fine-grained contextual knowledge selection M Han, L Dong, Z Liang, M Cai, S Zhou, Z Ma, B Xu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 35 | 2022 |
Cif-based collaborative decoding for end-to-end contextual speech recognition M Han, L Dong, S Zhou, B Xu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 21 | 2021 |
A comparison of label-synchronous and frame-synchronous end-to-end models for speech recognition L Dong, C Yi, J Wang, S Zhou, S Xu, X Jia, B Xu arXiv preprint arXiv:2005.10113, 2020 | 15 | 2020 |
Language-specific acoustic boundary learning for mandarin-english code-switching speech recognition Z Fan, L Dong, C Shen, Z Liang, J Zhang, L Lu, Z Ma arXiv preprint arXiv:2306.05279, 2023 | 5 | 2023 |
Sequence-level speaker change detection with difference-based continuous integrate-and-fire Z Fan, L Dong, M Cai, Z Ma, B Xu IEEE Signal Processing Letters 29, 1551-1554, 2022 | 5 | 2022 |
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring. Y Zou, L Dong, B Xu INTERSPEECH, 2055-2059, 2019 | 5 | 2019 |
Syllable-based acoustic modeling with CTC for multi-scenarios Mandarin speech recognition Y Zhao, L Dong, S Xu, B Xu 2018 International Joint Conference on Neural Networks (IJCNN), 1-8, 2018 | 4 | 2018 |
Token-level speaker change detection using speaker difference and speech content via continuous integrate-and-fire Z Fan, Z Liang, L Dong, Y Liu, S Zhou, M Cai, J Zhang, Z Ma, B Xu arXiv preprint arXiv:2211.09381, 2022 | 2 | 2022 |
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition Y Bai, J Chen, J Chen, W Chen, Z Chen, C Ding, L Dong, Q Dong, Y Du, ... arXiv preprint arXiv:2407.04675, 2024 | 1 | 2024 |
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR Z Fan, L Dong, J Zhang, L Lu, Z Ma ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Cif-pt: Bridging speech and text representations for spoken language understanding via continuous integrate-and-fire pre-training L Dong, Z An, P Wu, J Zhang, L Lu, Z Ma arXiv preprint arXiv:2305.17499, 2023 | 1 | 2023 |
Method, apparatus, device, and storage medium for speaker change point detection D Linhao, Z Fan, Z Ma US Patent 12,039,981, 2024 | | 2024 |
Voice recognition method and apparatus, medium, and electronic device D Linhao, Z Ma US Patent App. 18/288,531, 2024 | | 2024 |
Method and device of generating acoustic features, speech model training, and speech recognition D Linhao, Z Ma US Patent App. 18/427,538, 2024 | | 2024 |