Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System Z Gao, Y Song, IV McLoughlin, P Li, Y Jiang, LR Dai INTERSPEECH 2019, 361-365, 2019 | 82 | 2019 |
Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition Z Gao, S Zhang, I McLoughlin, Z Yan arXiv preprint arXiv:2206.08317, 2022 | 50 | 2022 |
An Effective Deep Embedding Learning Architecture for Speaker Verification Y Jiang, Y Song, IV McLoughlin, Z Gao, LR Dai INTERSPEECH 2019, 4040-4044, 2019 | 34 | 2019 |
An improved deep embedding learning method for short duration speaker verification Z Gao, Y Song, IV McLoughlin, W Guo, LR Dai INTERSPEECH 2018, 3578-3582, 2018 | 30 | 2018 |
San-m: Memory equipped self-attention for end-to-end speech recognition Z Gao, S Zhang, M Lei, I McLoughlin INTERSPEECH 2020, 6-10, 2020 | 29 | 2020 |
Streaming chunk-aware multihead attention for online end-to-end speech recognition S Zhang, Z Gao, H Luo, M Lei, J Gao, Z Yan, L Xie INTERSPEECH 2020, 2142-2146, 2020 | 29 | 2020 |
FunASR: A Fundamental End-to-End Speech Recognition Toolkit Z Gao, Z Li, J Wang, H Luo, X Shi, M Chen, Y Li, L Zuo, Z Du, Z Xiao, ... INERSPEECH 2023, 2023 | 19 | 2023 |
Lauragpt: Listen, attend, understand, and regenerate audio with gpt Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, W Wang, S Zheng, ... arXiv preprint arXiv:2310.04673, 2023 | 17 | 2023 |
Extremely Low Footprint End-to-End ASR System for Smart Device Z Gao, Y Yao, S Zhang, J Yang, M Lei, I McLoughlin INTERSPEECH 2021, 4548-4552, 2021 | 15 | 2021 |
Universal ASR: Unifying streaming and non-streaming ASR using a single encoder-decoder model Z Gao, S Zhang, M Lei, I McLoughlin arXiv preprint arXiv:2010.14099, 2020 | 14 | 2020 |
emotion2vec: Self-supervised pre-training for speech emotion representation Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen arXiv preprint arXiv:2312.15185, 2023 | 9 | 2023 |
SeACo-Paraformer: A non-autoregressive ASR system with flexible and effective hotword customization ability X Shi, Y Yang, Z Li, Y Chen, Z Gao, S Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ... arXiv preprint arXiv:2402.08846, 2024 | 4 | 2024 |
MaLa-ASR: Multimedia-Assisted LLM-Based ASR G Yang, Z Ma, F Yu, Z Gao, S Zhang, X Chen arXiv preprint arXiv:2406.05839, 2024 | | 2024 |
Wav2vec‐MoE: An unsupervised pre‐training and adaptation method for multi‐accent ASR Y Lin, S Zhang, Z Gao, L Wang, Y Yang, J Dang Electronics Letters 59 (11), e12823, 2023 | | 2023 |
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System X Shi, H Luo, Z Gao, S Zhang, Z Yan INERSPEECH 2023, 2023 | | 2023 |
Streaming End-to-End Speech Recognition Method, Apparatus and Electronic Device S Zhang, GAO Zhifu US Patent App. 17/976,464, 2023 | | 2023 |
Speech Processing method, Speech Encoder, Speech Decoder and Speech Recognition System S Zhang, GAO Zhifu, M Lei US Patent App. 17/951,569, 2023 | | 2023 |