A 5.1 pJ/neuron 127.3 us/inference RNN-based speech recognition processor using 16 computing-in-memory SRAM macros in 65nm CMOS R Guo, Y Liu, S Zheng, SY Wu, P Ouyang, WS Khwa, X Chen, JJ Chen, ... 2019 Symposium on VLSI Circuits, C120-C121, 2019 | 102 | 2019 |
Small-footprint keyword spotting with graph convolutional network X Chen, S Yin, D Song, P Ouyang, L Liu, S Wei 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019 | 34 | 2019 |
Transformer with bidirectional decoder for speech recognition X Chen, S Zhang, D Song, P Ouyang, S Yin arXiv preprint arXiv:2008.04481, 2020 | 21 | 2020 |
Amphion: An open-source audio, music and speech generation toolkit X Zhang, L Xue, Y Wang, Y Gu, X Chen, Z Fang, H Chen, L Zou, C Wang, ... arXiv preprint arXiv:2312.09911, 2023 | 10 | 2023 |
LLaST: Improved end-to-end speech translation system leveraged by large language models X Chen, S Zhang, Q Bai, K Chen, S Nakamura arXiv preprint arXiv:2407.15415, 2024 | 3 | 2024 |
Sptts: Parallel speech synthesis without extra aligner model Z Zhao, X Chen, H Liu, X Wang, L Yang, J Wang 2021 Asia-Pacific Signal and Information Processing Association Annual …, 2021 | 2 | 2021 |
GLMSnet: Single channel speech separation framework in noisy and reverberant environments H Shi, X Chen, T Kong, S Yin, P Ouyang 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 1 | 2021 |
Transfer the linguistic representations from TTS to accent conversion with non-parallel data X Chen, J Pei, L Xue, M Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |