Waveform modeling and generation using hierarchical recurrent neural networks for speech bandwidth extension ZH Ling, Y Ai, Y Gu, LR Dai IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (5), 883-894, 2018 | 80 | 2018 |
Singing voice synthesis using deep autoregressive neural networks for acoustic modeling YH Yi, Y Ai, ZH Ling, LR Dai arXiv preprint arXiv:1906.08977, 2019 | 39 | 2019 |
A neural vocoder with hierarchical generation of amplitude and phase spectra for statistical parametric speech synthesis Y Ai, ZH Ling IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 839-851, 2020 | 35 | 2020 |
SampleRNN-based neural vocoder for statistical parametric speech synthesis Y Ai, HC Wu, ZH Ling 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 33 | 2018 |
MP-SENet: A speech enhancement model with parallel denoising of magnitude and phase spectra YX Lu, Y Ai, ZH Ling arXiv preprint arXiv:2305.13686, 2023 | 26 | 2023 |
Bddr: An effective defense against textual backdoor attacks K Shao, J Yang, Y Ai, H Liu, Y Zhang Computers & Security 110, 102433, 2021 | 23 | 2021 |
Neural speech phase prediction based on parallel estimation architecture and anti-wrapping losses Y Ai, ZH Ling ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 17 | 2023 |
DNN-based spectral enhancement for neural waveform generators with low-bit quantization Y Ai, JX Zhang, L Chen, ZH Ling ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 10 | 2019 |
APNet: An all-frame-level neural vocoder incorporating direct prediction of amplitude and phase spectra Y Ai, ZH Ling IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2145-2157, 2023 | 9 | 2023 |
The USTC-NERCSLIP System for the Track 1.2 of Audio Deepfake Detection (ADD 2023) Challenge. H Wu, Z Li, L Xu, Z Zhang, W Zhao, B Gu, Y Ai, Y Lu, J Zhang, Z Ling, ... DADA@ IJCAI, 119-124, 2023 | 9 | 2023 |
Knowledge-and-data-driven amplitude spectrum prediction for hierarchical neural vocoders Y Ai, ZH Ling arXiv preprint arXiv:2004.07832, 2020 | 9 | 2020 |
APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding Y Ai, XH Jiang, YX Lu, HP Du, ZH Ling arXiv preprint arXiv:2402.10533, 2024 | 7 | 2024 |
A Light CNN with Split Batch Normalization for Spoofed Speech Detection Using Data Augmentation H Lin, Y Ai, Z Ling 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022 | 5 | 2022 |
Reverberation modeling for source-filter-based neural vocoder Y Ai, X Wang, J Yamagishi, ZH Ling arXiv preprint arXiv:2005.07379, 2020 | 5 | 2020 |
Towards high-quality and efficient speech bandwidth extension with parallel amplitude and phase prediction YX Lu, Y Ai, HP Du, ZH Ling arXiv preprint arXiv:2401.06387, 2024 | 4 | 2024 |
Zero-shot personalized lip-to-speech synthesis with face image based voice control ZY Sheng, Y Ai, ZH Ling ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
Incorporating ultrasound tongue images for audio-visual speech enhancement through knowledge distillation RC Zheng, Y Ai, ZH Ling arXiv preprint arXiv:2305.14933, 2023 | 4 | 2023 |
Denoising-and-dereverberation hierarchical neural vocoder for statistical parametric speech synthesis Y Ai, ZH Ling, WL Wu, A Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2036-2048, 2022 | 4 | 2022 |
Denoising-and-dereverberation hierarchical neural vocoder for robust waveform generation Y Ai, H Li, X Wang, J Yamagishi, Z Ling 2021 IEEE Spoken Language Technology Workshop (SLT), 477-484, 2021 | 4 | 2021 |
Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment ZY Sheng, Y Ai, YN Chen, ZH Ling Proceedings of the 31st ACM International Conference on Multimedia, 8443-8452, 2023 | 3 | 2023 |