关注
Yang Ai
Yang Ai
在 ustc.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Waveform modeling and generation using hierarchical recurrent neural networks for speech bandwidth extension
ZH Ling, Y Ai, Y Gu, LR Dai
IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (5), 883-894, 2018
802018
Singing voice synthesis using deep autoregressive neural networks for acoustic modeling
YH Yi, Y Ai, ZH Ling, LR Dai
arXiv preprint arXiv:1906.08977, 2019
392019
A neural vocoder with hierarchical generation of amplitude and phase spectra for statistical parametric speech synthesis
Y Ai, ZH Ling
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 839-851, 2020
352020
SampleRNN-based neural vocoder for statistical parametric speech synthesis
Y Ai, HC Wu, ZH Ling
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
332018
MP-SENet: A speech enhancement model with parallel denoising of magnitude and phase spectra
YX Lu, Y Ai, ZH Ling
arXiv preprint arXiv:2305.13686, 2023
262023
Bddr: An effective defense against textual backdoor attacks
K Shao, J Yang, Y Ai, H Liu, Y Zhang
Computers & Security 110, 102433, 2021
232021
Neural speech phase prediction based on parallel estimation architecture and anti-wrapping losses
Y Ai, ZH Ling
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
172023
DNN-based spectral enhancement for neural waveform generators with low-bit quantization
Y Ai, JX Zhang, L Chen, ZH Ling
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
102019
APNet: An all-frame-level neural vocoder incorporating direct prediction of amplitude and phase spectra
Y Ai, ZH Ling
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2145-2157, 2023
92023
The USTC-NERCSLIP System for the Track 1.2 of Audio Deepfake Detection (ADD 2023) Challenge.
H Wu, Z Li, L Xu, Z Zhang, W Zhao, B Gu, Y Ai, Y Lu, J Zhang, Z Ling, ...
DADA@ IJCAI, 119-124, 2023
92023
Knowledge-and-data-driven amplitude spectrum prediction for hierarchical neural vocoders
Y Ai, ZH Ling
arXiv preprint arXiv:2004.07832, 2020
92020
APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding
Y Ai, XH Jiang, YX Lu, HP Du, ZH Ling
arXiv preprint arXiv:2402.10533, 2024
72024
A Light CNN with Split Batch Normalization for Spoofed Speech Detection Using Data Augmentation
H Lin, Y Ai, Z Ling
2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022
52022
Reverberation modeling for source-filter-based neural vocoder
Y Ai, X Wang, J Yamagishi, ZH Ling
arXiv preprint arXiv:2005.07379, 2020
52020
Towards high-quality and efficient speech bandwidth extension with parallel amplitude and phase prediction
YX Lu, Y Ai, HP Du, ZH Ling
arXiv preprint arXiv:2401.06387, 2024
42024
Zero-shot personalized lip-to-speech synthesis with face image based voice control
ZY Sheng, Y Ai, ZH Ling
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
Incorporating ultrasound tongue images for audio-visual speech enhancement through knowledge distillation
RC Zheng, Y Ai, ZH Ling
arXiv preprint arXiv:2305.14933, 2023
42023
Denoising-and-dereverberation hierarchical neural vocoder for statistical parametric speech synthesis
Y Ai, ZH Ling, WL Wu, A Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2036-2048, 2022
42022
Denoising-and-dereverberation hierarchical neural vocoder for robust waveform generation
Y Ai, H Li, X Wang, J Yamagishi, Z Ling
2021 IEEE Spoken Language Technology Workshop (SLT), 477-484, 2021
42021
Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
ZY Sheng, Y Ai, YN Chen, ZH Ling
Proceedings of the 31st ACM International Conference on Multimedia, 8443-8452, 2023
32023
系统目前无法执行此操作,请稍后再试。
文章 1–20