Asvspoof 2019: Future horizons in spoofed and fake audio detection M Todisco, X Wang, V Vestman, M Sahidullah, H Delgado, A Nautsch, ... Proc. Interspeech, 1008-1012, 2019 | 616 | 2019 |
Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech. C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi SSW, 146-152, 2016 | 396 | 2016 |
ASVspoof 2019: a large-scale public database of synthetized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language, 101114, 2020 | 323 | 2020 |
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection J Yamagishi, X Wang, M Todisco, M Sahidullah, J Patino, A Nautsch, ... Proc. 2021 Edition of the Automatic Speaker Verification and Spoofing …, 2021 | 254 | 2021 |
Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 188 | 2020 |
Neural source-filter-based waveform model for statistical parametric speech synthesis X Wang, S Takaki, J Yamagishi ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 159 | 2019 |
A comparative study on recent neural spoofing countermeasures for synthetic speech detection X Wang, J Yamagishi Proc. Interspeech, 4259--4263, 2021 | 153 | 2021 |
Neural source-filter waveform models for statistical parametric speech synthesis X Wang, S Takaki, J Yamagishi IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 402-415, 2019 | 146 | 2019 |
ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech A Nautsch, X Wang, N Evans, TH Kinnunen, V Vestman, M Todisco, ... IEEE Transactions on Biometrics, Behavior, and Identity Science 3 (2), 252-265, 2021 | 140 | 2021 |
Speaker anonymization using x-vector and neural waveform models F Fang, X Wang, J Yamagishi, I Echizen, M Todisco, N Evans, ... Proc. SSW, 155-160, 2019 | 138 | 2019 |
Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks. C Valentini-Botinhao, X Wang, S Takaki, J Yamagishi Interspeech, 352-356, 2016 | 122 | 2016 |
Introducing the VoicePrivacy initiative N Tomashenko, BML Srivastava, X Wang, E Vincent, A Nautsch, ... Proc. Interspeech, 1693--1697, 2020 | 120 | 2020 |
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language Y Yasuda, X Wang, S Takaki, J Yamagishi ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 104 | 2019 |
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation H Tak, M Todisco, X Wang, J Jung, J Yamagishi, N Evans Proc. Odyssey, 2022 | 95 | 2022 |
Tandem assessment of spoofing countermeasures and automatic speaker verification: Fundamentals T Kinnunen, H Delgado, N Evans, KA Lee, V Vestman, A Nautsch, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2195-2210, 2020 | 94 | 2020 |
ASVspoof 2021: Towards spoofed and deepfake speech detection in the wild X Liu, X Wang, M Sahidullah, J Patino, H Delgado, T Kinnunen, ... IEEE/ACM Transaction on Audio, Speech, and Language Processing (accepted), 2022 | 92 | 2022 |
The VoicePrivacy 2020 Challenge: Results and findings N Tomashenko, X Wang, E Vincent, J Patino, BML Srivastava, PG Noé, ... Computer Speech & Language 74, 101362, 2022 | 88 | 2022 |
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data J Lorenzo-Trueba, F Fang, X Wang, I Echizen, J Yamagishi, T Kinnunen Proc. Speaker Odyssey, 240-247, 2018 | 86 | 2018 |
Investigating self-supervised front ends for speech spoofing countermeasures X Wang, J Yamagishi Proc. Odyssey, 100-106, 2022 | 82 | 2022 |
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis X Wang, J Lorenzo-Trueba, S Takaki, L Juvela, J Yamagishi 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 77 | 2018 |