Vqmivc: Vector quantization and mutual information-based unsupervised speech representation disentanglement for one-shot voice conversion D Wang, L Deng, YT Yeung, X Chen, X Liu, H Meng arXiv preprint arXiv:2106.10132, 2021 | 116 | 2021 |
Any-to-many voice conversion with location-relative sequence-to-sequence modeling S Liu, Y Cao, D Wang, X Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1717-1728, 2021 | 80 | 2021 |
End-to-end accent conversion without using native utterances S Liu, D Wang, Y Cao, L Sun, X Wu, S Kang, Z Wu, X Liu, D Su, D Yu, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 41 | 2020 |
Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization D Wang, J Yu, X Wu, L Sun, X Liu, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 40 | 2021 |
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 39 | 2020 |
Speech emotion recognition using sequential capsule networks X Wu, Y Cao, H Lu, S Liu, D Wang, Z Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3280-3291, 2021 | 24 | 2021 |
Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor D Wang, Y Zou, W Wang Journal of the Franklin Institute 355 (4), 1692-1709, 2018 | 24 | 2018 |
Accurate and robust device-free localization approach via sparse representation in presence of noise and outliers DS Wang, XS Guo, YX Zou 2016 IEEE International conference on digital signal processing (DSP), 199-203, 2016 | 19 | 2016 |
VCVTS: Multi-speaker video-to-speech synthesis via cross-modal knowledge transfer from voice conversion D Wang, S Yang, D Su, X Liu, D Yu, H Meng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 11 | 2022 |
Fcl-taco2: Towards fast, controllable and lightweight text-to-speech synthesis D Wang, L Deng, Y Zhang, N Zheng, YT Yeung, X Chen, X Liu, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 10 | 2021 |
Disentangled speech representation learning for one-shot cross-lingual voice conversion using ß-vae H Lu, D Wang, X Wu, Z Wu, X Liu, H Meng 2022 IEEE Spoken Language Technology Workshop (SLT), 814-821, 2023 | 7 | 2023 |
Learning explicit prosody models and deep speaker embeddings for atypical voice conversion D Wang, S Liu, L Sun, X Wu, X Liu, H Meng arXiv preprint arXiv:2011.01678, 2020 | 7 | 2020 |
Unsupervised domain adaptation for dysarthric speech detection via domain adversarial training and mutual information minimization D Wang, L Deng, YT Yeung, X Chen, X Liu, H Meng arXiv preprint arXiv:2106.10127, 2021 | 6 | 2021 |
Learning a robust DOA estimation model with acoustic vector sensor cues Y Zou, R Gu, D Wang, A Jiang, CH Ritz 2017 Asia-Pacific Signal and Information Processing Association Annual …, 2017 | 6 | 2017 |
A deep convolutional encoder-decoder model for robust speech dereverberation DS Wang, YX Zou, W Shi 2017 22nd International Conference on Digital Signal Processing (DSP), 1-5, 2017 | 5 | 2017 |
A robust DBN-vector based speaker verification system under channel mismatch conditions DS Wang, YX Zou, JH Liu, YC Huang 2016 IEEE International Conference on Digital Signal Processing (DSP), 94-98, 2016 | 5 | 2016 |
Speaker identity preservation in dysarthric speech reconstruction by adversarial speaker adaptation D Wang, S Liu, X Wu, H Lu, L Sun, X Liu, H Meng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 4 | 2022 |
Joint Noise and Reverberation Adaptive Learning for Robust Speaker DOA Estimation with an Acoustic Vector Sensor. D Wang, Y Zou INTERSPEECH, 821-825, 2018 | 4 | 2018 |
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction X Chen, Y Wang, X Wu, D Wang, Z Wu, X Liu, H Meng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Near-field source localization in complex indoor environment using uniform circular array X Guo, B Li, L Chu, D Wang 2014 IEEE China Summit & International Conference on Signal and Information …, 2014 | 2 | 2014 |