Diffsound: Discrete diffusion model for text-to-sound generation D Yang, J Yu, H Wang, W Wang, C Weng, Y Zou, D Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1720-1733, 2023 | 204 | 2023 |
Environmental sound classification with parallel temporal-spectral attention H Wang, Y Zou, D Chong, W Wang Proc. Interspeech 2020, 821–825, 2020 | 55 | 2020 |
SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification H Wang, Y Zou, W Wang Proc. Interspeech 2021, 2021 | 54 | 2021 |
Contrastive self-supervised learning for text-independent speaker verification H Zhang, Y Zou, H Wang ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 48 | 2021 |
Masked spectrogram prediction for self-supervised audio pre-training H Wang, D Chong, P Zhou, Q Zeng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2022 | 43* | 2022 |
Benchmarking large language models on cmexam-a comprehensive chinese medical exam dataset J Liu, P Zhou, Y Hua, D Chong, Z Tian, A Liu, H Wang, C You, Z Guo, ... Advances in Neural Information Processing Systems 36, 2024 | 37 | 2024 |
Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Textual Information Z Ye, H Wang, D Yang, Y Zou DCASE2021 Challenge, 2021 | 31* | 2021 |
Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter-and Intra-modality Attention Z Huang, F Liu, X Wu, S Ge, H Wang, W Fan, Y Zou Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), 2021 | 27 | 2021 |
Acoustic Scene Classification with Spectrogram Processing Strategies H Wang, Y Zou, D Chong Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2020 | 27* | 2020 |
A Mutual learning framework for Few-shot Sound Event Detection D Yang, H Wang, Y Zou, Z Ye, W Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 25 | 2022 |
Automated Audio Captioning with Temporal Attention H Wang, B Yang, Y Zou, D Chong DCASE2020 Challenge, 2020 | 14 | 2020 |
What affects the performance of convolutional neural networks for audio event classification H Wang, D Chong, D Huang, Y Zou 2019 8th International Conference on Affective Computing and Intelligent …, 2019 | 14 | 2019 |
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS D Yang, S Liu, J Yu, H Wang, C Weng, Y Zou Proc. Interspeech 2023, 2022 | 13 | 2022 |
Few-shot Bioacoustic Event Detection: A Good Transductive Inference is All You Need D Yang, H Wang, Z Ye, Y Zou DCASE2021 Challenge, 2021 | 13 | 2021 |
Modeling label dependencies for audio tagging with graph convolutional network H Wang, Y Zou, D Chong, W Wang IEEE Signal Processing Letters 27, 1560-1564, 2020 | 13 | 2020 |
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model H Wang, T Thebaud, J Villalba, M Sydnor, B Lammers, N Dehak, ... Proc. Interspeech 2023, 2023 | 9 | 2023 |
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation H Wang, B Wu, L Chen, M Yu, J Yu, Y Xu, SX Zhang, C Weng, D Su, D Yu Proc. Interspeech 2021, 2021 | 9 | 2021 |
Detect what you want: Target sound detection H Wang, D Yang, Y Zou, C Weng DCASE2022 Workshop, 2021 | 8 | 2021 |
A global-local attention framework for weakly labelled audio tagging H Wang, Y Zou, W Wang ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 8 | 2021 |
Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification D Yang, H Wang, Y Zou Proc. Interspeech 2021, 2021 | 7 | 2021 |