AudioLDM: Text-to-Audio Generation with Latent Diffusion Models H Liu, Z Chen, Y Yuan, X Mei, X Liu, D Mandic, W Wang, MD Plumbley International Conference on Machine Learning (ICML), 2023, 2023 | 274 | 2023 |
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining H Liu, Q Tian, Y Yuan, X Liu, X Mei, Q Kong, Y Wang, W Wang, Y Wang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 68* | 2024 |
Audio Captioning Transformer X Mei, X Liu, Q Huang, MD Plumbley, W Wang Proceedings of the Detection and Classification of Acoustic Scenes and …, 2021 | 68 | 2021 |
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning X Liu, T Iqbal, J Zhao, Q Huang, MD Plumbley, W Wang 2021 IEEE 31st International Workshop on Machine Learning for Signal …, 2021 | 57 | 2021 |
On Metric Learning for Audio-Text Cross-Modal Retrieval X Mei, X Liu, J Sun, MD Plumbley, W Wang INTERSPEECH 2022, 2022 | 48 | 2022 |
An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... Proceedings of the Detection and Classification of Acoustic Scenes and …, 2021 | 46 | 2021 |
Automated Audio Captioning: An Overview of Recent Progress and New Challenges X Mei, X Liu, MD Plumbley, W Wang EURASIP Journal on Audio, Speech, and Music Processing 2022 (1), 1-18, 2022 | 37 | 2022 |
Separate What You Describe: Language-Queried Audio Source Separation X Liu, H Liu, Q Kong, X Mei, J Zhao, Q Huang, MD Plumbley, W Wang INTERSPEECH 2022, 2022 | 36* | 2022 |
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration H Liu*, X Liu*, Q Kong, Q Tian, Y Zhao, DL Wang INTERSPEECH 2022, 2022 | 30* | 2022 |
Leveraging Pre-trained BERT for Audio Captioning X Liu, X Mei, Q Huang, J Sun, J Zhao, H Liu, MD Plumbley, V Kılıç, ... EUSIPCO 2022, 2022 | 28 | 2022 |
Neural Vocoder is All You Need for Speech Super-resolution H Liu, W Choi, X Liu, Q Kong, Q Tian, DL Wang INTERSPEECH 2022, 2022 | 27 | 2022 |
Diverse Audio Captioning via Adversarial Training X Mei, X Liu, J Sun, MD Plumbley, W Wang Proceedings of the IEEE International Conference on Acoustics, Speech, and …, 2022 | 25 | 2022 |
CL4AC: A Contrastive Loss for Audio Captioning X Liu*, Q Huang*, X Mei, T Ko, HL Tang, MD Plumbley, W Wang Proceedings of the Detection and Classification of Acoustic Scenes and …, 2021 | 25 | 2021 |
Token-Level Supervised Contrastive Learning for Punctuation Restoration Q Huang, T Ko, HL Tang, X Liu, B Wu INTERSPEECH 2021, 2021 | 22 | 2021 |
Language-based Audio Retrieval With Pre-trained Models X Mei, X Liu, H Liu, J Sun, MD Plumbley, W Wang DCASE2022 Challenge, Tech. Rep, 2022 | 20 | 2022 |
An Encoder-decoder Based Audio Captioning System with Transfer and Reinforcement Learning for DCASE Challenge 2021 Task 6 X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... DCASE2021 Challenge, Tech. Rep, 2021 | 17* | 2021 |
Separate Anything You Describe X Liu, Q Kong, Y Zhao, H Liu, Y Yuan, Y Liu, R Xia, Y Wang, MD Plumbley, ... arXiv preprint arXiv:2308.05037, 2023 | 16* | 2023 |
Visually-Aware Audio Captioning with Adaptive Audio-Visual Attention X Liu, Q Huang, X Mei, H Liu, Q Kong, J Sun, S Li, T Ko, Y Zhang, ... INTERSPEECH 2023, 2023 | 15* | 2023 |
Low-Complexity CNNs for Acoustic Scene Classification A Singh, J King, X Liu, MD Plumbley, W Wang DCASE2022 Challenge, Tech. Rep, 2022 | 14 | 2022 |
SynthVSR: Scaling Up Visual Speech Recognition with Synthetic Supervision X Liu, E Lakomkin, K Vougioukas, P Ma, H Chen, R Xie, M Doulaty, ... 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR …, 2023 | 13 | 2023 |