关注
Xuenan Xu
标题
引用次数
引用次数
年份
Investigating local and global information for automated audio captioning with transfer learning
X Xu, H Dinkel, M Wu, Z Xie, K Yu
ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021
592021
Predicting tensile properties of AZ31 magnesium alloys by machine learning
X Xu, L Wang, G Zhu, X Zeng
Jom 72 (11), 3935-3942, 2020
502020
A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning.
X Xu, H Dinkel, M Wu, K Yu
DCASE, 225-229, 2020
492020
Voice activity detection in the wild: A data-driven approach using teacher-student training
H Dinkel, S Wang, X Xu, M Wu, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1542-1555, 2021
462021
Can audio captions be evaluated with image caption metrics?
Z Zhou, Z Zhang, X Xu, Z Xie, M Wu, KQ Zhu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
382022
The SJTU system for DCASE2022 challenge task 6: Audio captioning with audio-text retrieval pre-training
X Xu, Z Xie, M Wu, K Yu
Tech. Rep., DCASE2022 Challenge, 2022
342022
Audio-text retrieval in context
S Lou, X Xu, M Wu, K Yu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
252022
Text-to-audio grounding: Building correspondence between captions and sound events
X Xu, H Dinkel, M Wu, K Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
242021
Audio caption in a car setting with a sentence-level loss
X Xu, H Dinkel, M Wu, K Yu
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
202021
The SJTU system for DCASE2021 challenge task 6: Audio captioning based on encoder pre-training and reinforcement learning
X Xu, Z Xie, M Wu, K Yu
Proc. Conf. Detection Classification Acoust. Scenes Events, 1-4, 2021
162021
Sound-based construction activity monitoring with deep learning
W Xiong, X Xu, L Chen, J Yang
Buildings 12 (11), 1947, 2022
132022
A comprehensive survey of automated audio captioning
X Xu, M Wu, K Yu
arXiv preprint arXiv:2205.05357, 2022
132022
Automatic detection pipeline for accessing the motor severity of Parkinson’s disease in finger tapping and postural stability
N Yang, DF Liu, T Liu, T Han, P Zhang, X Xu, S Lou, HG Liu, AC Yang, ...
IEEE Access 10, 66961-66973, 2022
112022
Enhance temporal relations in audio captioning with sound event detection
Z Xie, X Xu, M Wu, K Yu
arXiv preprint arXiv:2306.01533, 2023
102023
Blat: Bootstrapping language-audio pre-training based on audioset tag-guided synthetic data
X Xu, Z Zhang, Z Zhou, P Zhang, Z Xie, M Wu, KQ Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 2756-2764, 2023
92023
A Lightweight Framework for Online Voice Activity Detection in the Wild.
X Xu, H Dinkel, M Wu, K Yu
Interspeech, 371-375, 2021
92021
A large-scale dataset for audio-language representation learning
L Sun, X Xu, M Wu, W Xie
arXiv preprint arXiv:2309.11500, 2023
72023
Beyond the status quo: A contemporary survey of advances and challenges in audio captioning
X Xu, Z Xie, M Wu, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
62023
Diversity-controllable and accurate audio captioning based on neural condition
X Xu, M Wu, K Yu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
62022
Towards Weakly Supervised Text-to-Audio Grounding
X Xu, Z Ma, M Wu, K Yu
arXiv preprint arXiv:2401.02584, 2024
42024
系统目前无法执行此操作,请稍后再试。
文章 1–20