关注
Xuguang Duan
Xuguang Duan
在 mails.tsinghua.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Weakly supervised dense event captioning in videos
X Duan, W Huang, C Gan, J Wang, W Zhu, J Huang
Advances in Neural Information Processing Systems 31, 2018
1582018
Avqa: A dataset for audio-visual question answering on videos
P Yang, X Wang, X Duan, H Chen, R Hou, C Jin, W Zhu
Proceedings of the 30th ACM international conference on multimedia, 3480-3491, 2022
422022
Memor: A dataset for multimodal emotion reasoning in videos
G Shen, X Wang, X Duan, H Li, W Zhu
Proceedings of the 28th ACM international conference on multimedia, 493-502, 2020
342020
Disenbooth: Disentangled parameter-efficient tuning for subject-driven text-to-image generation
H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu
arXiv preprint arXiv:2305.03374 3, 2023
282023
Disenbooth: Identity-preserving disentangled tuning for subject-driven text-to-image generation
H Chen, Y Zhang, S Wu, X Wang, X Duan, Y Zhou, W Zhu
arXiv preprint arXiv:2305.03374, 2023
252023
STDMANet: Spatio-temporal differential multiscale attention network for small moving infrared target detection
P Yan, R Hou, X Duan, C Yue, X Wang, X Cao
IEEE transactions on geoscience and remote sensing 61, 1-16, 2023
192023
Learning-to-ask: Knowledge acquisition via 20 questions
Y Chen, B Chen, X Duan, JG Lou, Y Wang, W Zhu, Y Cao
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018
172018
Curriculum-nas: Curriculum weight-sharing neural architecture search
Y Zhou, X Wang, H Chen, X Duan, C Guan, W Zhu
Proceedings of the 30th ACM International Conference on Multimedia, 6792-6801, 2022
122022
Dynamic spatio-temporal modular network for video question answering
Z Qian, X Wang, X Duan, H Chen, W Zhu
Proceedings of the 30th ACM International Conference on Multimedia, 4466-4477, 2022
112022
Deeplogic: Joint learning of neural perception and logical reasoning
X Duan, X Wang, P Zhao, G Shen, W Zhu
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (4), 4321-4334, 2022
102022
Multi-modal contextual graph neural network for text visual question answering
Y Liang, X Wang, X Duan, W Zhu
2020 25th International Conference on Pattern Recognition (ICPR), 3491-3498, 2021
82021
Decouple before interact: Multi-modal prompt learning for continual visual question answering
Z Qian, X Wang, X Duan, P Qin, Y Li, W Zhu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
72023
Watch, reason and code: Learning to represent videos using program
X Duan, Q Wu, C Gan, Y Zhang, W Huang, A Van Den Hengel, W Zhu
Proceedings of the 27th ACM International Conference on Multimedia, 1543-1551, 2019
72019
Curriculum-listener: Consistency-and complementarity-aware audio-enhanced temporal sentence grounding
H Chen, X Wang, X Lan, H Chen, X Duan, J Jia, W Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 3117-3128, 2023
52023
DisenDreamer: Subject-Driven Text-to-Image Generation with Sample-aware Disentangled Tuning
H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu
IEEE Transactions on Circuits and Systems for Video Technology, 2024
42024
Intra-and Inter-Modal Curriculum for Multimodal Learning
Y Zhou, X Wang, H Chen, X Duan, W Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 3724-3735, 2023
32023
Parametric visual program induction with function modularization
X Duan, X Wang, Z Zhang, W Zhu
International Conference on Machine Learning, 5643-5658, 2022
22022
H2V4Sports: Real-Time Horizontal-to-Vertical Video Converter for Sports Lives via Fast Object Detection and Tracking
Y Han, K Li, Z Song, W Feng, X Cao, S Guo, X Wang, X Duan, W Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 9376-9378, 2023
12023
Unsupervised Image Sequence Registration and Enhancement for Infrared Small Target Detection
R Hou, P Yan, X Duan, X Wang
IEEE Transactions on Geoscience and Remote Sensing, 2024
2024
Modularized parametric visual program induction algorithm, device, medium and product
W Zhu, X Wang, D Xuguang
US Patent App. 18/197,746, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20