Siri: A simple selective retraining mechanism for transformer-based visual grounding M Qu, Y Wu, W Liu, Q Gong, X Liang, O Russakovsky, Y Zhao, Y Wei European Conference on Computer Vision, 546-562, 2022 | 25 | 2022 |
Learning to segment every referring object point by point M Qu, Y Wu, Y Wei, W Liu, X Liang, Y Zhao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 10 | 2023 |
RIO: A benchmark for reasoning intention-oriented objects in open environments M Qu, Y Wu, W Liu, X Liang, J Song, Y Zhao, Y Wei Advances in Neural Information Processing Systems 36, 2024 | 6 | 2024 |
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention W Kang, M Qu, J Kini, Y Wei, M Shah, Y Yan arXiv preprint arXiv:2405.18295, 2024 | 4 | 2024 |
Actress: Active retraining for semi-supervised visual grounding W Kang, M Qu, Y Wei, Y Yan arXiv preprint arXiv:2407.03251, 2024 | 3 | 2024 |
ChatVTG: Video Temporal Grounding via Chat with Video Dialogue Large Language Models M Qu, X Chen, W Liu, A Li, Y Zhao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 3 | 2024 |
Single-Frame Supervision for Spatio-Temporal Video Grounding K Liu, M Qu, Y Liu, Y Wei, W Zhe, Y Zhao, W Liu IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-17, 2024 | 2 | 2024 |
Supplementary Materials for “SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding” M Qu, Y Wu, W Liu, Q Gong, X Liang, O Russakovsky, Y Zhao, Y Wei | | |