UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection Y Liu, S Li, Y Wu, CW Chen, Y Shan, X Qie Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 134 | 2022 |
ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection Y Liu, J Yuan, CW Chen Proceedings of the 28th ACM International Conference on Multimedia, 4235-4243, 2020 | 91 | 2020 |
Learning to aggregate multi-scale context for instance segmentation in remote sensing images Y Liu, H Li, C Hu, S Luo, Y Luo, CW Chen IEEE Transactions on Neural Networks and Learning Systems, 2024 | 51* | 2024 |
Timestamps as Prompts for Geography-Aware Location Recommendation Y Luo, H Duan, Y Liu, F Chung Proceedings of the 32nd ACM International Conference on Information and …, 2023 | 6 | 2023 |
End-to-end personalized next location recommendation via contrastive user preference modeling Y Luo, Y Liu, F Chung, Y Liu, CW Chen arXiv preprint arXiv:2303.12507, 2023 | 5 | 2023 |
-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding Y Liu, J He, W Li, J Kim, D Wei, H Pfister, CW Chen arXiv preprint arXiv:2404.00801, 2024 | 4 | 2024 |
ET Bench: Towards Open-Ended Event-Level Video-Language Understanding Y Liu, Z Ma, Z Qi, Y Wu, Y Shan, CW Chen arXiv preprint arXiv:2409.18111, 2024 | | 2024 |