Occlude them all: Occlusion-aware attention network for occluded person re-id P Chen, W Liu, P Dai, J Liu, Q Ye, M Xu, Q Chen, R Ji Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 93 | 2021 |
Dual distribution alignment network for generalizable person re-identification P Chen, P Dai, J Liu, F Zheng, M Xu, Q Tian, R Ji Proceedings of the AAAI conference on artificial intelligence 35 (2), 1054-1062, 2021 | 51 | 2021 |
A challenger to gpt-4v? early explorations of gemini in visual expertise C Fu, R Zhang, H Lin, Z Wang, T Gao, Y Luo, Y Huang, Z Zhang, L Qiu, ... arXiv preprint arXiv:2312.12436, 2023 | 28 | 2023 |
Arm: Any-time super-resolution method B Chen, M Lin, K Sheng, M Zhang, P Chen, K Li, L Cao, R Ji European Conference on Computer Vision, 254-270, 2022 | 24 | 2022 |
Aha! adaptive history-driven attack for decision-based black-box models J Li, R Ji, P Chen, B Zhang, X Hong, R Zhang, S Li, J Li, F Huang, Y Wu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 19 | 2021 |
Open vocabulary object detection with proposal mining and prediction equalization P Chen, K Sheng, M Zhang, M Lin, Y Shen, S Lin, B Ren, K Li arXiv preprint arXiv:2206.11134, 2022 | 17 | 2022 |
Deep adversarial data augmentation with attribute guided for person re-identification Q Wu, P Dai, P Chen, Y Huang Signal, Image and Video Processing 15 (4), 655-662, 2021 | 16 | 2021 |
Multi-modal queried object detection in the wild Y Xu, M Zhang, C Fu, P Chen, X Yang, K Li, C Xu Advances in Neural Information Processing Systems 36, 2024 | 13 | 2024 |
Efficient decoder-free object detection with transformers P Chen, M Zhang, Y Shen, K Sheng, Y Gao, X Sun, K Li, C Shen European Conference on Computer Vision, 70-86, 2022 | 13 | 2022 |
Disentangling task-oriented representations for unsupervised domain adaptation P Dai, P Chen, Q Wu, X Hong, Q Ye, Q Tian, CW Lin, R Ji IEEE Transactions on Image Processing 31, 1012-1026, 2021 | 13 | 2021 |
Aligning and prompting everything all at once for universal visual perception Y Shen, C Fu, P Chen, M Zhang, K Li, X Sun, Y Wu, S Lin, R Ji Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 4 | 2024 |
MME: a comprehensive evaluation benchmark for multimodal large language models. CoRR abs/2306.13394 (2023) C Fu, P Chen, Y Shen, Y Qin, M Zhang, X Lin, Z Qiu, W Lin, J Yang, ... | 4 | 2023 |
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis C Fu, Y Dai, Y Luo, L Li, S Ren, R Zhang, Z Wang, C Zhou, Y Shen, ... arXiv preprint arXiv:2405.21075, 2024 | 3 | 2024 |
VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models C Zhou, M Zhang, P Chen, C Fu, Y Shen, X Zheng, X Sun, R Ji arXiv preprint arXiv:2406.10228, 2024 | | 2024 |
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM T Gao, P Chen, M Zhang, C Fu, Y Shen, Y Zhang, S Zhang, X Zheng, ... arXiv preprint arXiv:2404.16033, 2024 | | 2024 |
Mme: A comprehensive evaluation benchmark for multimodal large language models C Fu, P Chen, Y Shen, Y Qin, M Zhang, X Lin, J Yang, X Zheng, K Li, ... arXiv preprint arXiv:2306.13394, 2023 | | 2023 |
Learning Task-oriented Disentangled Representations for Unsupervised Domain Adaptation P Dai, P Chen, Q Wu, X Hong, Q Ye, Q Tian, R Ji arXiv preprint arXiv:2007.13264, 2020 | | 2020 |
Video-based Person Re-identification with Two-stream Convolutional Network and Co-attentive Snippet Embedding P Chen, P Dai, Q Wu, Y Huang arXiv preprint arXiv:1905.11862, 2019 | | 2019 |