关注
Jiannan Wu
Jiannan Wu
在 connect.hku.hk 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks
W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ...
Advances in Neural Information Processing Systems (NeurIPS), 2023
2382023
Language as queries for referring video object segmentation
J Wu, Y Jiang, P Sun, Z Yuan, P Luo
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1162022
Universal instance perception as object discovery and retrieval
B Yan, Y Jiang, J Wu, D Wang, P Luo, Z Yuan, H Lu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1062023
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
96*2024
Watch only once: An end-to-end video action detection framework
S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
542021
Self-supervised video representation learning with motion-aware masked autoencoders
H Yang, D Huang, B Wen, J Wu, H Yao, Y Jiang, X Zhu, Z Yuan
arXiv preprint arXiv:2210.04154, 2022
122022
Development of an effective model for computing rightmost eigenvalues of power systems with inclusion of time delays
C Li, J Wu, C Duan, Z Du
IEEE Transactions on Power Systems 34 (6), 4216-4227, 2019
112019
Towards high-quality temporal action detection with sparse proposals
J Wu, P Sun, S Chen, J Yang, Z Qi, L Ma, P Luo
arXiv preprint arXiv:2109.08847, 2021
92021
Segment every reference object in spatial and temporal spaces
J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
72023
The first visual object tracking segmentation vots2023 challenge results
M Kristan, J Matas, M Danelljan, M Felsberg, HJ Chang, LČ Zajc, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
52023
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
C Ma, Y Jiang, J Wu, Z Yuan, X Qi
arXiv preprint arXiv:2404.13013, 2024
42024
Exploring transformers for open-world instance segmentation
J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
42023
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo
arXiv preprint arXiv:2312.15715, 2023
12023
A Simple Baseline for Open-World Tracking via Self-training
B Wang, T Li, J Wu, Y Jiang, H Lu, Y He
Proceedings of the 31st ACM International Conference on Multimedia, 2765-2774, 2023
12023
Multi-Level Contrastive Learning for Dense Prediction Task
Q Guo, Y Yu, Y Jiang, J Wu, Z Yuan, P Luo
arXiv preprint arXiv:2304.02010, 2023
12023
METHOD, APPARATUS, DEVICE, AND MEDIUM FOR PROCESSING VISUAL TASK BY GENERIC MODEL
Y Jiang, B Yan, J Wu, Z Yuan
US Patent App. 18/531,091, 2024
2024
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
J Wu, M Zhong, S Xing, Z Lai, Z Liu, W Wang, Z Chen, X Zhu, L Lu, T Lu, ...
arXiv preprint arXiv:2406.08394, 2024
2024
Method, apparatus, device and medium for processing image using machine learning model
Y Jiang, J Wu, B Yan, Y Zehuan
US Patent App. 18/499,066, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–18