Jiannan Wu 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	665	663
h 指数	8	8
i10 指数	7	7

440

220

110

330

20212022202320246 27 200 427

开放获取的出版物数量

查看全部

6 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Ping Luo (羅平)Associate Professor, The University of Hong Kong在 hku.hk 的电子邮件经过验证
Zehuan YuanBytedance Inc.在 bytedance.com 的电子邮件经过验证
Yi JiangBytedance在 bytedance.com 的电子邮件经过验证
Wenhai Wang (王文海)CUHK | Shanghai AI Laboratory | NJU在 cuhk.edu.hk 的电子邮件经过验证
Zhe Chen (陈喆)PhD candidate, Nanjing University在 smail.nju.edu.cn 的电子邮件经过验证
Jifeng DaiAssociate Professor of EE, Tsinghua University; Adjuct Researcher of Shanghai AI Laboratory在 tsinghua.edu.cn 的电子邮件经过验证
Bin YanPhD student of Computer Vision, Dalian University of Technology在 mail.dlut.edu.cn 的电子邮件经过验证
Peize SunMeta, FAIR在 meta.com 的电子邮件经过验证

关注

Jiannan Wu

The University of Hong Kong

在 connect.hku.hk 的电子邮件经过验证 - 首页

Computer Vision Video Understanding Multimodal LLMs


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ... Advances in Neural Information Processing Systems (NeurIPS), 2023	238	2023
Language as queries for referring video object segmentation J Wu, Y Jiang, P Sun, Z Yuan, P Luo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	116	2022
Universal instance perception as object discovery and retrieval B Yan, Y Jiang, J Wu, D Wang, P Luo, Z Yuan, H Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	106	2023
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024	96*	2024
Watch only once: An end-to-end video action detection framework S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	54	2021
Self-supervised video representation learning with motion-aware masked autoencoders H Yang, D Huang, B Wen, J Wu, H Yao, Y Jiang, X Zhu, Z Yuan arXiv preprint arXiv:2210.04154, 2022	12	2022
Development of an effective model for computing rightmost eigenvalues of power systems with inclusion of time delays C Li, J Wu, C Duan, Z Du IEEE Transactions on Power Systems 34 (6), 4216-4227, 2019	11	2019
Towards high-quality temporal action detection with sparse proposals J Wu, P Sun, S Chen, J Yang, Z Qi, L Ma, P Luo arXiv preprint arXiv:2109.08847, 2021	9	2021
Segment every reference object in spatial and temporal spaces J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	7	2023
The first visual object tracking segmentation vots2023 challenge results M Kristan, J Matas, M Danelljan, M Felsberg, HJ Chang, LČ Zajc, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	5	2023
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models C Ma, Y Jiang, J Wu, Z Yuan, X Qi arXiv preprint arXiv:2404.13013, 2024	4	2024
Exploring transformers for open-world instance segmentation J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	4	2023
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo arXiv preprint arXiv:2312.15715, 2023	1	2023
A Simple Baseline for Open-World Tracking via Self-training B Wang, T Li, J Wu, Y Jiang, H Lu, Y He Proceedings of the 31st ACM International Conference on Multimedia, 2765-2774, 2023	1	2023
Multi-Level Contrastive Learning for Dense Prediction Task Q Guo, Y Yu, Y Jiang, J Wu, Z Yuan, P Luo arXiv preprint arXiv:2304.02010, 2023	1	2023
METHOD, APPARATUS, DEVICE, AND MEDIUM FOR PROCESSING VISUAL TASK BY GENERIC MODEL Y Jiang, B Yan, J Wu, Z Yuan US Patent App. 18/531,091, 2024		2024
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks J Wu, M Zhong, S Xing, Z Lai, Z Liu, W Wang, Z Chen, X Zhu, L Lu, T Lu, ... arXiv preprint arXiv:2406.08394, 2024		2024
Method, apparatus, device and medium for processing image using machine learning model Y Jiang, J Wu, B Yan, Y Zehuan US Patent App. 18/499,066, 2024		2024

系统目前无法执行此操作，请稍后再试。

文章 1–18

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用