所有版本 - 学术资源搜索

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

Ferret: Refer and ground anything anywhere at any granularity

H You, H Zhang, Z Gan, X Du, B Zhang, Z Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

We introduce Ferret, a new Multimodal Large Language Model (MLLM) capable of
understanding spatial referring of any shape or granularity within an image and accurately …

被引用次数：110 相关文章

Ferret: Refer and Ground Anything Anywhere at Any Granularity

H You, H Zhang, Z Gan, X Du, B Zhang, Z Wang… - The Twelfth International … - openreview.net

We introduce Ferret, a new Multimodal Large Language Model (MLLM) capable of
understanding spatial referring of any shape or granularity within an image and accurately …

Ferret: Refer and Ground Anything Anywhere at Any Granularity

H You, H Zhang, Z Gan, X Du, B Zhang… - arXiv e …, 2023 - ui.adsabs.harvard.edu

We introduce Ferret, a new Multimodal Large Language Model (MLLM) capable of
understanding spatial referring of any shape or granularity within an image and accurately …

高级搜索

QQ 群

Ferret: Refer and ground anything anywhere at any granularity

Ferret: Refer and Ground Anything Anywhere at Any Granularity

Ferret: Refer and Ground Anything Anywhere at Any Granularity

引用