Interpreting clip with sparse linear concept embeddings (splice)

文章

学术资源搜索

获得 4 条结果（用时0.01秒）

我的图书馆

Interpreting clip with sparse linear concept embeddings (splice)

在引用文章中搜索

[PDF] researchgate.net

Zero-shot urban function inference with street view images through prompting a pretrained vision-language model

W Huang, J Wang, G Cong - International Journal of Geographical …, 2024 - Taylor & Francis

Inferring urban functions using street view images (SVIs) has gained tremendous
momentum. The recent prosperity of large-scale vision-language pretrained models sheds …

相关文章所有 2 个版本

[PDF] arxiv.org

DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor

J Wu, Z Ni, H Wang, W Yang, Y Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org

Image deep features extracted by pre-trained networks are known to contain rich and
informative representations. In this paper, we present Deep Degradation Response (DDR) …

相关文章所有 2 个版本

[PDF] arxiv.org

I Bet You Did Not Mean That: Testing Semantic Importance via Betting

J Teneggi, J Sulam - arXiv preprint arXiv:2405.19146, 2024 - arxiv.org

Recent works have extended notions of feature importance to\emph {semantic concepts}
that are inherently interpretable to the users interacting with a black-box predictive model …

相关文章所有 2 个版本

[PDF] arxiv.org

Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment

Z Song, Z Zang, Y Wang, G Yang, J Zheng… - arXiv preprint arXiv …, 2024 - arxiv.org

Multimodal fusion breaks through the barriers between diverse modalities and has already
yielded numerous impressive performances. However, in various specialized fields, it is …

相关文章所有 2 个版本

高级搜索

QQ 群

Interpreting clip with sparse linear concept embeddings (splice)

Zero-shot urban function inference with street view images through prompting a pretrained vision-language model

DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor

I Bet You Did Not Mean That: Testing Semantic Importance via Betting

Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment

引用