所有版本 - 学术资源搜索

文章

学术资源搜索

获得 4 条结果（用时0.05秒）

Videocon: Robust video-language alignment via contrast captions

H Bansal, Y Bitton, I Szpektor… - Proceedings of the …, 2024 - openaccess.thecvf.com

Despite being (pre) trained on a massive amount of data state-of-the-art video-language
alignment models are not robust to semantically-plausible contrastive changes in the video …

被引用次数：5 相关文章

VideoCon: Robust Video-Language Alignment via Contrast Captions

H Bansal, Y Bitton, I Szpektor, KW Chang… - ICLR 2024 Workshop on … - openreview.net

Despite being (pre) trained on a massive amount of data, state-of-the-art video-language
alignment models are not robust to semantically-plausible contrastive changes in the video …

VideoCon: Robust Video-Language Alignment via Contrast Captions

H Bansal, Y Bitton, I Szpektor, KW Chang… - arXiv e …, 2023 - ui.adsabs.harvard.edu

Despite being (pre) trained on a massive amount of data, state-of-the-art video-language
alignment models are not robust to semantically-plausible contrastive changes in the video …

VideoCon: Robust Video-Language Alignment via Contrast Captions

H Bansal, Y Bitton, I Szpektor, KW Chang… - arXiv preprint arXiv …, 2023 - arxiv.org

Despite being (pre) trained on a massive amount of data, state-of-the-art video-language
alignment models are not robust to semantically-plausible contrastive changes in the video …

高级搜索

QQ 群

Videocon: Robust video-language alignment via contrast captions

VideoCon: Robust Video-Language Alignment via Contrast Captions

VideoCon: Robust Video-Language Alignment via Contrast Captions

VideoCon: Robust Video-Language Alignment via Contrast Captions

引用