Moving Off-the-Grid: Scene-Grounded Video Representations- 学术资源搜索

文章

学术资源搜索

我的图书馆

[引用][C] Moving offthe-grid: Scene-grounded video representations

S van Steenkiste, D Zoran, Y Yang, Y Rubanova… - 2024 - NeurIPS

被引用次数：2 相关文章

[PDF] arxiv.org

Moving Off-the-Grid: Scene-Grounded Video Representations

S van Steenkiste, D Zoran, Y Yang, Y Rubanova… - arXiv preprint arXiv …, 2024 - arxiv.org

Current vision models typically maintain a fixed correspondence between their
representation structure and image space. Each layer comprises a set of tokens arranged"
on-the-grid," which biases patches or tokens to encode information at a specific spatio (-
temporal) location. In this work we present Moving Off-the-Grid (MooG), a self-supervised
video representation model that offers an alternative approach, allowing tokens to move" off-
the-grid" to better enable them to represent scene elements consistently, even as they move …

被引用次数：1 相关文章所有 3 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

[引用][C] Moving offthe-grid: Scene-grounded video representations

Moving Off-the-Grid: Scene-Grounded Video Representations

引用