Semantic attention flow fields for monocular dynamic scene decomposition

TAQ Nguyen, A Bourki, M Macudzinski… - arXiv preprint arXiv …, 2024 - arxiv.org

This review thoroughly examines the role of semantically-aware Neural Radiance Fields
(NeRFs) in visual scene understanding, covering an analysis of over 250 scholarly papers. It …

被引用次数：6 相关文章所有 3 个版本

[PDF] thecvf.com

Wordepth: Variational language prior for monocular depth estimation

Z Zeng, D Wang, F Yang, H Park… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Three-dimensional (3D) reconstruction from a single image is an ill-posed problem
with inherent ambiguities ie scale. Predicting a 3D scene from text description (s) is similarly …

被引用次数：21 相关文章所有 3 个版本

[PDF] neurips.cc

Epic fields: Marrying 3d geometry and video understanding

V Tschernezki, A Darkhalil, Z Zhu… - Advances in …, 2024 - proceedings.neurips.cc

Neural rendering is fuelling a unification of learning, 3D geometry and video understanding
that has been waiting for more than two decades. Progress, however, is still hampered by a …

被引用次数：21 相关文章所有 13 个版本

[PDF] arxiv.org

Gaufre: Gaussian deformation fields for real-time dynamic novel view synthesis

Y Liang, N Khan, Z Li, T Nguyen-Phuoc… - arXiv preprint arXiv …, 2023 - arxiv.org

We propose a method for dynamic scene reconstruction using deformable 3D Gaussians
that is tailored for monocular video. Building upon the efficiency of Gaussian splatting, our …

被引用次数：35 相关文章所有 3 个版本

[PDF] arxiv.org

When Does Perceptual Alignment Benefit Vision Representations?

S Sundaram, S Fu, L Muttenthaler, NY Tamir… - arXiv preprint arXiv …, 2024 - arxiv.org

Humans judge perceptual similarity according to diverse visual attributes, including scene
layout, subject location, and camera pose. Existing vision models understand a wide range …

被引用次数：2 相关文章所有 3 个版本

[PDF] github.io

Flowed Time of Flight Radiance Fields

M Okunev, M Mapeke, B Attal, C Richardt… - … on Computer Vision, 2025 - Springer

Flowed time of flight radiance fields (F-TöRF) is a method to correct for motion artifacts in
continuous-wave time of flight imaging (C-ToF). As C-ToF cameras must capture multiple …

被引用次数：1 相关文章所有 7 个版本

DGD: Dynamic 3D Gaussians Distillation

I Labe, N Issachar, I Lang, S Benaim - European Conference on Computer …, 2025 - Springer

We tackle the task of learning dynamic 3D semantic radiance fields given a single
monocular video as input. Our learned semantic radiance field captures per-point semantics …

Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting

J Xu, Z Fan, J Yang, J Xie - arXiv preprint arXiv:2410.20815, 2024 - arxiv.org

Recently, Gaussian splatting has received more and more attention in the field of static
scene rendering. Due to the low computational overhead and inherent flexibility of explicit …

[图书][B] Computer Vision-ECCV 2024: 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXIV.

A Leonardis - 2024 - books.google.com

The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes
the refereed proceedings of the 18th European Conference on Computer Vision, ECCV …

[PDF][PDF] DGD: Dynamic 3D Gaussians Distillation

ILNII Lang, S Benaim - fq.pkwyx.com

We tackle the task of learning dynamic 3D semantic radiance fields given a single
monocular video as input. Our learned semantic radiance field captures per-point semantics …

高级搜索

QQ 群