Towards zero-shot scale-aware monocular depth estimation

L Yang, B Kang, Z Huang, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …

被引用次数：148 相关文章所有 6 个版本

[PDF] thecvf.com

Repurposing diffusion-based image generators for monocular depth estimation

B Ke, A Obukhov, S Huang, N Metzger… - Proceedings of the …, 2024 - openaccess.thecvf.com

Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth
from a single image is geometrically ill-posed and requires scene understanding so it is not …

被引用次数：51 相关文章所有 3 个版本

[PDF] acm.org

Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey

U Rajapaksha, F Sohel, H Laga, D Diepeveen… - ACM Computing …, 2024 - dl.acm.org

Estimating depth from single RGB images and videos is of widespread interest due to its
applications in many areas, including autonomous driving, 3D reconstruction, digital …

Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

R Li, T Fischer, M Segu, M Pollefeys… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recovering the 3D scene geometry from a single view is a fundamental yet ill-posed
problem in computer vision. While classical depth estimation methods infer only a 2.5 D …

被引用次数：1 相关文章所有 3 个版本

[PDF] thecvf.com

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

S Zhang, Y Zhang, Q Zheng, R Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-driven 3D scene generation techniques have made rapid progress in recent years.
Their success is mainly attributed to using existing generative models to iteratively perform …

被引用次数：1 相关文章所有 3 个版本

[PDF] acm.org

Diffcad: Weakly-supervised probabilistic cad model retrieval and alignment from an rgb image

D Gao, D Rozenberszki, S Leutenegger… - ACM Transactions on …, 2024 - dl.acm.org

Perceiving 3D structures from RGB images based on CAD model primitives can enable an
effective, efficient 3D object-based representation of scenes. However, current approaches …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Zero-shot metric depth with a field-of-view conditioned diffusion model

S Saxena, J Hur, C Herrmann, D Sun… - arXiv preprint arXiv …, 2023 - arxiv.org

While methods for monocular depth estimation have made significant strides on standard
benchmarks, zero-shot metric depth estimation remains unsolved. Challenges include the …

被引用次数：4 相关文章所有 2 个版本

[PDF] thecvf.com

UniDepth: Universal Monocular Metric Depth Estimation

L Piccinelli, YH Yang, C Sakaridis… - Proceedings of the …, 2024 - openaccess.thecvf.com

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks
in 3D perception and modeling. However the remarkable accuracy of recent MMDE methods …

被引用次数：1 相关文章所有 3 个版本

[PDF] thecvf.com

Do More With What You Have: Transferring Depth-Scale from Labeled to Unlabeled Domains

A Dana, N Carmel, A Shomer… - Proceedings of the …, 2024 - openaccess.thecvf.com

Transferring the absolute depth prediction capabilities of an estimator to a new domain is a
task with significant real-world applications. This task is specifically challenging when …

[PDF] arxiv.org

FreeReg: Image-to-point cloud registration leveraging pretrained diffusion models and monocular depth estimators

H Wang, Y Liu, B Wang, Y Sun, Z Dong… - arXiv preprint arXiv …, 2023 - arxiv.org

Matching cross-modality features between images and point clouds is a fundamental
problem for image-to-point cloud registration. However, due to the modality difference …

被引用次数：2 相关文章所有 3 个版本

高级搜索

QQ 群