Depth anything: Unleashing the power of large-scale unlabeled data

L Yang, B Kang, Z Huang, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …

Repurposing diffusion-based image generators for monocular depth estimation

B Ke, A Obukhov, S Huang, N Metzger… - Proceedings of the …, 2024 - openaccess.thecvf.com
Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth
from a single image is geometrically ill-posed and requires scene understanding so it is not …

Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey

U Rajapaksha, F Sohel, H Laga, D Diepeveen… - ACM Computing …, 2024 - dl.acm.org
Estimating depth from single RGB images and videos is of widespread interest due to its
applications in many areas, including autonomous driving, 3D reconstruction, digital …

Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

R Li, T Fischer, M Segu, M Pollefeys… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recovering the 3D scene geometry from a single view is a fundamental yet ill-posed
problem in computer vision. While classical depth estimation methods infer only a 2.5 D …

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

S Zhang, Y Zhang, Q Zheng, R Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-driven 3D scene generation techniques have made rapid progress in recent years.
Their success is mainly attributed to using existing generative models to iteratively perform …

Diffcad: Weakly-supervised probabilistic cad model retrieval and alignment from an rgb image

D Gao, D Rozenberszki, S Leutenegger… - ACM Transactions on …, 2024 - dl.acm.org
Perceiving 3D structures from RGB images based on CAD model primitives can enable an
effective, efficient 3D object-based representation of scenes. However, current approaches …

Zero-shot metric depth with a field-of-view conditioned diffusion model

S Saxena, J Hur, C Herrmann, D Sun… - arXiv preprint arXiv …, 2023 - arxiv.org
While methods for monocular depth estimation have made significant strides on standard
benchmarks, zero-shot metric depth estimation remains unsolved. Challenges include the …

UniDepth: Universal Monocular Metric Depth Estimation

L Piccinelli, YH Yang, C Sakaridis… - Proceedings of the …, 2024 - openaccess.thecvf.com
Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks
in 3D perception and modeling. However the remarkable accuracy of recent MMDE methods …

Do More With What You Have: Transferring Depth-Scale from Labeled to Unlabeled Domains

A Dana, N Carmel, A Shomer… - Proceedings of the …, 2024 - openaccess.thecvf.com
Transferring the absolute depth prediction capabilities of an estimator to a new domain is a
task with significant real-world applications. This task is specifically challenging when …

FreeReg: Image-to-point cloud registration leveraging pretrained diffusion models and monocular depth estimators

H Wang, Y Liu, B Wang, Y Sun, Z Dong… - arXiv preprint arXiv …, 2023 - arxiv.org
Matching cross-modality features between images and point clouds is a fundamental
problem for image-to-point cloud registration. However, due to the modality difference …