Multi-frame self-supervised depth with transformers

V Guizilini, R Ambruș, D Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Multi-frame depth estimation improves over single-frame approaches by also leveraging
geometric relationships between images via feature matching, in addition to learning …

Mamo: Leveraging memory and attention for monocular video depth estimation

R Yasarla, H Cai, J Jeong, Y Shi… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose MAMo, a novel memory and attention framework for monocular video depth
estimation. MAMo can augment and improve any single-image depth estimation networks …

NVDS: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation

Y Wang, M Shi, J Li, C Hong, Z Huang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Video depth estimation aims to infer temporally consistent depth. One approach is to
finetune a single-image model on each video with geometry constraints, which proves …

Exploring efficiency of vision transformers for self-supervised monocular depth estimation

A Karpov, I Makarov - 2022 IEEE International Symposium on …, 2022 - ieeexplore.ieee.org
Depth estimation is a crucial task for the creation of depth maps, one of the most important
components for augmented reality (AR) and other applications. However, the most widely …

Monocular depth estimation: A thorough review

V Arampatzakis, G Pavlidis… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Estimation of depth in two-dimensional images is among the challenging topics in Computer
Vision. This is a well-studied but also an ill-posed problem, which has long been the focus of …

On the importance of accurate geometry data for dense 3D vision tasks

HJ Jung, P Ruhkamp, G Zhai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Learning-based methods to solve dense 3D vision problems typically train on 3D sensor
data. The respectively used principle of measuring distances provides advantages and …

Futuredepth: Learning to predict the future improves video depth estimation

R Yasarla, MK Singh, H Cai, Y Shi, J Jeong… - … on Computer Vision, 2025 - Springer
In this paper, we propose a novel video depth estimation approach, FutureDepth, which
enables the model to implicitly leverage multi-frame and motion cues to improve depth …

Self-supervised monocular depth estimation using hybrid transformer encoder

SJ Hwang, SJ Park, JH Baek, B Kim - IEEE Sensors Journal, 2022 - ieeexplore.ieee.org
Depth estimation using monocular camera sensors is an important technique in computer
vision. Supervised monocular depth estimation requires a lot of data acquired from depth …

Attention Mechanism Used in Monocular Depth Estimation: An Overview

Y Li, X Wei, H Fan - Applied Sciences, 2023 - mdpi.com
Monocular depth estimation (MDE), as one of the fundamental tasks of computer vision,
plays important roles in downstream applications such as virtual reality, 3D reconstruction …

Mono-ViFI: A Unified Learning Framework for Self-supervised Single and Multi-frame Monocular Depth Estimation

J Liu, L Kong, B Li, Z Wang, H Gu, J Chen - European Conference on …, 2025 - Springer
Self-supervised monocular depth estimation has gathered notable interest since it can
liberate training from dependency on depth annotations. In monocular video training case …