Unifying flow, stereo and depth estimation

H Xu, J Zhang, J Cai, H Rezatofighi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …

Towards zero-shot scale-aware monocular depth estimation

V Guizilini, I Vasiljevic, D Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Monocular depth estimation is scale-ambiguous, and thus requires scale supervision to
produce metric predictions. Even so, the resulting models will be geometry-specific, with …

Neo 360: Neural fields for sparse view synthesis of outdoor scenes

MZ Irshad, S Zakharov, K Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent implicit neural representations have shown great results for novel view synthesis.
However, existing methods require expensive per-scene optimization from many views …

Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning

J He, Y Wang, L Wang, H Lu, B Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com
Depth-aware panoptic segmentation is an emerging topic in computer vision which
combines semantic and geometric understanding for more robust scene interpretation …

Self-supervised monocular depth estimation with isometric-self-sample-based learning

G Cha, HD Jang, D Wee - IEEE Robotics and Automation …, 2022 - ieeexplore.ieee.org
Managing the dynamic regions in the photometric loss formulation has been a main issue for
handling the self-supervised depth estimation problem. Most previous methods have …

ReFiNe: Recursive Field Networks for Cross-Modal Multi-Scene Representation

S Zakharov, K Liu, A Gaidon, R Ambrus - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
The common trade-offs of state-of-the-art methods for multi-shape representation (a single
model" packing" multiple objects) involve trading modeling accuracy against memory and …

DeLiRa: Self-Supervised Depth, Light, and Radiance Fields

V Guizilini, I Vasiljevic, J Fang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Differentiable volumetric rendering is a powerful paradigm for 3D reconstruction and novel
view synthesis. However, standard volume rendering approaches struggle with degenerate …

Learning 3D Robotics Perception using Inductive Priors

MZ Irshad - arXiv preprint arXiv:2405.20364, 2024 - arxiv.org
Recent advances in deep learning have led to a data-centric intelligence ie artificially
intelligent models unlocking the potential to ingest a large amount of data and be really …

Ray-Patch: An Efficient Querying for Light Field Transformers

TB Martins, J Civera - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
In this paper we propose the Ray-Patch querying, a novel model to efficiently query
transformers to decode implicit representations into target views. Our Ray-Patch decoding …

3D Hand Pose Estimation in Egocentric Images in the Wild

A Prakash, R Tu, M Chang, S Gupta - arXiv preprint arXiv:2312.06583, 2023 - arxiv.org
We present WildHands, a method for 3D hand pose estimation in egocentric images in the
wild. This is challenging due to (a) lack of 3D hand pose annotations for images in the wild …