Deep learning-based depth estimation methods from monocular image and videos: A comprehensive survey

U Rajapaksha, F Sohel, H Laga, D Diepeveen… - ACM Computing …, 2024 - dl.acm.org
Estimating depth from single RGB images and videos is of widespread interest due to its
applications in many areas, including autonomous driving, 3D reconstruction, digital …

HouseCat6D-A Large-Scale Multi-Modal Category Level 6D Object Perception Dataset with Household Objects in Realistic Scenarios

HJ Jung, SC Wu, P Ruhkamp, G Zhai… - Proceedings of the …, 2024 - openaccess.thecvf.com
Estimating 6D object poses is a major challenge in 3D computer vision. Building on
successful instance-level approaches research is shifting towards category-level pose …

Monograspnet: 6-dof grasping with a single rgb image

G Zhai, D Huang, SC Wu, HJ Jung, Y Di… - … on Robotics and …, 2023 - ieeexplore.ieee.org
6-DoF robotic grasping is a long-lasting but un-solved problem. Recent methods utilize
strong 3D networks to extract geometric grasping representations from depth sensors …

Polarimetric pose prediction

D Gao, Y Li, P Ruhkamp, I Skobleva, M Wysocki… - … on Computer Vision, 2022 - Springer
Light has many properties that vision sensors can passively measure. Colour-band
separated wavelength and intensity are arguably the most commonly used for monocular 6D …

Rapid network adaptation: Learning to adapt neural networks using test-time feedback

T Yeo, OF Kar, Z Sodagar… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We propose a method for adapting neural networks to distribution shifts at test-time. In
contrast to training-time robustness mechanisms that attempt to anticipate the shift, we …

On the importance of accurate geometry data for dense 3D vision tasks

HJ Jung, P Ruhkamp, G Zhai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Learning-based methods to solve dense 3D vision problems typically train on 3D sensor
data. The respectively used principle of measuring distances provides advantages and …

Learning accurate 3d shape based on stereo polarimetric imaging

T Huang, H Li, K He, C Sui, B Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Shape from Polarization (SfP) aims to recover surface normal using the polarization
cues of light. The accuracy of existing SfP methods is affected by two main problems. First …

SP: Self-Supervised Polarimetric Pose Prediction

P Ruhkamp, D Gao, N Navab, B Busam - International Journal of …, 2024 - Springer
This paper proposes the first self-supervised 6D object pose prediction from multimodal
RGB+ polarimetric images. The novel training paradigm comprises (1) a physical model to …

Unsupervised spike depth estimation via cross-modality cross-domain knowledge transfer

J Liu, Q Zhang, X Li, J Li, G Wang, M Lu… - … on Robotics and …, 2024 - ieeexplore.ieee.org
Neuromorphic spike data, an upcoming modality with high temporal resolution, has shown
promising potential in autonomous driving by mitigating the challenges posed by high …

Wild tofu: Improving range and quality of indirect time-of-flight depth with rgb fusion in challenging environments

HJ Jung, N Brasch, A Leonardis… - … conference on 3D …, 2021 - ieeexplore.ieee.org
Indirect Time-of-Flight (I-ToF) imaging is a widespread way of depth estimation for mobile
devices due to its small size and affordable price. Previous works have mainly focused on …