Deep learning advances in computer vision with 3d data: A survey

A Ioannidou, E Chatzilari, S Nikolopoulos… - ACM computing …, 2017 - dl.acm.org
Deep learning has recently gained popularity achieving state-of-the-art performance in tasks
involving text, sound, or image processing. Due to its outstanding performance, there have …

Enhanced computer vision with microsoft kinect sensor: A review

J Han, L Shao, D Xu, J Shotton - IEEE transactions on …, 2013 - ieeexplore.ieee.org
With the invention of the low-cost Microsoft Kinect sensor, high-resolution depth and visual
(RGB) sensing has become available for widespread use. The complementary nature of the …

Unsupervised embedding learning via invariant and spreading instance feature

M Ye, X Zhang, PC Yuen… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
This paper studies the unsupervised embedding learning problem, which requires an
effective similarity measurement between samples in low-dimensional embedding space …

Real-world multiobject, multigrasp detection

FJ Chu, R Xu, PA Vela - IEEE Robotics and Automation Letters, 2018 - ieeexplore.ieee.org
A deep learning architecture is proposed to predict graspable locations for robotic
manipulation. It considers situations where no, one, or multiple object (s) are seen. By …

Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for RGB-D salient object detection

H Chen, Y Li, D Su - Pattern Recognition, 2019 - Elsevier
Paired RGB and depth images are becoming popular multi-modal data adopted in computer
vision tasks. Traditional methods based on Convolutional Neural Networks (CNNs) typically …

Deep sliding shapes for amodal 3d object detection in rgb-d images

S Song, J Xiao - Proceedings of the IEEE conference on …, 2016 - openaccess.thecvf.com
We focus on the task of amodal 3D object detection in RGB-D images, which aims to
produce a 3D bounding box of an object in metric form at its full extent. We introduce Deep …

Learning rich features from RGB-D images for object detection and segmentation

S Gupta, R Girshick, P Arbeláez, J Malik - … 6-12, 2014, Proceedings, Part VII …, 2014 - Springer
In this paper we study the problem of object detection for RGB-D images using semantically
rich image and depth features. We propose a new geocentric embedding for depth images …

Discriminative unsupervised feature learning with convolutional neural networks

A Dosovitskiy, JT Springenberg… - Advances in neural …, 2014 - proceedings.neurips.cc
Current methods for training convolutional neural networks depend on large amounts of
labeled samples for supervised training. In this paper we present an approach for training a …

Rotationnet: Joint object categorization and pose estimation using multiviews from unsupervised viewpoints

A Kanezaki, Y Matsushita… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Abstract We propose a Convolutional Neural Network (CNN)-based model``RotationNet,''
which takes multi-view images of an object as input and jointly estimates its pose and object …

Uncertainty-driven 6d pose estimation of objects and scenes from a single rgb image

E Brachmann, F Michel, A Krull… - Proceedings of the …, 2016 - openaccess.thecvf.com
In recent years, the task of estimating the 6D pose of object instances and complete scenes,
ie camera localization, from a single input image has received considerable attention …