作者
Po-Yi Chen, Alexander H Liu, Yen-Cheng Liu, Yu-Chiang Frank Wang
发表日期
2019
研讨会论文
Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition
页码范围
2624-2632
简介
Monocular depth estimation is a challenging task in scene understanding, with the goal to acquire the geometric properties of 3D space from 2D images. Due to the lack of RGB-depth image pairs, unsupervised learning methods aim at deriving depth information with alternative supervision such as stereo pairs. However, most existing works fail to model the geometric structure of objects, which generally results from considering pixel-level objective functions during training. In this paper, we propose SceneNet to overcome this limitation with the aid of semantic understanding from segmentation. Moreover, our proposed model is able to perform region-aware depth estimation by enforcing semantics consistency between stereo pairs. In our experiments, we qualitatively and quantitatively verify the effectiveness and robustness of our model, which produces favorable results against the state-of-the-art approaches do.
引用总数
20192020202120222023202453863596129
学术搜索中的文章