查看文章

thecvf.com 中的 [PDF]

Monocular 3d object detection leveraging accurate proposals and shape reconstruction

作者

Jason Ku, Alex D Pon, Steven L Waslander

发表日期

2019

研讨会论文

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

页码范围

11867-11876

简介

We present MonoPSR, a monocular 3D object detection method that leverages proposals and shape reconstruction. First, using the fundamental relations of a pinhole camera model, detections from a mature 2D object detector are used to generate a 3D proposal per object in a scene. The 3D location of these proposals prove to be quite accurate, which greatly reduces the difficulty of regressing the final 3D bounding box detection. Simultaneously, a point cloud is predicted in an object centered coordinate system to learn local scale and shape information. However, the key challenge is how to exploit shape information to guide 3D localization. As such, we devise aggregate losses, including a novel projection alignment loss, to jointly optimize these tasks in the neural network to improve 3D localization accuracy. We validate our method on the KITTI benchmark where we set new state-of-the-art results among published monocular methods, including the harder pedestrian and cyclist classes, while maintaining efficient run-time.

引用总数

被引用次数：280

20182019202020212022202320241 12 62 65 61 52 26

学术搜索中的文章

Monocular 3d object detection leveraging accurate proposals and shape reconstruction

J Ku, AD Pon, SL Waslander - Proceedings of the IEEE/CVF conference on computer …, 2019

被引用次数：280 相关文章所有 7 个版本