Depth Estimation From Surface-Ground Correspondence for Monocular 3D Object Detection
Monocular 3D object detection has attracted great attention due to simplicity and low cost. However, object location recovery in the 3D space from a monocular image is challenging since the depth information is lost. How to estimate the instance depth is the core problem to be solved. Intuitively, the ground depth is continuous and global in essence, independent of the objects in the scene. Therefore the ground depth estimation can be more accurate and easier than the object depth estimation. Inspired by this, the authors propose to map a set of surface points of an object onto the ground plane and decompose the object depth solving problem into the ground depth estimation and surface point heights estimation. During the training stage, dense ground depth labels are provided by the ground truth (GT) surface depths of objects from LiDAR data. In the inference stage, surface depths are recovered through querying the ground depth map. As a result, a set of instance depth candidates are obtained and the final instance depth can be assembled according to their uncertainties. In addition, since most of the mapped ground points are occluded by the object which may mislead the network learning, they devise a depth expansion strategy to extend the ground depth labels. The proposed method MonoSGC achieves state-of-the-art (SOTA) performance on the KITTI and Waymo datasets. Ablation studies demonstrate the effectiveness of the proposed components. The code and model are released at https://github.com/JiYinshuai/MonoSGC.
- Record URL:
-
Availability:
- Find a library where document is available. Order URL: http://worldcat.org/oclc/41297384
-
Supplemental Notes:
- Copyright © 2024, IEEE.
-
Authors:
- Ji, Yinshuai
- Xu, Jinhua
- Publication Date: 2024-11
Language
- English
Media Info
- Media Type: Web
- Features: References;
- Pagination: pp 16312-16322
-
Serial:
- IEEE Transactions on Intelligent Transportation Systems
- Volume: 25
- Issue Number: 11
- Publisher: Institute of Electrical and Electronics Engineers (IEEE)
- ISSN: 1524-9050
- Serial URL: http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6979
Subject/Index Terms
- TRT Terms: Autonomous vehicles; Estimating; Laser radar; Machine learning; Mapping; Object detection
- Subject Areas: Data and Information Technology; Highways; Vehicles and Equipment;
Filing Info
- Accession Number: 01945553
- Record Type: Publication
- Files: TRIS
- Created Date: Feb 12 2025 8:59AM