25 October 2023 Three-dimensional object detection with spatial-semantic features of point clouds
Tianxiang Chen, Chao Han
Author Affiliations +
Abstract

Three-dimensional (3D) object detection is crucial for accurate recognition of autonomous driving roads, and the distribution of point clouds in 3D scenes becomes sparse with increasing distance, thus seriously affecting the sensor’s perception precision. To address this problem, we propose a two-stage 3D object detection network based on point and voxel feature fusion. In the first stage, a spatial semantic feature fusion module is designed to effectively fuse low-level spatial features and high-level semantic features to generate high-quality proposals. Then, an attention mechanism-based residual module is constructed to expand the receptive field and adaptively aggregate the voxel features in the 3D scene. At the same time, the sampled key points and voxel features are fused to extract the key information in the 3D scene. In the second stage, the graph network pooling module is introduced to construct local graphs on 3D proposals using key point features as nodes to estimate the confidence and location of objects more accurately. Experimental results on the KITTI dataset show that the detection precision is improved significantly in easy, moderate, and hard tasks.

© 2023 SPIE and IS&T
Tianxiang Chen and Chao Han "Three-dimensional object detection with spatial-semantic features of point clouds," Journal of Electronic Imaging 32(5), 053039 (25 October 2023). https://doi.org/10.1117/1.JEI.32.5.053039
Received: 18 May 2023; Accepted: 9 October 2023; Published: 25 October 2023
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Voxels

Point clouds

Semantics

Feature fusion

Convolution

Feature extraction

Back to Top