TransRVNet: LiDAR Semantic Segmentation With Transformer
Effective and efficient 3D semantic segmentation from large-scale LiDAR point cloud is a fundamental problem in the field of autonomous driving. In this paper, the authors present Transformer-Range-View Network (TransRVNet), a novel and powerful projection-based CNN-Transformer architecture to infer point-wise semantics. First, a Multi Residual Channel Interaction Attention Module (MRCIAM) is introduced to capture channel-level multi-scale feature and model intra-channel, inter-channel correlations based on attention mechanism. Then, in the encoder stage, they use a well-designed Residual Context Aggregation Module (RCAM), including a residual dilated convolution structure and a context aggregation module, to fuse information from different receptive fields while reducing the impact of missing points. Finally, a Balanced Non-square-Transformer Module (BNTM) is employed as fundamental component of decoder to achieve locally feature dependencies for more discriminative feature learning by introducing the non-square shifted window strategy. Extensive qualitative and quantitative experiments conducted on challenging SemanticKITTI and SemanticPOSS benchmarks have verified the effectiveness of their proposed technique. Their TransRVNet presents superior performance over most existing state-of-the-art approaches. The source code and trained model are available at https://github.com/huixiancheng/TransRVNet.
- Record URL:
-
Availability:
- Find a library where document is available. Order URL: http://worldcat.org/oclc/41297384
-
Supplemental Notes:
- Copyright © 2023, IEEE.
-
Authors:
- Cheng, Hui-Xian
- Han, Xian-Feng
- Xiao, Guo-Qiang
- Publication Date: 2023-6
Language
- English
Media Info
- Media Type: Web
- Features: References;
- Pagination: pp 5895-5907
-
Serial:
- IEEE Transactions on Intelligent Transportation Systems
- Volume: 24
- Issue Number: 6
- Publisher: Institute of Electrical and Electronics Engineers (IEEE)
- ISSN: 1524-9050
- Serial URL: http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6979
Subject/Index Terms
- TRT Terms: Autonomous vehicles; Laser radar; Mathematical models; Neural networks
- Subject Areas: Data and Information Technology; Highways; Vehicles and Equipment;
Filing Info
- Accession Number: 01894601
- Record Type: Publication
- Files: TRIS
- Created Date: Sep 26 2023 9:08AM