基于Light-BotNet的激光点云分类研究-AET-电子技术应用

基于Light-BotNet的激光点云分类研究

2022年电子技术应用第6期

雷根华1，王蕾1，2，张志勇1

1.东华理工大学信息工程学院，江西南昌330013； 2.江西省核地学数据科学与系统工程技术研究中心，江西南昌330013

摘要： 三维点云在机器人与自动驾驶中都有着普遍的应用，深度学习在二维图像上的研究成果显著，但是如何利用深度学习识别不规则的三维点云，仍然是一个开放性的问题。目前大场景点云自身数据的复杂性，点云扫描距离的变化造成点的分布不均匀，噪声和异常点引起的挑战性依然存在。针对于现有的深度学习网络框架对于激光点云数据的分类效率不高以及分类精度低的问题，提出一种基于激光点云特征图像与Light-BotNet相结合的CNN-Transform框架。该框架在于通过对点云数据进行特征提取，以相邻的特征点构造点云特征图像作为网络框架的输入，最后以Light-BotNet为网络框架模型进行点云分类训练。实验结果表明，该方法与现有的多数点云分类方法相比，能够较好地提升激光点云的分类效率以及分类精度。

关键词： 点云特征图像 BotNet Transform CNN 激光点云分类

中图分类号： TP391
文献标识码： A
DOI：10.16157/j.issn.0258-7998.222725
中文引用格式： 雷根华，王蕾，张志勇. 基于Light-BotNet的激光点云分类研究[J].电子技术应用，2022，48(6)：84-88，97.
英文引用格式： Lei Genhua，Wang Lei，Zhang Zhiyong. Research on laser point cloud classification based on Light-BotNet[J]. Application of Electronic Technique，2022，48(6)：84-88，97.

Research on laser point cloud classification based on Light-BotNet

Lei Genhua1，Wang Lei1，2，Zhang Zhiyong1

1.School of Information Engineering，East China University of Technology，Nanchang 330013，China； 2.Jiangxi Engineering Technology Research Center of Nuclear Geoscience Data Science and System，Nanchang 330013，China

Abstract： Three dimensional point clouds are widely used in robots and automatic driving. The research results of deep learning on two-dimensional images are remarkable, but how to use deep learning to identify irregular three-dimensional point clouds is still an open problem. At present, due to the complexity of the data of the scenic spot cloud itself, the uneven distribution of points caused by the change of the scanning distance of the point cloud, and the challenges caused by noise and abnormal points still exist. Aiming at the problems of low classification efficiency and low classification accuracy of the existing deep learning Network framework for laser point cloud data, a CNN Transform framework based on laser point cloud feature image and Light-BotNet is proposed. The framework is to extract the features of point cloud data, construct the point cloud feature image with adjacent feature points as the input of the network framework, and finally take Light-BotNet as the network framework model for point cloud classification training. The experimental results show that compared with most existing point cloud classification methods, this method can better improve the classification efficiency and accuracy of laser point cloud.

Key words : point cloud feature image；BotNet；Transform；CNN；laser point cloud classification

0 引言

大多的深度学习点云分类方法都是采用卷积层与池化层交替实现的，卷积层中的神经元仅与上一层的部分区域相连接，学习局部特征，在点云数据特征提取时容易丢失部分特征，从而导致分类精度下降等问题。而Transform的提出则带来了一种新的思路，主要利用自我注意机制提取内在特征^[1-3]。Transform最初应用在自然语言处理(NLP)领域，并且取得了重大的成功，受到NLP中Transformer功能的启发，研究人员开始将Transformer应用在计算机视觉(CV)任务。研究发现CNN曾是视觉应用的基本组件^[4-5]，但Transformer正在显示其作为CNN替代品的能力。Chen等人^[6]训练序列变换器，以自回归预测像素，并在图像分类任务上与CNN取得竞争性结果。卷积操作擅长提取细节，但是在大数据量的大场景三维点云数据分类任务中，要掌握三维点云的全局信息往往需要堆叠很多个卷积层，而Transform中的注意力善于把握整体信息，但又需要大量的数据进行训练。

BotNet^[7]网络是伯克利与谷歌的研究人员在Convolution+Transformer组合方面一个探索，它采用混合方式同时利用了CNN的特征提取能力、Transformer的内容自注意力与位置自注意力机制，取得了优于纯CNN或者自注意力的性能，在ImageNet中取得了84.7%的精度。将CNN与Transform结合起来，达到取长补短的效果。BoTNet与ResNet^[8]网络框架的不同之处在于：ResNet^[8]框架在最后3个bottleneck blocks中使用的是3×3的空间卷积，而BotNet框架则是采用全局自我注意替代空间卷积。带自注意力模块的Bottleneck模块可以视作Transformer模块。

本文详细内容请下载：http://www.chinaaet.com/resource/share/2000004426。

作者信息：

雷根华1，王蕾1，2，张志勇1

(1.东华理工大学信息工程学院，江西南昌330013；

2.江西省核地学数据科学与系统工程技术研究中心，江西南昌330013)

原创声明：此内容为AET网站原创，未经授权禁止转载。

相关内容