一种基于改进Mask R-CNN模型的遥感图像目标识别方法-AET-电子技术应用

一种基于改进Mask R-CNN模型的遥感图像目标识别方法

信息技术与网络安全

余慧明，周志祥，彭杨，崔志斌

(武汉兴图新科电子股份有限公司平台产品部，湖北武汉430073)

摘要： 目标识别技术作为机器视觉领域的一个重要分支，在各个领域有着重要的应用。鉴于一般的目标识别模型在遥感图片中表现不佳，另外需要识别的目标数量较多，尺寸大小不一，因此在经典的目标识别Mask R-CNN模型的基础上，提出了一种融合了通道注意力机制和数据增强技术的模型Mask R-CNN模型。在输入图片数据时，首先通过Random-Batch images操作对数据进行增强处理，提高模型对尺寸大小不一的目标识别的准确率；然后提取特征时，将原Mask R-CNN模型中的FPN改进为BiFPN，使提取到的特征能更好体现原本的图片信息；在最后的Mask阶段，增加了通道注意力机制，使得模型更多地得到需要的信息。经过实验表明，此模型在遥感图像的特殊目标细粒度识别中有良好的表现。对于同一个数据集，其评价指标在各个方面都要优于其他对比算法。

关键词： 目标识别 MaskR-CNN 通道注意力数据增强

中图分类号： TP389.1
文献标识码： A
DOI： 10.19358/j.issn.2096-5133.2021.03.007
引用格式：余慧明，周志祥，彭杨，等. 一种基于改进Mask R-CNN模型的遥感图像目标识别方法[J].信息技术与网络安全，2021，40(3)：38-42，47.

A remote sensing image target recognition method based on improved Mask R-CNN model

Yu Huiming，Zhou Zhixiang，Peng Yang，Cui Zhibin

(Wuhan Xing Tu Xin Ke Co.，Ltd.，Platform Products Department，Wuhan 430073，China)

Abstract： As an important branch in the field of machine vision, target recognition technology has important applications in various fields. In view of the fact that the general target recognition model does not perform well in remote sensing images, the number of targets that need to be recognized is large, and the sizes are different, based on the classic target recognition Mask R-CNN model, a Mask R-CNN model fusing channel attention mechanism and data enhancement technology is proposed. When inputting image data, the data through the Random-Batch images operation to improve the accuracy of the model′s recognition of targets of different sizes is first enhanced; then, when extracting features, the FPN in the original Mask R-CNN model is improved to BiFPN, so that the extracted features can better reflect the original picture information; in the final Mask stage, the channel attention mechanism is added, so that the model can get more information. Experiments show that this model is used in remote sensing images. It has a good performance in the fine-grained recognition of special targets. For the same data set, its evaluation indicators are superior to other comparison algorithms in all aspects.

Key words : target recognition；Mask R-CNN；channel attention；data enhancement

0 引言

随着人工智能的兴起，深度学习[1]算法各个领域的优势被体现出来。对视频、图像中的多目标、细粒度的目标识别技术，可以方便人们在复杂的情景中快速定位到所需要的检测目标。随着场景的复杂度加深，基于基础模型的各种改进版本层出不穷。

在过去近10年中，目标识别技术又有了飞速的发展，从最开始的机器学习算法，再到目前主流的深度学习目标识别算法，如RCNN[2]、SSP-Net[3]、Fast R-CNN[4]、Faster R-CNN[5]，目标识别技术已经在各个领域都有了很好的应用。但是，由于数据集的制约，针对遥感图像的军事目标识别却是一个例外。另外，军事码头物体数量众多，需要检测的目标与其他物体交错相间，大大降低了模型的准确性。对于一些密集型的目标检测，PAN X[6]等人提出了一个由特征选择模块(Feature Selection Module，FSM)和动态优化头(Dynamic Refinement Head，DRH)组成的动态优化网络。FSM使神经元能够根据目标物体的形状和方向调整接受野，而DRH使模型能够以一种对象感知的方式动态地改进预测。何代毅[7]等人就提出了一种基于改进 Mask-RCNN[8]的建筑物自动提取方法,在网络的设计中添加了路径聚合网络和特征增强功能,通过监督和迁移学习的方式在Inria航空影像标签数据集中进行多线程迭代训练与模型优化学习,实现了建筑物的自动精确分割和提取。对于数据集缺乏等问题，林通[9]等人通过迁移姿态生成对抗网络生成姿态不同的行人图片，对数据集进行了扩充。

本文详细内容请下载:http://www.chinaaet.com/resource/share/2000003422

作者信息:

余慧明，周志祥，彭杨，崔志斌

(武汉兴图新科电子股份有限公司平台产品部，湖北武汉430073)

原创声明：此内容为AET网站原创，未经授权禁止转载。

相关内容