融合CNN与Transformer的遥感影像道路信息提取

曲海成; 王莹; 刘腊梅; 郝明

doi:10.6046/zrzyyg.2023237

融合CNN与Transformer的遥感影像道路信息提取

Information extraction of roads from remote sensing images using CNN combined with Transformer

摘要

摘要: 利用高分辨率遥感影像进行道路信息提取时, 深度神经网络很难同时学习影像全局上下文信息和边缘细节信息, 为此, 该文提出了一种同时学习全局语义信息和局部空间细节的级联神经网络。首先将输入的特征图分别送入到双分支编码器卷积神经网络(convolutional neural networks, CNN)和Transformer中, 然后, 采用了双分支融合模块(shuffle attention dual branch fusion block, SA-DBF)来有效地结合这2个分支学习到的特征, 从而实现全局信息与局部信息的融合。其中, 双分支融合模块通过细粒度交互对这2个分支的特征进行建模, 同时利用多重注意力机制充分提取特征图的通道和空间信息, 并抑制掉无效的噪声信息。在公共数据集Massachusetts道路数据集上对模型进行测试, 准确率(overall accuracy, OA)、交并比(intersection over union, IoU)和F₁等评价指标分别达到98.04%, 88.03%和65.13%; 与主流方法U-Net和TransRoadNet等进行比较, IoU分别提升了2.01个百分点和1.42个百分点, 实验结果表明所提出的方法优于其他的比较方法, 能够有效提高道路分割的精确度。

Abstract: Deep learning-based methods for information extraction of roads from high-resolution remote sensing images face challenges in extracting information about both global context and edge details. This study proposed a cascaded neural network for road segmentation in remote sensing images, allowing both types of information to be simultaneously learned. First, the input feature images were sent to encoders CNN and Transformer. Then, the characteristics learned by both branch encoders were effectively combined using the shuffle attention dual branch fusion (SA-DBF) module, thus achieving the fusion of global and local information. Using the SA-DBF module, the model of the features learned from both branches was established through fine-grained interaction, during which channel and spatial information in the feature images were efficiently extracted and invalid noise was suppressed using multiple attention mechanisms. The proposed network was evaluated using the Massachusetts Road dataset, yielding an overall accuracy rate (OA) of 98.04%, an intersection over union (IoU) of 88.03%, and an F1 score of 65.13%. Compared to that of mainstream methods U-Net and TransRoadNet, the IoU of the proposed network increased by 2.01 and 1.42 percentage points, respectively. Experimental results indicate that the proposed method outperforms all the methods compared and can effectively improve the accuracy of road segmentation.

HTML全文

参考文献(24)

施引文献

资源附件(0)