A method for information extraction of buildings from remote sensing images based on hybrid attention mechanism and Deeplabv3+
-
摘要: 在大量且复杂的遥感影像中提取建筑物信息是遥感智能应用的重要研究内容之一。针对复杂环境下的遥感影像建筑物提取不精准及小型建筑物易被忽略等问题, 文章提出了一种基于混合注意力机制和Deeplabv3+的遥感影像语义分割算法——SC-deep网络。该网络采用编码-解码结构, 利用主干残差注意力网络提取深层特征和浅层特征, 通过空洞空间金字塔池化模块和通道空间注意力模块聚合遥感影像的空间和通道信息权重, 有效利用了遥感影像建筑物的多尺度信息, 从而减少影像细节在训练中的损失。实验结果表明, 所提方法在Aerial imagery dataset数据集上的分割结果均优于其他主流分割网络, 能够有效识别并提取复杂建筑物边缘和小型建筑物, 表现出更优异的建筑物提取性能。Abstract: Extracting information about buildings from a large and complex set of remote sensing images has always been a hot research topic in the intelligent applications of remote sensing. To address issues such as inaccurate information extraction of buildings and the tendency to ignore small buildings within a complex environment in remote sensing images, this study proposed the SC-deep network-a semantic segmentation algorithm for remote sensing images based on a hybrid attention mechanism and Deeplabv3+. Utilizing an encoder-decoder structure, this network employs a backbone residual attention network to extract deep- and shallow-layer features. Meanwhile, this network aggregates the spatial and channel information weights in remote sensing images using a dilated space pyramid pool module and a channel-space attention module. These allow for effectively utilizing the multi-scale information of building structures in remote sensing images, thereby reducing the loss of image details during training. The experimental results indicate that the proposed method outperforms other mainstream segmentation networks on the Aerial imagery dataset. Overall, this method can effectively identify and extract the edges of complex buildings and small structures, exhibiting superior building extraction performance.
-
-
[1] 胡明洪, 李佳田, 姚彦吉, 等.结合多路径的高分辨率遥感影像建筑物提取SER-UNet算法[J].测绘学报, 2023, 52(5):808-817.
Hu M H, Li J T, Yao Y J, et al.SER-UNet algorithm for building extraction from high-resolution remote sensing image combined with multipath[J].Acta Geodaetica et Cartographica Sinica, 2023, 52(5):808-817.
[2] 吴炜, 骆剑承, 沈占锋, 等.光谱和形状特征相结合的高分辨率遥感图像的建筑物提取方法[J].武汉大学学报(信息科学版), 2012, 37(7):800-805.
Wu W, Luo J C, Shen Z F, et al.Building extraction from high resolution remote sensing imagery based on spatial-spectral method[J].Geomatics and Information Science of Wuhan University, 2012, 37(7):800-805.
[3] 贾士军, 王昆.融合颜色和纹理特征的彩色图像分割[J].测绘科学, 2014, 39(12):138-142, 147.
Jia S J, Wang K.Color image segmentation by integrating color and texture features[J].Science of Surveying and Mapping, 2014, 39(12):138-142, 147.
[4] Lagunas E, Amin M G, Ahmad F, et al.Pattern matching for building feature extraction[J].IEEE Geoscience and Remote Sensing Letters, 2014, 11(12):2193-2197.
[5] Gong J, Ji S.Photogrammetry and deep learning[J].Journal of Geodesy and Geoinformation Science, 2018(1):1-15.
[6] Shelhamer E, Long J, Darrell T.Fully convolutional networks for semantic segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651.
[7] Ronneberger O, Fischer P, Brox T.U-net:Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Computer-Assisted Intervention.Cham:Springer, 2015:234-241.
[8] Zhuo Z W, Tajbakhsh N, Liang J M, et al.Unet++:A nested U-Net architecture for medical image segmentation[EB/OL].(2018-09-20).[2022-05-20].https://arxiv.org/abs/1807.10165.
[9] Badrinarayanan V, Kendall A, Cipolla R.SegNet:A deep convolutional encoder-decoder architecture for image segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495.
[10] Zhao H, Shi J, Qi X, et al.Pyramid scene parsing network[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Honolulu, HI, USA.IEEE, 2017:6230-6239.
[11] Chen L C, Papandreou G, Kokkinos I, et al.DeepLab:Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848.
[12] 季顺平, 魏世清.遥感影像建筑物提取的卷积神经元网络与开源数据集方法[J].测绘学报, 2019, 48(4):448-459.
Ji S P, Wei S Q.Building extraction via convolutional neural networks from an open remote sensing building dataset[J].Acta Geodaetica et Cartographica Sinica, 2019, 48(4):448-459.
[13] Yang H, Wu P, Yao X, et al.Building extraction in very high resolution imagery by dense-attention networks[J].Remote Sensing, 2018, 10(11):1768.
[14] 赵凌虎, 袁希平, 甘淑, 等.改进Deeplabv3+的高分辨率遥感影像道路提取模型[J].自然资源遥感, 2023, 35(1):107-114.doi:10.6046/zrzyyg.2021460.
Zhao L H, Yuan X P, Gan S, et al.An information extraction model of roads from high-resolution remote sensing images based on improved Deeplabv3+[J].Remote Sensing for Natural Resources, 2023, 35(1):107-114.doi:10.6046/zrzyyg.2021460.
[15] Xia L, Mi S, Zhang J, et al.Dual-stream feature extraction network based on CNN and transformer for building extraction[J].Remote Sensing, 2023, 15(10):2689.
[16] 郭文, 张荞.基于注意力增强全卷积神经网络的高分卫星影像建筑物提取[J].国土资源遥感, 2021, 33(2):100-107.doi:10.6046/gtzyyg.2020230.
Guo W, Zhang Q.Building extraction using high-resolution satellite imagery based on an attention enhanced full convolution neural network[J].Remote Sensing for Land and Resources, 2021, 33(2):100-107.doi:10.6046/gtzyyg.2020230.
[17] 吕少云, 李佳田, 阿晓荟, 等.Res_ASPP_UNet++:结合分离卷积与空洞金字塔的遥感影像建筑物提取网络[J].遥感学报, 2023, 27(2):502-519.
Lyu S Y, Li J T, A X H, et al.Res_ASPP_UNet++:Building an extraction network from remote sensing imagery combining depthwise separable convolution with atrous spatial pyramid pooling[J].National Remote Sensing Bulletin, 2023, 27(2):502-519.
[18] Chollet F.Xception:deep learning with depthwise separable convolutions[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Honolulu, HI, USA.IEEE, 2017:1800-1807.
[19] Woo S, Park J, Lee J Y, et al.Cbam:Convolutional block attention module[C]// Proceedings of the European conference on computer vision (ECCV).2018:3-19.
-
计量
- 文章访问数: 150
- PDF下载数: 23
- 施引文献: 0