基于混合注意力机制和Deeplabv3+的遥感影像建筑物提取方法

刘晨晨; 葛小三; 武永斌; 余海坤; 张蓓蓓

doi:10.6046/zrzyyg.2023295

基于混合注意力机制和Deeplabv3+的遥感影像建筑物提取方法

1. 河南理工大学测绘与国土信息工程学院, 焦作 454003;

2. 河南省测绘地理信息技术中心, 郑州 450003;

3. 河南省遥感院, 郑州 450003

基金项目:
国家自然科学基金项目“面向矿区地理协同设计的空间信息语义服务模式研究”(编号: 41572341)、河南省自然科学基金项目“深度学习支持下的灾损建筑物提取与检测研究”(编号: 222300420450)和河南省高等教育教学改革研究与实践项目(学位与研究生教育)“面向学科前沿的研究生创新能力提升路径研究与实践”(编号: 2021SJGLX100Y)共同资助

详细信息

作者简介: 刘晨晨(1998-), 男, 硕士研究生, 主要从事遥感影像处理与应用方面的研究。Email: 18203233036@163.com

通讯作者: 葛小三(1971-), 男, 博士, 教授, 主要从事时空数据智能处理与分析和地理信息服务方面的研究。Email: gexiaosan@163.com

中图分类号: TU198; |TP751

收稿日期: 2023-09-22

修回日期: 2024-01-26

A method for information extraction of buildings from remote sensing images based on hybrid attention mechanism and Deeplabv3+

1. School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454003, China;

2. Henan Surveying and Mapping Geographic Information Technology Center, Zhengzhou 450003, China;

3. Henan Remote Sensing Institute, Zhengzhou 450003, China

More Information

Corresponding author: GE Xiaosan

Received Date: 22 September 2023

Revised Date: 26 January 2024

摘要

摘要: 在大量且复杂的遥感影像中提取建筑物信息是遥感智能应用的重要研究内容之一。针对复杂环境下的遥感影像建筑物提取不精准及小型建筑物易被忽略等问题, 文章提出了一种基于混合注意力机制和Deeplabv3+的遥感影像语义分割算法——SC-deep网络。该网络采用编码-解码结构, 利用主干残差注意力网络提取深层特征和浅层特征, 通过空洞空间金字塔池化模块和通道空间注意力模块聚合遥感影像的空间和通道信息权重, 有效利用了遥感影像建筑物的多尺度信息, 从而减少影像细节在训练中的损失。实验结果表明, 所提方法在Aerial imagery dataset数据集上的分割结果均优于其他主流分割网络, 能够有效识别并提取复杂建筑物边缘和小型建筑物, 表现出更优异的建筑物提取性能。
- 多尺度信息 /
- 建筑物提取 /
- 语义分割 /
- 注意力机制 /
- 空洞卷积
Abstract: Extracting information about buildings from a large and complex set of remote sensing images has always been a hot research topic in the intelligent applications of remote sensing. To address issues such as inaccurate information extraction of buildings and the tendency to ignore small buildings within a complex environment in remote sensing images, this study proposed the SC-deep network-a semantic segmentation algorithm for remote sensing images based on a hybrid attention mechanism and Deeplabv3+. Utilizing an encoder-decoder structure, this network employs a backbone residual attention network to extract deep- and shallow-layer features. Meanwhile, this network aggregates the spatial and channel information weights in remote sensing images using a dilated space pyramid pool module and a channel-space attention module. These allow for effectively utilizing the multi-scale information of building structures in remote sensing images, thereby reducing the loss of image details during training. The experimental results indicate that the proposed method outperforms other mainstream segmentation networks on the Aerial imagery dataset. Overall, this method can effectively identify and extract the edges of complex buildings and small structures, exhibiting superior building extraction performance.
- multi-scale information /
- building extraction /
- semantic segmentation /
- attention mechanisms /
- dilated convolution

HTML全文

参考文献(19)

[1]	胡明洪, 李佳田, 姚彦吉, 等.结合多路径的高分辨率遥感影像建筑物提取SER-UNet算法[J].测绘学报, 2023, 52(5):808-817. Hu M H, Li J T, Yao Y J, et al.SER-UNet algorithm for building extraction from high-resolution remote sensing image combined with multipath[J].Acta Geodaetica et Cartographica Sinica, 2023, 52(5):808-817.
[2]	吴炜, 骆剑承, 沈占锋, 等.光谱和形状特征相结合的高分辨率遥感图像的建筑物提取方法[J].武汉大学学报(信息科学版), 2012, 37(7):800-805. Wu W, Luo J C, Shen Z F, et al.Building extraction from high resolution remote sensing imagery based on spatial-spectral method[J].Geomatics and Information Science of Wuhan University, 2012, 37(7):800-805.
[3]	贾士军, 王昆.融合颜色和纹理特征的彩色图像分割[J].测绘科学, 2014, 39(12):138-142, 147. Jia S J, Wang K.Color image segmentation by integrating color and texture features[J].Science of Surveying and Mapping, 2014, 39(12):138-142, 147.
[4]	Lagunas E, Amin M G, Ahmad F, et al.Pattern matching for building feature extraction[J].IEEE Geoscience and Remote Sensing Letters, 2014, 11(12):2193-2197.
[5]	Gong J, Ji S.Photogrammetry and deep learning[J].Journal of Geodesy and Geoinformation Science, 2018(1):1-15.
[6]	Shelhamer E, Long J, Darrell T.Fully convolutional networks for semantic segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651.
[7]	Ronneberger O, Fischer P, Brox T.U-net:Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computing and Computer-Assisted Intervention.Cham:Springer, 2015:234-241.
[8]	Zhuo Z W, Tajbakhsh N, Liang J M, et al.Unet++:A nested U-Net architecture for medical image segmentation[EB/OL].(2018-09-20).[2022-05-20].https://arxiv.org/abs/1807.10165.
[9]	Badrinarayanan V, Kendall A, Cipolla R.SegNet:A deep convolutional encoder-decoder architecture for image segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495.
[10]	Zhao H, Shi J, Qi X, et al.Pyramid scene parsing network[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Honolulu, HI, USA.IEEE, 2017:6230-6239.
[11]	Chen L C, Papandreou G, Kokkinos I, et al.DeepLab:Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848.
[12]	季顺平, 魏世清.遥感影像建筑物提取的卷积神经元网络与开源数据集方法[J].测绘学报, 2019, 48(4):448-459. Ji S P, Wei S Q.Building extraction via convolutional neural networks from an open remote sensing building dataset[J].Acta Geodaetica et Cartographica Sinica, 2019, 48(4):448-459.
[13]	Yang H, Wu P, Yao X, et al.Building extraction in very high resolution imagery by dense-attention networks[J].Remote Sensing, 2018, 10(11):1768.
[14]	赵凌虎, 袁希平, 甘淑, 等.改进Deeplabv3+的高分辨率遥感影像道路提取模型[J].自然资源遥感, 2023, 35(1):107-114.doi:10.6046/zrzyyg.2021460. Zhao L H, Yuan X P, Gan S, et al.An information extraction model of roads from high-resolution remote sensing images based on improved Deeplabv3+[J].Remote Sensing for Natural Resources, 2023, 35(1):107-114.doi:10.6046/zrzyyg.2021460.
[15]	Xia L, Mi S, Zhang J, et al.Dual-stream feature extraction network based on CNN and transformer for building extraction[J].Remote Sensing, 2023, 15(10):2689.
[16]	郭文, 张荞.基于注意力增强全卷积神经网络的高分卫星影像建筑物提取[J].国土资源遥感, 2021, 33(2):100-107.doi:10.6046/gtzyyg.2020230. Guo W, Zhang Q.Building extraction using high-resolution satellite imagery based on an attention enhanced full convolution neural network[J].Remote Sensing for Land and Resources, 2021, 33(2):100-107.doi:10.6046/gtzyyg.2020230.
[17]	吕少云, 李佳田, 阿晓荟, 等.Res_ASPP_UNet++:结合分离卷积与空洞金字塔的遥感影像建筑物提取网络[J].遥感学报, 2023, 27(2):502-519. Lyu S Y, Li J T, A X H, et al.Res_ASPP_UNet++:Building an extraction network from remote sensing imagery combining depthwise separable convolution with atrous spatial pyramid pooling[J].National Remote Sensing Bulletin, 2023, 27(2):502-519.
[18]	Chollet F.Xception:deep learning with depthwise separable convolutions[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Honolulu, HI, USA.IEEE, 2017:1800-1807.
[19]	Woo S, Park J, Lee J Y, et al.Cbam:Convolutional block attention module[C]// Proceedings of the European conference on computer vision (ECCV).2018:3-19.

施引文献

资源附件(0)

访问统计

计量

文章访问数: 150
PDF下载数: 23
施引文献: 0

中国自然资源航空物探遥感中心	主办
地质出版社	出版

基于混合注意力机制和Deeplabv3+的遥感影像建筑物提取方法

1. 河南理工大学测绘与国土信息工程学院, 焦作 454003; 2. 河南省测绘地理信息技术中心, 郑州 450003; 3. 河南省遥感院, 郑州 450003

作者简介: 刘晨晨(1998-), 男, 硕士研究生, 主要从事遥感影像处理与应用方面的研究。Email: 18203233036@163.com

通讯作者: 葛小三(1971-), 男, 博士, 教授, 主要从事时空数据智能处理与分析和地理信息服务方面的研究。Email: gexiaosan@163.com

A method for information extraction of buildings from remote sensing images based on hybrid attention mechanism and Deeplabv3+

1. School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454003, China; 2. Henan Surveying and Mapping Geographic Information Technology Center, Zhengzhou 450003, China; 3. Henan Remote Sensing Institute, Zhengzhou 450003, China

Corresponding author: GE Xiaosan

计量

出版历程

目录

1. 河南理工大学测绘与国土信息工程学院, 焦作 454003;

2. 河南省测绘地理信息技术中心, 郑州 450003;

3. 河南省遥感院, 郑州 450003

1. School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454003, China;

2. Henan Surveying and Mapping Geographic Information Technology Center, Zhengzhou 450003, China;

3. Henan Remote Sensing Institute, Zhengzhou 450003, China