Spatial downscaling methods for the 2-meter air temperature grid data based on multiple machine learning models
-
摘要: 高分辨率气象资料是精细化气象业务服务的重要数据基础,文章利用2020年1月—2021年3月逐小时的2 m气温网格数据,选取海拔、经度、纬度等地形因子,综合应用LightGBM(LGB)、XGBoost(XGB)、梯度提升树(gradient boosting tree, GBT)和随机森林(random forest, RF)4种机器学习方法,实现1 km分辨率的2 m气温网格数据降尺度至100 m,并对4种机器学习降尺度结果进行加权融合。将不同模型降尺度结果与双线性插值结果对比,结果表明: 各降尺度模型结果与站点观测值较为一致,LGB,XGB和RF模型与双线性插值降尺度结果空间结构相似,但更为精细; 各降尺度模型具有相同的时空误差分布特征,与双线性插值结果相比,LGB,XGB和GBT的数据精度均有明显提高,均方根误差(root mean square error,RMSE)分别降低了5.2%,4.1%和4.6%,而加权融合后的RMSE降低了5.9%,优于单一机器学习模型; LGB,XGB和GBT模型对不同地形条件下的降尺度结果均具有一定改善,尤其对高海拔地区(海拔在600 m以上)的改进效果更为显著,LGB,XGB和BGT和融合模型的相关系数分别提高了0.45%,0.40%,0.63%和0.66%,RMSE分别降低了9.1%,8.0%,12.7%和13.1%。研究显示,多种机器学习加权融合的降尺度模型兼顾了提升空间分辨率和保持数据精度两方面的要求,适用于研究区2 m气温数据的降尺度研究,为研制高分辨率数据产品具有一定参考意义。Abstract: High-resolution meteorological data serve as an important data basis for fine-scale meteorological services. Using the hourly 2-meter air temperature grid data from January 2020 to March 2021 and the terrain factors such as altitude, longitude, and latitude, this study aimed to enhance the resolution of 2-meter air temperature grid data with a resolution of 1 km to 100 m through downscaling based on four machine learning methods, namely LightGBM (LGB), XGBoost (XGB), gradient boosting tree (GBT), and random forest (RF). Then, this study conducted the weighted fusion of downscaling results of different models. Finally, the downscaling results of different models were compared with the bilinear interpolation results, and the results are as follows. The results of each downscaling model were relatively consistent with the observational data. Compared with the bilinear interpolation results, the results of the LGB, XGB, and RF models had similar spatial structures but were more detailed. All downscaling models yielded the same spatio-temporal distribution characteristics of errors. Compared with the bilinear interpolation results, the data of the LGB, XGB, and GBT models showed significantly higher precision, and their root mean square errors (RMSEs) decreased by 5.2%, 4.1%, and 4.6%, respectively. Meanwhile, the RMSE after weighted fusion decreased by 5.9%, which was higher than that of any single machine learning model. The downscaling results of the LGB, XGB, and GBT models were improved to a certain degree compared with the bilinear interpolation results under different terrain conditions, especially in high-altitude areas (above 600 m). The correlation coefficients of results of the LGB, XGB, and BGT models and model based on weighted fusion increased by 0.45%, 0.40%, 0.63%, and 0.66%, respectively, and their RMSEs decreased by 9.1%, 8.0%, 12.7%, and 13.1%, respectively. These results indicate that the downscaling model based on the weighted fusion of different machine learning methods can both improve spatial resolution and maintain data precision and, thus, is suitable for downscaling research on 2-meter air temperature data in the study area. This study can be used as a reference for developing high-resolution data products.
-
Key words:
- real-time grid product /
- 2-meter air temperature /
- downscaling /
- machine learning /
- weighted fusion
-
-
[1] 李显风, 张玮, 李芬, 等. 基于WebGIS的实况网格产品应用分析平台及关键技术[J]. 气象科技, 2020, 48(2):185-194.
[2] Li X F, Zhang W, Li F, et al. An WebGIS-based real-time application/analysis platform of grid products and key technology[J]. Meteorological Science and Technology, 2020, 48(2):185-194.
[3] 张玮, 李显风, 何瑶, 等. 基于江西省站点资料对智能网格实况分析产品的检验评估[J]. 江西师范大学学报(自然科学版), 2020, 44(4):403-410.
[4] Zhang W, Li X F, He Y, et al. The quality assessment of the intelligent grid real-time analysis products using the station data in Jiangxi Province[J]. Journal of Jiangxi Normal University (Nature Science), 2020, 44(4):403-410.
[5] 韩帅, 师春香, 姜志伟, 等. CMA高分辨率陆面数据同化系统(HRCLDAS-V1.0)研发及进展[J]. 气象科技进展, 2018, 8(1):102-108.
[6] Han S, Shi C X, Jiang Z W, et al. Development and progress of high resolution CMA land surface data assimilation system[J]. Advances in Meteorological Science and Technology, 2018, 8(1):102-108.
[7] 潘旸, 谷军霞, 徐宾, 等. 多源降水数据融合研究及应用进展[J]. 气象科技进展, 2018, 8(1):143-152.
[8] Pan Y, Gu J X, Xu B, et al. Advances in multi-source precipitation merging research[J]. Advances in Meteorological Science and Technology, 2018, 8(1):143-152.
[9] 师春香, 潘旸, 谷军霞, 等. 多源气象数据融合格点实况产品研制进展[J]. 气象学报, 2019, 77(4):774-783.
[10] Shi C X, Pan Y, Gu J X, et al. A review of multi-source meteorological data fusion products[J]. Acta Meteorologica Sinica, 2019, 77(4):774-783.
[11] 李显风, 张玮, 黄少平, 等. 江西省气象-水文-雷达小时雨量信息融合试验结果分析[J]. 暴雨灾害, 2020, 39(3):276-284.
[12] Li X F, Zhang W, Huang S P, et al. Analysis of fusion test results on hourly precipitation from meteorological and hydrological stations and Radar[J]. Torrential Rain and Disasters, 2020, 39(3):276-284.
[13] Kustas W P, Norman J M, Anderson M C, et al. Estimating subpixel surface temperatures and energy fluxes from the vegetation index-radiometric temperature relationship[J]. Remote Sensing of Environment, 2003, 85(4):429-440. [8 ] Agam N, Kustas W P, Anderson M C, et al. A vegetation index based technique for spatial sharpening of thermal imagery[J]. Remote Sensing of Environment, 2007, 107(4):545-558.
[14] 张晓. 天山中段卫星降水数据的降尺度研究及时空特征分析[D]. 兰州: 西北师范大学, 2013.
[15] Zhang X. Study on the downscaling algorithm of remote sensing precipitation and analysis of temporal-spatial characteristic in the middle section of Mount Tianshan[D]. Lanzhou: Northwest Normal University, 2013.
[16] Duan S B, Li Z L. Spatial downscaling of MODIS land surface temperatures using geographically weighted regression:Case study in northern China[J]. IEEE Transactions on Geoscience and Remote Sensing, 2016, 54(11):6458-6469.
[17] Yang G J, Pu R L, Huang W J, et al. A novel method to estimate subpixel temperature by fusing solar-reflec-tive and thermal-infrared remote-sensing data with an artificial neural network[J]. IEEE Transactions on Geoscience and Remote Sensing, 2010, 48(4):2170-2178.
[18] Hutengs C, Vohland M. Downscaling land surface temperatures at regional scales with random forest regression[J]. Remote Sensing of Environment, 2016, 178:127-141.
[19] 颜佳楠, 陈虹, 姚光林, 等. 基于XGBoost的LST空间降尺度方法[J]. 无线电工程, 2021, 51(12):1508-1516.
[20] Yan J N, Chen H, Yao G L, et al. A spatial downscaling method of land surface temperature based on XGBoost algorithm[J]. Radion Engineering, 2021, 51(12):1508-1516.
[21] 尹枷愿, 蔡宏, 田鹏举, 等. 喀斯特地区地表温度空间降尺度方法初探[J]. 地理与地理信息科学, 2021, 27(3):38-46.
[22] Yin J Y, Cai H, Tian P J, et al. Spatial downscaling research of the land surface temperature in karst region[J]. Geography and Geo-Information Science, 2021, 27(3):38-46.
[23] 徐彬仁, 魏瑗瑗. 基于随机森林算法对青藏高原TRMM降水数据进行空间统计降尺度研究[J]. 国土资源遥感, 2018, 30(3):181-188.doi:10.6046/gtzyyg.2018.03.25.
[24] Xu B R, Wei Y Y. Spatial statistics of TRMM precipitation in the Tibetan Plateau using random forest algorithm[J]. Remote Sensing for Land and Resources, 2018, 30(3):181-188.doi:10.6046/gtzyyg.2018.03.25.
[25] 张朝忙, 刘庆生, 刘高焕, 等. STRM3与ASTER GDEM数据处理及应用进展[J]. 地理与地理信息科学, 2012, 28(5):29-34.
[26] Zhang C M, Liu Q S, Liu G H, et al. Data processing and application progress of SRTM3 and ASTER GDEM[J]. Geography and Geo-Information Science, 2012, 28(5):29-34.
[27] 康俊锋, 黄烈星, 张春艳, 等. 多机器学习模型下逐小时PM2.5预测及对比分析[J]. 中国环境科学, 2020, 40(5):1895-1905.
[28] Kang J F, Huang L X, Zhang C Y, et al. Hourly PM2.5 prediction and comparative analysis under multi-machine learning model[J]. China Environment Science, 2020, 40(5):1895-1905.
[29] 胡占占, 陈传法, 胡保健. 基于时空XGBoost的中国区域PM2.5浓度遥感反演[J]. 环境科学学报, 2021, 41(10):4228-4237.
[30] Hu Z Z, Chen C F, Hu B J. Estimating PM2.5 concentrations across China based on space-time XGBoost approach[J]. Acta Scientiae Circumstan tiae, 2021, 41(10):4228-4237.
[31] 余东昌, 赵文芳, 聂凯, 等. 基于LightGBM算法的能见度预测模型[J]. 计算机应用, 2021, 41(4):1035-1041.
[32] Yu D C, Zhao W F, Nie K, et al. Visibility forecast model based on LightGBM algorithm[J]. Journal of Computer Application, 2021, 41(4):1035-1041.
-
计量
- 文章访问数: 1760
- PDF下载数: 161
- 施引文献: 0