摘要:
遥感图像数据中云和云阴影的存在是影响数据应用的主要原因,专家已经研发了多种去除云及其阴影的方法。在对不同目标像元光谱曲线分析的基础上,研究了基于随机森林(random forests,RF)分类器的云-云阴影-水体掩模建立方法。由于云阴影是阴影与地表物体的叠加,其光谱曲线与水体的光谱曲线之间存在细微的差别,这使得决策树( decision tree ,DT)分类方法不能非常有效地应对这种细微差别。 RF分类器是建立在多个DT分类结果集成的基础上,其算法原理保证了该算法的稳健性和有效性。研究结果表明:在样本容量较少时,RF算法比DT具有更好的分类效果;而在样本容量增大到250~400个像元时,2种方法的分类效果没有明显区别。这表明RF算法可以成功地用于建立云-云阴影-水体掩模,这将在遥感数据处理中得到更加广泛的应用。
Abstract:
Clouds and their shadows in the remote sensing images are the key factors that influence the application of the data in many fields .Several methods , such as constructing cloud mask , replacement of the pixels , linear mixture spectral analysis , and principal component analysis , have been proposed in the past decades to solve this problem.In this research, based on the analysis of spectral curve , the authors utilized Decision Tree (DT)classifier and Random Forest ( RF) classifier to obtain the cloud -shadow-water mask.There was little difference between the spectral curve of shadow and water due to the mixture of shadow and other surface materials such as vegetation and impervious surface .In this case , the DT classifier could not effectively distinguish shadow and water because the decision rule and threshold were determined by analyzing the spectral curves of different samples .RF classifier was based on the ensemble of the results derived from multiple decision tree classifiers , which was more robust than one decision tree classifier .In this study , when there were only a few training samples , results that were more accurate were derived from RF classifier compared with the results from DT classifier .When the size of training samples lay in the range of 250 and 400 , no significant difference was found between the results derived from these two algorithms.This indicates that RF classifier could be used to deduce the cloud -cloud shadow -water mask successfully .