Next Article in Journal
Using Choice Experiments as a Planning Tool for Reforestation after Extreme Events: The Case of the Vaia Windstorm in Italy
Next Article in Special Issue
Forest Canopy Water Content Monitoring Using Radiative Transfer Models and Machine Learning
Previous Article in Journal
Wildfire and Climate Impacts Tree Hollow Density in a Temperate Australian Forest
Previous Article in Special Issue
Forest Segmentation with Spatial Pyramid Pooling Modules: A Surveillance System Based on Satellite Images
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Automatic Extraction of the Spatial Distribution of Picea schrenkiana in the Tianshan Mountains Based on Google Earth Engine and the Jeffries–Matusita Distance

1
College of Geography and Remote Sensing Sciences, Xinjiang University, Urumqi 830017, China
2
College of Ecology and Environment, Xinjiang University, Urumqi 830017, China
3
Key Laboratory of Oasis Ecology, Xinjiang University, Urumqi 830017, China
4
Xinjiang Jinghe Observation and Research Station of Temperate Desert Ecosystem, Ministry of Education, Urumqi 830017, China
*
Authors to whom correspondence should be addressed.
Forests 2023, 14(7), 1373; https://doi.org/10.3390/f14071373
Submission received: 5 May 2023 / Revised: 15 June 2023 / Accepted: 30 June 2023 / Published: 4 July 2023

Abstract

:
As a distinct species in the Tianshan Mountains (TS) of Central Asia (CA), Picea schrenkiana plays a significant role in water purification, soil and water conservation, and climate regulation. In the context of climate change, rapidly and accurately obtaining its spatial distribution has critical decision-making significance for maintaining ecological security in the arid area of CA and the sustainable development of the “Silk Road Economic Belt”. However, conventional methods are extremely challenging to accomplish the high-resolution mapping of Picea schrenkiana in the TS, which is characterized by a wide range (9.97 × 105 km2) and complex terrain. The approach of geo-big data and cloud computing provides new opportunities to address this issue. Therefore, the purpose of this study is to propose an automatic extraction procedure for the spatial distribution of Picea schrenkiana based on Google Earth Engine and the Jeffries–Matusita (JM) distance, which considered three aspects: sample points, remote-sensing images, and classification features. The results showed that (1) after removing abnormal samples and selecting the summer image, the producer accuracy (PA) of Picea schrenkiana was improved by 2.95% and 0.24%–2.10%, respectively. (2) Both the separation obtained by the JM distance and the analysis results of eight schemes showed that spectral features and texture features played a key role in the mapping of Picea schrenkiana. (3) The JM distance can seize the classification features that are most conducive to the mapping of Picea schrenkiana, and effectively improve the classification accuracy. The PA and user accuracy of Picea schrenkiana were 96.74% and 96.96%, respectively. The overall accuracy was 91.93%, while the Kappa coefficient was 0.89. (4) The results show that Picea schrenkiana is concentrated in the middle TS and scattered in the remaining areas. In total, 85.7%, 66.4%, and 85.9% of Picea schrenkiana were distributed in the range of 1500–2700 m, 20–40°, and on shady slope and semi-shady slope, respectively. The automatic procedure adopted in this study provides a basis for the rapid and accurate mapping of the spatial distribution of coniferous forests in the complex terrain.

1. Introduction

As one of the most valuable terrestrial ecosystems, the forest ecosystem plays a vital role in maintaining global carbon balance and slowing down the rise in the greenhouse gas concentration [1]. The carbon stock of forest ecosystems accounts for about 33%–46% of terrestrial ecosystems [2]. Under the background of “carbon neutrality”, global changes caused by both natural and human factors directly or indirectly affect forest ecosystems, which has aroused widespread concern in the whole society [3]. The arid area of Central Asia (CA), which is connected to China by mountains and rivers, is a key hub zone in the construction of “the Silk Road Economic Belt” and a hot area concerned with the international community [4]. The Tianshan Mountains (TS) is one of the most important parts in the arid area of CA, in which the forest ecosystem has critical ecological values and functions such as carbon sink [5], water resource conservation, flood regulation, water purification, and soil and water conservation. The TS is highly essential for maintaining the ecological security of the downstream areas and the sustainable development of the whole arid zone [6]. The arid areas of CA, far from the sea and interspersed with mountains and basins, have light precipitation but intense evaporation, severe water shortages, and fragile ecosystems. Therefore, this region is a dramatic response region to global changes [7,8]. In the past half-century, although the trend of “warming and humidification” has occurred in the arid area of CA [9], it has not slowed down the ecological degradation in the region, because the positive ecological effect of increasing precipitation is not sufficient to offset the negative effect of rising temperatures [10]. In addition, the normalized difference vegetation index (NDVI) of the natural vegetation growing season in CA has declined at a rate of 0.0003 per year due to a sudden increase in the temperature since 1998 [10]. All of the above studies indicate that the natural vegetation in this area shows browning as a whole. Picea schrenkiana is a coniferous forest species with an altitude range of 1400–2800 m on the northern slopes of the TS, and is particularly sensitive to climate change [11]. Timely and accurate acquisition of its spatial distribution information is of great significance to ensure the sustainable development of the arid area of CA and “the Silk Road Economic Belt”.
Remote sensing is the most time-efficient and economical method for obtaining large spatial-scale land-cover information [12,13]. With the improvement in remote-sensing image resolution, remote-sensing technology plays a vital role in forest-cover extraction and forest-change monitoring [14]. In Vietnam, researchers utilized Landsat images from 1973 to 2020 to identify changes in the spatial distribution of mangroves in Thanh Hoa and Nghe An provinces, and analyzed the reasons for these changes [15]. The researchers first determined the range of cultivated land in the Guanzhong region based on the land-use map, and then distinguished between the grain/non-grain crops based on MODIS vegetation index data. Finally, the non-grain production of cultivated land was extracted [16]. The above studies on land-cover extraction and forest-change monitoring mainly used Landsat data with high spatial resolution and MODIS data with high temporal resolution. Landsat data are limited by the temporal resolution and rainy weather, and MODIS data are limited by the spatial resolution and mixed pixels, which when used separately do not meet the requirements for fine forest-cover extraction and forest-change monitoring. To solve this problem, some scholars [17,18] implement the method for integrating different remote-sensing data to make comprehensive use of the advantages of different sensors in time and space. However, the traditional data acquisition and data processing and analysis methods based on stand-alone off-line remote-sensing analysis software (ENVI and ERDAS) have great limitations [19], which cannot meet the needs of wide-area and high spatial resolution fine forest-cover extraction and forest-change monitoring.
The rapid development of remote-sensing cloud-computing platforms has become a new way to solve geographical and spatial-related problems. Researchers can directly invoke the huge amount of data on the platform, in conjunction with the algorithms provided by it, to implement online analysis and processing, as well as visualization operations, which considerably increase the efficiency of data analysis and processing. So far, Google Earth Engine (GEE) has been used in forestry [20], agriculture [21,22], wetlands [23], disaster monitoring [24,25], and land-use change [26,27]. In terms of vegetation information extraction, Liu et al. [28] selected Landsat images and used decision-tree classification to extract the spatial distribution information of mangroves in Yuxian County in Vietnam, and monitored the inter-annual dynamic changes in mangroves in the region from 1993 to 2017. Li et al. [29] selected Landsat OLI and MODIS NDVI data and fused spectral and phonological characteristics to extract distribution information on rubber forests in Thailand, Malaysia, and Indonesia. Xu et al. [30] used the GEE platform and multi-temporal Landsat images to extract information from citrus orchards in southern Jiangxi based on the random forest (RF) method, and the average OA reached 93.15%. The above results show that with the help of GEE, there are three main features for the extraction of vegetation information by domestic and foreign scholars. First, this vegetation is the most important cash crop in the region, such as rubber and citrus, which plays an essential role in the prosperity of farmers and the economic development of the region. Second, it has a major ecological value and ecological function. Third, it grows mainly in tropical and subtropical regions, such as Southeast Asia and Yunnan, Guangxi, Guangdong, and Fujian provinces.
At present, the research on Picea schrenkiana mainly includes nitrogen addition [31], individual genomics [32], stoichiometric characteristics [33], population dynamics [34], and biomass estimation [35,36]. However, the spatial distribution pattern of Picea schrenkiana in the TS is not clear, which has become the bottleneck for the scientific management and utilization of the plant resources. Therefore, we proposed an automatic extraction procedure for the spatial distribution of Picea schrenkiana based on GEE and the Jeffries–Matusita (JM) distance. The key research contents are as follows: (1) Can eliminating abnormal sample data improve the accuracy of land-cover classification in the TS of CA? (2) The difference in land-cover classification accuracy in the TS of CA under different seasonal images. (3) Feasibility analysis of using the JM distance to determine the importance of classification features. (4) Eight different scenarios were designed to determine the effects of different types of classification features and the best scenario was to extract the spatial distribution of Picea schrenkiana with a resolution of 10 m in the TS.

2. Materials and Methods

2.1. Study Area Description

This study was conducted in the TS (69°–95° E, 39°–46° N), with a total area of about 9.97 × 105 km2 (Figure 1). Due to its vast area and distance from the ocean, the TS is the farthest mountain system from the ocean in the world [37] and also the largest mountain system in the arid area of the world. The main topography of the TS consists of mountains, intermountain basins, and piedmont plains. The mean altitude of the ridge is about 4000 m and the highest peak has an altitude of about 7443.8 m [38]. Located in the northern temperate zone and far from the sea, the area is characterized by a temperate continental climate. The annual average temperature is about 5.0 °C [39], and the annual average precipitation ranges from 250 to 300 mm. The vegetation distribution of the TS has a typical vertical zone. From low altitude to high altitude, there are mountain steppe, meadow steppe, mountain coniferous forest, alpine steppe, alpine cushion plants, and snow-covered glaciers [33]. Picea schrenkiana is the dominant species in mountain conifer forests. According to the field survey data and existing literature (Table 1), the elevation range of Picea schrenkiana on the northern slope of the TS in CA is from 1400 to 2800 m. To ensure an accurate extraction of Picea schrenkiana, a 1300 m contour was drawn from ArcGIS software and SRTM data with a spatial resolution of 90 m. The field enclosed by the contour at 1300 m is defined as the region studied in this paper.

2.2. Remote-Sensing Data

Sentinel-1 satellites are equipped with a dual-polarization C-band synthetic aperture radar (SAR) instrument, which provides SAR images with high temporal and spatial resolution. Sentinel-1 data are interferometric wide swath mode (IW) images, which is the standard mode over land [43] and corresponds to COPERNICUS/S1_GRD in GEE. The Sentinel-1 Toolbox has been used to preprocess the dataset, including thermal noise removal, radiometric calibration, and terrain correction [44]. Here, we exclusively utilized the vertical transmit/horizontal receive (VH) and vertical transmit/vertical receive (VV) bands. We also used Sentinel-2 data from the GEE archive. Sentinel-2 consists of two satellites (Sentinel-2A and Sentinel-2B), each carrying a multi-spectral imager (MSI) to acquire data in 13 spectral bands along a 290 km sun-synchronous orbit path every 5 days. Since Sentinel-2 MSI, Level-2A data were not provided fully on GEE’s data pool, Level-1C data were employed in this work, which corresponds to COPERNICUS/S2 in GEE. The datasets are products of the top of the atmosphere (TOA) after radiometric calibration and terrain correction.
Due to the wide coverage of the study area, it is necessary to combine images taken at different times to obtain high-quality image data covering the study area. At the same time, considering the factors of vegetation growth seasons, the images are divided into the four seasons of spring, summer, autumn, and winter (Table 2). The ee. Filter.calendarRange function was used to obtain images in the time range of the four seasons. The shapefile for the study area was imported into the GEE platform and the ee. filterBounds filter was employed to obtain images of the region of interest. For Sentinel-2 images, CLOUDY_PIXEL_PERCENTAGE of the four seasons was set to different values to remove all Sentinel-2 images of poor quality and obtain the number of pixel-by-pixel high-quality observation images in the study area (Table 2). The results showed that during the study period, the number of high-quality Sentinel-2 images from the four seasons in the study area ranged from 1 to 288 (Figure 2). The ee. Reducer.median function was then utilized to create a single image by calculating the median of all values on each pixel across the stack of all images. Finally, the Sentinel-1 composite image and high-quality Sentinel-2 composite image covering the study area were achieved for subsequent studies.

2.3. Training and Validation Sample Points

The accuracy of classification results is intimately related to the representativeness and quantity of samples [45]. Under realistic conditions for the TS and available land-cover data, the land-cover categories in this region were divided into seven types to separate Picea schrenkiana from the rest of the land uses. These include Picea schrenkiana, grassland, cropland, built, bare land, snow and ice, and waterbody. Due to the large extent of the study area, visual interpretation of a large number of sample points not only requires a lot of work, but also fails to properly distinguish between land-cover classes. To address this issue, we combined visual interpretation using Google Earth VHRI with semi-automatic extraction of sample points. On the one hand, the Picea schrenkiana samples were collected from VHRI on Google Earth. On the other hand, the non-Picea schrenkiana data (e.g., grassland, cropland, built, bare land, snow and ice, and waterbody) were collected by the semi-automatic extraction of sample points method. Specifically, semi-automatic extraction of sample points refers to the use of two tools (Create Random Points tool and Extract Multi Values to Points tool) in ArcGIS software and ESA World Cover data to obtain a large number of samples more accurately and quickly. Furthermore, the coordinates of 99 field survey Picea schrenkiana samples (Figure 1) were collected through GPS in the summer of 2019. Finally, a total of 8159 samples (Table 3) were collected, and 70% of these (5866) were used for model training and 30% (2293) for validation.

2.4. Classification Feature Input

Seven categories of features are applied for classification in this study (Table 4). These include radar features, spectral features, red-edge features, spectral indices, texture features, and terrain features. Multiple lines of evidence showed that texture features, especially texture features based on SAR data, can avoid the phenomenon of “ same spectrum foreign objects” and “ same object foreign spectrum”, to improve the accuracy of remote-sensing image classification [46]. A gray-level co-occurrence matrix (GLCM), proposed by Haralick in 1973 [47], is the most common statistical analysis method for calculating texture features at present. Therefore, this study used the algorithm glcmTexture provided by GEE to compute the texture metrics with a window size of 3 × 3 and an offset of 1 × 1 from the GLCM around each pixel of Sentinel-1 bands (VV and VH) [48]. Considering that Picea schrenkiana is a single group tree species on the north slope of the TS at an altitude of 1400–2800 m, the ee. Algorithms. Terrain and ee. Image (“USGS/SRTMGL1_003”) function provided by the GEE platform was used to calculate terrain features.

2.5. The Method for Eliminating Abnormal Samples

The Box-plot developed by American statisticians in 1977 [49] is a common method to check whether there are outliers in a group of data. Compared with the Grubbs method, 3σ method, and Z-score method, the Box-plot has a wider application because it does not require the data to obey normal distribution to determine outliers. The 1st quartile (Q1), 2nd quartile (Q2), and 3rd quartile (Q3) are the numbers placed at 25%, 50%, and 75% of a set of data in ascending order, respectively [50] (Figure 3). The difference between Q1 and Q3 is called the interquartile range (IQR). The criteria for identifying outliers in the Box-plot are whether the data are greater than Q3 + 1.5IQR or less than Q1 − 1.5IQR in a group of data [51]. Minimum and maximum are the minimum and maximum values except for outliers. In the process of visual interpretation, Google Earth VHRI and semi-automatic extraction of sample points were used to determine land-cover categories. It is impossible to ensure that all sample points are pure and typical, which leads to the existence of outliers in the value of the extracted classification feature. For a specific sample, if the number of outliers in the 13 extraction values based on spectral features is greater than or equal to 2, the sample is considered to be an abnormal sample. Therefore, the outliers of the extracted values are identified based on the Box-plot and then the abnormal samples are judged and eliminated. By comparing the Kappa score and OA before and after eliminating abnormal samples, the influence of abnormal samples on classification accuracy was analyzed.

2.6. The Method for Classification Feature Optimization

Considering that some features selected in this study could not satisfy the requirement to distinguish Picea schrenkiana from other land-cover categories, redundant information existed in classification features [52], which affected the accuracy of mapping the spatial distribution of Picea schrenkiana. For this problem, the purpose of separability analysis is to evaluate the ability of each feature to distinguish Picea schrenkiana from other land-cover categories. There are three common methods to determine the separability between different categories: the separability index (SI) [53,54], the transformed divergence (TD) [55], and the JM distance [56,57]. Compared with the SI and TD, the JM distance can quantitatively evaluate the separability between different categories. Thus, the JM distance is used to evaluate the ability of each feature to distinguish Picea schrenkiana from other categories, to determine the best feature combination to extract Picea schrenkiana. The JM value is between 0 and 2, and its size represents the degree of separation of the seven land-cover categories under the selected features. When JM = 2, it indicates that seven types of land-cover categories are entirely separated under the selected classification features [58,59]. The expression for the JM distance is as follows:
JM = 2 1 e B
where B represents the Bhattacharyya distance on a feature dimension. On the premise that normal distribution is satisfied, the Bhattacharyya distance between samples of different land-cover types is:
B = 1 8 ( e 1 e 2 ) 2   2 δ 1 2 + δ 2 2 + 1 2 l n δ 1 2 + δ 2 2 2 δ 1 · δ 2
where ek represents the mean value of a certain type of feature; δ k 2 represents the variance of a certain type of feature, where (k = 1, 2).

2.7. Random Forest-Based Classifier

RF algorithm is a non-parametric machine-learning algorithm of multiple decision-tree combinations proposed by Breiman Leo and Adele Cutler in 2001 [60]. In contrast to machine-learning algorithms such as decision trees and support vector machines, the RF algorithm is stable and effective in processing data multi-collinearity and dimensionality [61,62] without an over-fitting phenomenon [63]. Correspondingly, the RF algorithm was chosen and used in this study to map the spatial distribution of Picea schrenkiana in the TS of the CA.
The dominant construction procedure of the RF algorithm is as follows: based on the bootstrap sampling method, 2/3 of the samples are randomly chosen from the original sample set to construct the decision-tree model. Assuming that the features of each sample are M dimensions, m feature subsets (m << M) are randomly extracted from them, and the optimal feature is selected from these m features to split the node of the decision tree. The above procedure is iterated n times to obtain a random forest composed of n decision trees, and the optimal result is chosen from the n decision trees by voting for the classification result for each classification sample. In this study, a decision tree of size 50 was selected as optimal.

2.8. Description of Different Scenarios

Eight scenarios (Table 5) were designed to illustrate the effects of different types of classification features and the optimal classification features conducive to the extraction of Picea schrenkiana. We selected the best scenario and optimized the parameters of the classifier to extract the spatial distribution of Picea schrenkiana with a resolution of 10 m in the TS.

2.9. The Method for Accuracy Assessment

Accuracy evaluation is performed by constructing a confusion matrix based on the 2293 validation sample points and classification results from this study. The purpose of constructing the confusion matrix is to obtain accuracy assessment indexes, such as producer accuracy (PA), user accuracy (UA), the OA, and the Kappa score. The PA is the ratio of the number of pixels correctly assigned by the classifier to the actual total number of references in the category, which is used to measure the omission error of each category. The UA represents the ratio between the number of pixels correctly assigned to this category and the total number of pixels classified into this category by the classifier, which is used to measure the commission error of this category. The OA and Kappa scores are used to evaluate the overall classification accuracy of all categories. The four evaluation indices are calculated as follows:
PA = N i j N j × 100 %
UA = N i j N i × 100 %
OA = i = 1 q N i i N × 100 %
Kappa = [ N i = 1 q N i i i = 1 q ( N i N i ) ] [ N 2 i = 1 q ( N i N i ) ]
Nij is the value of row i and column j in the confusion matrix, N∗j is the sum of the j column of the confusion matrix, Ni∗ is the sum of row i of the confusion matrix, N is the total number of validation sample points, and q is the number of rows and columns of the confusion matrix.

3. Results

3.1. The Result of Eliminating Abnormal Samples

A total of 656 anomalous samples were selected by using the method described in Section 2.5. These include 117 Picea schrenkiana, 235 grassland, 56 cropland, 56 built, 139 bare lands, 24 snow and ice, and 29 waterbody. In this study, Picea schrenkiana was taken as an example to draw scatter plots of the extraction values of Picea schrenkiana samples in the spectral features (BLUE, GREEN, and RED) before and after the removal of abnormal samples. It can be seen from Figure 4 that after the removal of abnormal samples, the dispersion degree of the extraction values of Picea schrenkiana samples in the spectral features (BLUE, GREEN, and RED) decreased (Figure 4). In addition, the accuracy assessment index also changed. After the removal of abnormal samples, the OA and Kappa scores increased by 1.43% and 0.02, respectively, and the PA of Picea schrenkiana increased by 2.95%, but its UA decreased by 1.68% (Table 6).

3.2. Classification Results of Single Seasonal Images

The classification accuracy of different seasonal images is different; the OA and Kappa score was higher in spring and summer than in autumn and winter (Table 7). The difference in the classification accuracy between different seasonal images is mainly caused by three land-cover categories: cropland, snow and ice, and waterbody. The main reason is that compared with other land-cover categories (Picea schrenkiana, grassland, built, and bare land), the seasonal variation in the three land-cover categories (cropland, snow and ice, and waterbody) is more obvious. The PA of Picea schrenkiana was the highest in summer and the lowest in winter (96.04% and 95.34, respectively), while the UA was the highest in spring and the lowest in winter (96.71% and 96.24%, respectively). Although the summer UA of Picea is lower than the spring UA, the summer PA of Picea schrenkiana increases by 0.24%–2.10% compared to the other three seasons. In addition, the OA and Kappa scores increased by 0.48%–5.28% and 0.01–0.07, respectively, in the summer compared to the other three seasons. As a result, the summer was used as the optimal sextant for mapping the spatial distribution of Picea schrenkiana. This conclusion is in accordance with the actual situation, as a great variety of plants are in a period of vigorous growth during the summer, and the physical properties and chlorophyll of the different plants are significantly different.

3.3. The Results of Feature Optimization

The JM distance of different classification features is considerably different (Table 8). The feature with the largest JM distance is RED, while the smallest feature is VH CORR and VV CORR. The maximum distance is approximately 50 times the minimum distance. Based on the JM distances of the 42 categorical features in order from highest to lowest, the variation in the Kappa score was observed and the best combination of categorical features for mapping the spatial distribution of Picea schrenkiana was determined accordingly. With the increase in the number of features involved in classification, the Kappa score in the early stage (1–7 classification features) increased considerably, rapidly increasing from 0.410 to 0.821 (Table 8). In the middle period (8–17 classification features), the increased range of the Kappa score decreased significantly, showing a steady upward trend. In the later period (18–42 classification features), the Kappa score fluctuated slightly, fluctuating around 0.88. When the number of classification features used was 32 and 42, the Kappa score was 0.894 and 0.898, respectively. Considering that the difference between them is only 0.004, the first 32 classification features are selected as the final results of the feature optimization in this study.
The JM distances of the first 32 features were classified and analyzed (Table 9). The spectral features and texture features show high separation, while the radar features and terrain features show low separation. Among the spectral features, RED, GREEN, and BLUE have a high separation. The spectral features play an important role in forest mapping and are more prominent than other features in distinguishing Picea schrenkiana from other land-cover categories. In addition, the texture features play a key role in mapping the spatial distribution of Picea schrenkiana, as Picea schrenkiana interleaves with other land-cover categories such as grassland and bare land in patches.

3.4. Accuracy Analysis under Different Scenarios

Among the six scenarios (scenario 1- scenario 6), the PA and UA of mapping the spatial distribution of Picea schrenkiana were between 95.57%–97.44% and 95.40%–96.27%, respectively (Table 10). The OA of scenario 1 based on Sentinel-2 bands was 88.75%, and the Kappa score was 0.85. Based on the spectral features, the texture features (scenario 2), spectral indices (scenario 3), red-edge indices (scenario 4), radar features (scenario 5), and terrain features (scenario 6) were added, respectively, and the OA and Kappa scores were slightly increased. The OA scores increased by 0.39%, 1.09%, 0.7%, 0.61%, and 2.35%, while the Kappa scores increased by 0.01, 0.02, 0.01, 0.01, and 0.03, respectively. These results indicated that the spectral features had the best identification ability for Picea schrenkiana, while the other features had no obvious influence. However, the texture and terrain features still played a positive role in the land-use classification of the TS.
Compared with the other seven scenarios, scenario 7 obtained the best classification effect after optimizing all features by using the JM distance, and the OA and Kappa scores were 91.93% and 0.89 (Table 10), respectively. Compared to scenario 8, where all features participate in the classification, the OA and Kappa scores in scenario 7 are improved by 0.09% and 0.001, respectively, and the number of features in scenario 7 are less. Therefore, scenario 7 was selected as the final scheme for mapping the spatial distribution of Picea schrenkiana in the TS of CA.

3.5. Spatial Distribution of Picea schrenkiana in the Best Scenario

By comparing and analyzing different classification scenarios in Section 3.4, scenario 7 achieves the optimal classification effect, and its confusion matrix is shown in Table 11. Picea schrenkiana and open water have the best mapping results, with both the PA and UA above 95%. The PA and UA of grassland, built, and bare land are both above 85%. The mapping accuracy of cropland and snow and ice is not high and the PA of cropland is only 73.68%. The spatial distribution of Picea schrenkiana in the optimal classification scenario (scenario 7) is shown in the figure and six places are randomly selected for detailed display. Picea schrenkiana is concentrated in the middle TS and scattered in the remaining areas (Figure 5). In addition, the pattern boundary of Picea schrenkiana based on the method presented in this paper has a strong consistency with the optical image (Figure 6).
A total of seven intervals with a separation of 300 m were set in the elevation distribution range of Picea schrenkiana, and the variation characteristics of Picea schrenkiana with altitude were observed. The distribution area of Picea schrenkiana first increases and then decreases with altitude (Figure 7, Elevation). Picea schrenkiana occupies the largest distribution area in the altitude range of 2100 to 2400 m, accounting for 29.7%. In total, 75.7% of the Picea schrenkiana is distributed in a range of elevation from 1800 to 2700 m. This study shows that Picea schrenkiana distributes in the TS of CA with a slope range of 0–69°. A total of seven slope ranges were set at intervals of 10° to observe the change in Picea schrenkiana distribution. In total, 33.9% of the Picea schrenkiana was distributed in the slope range of 30–40°, accounting for the largest proportion of all the intervals, and 66.4% of the Picea schrenkiana was distributed in the slope range of 20–40° (Figure 7, Slope). According to the classification method for aspects in forestry investigation, four aspects (semi-shady slope, shady slope, semi-sunny slope, and sunny slope) are set. Only 14.1% of the Picea schrenkiana were distributed in sunny and semi-sunny slopes, while 85.9% of the Picea schrenkiana were distributed in shady and semi-shady slopes (Figure 7, Aspect).

4. Discussion

4.1. The Influence of Feature Selection on Classification Results

In this study, the JM distance was used to explore the best classification features of land-use classification and map the spatial distribution of Picea schrenkiana in the TS. SWIR2, GREEN, BlUE, RDED1, CIRRU, SWIR1, MNDVI, AEROS, VH ASM, VH ENT, LSWI, MNDWI, EVI, and VAPOR were the top classification features in a sequence of importance (Table 8). Jiang et al. [42] selected an area of 10,000 km2 in the TS for the remote-sensing identification of mountain coniferous forest and screened the ELEVATION, NDVI, spectral slope of red to near-infrared band, BLUE, RED, SWIR1. SLOPE is the most important classification feature of mountainous coniferous forest in the TS. SWIR1, RED, BLUE, SWIR1, and MNDVI (NDVI) in this study were consistent with Jiang’s research conclusion. The important reasons for BLUE and SWIR1 are that the bare land of the study area is large and the two classification features are sensitive to the soil background [64]. RED and NDVI are sensitive to vegetation and play a significant role in distinguishing vegetation from other land covers. ELEVATION and SLOPE, two categorical features, are of low importance in this study, which is inconsistent with the conclusions of Jiang et al. The main reason is that this study focuses on the importance of different features for mapping the spatial distribution of Picea schrenkiana. Therefore, only the JM distance between Picea schrenkiana and other land-cover types (grassland, cropland, built, bare land, snow and ice, and waterbody) was calculated. The distance between each of the two categories was not calculated. The immediate consequence of this practice would be to obscure the topographic features of Picea schrenkiana‘s distribution in the TS. In addition, the JM distance algorithm only evaluates the classification features based on the separation degree between different categories and does not consider the correlation between different features [65]. As a result, the classification features selected based on the JM distance selection may still have strong correlations, affecting the classification accuracy. At the same time, VH ASM and VH ENT are also of high importance in this study, because VH can obtain more vegetation structure information than VV [66].

4.2. The Spatial Distribution of Picea schrenkiana

The statistical results of this study showed that Picea schrenkiana was concentrated in the middle TS and scattered in other areas (Figure 5). Topographic factors such as elevation, slope, and aspect affect the spatial distribution of vegetation through the redistribution of vegetation habitat elements such as heat and water. The biomass of Picea schrenkiana showed a unimodal trend with the increase in altitude [35,67], and the biomass of Picea schrenkiana reached its peak at an altitude of 2100–2400 m, indicating that the altitude range was the most suitable for Picea schrenkiana to grow. The results of this study also show that the distribution area of Picea schrenkiana first increases and then decreases with the increase in altitude. The distribution area of Picea schrenkiana occupies the largest proportion in the altitude range of 2100 to 2400 m, accounting for 29.7% (Figure 7, Elevation). This is consistent with previous findings. Picea schrenkiana occupies the largest distribution area of 33.9% in the slope range of 30–40°, and 66.4% of the Picea schrenkiana is distributed in the slope range of 20–40° (Figure 7, Slope). In addition, 85.9% of the Picea schrenkiana is distributed in the shade slope and semi-shade slope (Figure 7, Aspect). These are consistent with the descriptions in the literature [68].
To sum up, although this paper has achieved good results in mapping the spatial distribution of Picea schrenkiana in the TS of CA (Figure 5), the impact of shadows was a major challenge for this work. In this area, the effect of removing shadows by terrain correction is not obvious, and the method for identifying the actual object type in the shadow according to the difference in the distribution position of shadows on the two images will have a large workload [42]. How to avoid the influence of shadows and choose the optimal feature selection algorithm [65] to more accurately identify the spatial distribution pattern of Picea schrenkiana in the TS is a problem worthy of attention in future studies.

5. Conclusions

The advantage of this work is being able to map the spatial distribution of Picea schrenkiana with 10 m resolution in the entire TS (9.97 × 105 km2) by using GEE, the JM distance, the latest available satellite sensors, and a large number of training samples, overcoming the problem of insufficient computing capability for the large-scale monitoring of Picea schrenkiana. By using the Box-plot method to eliminate abnormal samples, the PA of Picea schrenkiana is increased by 2.95%, and the OA and Kappa scores of land-cover classification in the TS of CA are increased by 1.43% and 0.02. Compared to the other three seasons, the PA of Picea schrenkiana in summer increased by 0.24%–2.10%, the OA and Kappa scores of land-cover classification in the TS of CA by 0.48%–5.28% and 0.01–0.07. The separation obtained by the JM distance showed that spectral features and texture features played a key role in the mapping of Picea schrenkiana. The JM distance can seize the classification features that are most conducive to the mapping of Picea schrenkiana. The best classification features are RED, SWIR2, GREEN, BlUE, RDED1, CIRRU, SWIR1, MNDVI, AEROS, VH ASM, VH ENT, LSWI, MNDWI, EVI, VAPOR, RDED2, NDTI, NIR, VV ASM, RDED4, RDED3, NDre2, NDre1, NDVIre1, NDVI, VV ENT, VH, NDVIre3, NDI45, SLOPE, VV, and ELEVATION. Picea schrenkiana has a PA of 96.74%, and OA and Kappa scores of 91.93% and 0.89, respectively, for land-cover classification in the TS. Picea schrenkiana is concentrated in the middle TS and scattered in the remaining areas. In total, 85.7% of the Picea schrenkiana was distributed in the altitude range of 1500–2700 m, 66.4% of the Picea schrenkiana was distributed in the slope range of 20–40°, and 85.9% of the Picea schrenkiana was distributed in shady and semi-shady slopes.

Author Contributions

Conceptualization, Z.X. and C.X.; methodology, Z.X. and C.X.; software, F.X. and T.Y.; validation, F.X.; formal analysis, F.X. and Z.X.; investigation, Z.X., C.X., F.X. and T.Y.; resources, Z.X.; data curation, Z.X., C.X., F.X. and T.Y.; writing—original draft preparation, F.X.; writing—review and editing, Z.X., C.X. and F.X.; visualization, F.X.; supervision, Z.X.; project administration, Z.X.; funding acquisition, Z.X. and C.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (42067062) and the Natural Science Foundation of Xinjiang Uygur Autonomous Region (2022D01B234, 2022D01C399, and 2022E01052).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Woodwell, G.M.; Whitaker, R.H.; Reiners, W.A.; Likens, G.E.; Botkin, D.B. The Biota and the World Carbon Budget: The terrestrial biomass appears to be a net source of carbon dioxide for the atmosphere. Science 1978, 199, 141–146. [Google Scholar] [CrossRef] [PubMed]
  2. Liu, W.; Wang, X.; Lu, F.; Ouyang, Z. Regional and global estimates of carbon stocks and carbon sequestration capacity in forest ecosystems: A review. Chin. J. Appl. Ecol. 2015, 26, 2881–2890. [Google Scholar]
  3. Prichard, S.J.; Stevens-Rumann, C.S.; Hessburg, P.F. Tamm Review: Shifting global fire regimes: Lessons from reburns and research needs. For. Ecol. Manag. 2017, 396, 217–233. [Google Scholar] [CrossRef]
  4. Pritchard, H.D. Asia’s glaciers are a regionally important buffer against drought. Nature 2017, 545, 169–174. [Google Scholar] [CrossRef] [PubMed]
  5. Xu, W.; Yang, L.; Chen, X.; Gao, Y.; Wang, L. Carbon storage, spatial distribution and the influence factors in Tianshan forests. Chin. J. Plant Ecol. 2016, 40, 364–373. [Google Scholar]
  6. Zhang, B.; Zhou, C.; Chen, S. The Geo-info-spectrum of Montane Altitudinal Belts in China. Acta Geogr. Sin. 2003, 58, 163–171. [Google Scholar]
  7. Li, C.; Zhang, C.; Luo, G.; Chen, X.; Maisupova, B.; Madaminov, A.A.; Han, Q.; Djenbaev, B.M. Carbon stock and its responses to climate change in Central Asia. Glob. Chang. Biol. 2015, 21, 1951–1967. [Google Scholar] [CrossRef]
  8. Wang, T.; Ren, G.; Chen, F.; Yuan, Y. An analysis of precipitation variations in the west-central Tianshan Mountains over the last 300 years. Quat. Int. 2015, 358, 48–57. [Google Scholar] [CrossRef]
  9. Li, Z.; Chen, Y.; Fang, G.; Li, Y. Multivariate assessment and attribution of droughts in Central Asia. Sci. Rep. 2017, 7, 1316. [Google Scholar] [CrossRef] [Green Version]
  10. Li, Z.; Chen, Y.; Li, W.; Deng, H.; Fang, G. Potential impacts of climate change on vegetation dynamics in Central Asia. J. Geophys. Res. Atmos. 2015, 120, 12345–12356. [Google Scholar] [CrossRef]
  11. Jiao, L.; Chen, K.; Wang, S.-J.; Liu, X.-P. Stability evaluation of radial growth of Picea schrenkiana in different age groups in response to climate change in the eastern Tianshan Mountains. J. Mt. Sci. 2020, 17, 1735–1748. [Google Scholar] [CrossRef]
  12. Gong, P. Remote sensing of environmental change over China: A review. Chin. Sci. Bull. 2012, 57, 2793–2801. [Google Scholar] [CrossRef] [Green Version]
  13. Weiss, M.; Jacob, F.; Duveiller, G. Remote sensing for agricultural applications: A meta-review. Remote Sens. Environ. 2020, 236, 111402. [Google Scholar] [CrossRef]
  14. Potapov, P.V.; Turubanova, S.A.; Hansen, M.C.; Adusei, B.; Broich, M.; Altstatt, A.; Mane, L.; Justice, C.O. Quantifying forest cover loss in Democratic Republic of the Congo, 2000–2010, with Landsat ETM+ data. Remote Sens. Environ. 2012, 122, 106–116. [Google Scholar] [CrossRef]
  15. Hao, S.; Chen, Y.; Liu, H.; Zhu, X.; Dawa, Z.; Li, W. Object-oriented Forest Classification of Linzhi County based on CART Decision Tree with Texture Information. Remote Sens. Technol. Appl. 2017, 32, 386–394. [Google Scholar]
  16. Zhang, D.; Yang, W.; Kang, D.; Zhang, H. Spatial-temporal characteristics and policy implication for non-grain production of cultivated land in Guanzhong Region. Land Use Policy 2023, 125, 106466. [Google Scholar] [CrossRef]
  17. Feng, G.; Masek, J.; Schwaller, M.; Hall, F. On the blending of the Landsat and MODIS surface reflectance: Predicting daily Landsat surface reflectance. IEEE Trans. Geosci. Remote Sens. 2006, 44, 2207–2218. [Google Scholar] [CrossRef]
  18. Walker, J.J.; de Beurs, K.M.; Wynne, R.H.; Gao, F. Evaluation of Landsat and MODIS data fusion products for analysis of dryland forest phenology. Remote Sens. Environ. 2012, 117, 381–393. [Google Scholar] [CrossRef]
  19. Baumann, P.; Mazzetti, P.; Ungar, J.; Barbera, R.; Wagner, S. Big Data Analytics for Earth Sciences: The EarthServer approach. Int. J. Digit. Earth 2015, 9, 3–29. [Google Scholar] [CrossRef]
  20. Liu, C.; Xiong, T.; Gong, P.; Qi, S. Improving large-scale moso bamboo mapping based on dense Landsat time series and auxiliary data: A case study in Fujian Province, China. Remote Sens. Lett. 2017, 9, 1–10. [Google Scholar] [CrossRef]
  21. Wang, S.; Di Tommaso, S.; Deines, J.M.; Lobell, D.B. Mapping twenty years of corn and soybean across the US Midwest using the Landsat archive. Sci. Data 2020, 7, 307. [Google Scholar] [CrossRef]
  22. Tan, S.; Wu, B.; Zhang, X. Mapping paddy rice in the Hainan Province using both Google Earth Engine and remote sensing images. J. Geo-Inf. Sci. 2019, 21, 937–947. [Google Scholar]
  23. Amani, M.; Mahdavi, S.; Afshar, M.; Brisco, B.; Huang, W.; Mohammad Javad Mirzadeh, S.; White, L.; Banks, S.; Montgomery, J.; Hopkinson, C. Canadian Wetland Inventory using Google Earth Engine: The First Map and Preliminary Results. Remote Sens. 2019, 11, 842. [Google Scholar] [CrossRef] [Green Version]
  24. Venkatappa, M.; Sasaki, N.; Han, P.; Abe, I. Impacts of droughts and floods on croplands and crop production in Southeast Asia—An application of Google Earth Engine. Sci. Total Environ. 2021, 795, 148829. [Google Scholar] [CrossRef] [PubMed]
  25. Guo, S.; Du, P.; Meng, Y.; Wang, X.; Tang, P.; Lin, C.; Xia, J. Dynamic monitoring on flooding situation in the Middle and Lower Reaches of the Yangtze River Region using Sentinel-1A time series. Natl. Remote Sens. Bull. 2021, 25, 2127–2141. [Google Scholar] [CrossRef]
  26. Zhang, X.; Liu, L.; Chen, X.; Gao, Y.; Jiang, M. Automatically Monitoring Impervious Surfaces Using Spectral Generalization and Time Series Landsat Imagery from 1985 to 2020 in the Yangtze River Delta. J. Remote Sens. 2021, 2021, 9873816. [Google Scholar] [CrossRef]
  27. Yang, L.; Zhang, J.; Gong, E.; Liu, M.; Ren, J.; Wang, Y. Analysis of spatio-temporal land-use patterns and the driving forces in Xi’an City using GEE and multi-source data. Trans. Chin. Soc. Agric. Eng. 2022, 38, 279–288. [Google Scholar]
  28. Liu, K.; Peng, L.; LI, X.; Tan, M.; Wang, S. Monitoring the Inter-annual Change of Mangroves based on the Google Earth Engine. J. Geo-Inf. Sci. 2019, 21, 731–739. [Google Scholar]
  29. Li, W.; Zhang, J.; Liu, S.; Chen, X.; Zou, H.; Bai, R.; Lyu, R. GEE-based Extraction of Rubber Forest Distribution in Main Producing Areas of Southeast Asia. Chin. J. Trop. Crops 2021, 42, 3345–3350. [Google Scholar]
  30. Xu, H.; Liu, C.; Wang, J.; QI, S. Study on extraction of citrus orchard in Gannan region based on google earth engine platform. J. Geo-Inf. Sci. 2018, 20, 396–404. [Google Scholar]
  31. Zhang, H.; Cong, L.; Liu, X.; Shao, K.; Li, X.-Z.; Li, R.-X. Soil Enzyme Activity in Picea schrenkiana and Its Relationship with Environmental Factors in the Tianshan Mountains, Xinjiang. Huanjing Kexue 2021, 42, 403–410. [Google Scholar] [PubMed]
  32. Li, D.-L.; Yang, Y.; Yang, S.; Chen, Y.-K. The complete chloroplast genome sequence of Keteleeria hainanensis (Pinaceae). Mitochondrial DNA Part B 2019, 4, 2934–2935. [Google Scholar] [CrossRef] [Green Version]
  33. Li, L.; Chang, Y.; Xu, Z. Stoichiometric characteristics of Picea schrenkiana forests with a hydrothermal gradient and their correlation with soil physicochemical factors on Tianshan Mountain. Acta Ecol. Sin. 2018, 38, 8139–8148. [Google Scholar]
  34. Wang, T.; Ren, H.; Ma, K. A preliminary study on Picea schrenkiana population dynamics in the central Tianshan Mountains, northwestern China. Ecol. Environ. 2006, 3, 564–571. [Google Scholar]
  35. Luo, Q.; Xu, Z.; Xu, Z. Estimation and spatial pattern analysis of biomass of Picea schrenkiana forests. Acta Ecol. Sin. 2020, 40, 5288–5297. [Google Scholar]
  36. Ding, X.; Xu, Z.; Wang, Y. Application of MaxEnt Model in Biomass Estimation: An Example of Spruce Forest in the Tianshan Mountains of the Central-Western Part of Xinjiang, China. Forests 2023, 14, 953. [Google Scholar] [CrossRef]
  37. Chen, Y.; Li, Z.; Fang, G.; Deng, H. Impact of climate change on water resources in the Tianshan Mountians, Central Asia. Acta Geogr. Sin. 2017, 72, 18–26. [Google Scholar]
  38. Cai, X.; Li, Z.; Zhang, H.; Xu, C. Vulnerability of glacier change in Chinese Tianshan Mountains. Acta Geogr. Sin. 2021, 76, 2253–2268. [Google Scholar]
  39. Liu, C.; Ding, L. A Primary Calculation of Temperature and Precipitation in Tianshan Mountains, China. J. Glaciol. Geocryol. 1988, 10, 151–159. [Google Scholar]
  40. Lan, J.; Xiao, Z.; Li, J.; Zhang, Y. Biomass allocation and allometric growth of Picea schrenkiana in Tianshan Mountains. J. Zhejiang AF Univ. 2020, 37, 416–423. [Google Scholar]
  41. Li, Z.; Luo, Q.; Xu, Z. Effects of stand density on the biomass allocation and tree height-diameter allometric growth of Picea schrenkiana forest on the northern slope of the western TS. Arid Zone Res. 2021, 38, 545–552. [Google Scholar]
  42. Jiang, J.; Zhu, W.; Qiao, K.; Jiang, Y. An Identification Method for Mountains Coniferous in Tian- shan with Sentinel-2 Data. Remote Sens. Technol. Appl. 2021, 36, 847–856. [Google Scholar]
  43. Torres, R.; Snoeij, P.; Geudtner, D.; Bibby, D.; Davidson, M.; Attema, E.; Potin, P.; Rommen, B.; Floury, N.; Brown, M.; et al. GMES Sentinel-1 mission. Remote Sens. Environ. 2012, 120, 9–24. [Google Scholar] [CrossRef]
  44. Gulácsi, A.; Kovács, F. Sentinel-1-Imagery-Based High-Resolution Water Cover Detection on Wetlands, Aided by Google Earth Engine. Remote Sens. 2020, 12, 1614. [Google Scholar] [CrossRef]
  45. Zhang, L.; Gong, Z.; Wang, Q.; Jin, D.; Wang, X. Wetland mapping of Yellow River Delta wetlands based on multi-feature optimization of Sentinel-2 images. J. Remote Sens. 2019, 23, 313–326. [Google Scholar]
  46. Erasmi, S.; Twele, A. Regional land cover mapping in the humid tropics using combined optical and SAR satellite data—A case study from Central Sulawesi, Indonesia. Int. J. Remote Sens. 2009, 30, 2465–2478. [Google Scholar] [CrossRef]
  47. Haralick, R.M. Textural features for image classification. IEEE Trans.Syst. Man Cybern. 1973, 3, 610–621. [Google Scholar] [CrossRef] [Green Version]
  48. Stromann, O.; Nascetti, A.; Yousif, O.; Ban, Y. Dimensionality Reduction and Feature Selection for Object-Based Land Cover Classification based on Sentinel-1 and Sentinel-2 Time Series Using Google Earth Engine. Remote Sens. 2019, 12, 76. [Google Scholar] [CrossRef] [Green Version]
  49. David, F.N.; Tukey, J.W. Exploratory Data Analysis. Biometrics 1977, 33, 768. [Google Scholar] [CrossRef]
  50. Frigge, M.; Hoaglin, D.C.; Iglewicz, B. Some Implementations of the Boxplot. Am. Stat. 1989, 43, 50–54. [Google Scholar]
  51. Grönroos, C. The perceived service quality concept—A mistake? Manag. Serv. Qual. 2001, 11, 150–152. [Google Scholar] [CrossRef]
  52. Bellman, R. Dynamic Programming. Science 1966, 153, 34–37. [Google Scholar] [CrossRef]
  53. Somers, B.; Asner, G.P. Multi-temporal hyperspectral mixture analysis and feature selection for invasive species mapping in rainforests. Remote Sens. Environ. 2013, 136, 14–27. [Google Scholar] [CrossRef]
  54. Hu, Q.; Sulla-Menashe, D.; Xu, B.; Yin, H.; Tang, H.; Yang, P.; Wu, W. A phenology-based spectral and temporal feature selection method for crop mapping from satellite time series. Int. J. Appl. Earth Obs. Geoinf. 2019, 80, 218–229. [Google Scholar] [CrossRef]
  55. Chen, L.; Jin, Z.; Michishita, R.; Cai, J.; Yue, T.; Chen, B.; Xu, B. Dynamic monitoring of wetland cover changes using time-series remote sensing imagery. Ecol. Inform. 2014, 24, 17–26. [Google Scholar] [CrossRef]
  56. Wardlow, B.D.; Egbert, S.L.; Kastens, J.H. Analysis of time-series MODIS 250 m vegetation index data for crop classification in the U.S. Central Great Plains. Remote Sens. Environ. 2007, 108, 290–310. [Google Scholar] [CrossRef] [Green Version]
  57. Arvor, D.; Jonathan, M.; Meirelles, M.S.P.; Dubreuil, V.; Durieux, L. Classification of MODIS EVI time series for crop mapping in the state of Mato Grosso, Brazil. Int. J. Remote Sens. 2011, 32, 7847–7871. [Google Scholar] [CrossRef]
  58. Vanniel, T.; McVicar, T.; Datt, B. On the relationship between training sample size and data dimensionality: Monte Carlo analysis of broadband multi-temporal classification. Remote Sens. Environ. 2005, 98, 468–480. [Google Scholar] [CrossRef]
  59. Murakami, T.; Ogawa, S.; Ishitsuka, N.; Kumagai, K.; Saito, G. Crop discrimination with multitemporal SPOT/HRV data in the Saga Plains, Japan. Int. J. Remote Sens. 2010, 22, 1335–1348. [Google Scholar] [CrossRef]
  60. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  61. Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J. Photogramm. Remote Sens. 2012, 67, 93–104. [Google Scholar] [CrossRef]
  62. Belgiu, M.; Drăguţ, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
  63. Teluguntla, P.; Thenkabail, P.S.; Oliphant, A.; Xiong, J.; Gumma, M.K.; Congalton, R.G.; Yadav, K.; Huete, A. A 30-m landsat-derived cropland extent product of Australia and China using random forest machine learning algorithm on Google Earth Engine cloud computing platform. ISPRS J. Photogramm. Remote Sens. 2018, 144, 325–340. [Google Scholar] [CrossRef]
  64. Tateishi, T.P.R.; Ishiyama, T.; Honda, Y. Relationships between percent vegetation cover and vegetation indices. Int. J. Remote Sens. 1998, 19, 3519–3535. [Google Scholar]
  65. Zhu, M.; Li, G.; Peng, Z. Remote sensing crop classification method based on feature selection. Sci. Surv. Mapp. 2022, 47, 122–128. [Google Scholar]
  66. Ning, X.; Chang, W.; Wang, H.; Zhang, H.; Zhu, Q. Extraction of marsh wetland in Heilongjiang Basin based on GEE and multi-source remote sensing data. Natl. Remote Sens. Bull. 2022, 26, 386–396. [Google Scholar] [CrossRef]
  67. Lei, Z.; Pengtao, Y.; Yanhui, W.; Shunli, W.; Xiande, L. Biomass Change of Middle Aged Forest of Qinghai Spruce along an Altitudinal Gradient on the North Slope of Qilian Mountains. Sci. Silvae Sin. 2015, 51, 4–10. [Google Scholar]
  68. Wang, X. Picea schrenkiana survey on the north slope of Tianshan Mountains. Xinjiang Agric. Sci. 1963, 7, 285–289. [Google Scholar]
Figure 1. Location of the study area.
Figure 1. Location of the study area.
Forests 14 01373 g001
Figure 2. Statistics of Sentinel-2 valid remote-sensing data from 2019 to 2020. (a) Spring; (b) Summer; (c) Autumn; (d) Winter.
Figure 2. Statistics of Sentinel-2 valid remote-sensing data from 2019 to 2020. (a) Spring; (b) Summer; (c) Autumn; (d) Winter.
Forests 14 01373 g002
Figure 3. The structure of the Box-plot.
Figure 3. The structure of the Box-plot.
Forests 14 01373 g003
Figure 4. Scatter plot of extraction values of Picea schrenkiana samples in Sentinel-2 bands (Blue, Green, Red) before and after eliminating abnormal samples. (The horizontal axis represents the number of samples, the vertical axis represents the band extraction value; (a,c,e) represent the scatter plots of extracted values of Picea schrenkiana in bands BLUE, GREEN, and RED before eliminating outliers, respectively; (b,d,f) represent the scatter plots of extracted values of Picea schrenkiana in bands BLUE, GREEN, and RED after eliminating outliers, respectively).
Figure 4. Scatter plot of extraction values of Picea schrenkiana samples in Sentinel-2 bands (Blue, Green, Red) before and after eliminating abnormal samples. (The horizontal axis represents the number of samples, the vertical axis represents the band extraction value; (a,c,e) represent the scatter plots of extracted values of Picea schrenkiana in bands BLUE, GREEN, and RED before eliminating outliers, respectively; (b,d,f) represent the scatter plots of extracted values of Picea schrenkiana in bands BLUE, GREEN, and RED after eliminating outliers, respectively).
Forests 14 01373 g004
Figure 5. Land-cover classification (a) and spatial distribution of Picea schrenkiana (b) in the Tianshan Mountains of Central Asia. The study selected six different regions to display the extraction results of Picea schrenkiana, with A–F representing the positions of six different regions.
Figure 5. Land-cover classification (a) and spatial distribution of Picea schrenkiana (b) in the Tianshan Mountains of Central Asia. The study selected six different regions to display the extraction results of Picea schrenkiana, with A–F representing the positions of six different regions.
Forests 14 01373 g005
Figure 6. Local classification results of Picea schrenkiana and their corresponding Sentinel-2 images. (AF) represents the extraction results of Picea schrenkiana in six different regions, respectively.
Figure 6. Local classification results of Picea schrenkiana and their corresponding Sentinel-2 images. (AF) represents the extraction results of Picea schrenkiana in six different regions, respectively.
Forests 14 01373 g006
Figure 7. Distribution of Picea schrenkiana in different elevations, slopes, and aspect.
Figure 7. Distribution of Picea schrenkiana in different elevations, slopes, and aspect.
Forests 14 01373 g007aForests 14 01373 g007b
Table 1. The main elevation range of Picea schrenkiana on the northern slope of the Tianshan Mountains.
Table 1. The main elevation range of Picea schrenkiana on the northern slope of the Tianshan Mountains.
ElevationReference
1500–2700 mWang, Ren et al. [34]
1600–2800 mLi, Chang et al. [33]
1600–2800 mLan, Xiao et al. [40]
1500–2800 mLuo, Xu et al. [35]
1500–2800 mLi, Luo et al. [41]
1400–2800 mJiang, Zhu et al. [42]
Table 2. Sentinel-1/2 image selection schedule.
Table 2. Sentinel-1/2 image selection schedule.
SeasonsYearsMonthsCloudy _Pixel_Percentage
Spring2019–20204–630
Summer2019–20207–920
Autumn2019–202010–1235
Winter2019–20201–360
Table 3. The situation of Sample Collection.
Table 3. The situation of Sample Collection.
TypePicea schrenkianaGrasslandCroplandBuiltBare LandSnow and IceWaterbody
Numbers152430365335171704275570
Table 4. Summary of classification features applied in this research.
Table 4. Summary of classification features applied in this research.
CategoriesFeaturesCentral Wavelength/Indices FormulaData Source
Radar featuresVVvertical transmit/vertical receiveSentinel-1
VHvertical transmit/horizontal receiveSentinel-1
Spectral featuresAEROS443 nmSentinel-2
BLUE490 nmSentinel-2
GREEN560 nmSentinel-2
RED665 nmSentinel-2
RDED1705 nmSentinel-2
RDED2740 nmSentinel-2
RDED3783 nmSentinel-2
NIR842 nmSentinel-2
RDED4865 nmSentinel-2
VAPOR940 nmSentinel-2
CIRRU1375 nmSentinel-2
SWIR11610 nmSentinel-2
SWIR22190 nmSentinel-2
Red-edge featuresNDVIre1(RDED4 − RDED1)/(RDED4 + RDED1)Sentinel-2
NDVIre2(RDED4 − RDED2)/(RDED4 + RDED2)Sentinel-2
NDVIre3(RDED4 − RDED3)/(RDED4 + RDED3)Sentinel-2
NDre1(RDED2 − RDED1)/(RDED2 + RDED1)Sentinel-2
NDre2(RDED3 − RDED1)/(RDED3 + RDED1)Sentinel-2
Spectral indicesNDVI(NIR RED)/(NIR + RED)Sentinel-2
EVI2.5 × (NIR RED)/(NIR + 6.0 × RED −7.5 × BLUE + 1)Sentinel-2
MNDVI(RED GREEN)/(RED + GREEN)Sentinel-2
NDWI(GREEN NIR)/(GREEN + NIR)Sentinel-2
LSWI(NIR SWIR1)/(NIR + SWIR1)Sentinel-2
MNDWI(GREEN SWIR1)/(GREEN + SWIR1)Sentinel-2
NDTI(SWIR1 SWIR2)/(SWIR1 + SWIR2)Sentinel-2
NDI45(RDED1 RED)/(RDED1 + RED)Sentinel-2
Texture featuresVV ASMVV Angular Second MomentSentinel-1
VV CONVV ContrastSentinel-1
VV CORRVV CorrelationSentinel-1
VV SVARVV Sum VarianceSentinel-1
VV ENTVV EntropySentinel-1
VH ASMVH Angular Second MomentSentinel-1
VHCONVH ContrastSentinel-1
VH CORRVH CorrelationSentinel-1
VH SVARVH Sum VarianceSentinel-1
VH ENTVH EntropySentinel-1
Terrain featuresELEELEVATIONSRTM
SLOSLOPESRTM
ASPASPECTSRTM
SHAHILL SHADESRTM
Table 5. Eight scenarios for mapping the spatial distribution of Picea schrenkiana.
Table 5. Eight scenarios for mapping the spatial distribution of Picea schrenkiana.
ScenariosRadar FeaturesSpectral FeaturesRed-Edge FeaturesSpectral IndicesTexture FeaturesTerrain FeaturesFeatures
Optimization
S1
S2
S3
S4
S5
S6
S7
S8
Table 6. Classification accuracy of abnormal samples before and after removal.
Table 6. Classification accuracy of abnormal samples before and after removal.
TypePicea schrenkianaGrasslandCroplandBuiltBare LandSnow and IceWaterbodyOAKappa
BeforePA/%93.9389.9366.8881.4085.8970.0091.5287.10%0.83
UA/%95.4384.5279.8590.3283.4778.8796.79
AfterPA/%96.8889.8962.9480.6986.9483.1394.8088.53%0.85
UA/%93.7587.6581.0881.8285.7890.7995.35
Note: Darker red indicates lower values, while darker green indicates higher values.
Table 7. Classification accuracy of single seasonal images.
Table 7. Classification accuracy of single seasonal images.
TypePicea
schrenkiana
GrasslandCroplandBuiltBare LandSnow and IceWaterbodyOAKappa
SpringPA/%95.8093.8579.6188.1088.2473.5696.8991.19%0.88
UA/%96.7189.9190.9893.2887.8881.0197.50
SummerPA/%96.0494.9174.3489.6887.6385.0696.8991.67%0.89
UA/%96.2688.5291.8791.1391.5390.2498.73
AutumnPA/%93.9491.4875.0092.0684.7970.1187.5888.36%0.85
UA/%95.9585.3290.4885.9385.4882.4398.60
WinterPA/%95.3493.1469.7487.3078.0962.0780.7586.39%0.82
UA/%96.2480.9788.3388.7184.0687.1098.48
Note: Darker red indicates lower values, while darker green indicates higher values.
Table 8. The JM distance of features and the accuracy of classification results of different numbers of features.
Table 8. The JM distance of features and the accuracy of classification results of different numbers of features.
FeatureJM DistanceKappaNumbers
RED1.010.411
SWIR20.940.6082
GREEN0.90.6593
BLUE0.890.734
RDED10.870.7645
CIRRU0.820.7936
SWIR10.810.8217
MNDVI0.720.8218
AEROS0.70.8289
VH ASM0.680.82910
VH ENT0.60.83611
LSWI0.60.83712
MNDWI0.590.84813
EVI0.580.85614
VAPOR0.580.85815
RDED20.560.86416
NDTI0.540.86917
NIR0.520.86918
VV ASM0.520.86219
RDED40.510.86820
RDED30.510.87121
NDre20.490.87322
NDre10.490.86923
NDVIre10.470.8724
NDVI0.470.86625
VV ENT0.460.86326
VH0.390.87327
NDVIre30.30.8728
NDI450.30.86929
SLOPE0.280.88630
VV0.260.8931
ELEVATION0.240.89432
VV CON0.230.88933
VV SVAR0.220.8934
NDVIre20.180.88435
ASPECT0.130.89436
VH CON0.110.89237
VH SVAR0.10.89438
NDWI0.090.8939
HILL SHADE0.070.89440
VH CORR0.020.8941
VV CORR0.020.89842
Note: Darker red indicates lower values, while darker green indicates higher values.
Table 9. Jeffries–Matusita distance between different types of features.
Table 9. Jeffries–Matusita distance between different types of features.
Type of FeatureThe Feature with the
Highest Degree of
Separation (Ranking)
The Feature with the
Lowest Degree of
Separation (Ranking)
The Average
JM Distance
Spectral featuresRED (1)RDED3 (21)0.74
Texture featuresVH ASM (10)VV ENT (26)0.57
Spectral indicesMNDVI (8)NDI45 (29)0.54
Red-edge featuresNDre2 (22)NDVIre3 (28)0.44
Radar featuresVH (27)VV (31)0.32
Terrain featuresSLOPE (30)ELEVATION (32)0.26
Note: Darker red indicates lower values, while darker green indicates higher values.
Table 10. Classification accuracy statistics under different scenarios.
Table 10. Classification accuracy statistics under different scenarios.
TypePicea schrenkianaGrasslandCroplandBuiltBare LandSnow and IceWaterbodyOAKappa
Scenario 1PA/%96.2792.9061.8475.486.0081.6195.0388.75%0.85
UA/%96.2785.3383.1986.3688.1587.6596.23
Scenario 2PA/%95.5792.962.576.1988.0382.7694.4189.14%0.86
UA/%95.7986.3685.5984.9687.5091.1496.82
Scenario 3PA/%96.5094.0865.7975.4087.6381.6195.0389.84%0.87
UA/%95.8387.1785.4784.8288.8993.4296.84
Scenario 4PA/%96.7493.7363.8279.3785.8081.6195.0389.45%0.86
UA/%95.6286.4688.1882.6489.6287.6596.23
Scenario 5PA/%96.7492.5466.4578.5787.0280.4695.0389.36%0.86
UA/%95.4086.7086.3286.8487.7389.7496.84
Scenario 6PA/%97.4493.7374.3484.9288.2479.3196.2791.10%0.88
UA/%95.8789.1990.4089.1788.4194.5297.48
Scenario 7PA/%96.7495.5073.6891.2787.6383.9195.6591.93%0.89
UA/%96.9689.1794.1289.1591.7286.998.09
Scenario 8PA/%97.2095.6273.0386.5188.0383.9195.6591.84%0.89
UA/%96.5388.7993.2888.6291.7591.2598.72
Note: Darker red indicates lower values, while darker green indicates higher values.
Table 11. Confusion matrix of scenario 7.
Table 11. Confusion matrix of scenario 7.
Picea schrenkianaGrasslandCroplandBuiltBare LandSnow and IceWaterbody
Picea schrenkiana4151300000
Grassland138073344305
Cropland051122000
Built027115500
Bare Land01805432142
Snow and Ice000011730
Waterbody100020154
PA/%96.7495.573.6891.2787.6383.9195.65
UA/%96.9689.1794.1289.1591.7286.9098.09
OA = 91.93%Kappa = 0.89
Note: Darker red indicates lower values, while darker green indicates higher values.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, F.; Xu, Z.; Xu, C.; Yu, T. Automatic Extraction of the Spatial Distribution of Picea schrenkiana in the Tianshan Mountains Based on Google Earth Engine and the Jeffries–Matusita Distance. Forests 2023, 14, 1373. https://doi.org/10.3390/f14071373

AMA Style

Xu F, Xu Z, Xu C, Yu T. Automatic Extraction of the Spatial Distribution of Picea schrenkiana in the Tianshan Mountains Based on Google Earth Engine and the Jeffries–Matusita Distance. Forests. 2023; 14(7):1373. https://doi.org/10.3390/f14071373

Chicago/Turabian Style

Xu, Fujin, Zhonglin Xu, Changchun Xu, and Tingting Yu. 2023. "Automatic Extraction of the Spatial Distribution of Picea schrenkiana in the Tianshan Mountains Based on Google Earth Engine and the Jeffries–Matusita Distance" Forests 14, no. 7: 1373. https://doi.org/10.3390/f14071373

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop