Next Article in Journal
Techno-Economic Evaluation of Ozone Application to Reduce Sludge Production in Small Urban WWTPs
Next Article in Special Issue
RSEI-Based Modeling of Ecological Security and Its Spatial Impacts on Soil Quality: A Case Study of Dayu, China
Previous Article in Journal
Key Aspects for IT-Services Integration in Urban Transit Service of Medium-Sized Cities: A Qualitative Exploratory Study in Colombia
Previous Article in Special Issue
Analysis of China’s Urban Innovation Connection Network Evolution: A Case Study of Henan Province
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatially Balanced Sampling for Validation of GlobeLand30 Using Landscape Pattern-Based Inclusion Probability

1
College of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, China
2
Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai 200092, China
3
Shanghai Key Laboratory of Space Mapping and Remote Sensing for Planetary Exploration, Tongji University, Shanghai 200092, China
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(5), 2479; https://doi.org/10.3390/su14052479
Submission received: 26 January 2022 / Revised: 17 February 2022 / Accepted: 17 February 2022 / Published: 22 February 2022
(This article belongs to the Special Issue Urban Expansion and Its Effect on Soil Sustainability)

Abstract

:
Global and local land-cover mapping products provide important data on land surface. However, the accuracy of land-cover products is the key issue for their further scientific application. There has been neglect of the relationship between inclusion probability and spatial heterogeneity in traditional spatially balanced sampling. The aim of this paper was to propose an improved spatially balanced sampling method using landscape pattern-based inclusion probability. Compared with other global land-cover datasets, Globeland30 has the advantages of high resolution and high classification accuracy. A two-stage stratified spatially balanced sampling scheme was designed and applied to the regional validation of GlobeLand30 in China. In this paper, the whole area was divided into three parts: the Tibetan Plateau region, the Northwest China region, and the East China region. The results show that 7242 sample points were selected, and the overall accuracy of GlobeLand30-2010 in China was found to be 80.46%, which is close to the third-party assessment accuracy of GlobeLand30. This method improves the representativeness of samples, reduces the classification error of remote sensing, and provides better guidance for biodiversity and sustainable development of environment.

1. Introduction

Land cover is the natural basis for the survival and development of human beings, and it affects the energy and material circulation of the Earth. It is also intrinsically related to global climate change, biological habitat protection, sustainable practices suitable for increasing soil protection and reducing pollution, and so on [1,2,3,4,5,6,7]. Global-scale land-cover mapping and monitoring is a complex remote sensing project. At present, the United States and the European Union have successively developed five sets of more commonly used global land-cover products, namely: (1) IGBP-DISCover (International Geosphere–Biosphere Program’s Data and Information System Cover) produced by the U.S. Geological Survey [8,9]; (2) UMD (University of Maryland) produced by the University of Maryland [10]; (3) MODIS (Moderate resolution Imaging Spectro-radiometer) produced by Boston University [11,12]; (4) GLC2000 (Global Land Cover 2000) produced by the European Commission’s Joint Research Centre [13]; (5) GlobCover (Global Land Cover Product) produced by the European Space Agency [14]. In order to support the development of global change research, China completed the world’s first 30 m global land-cover product GlobeLand30 in 2014. Someone generated nine LC maps of China from seven global LC datasets (IGBP DISCover, UMD, GLC, MCD12Q1, GLCNMO, CCI-LC, and GlobeLand30), while Globeland30 had the highest accuracy [15]. In addition, there were some other high-precision land-cover products made in China for classification accuracy evaluation [16,17,18].
The key issue for the further scientific application of land-cover products is accuracy. Without validation, any land-cover product, either at a global scale or at a regional scale, is only an unconfirmed hypothesis [19]. It is therefore necessary to ensure product quality by assessing the similarities between the classification results and the real land surface through quantitative means. At present, basic accuracy assessment is carried out by means of a confusion matrix, which is derived from the spatial comparison between classified data and reference data. A confusion matrix can be constructed by pixel-to-pixel comparison or by a sampling survey. Reference data of a whole region are not usually available, so that a sampling strategy is needed for selecting samples from the study area. The sample size (number of samples) can be determined according to the available resources/time/cost or by a rigorous statistical model [20,21]. Stratified sampling and cluster sampling are the most widely used sampling methods for land-cover product validation [22,23,24,25,26]. However, how to consider the land-cover spatial autocorrelation in the sample distribution and increase the flexibility of the sample size is still one of the main challenges for land-cover product validation [27,28,29]. Most populations are in fact distributed over space but many sampling designs, such as simple random sampling (SRS), do not incorporate the spatial aspect into the design [30]. The spatial distribution of a population represents an important tool in sampling designs that use the geographical coordinates of the units in the frame as auxiliary information [31]. In particular, our interest is focused on probability samples that are well spread over the population in every dimension, which in the recent literature is defined as spatially balanced samples [32]. Spatially balanced sampling can produce evenly distributed samples, and it also allows the user to use a priori knowledge to create an efficient sampling scheme [28,33]. This method has been applied to natural and environmental resource surveying [34,35,36].
The most common simple random sampling (SRS), although simple in theory, has low levels of error and of computational complexity of the target estimation, and it is difficult to operate in practice. For example, it needs a sampling frame containing all the basic units in the population. In addition, the variability among SRS samples is very high, and the spatial distribution of samples is uneven, resulting in poor representativeness of samples and unsatisfactory estimation accuracy; although cluster sampling can save costs in a large-scale investigation, it requires small differences among primary units (PSU) and large differences among secondary units (SSU), which is very difficult to achieve. The variability between the actually collected samples is high, and many redundant samples with high similarity will be obtained. Compared with the methods above, systematic sampling can obtain samples with uniform spatial distribution, but it cannot obtain unbiased estimation of estimators, and the adaptability of systematic sampling is very poor. If one is to increase or decrease samples, one must increase or decrease the grid density, which is inflexible. The principle of stratified sampling is to keep the intra-layer variance as small as possible and the inter-layer variance as large as possible, which makes its estimation accuracy higher and the variability between samples less. However, there is also the problem of uneven spatial distribution of samples; spatial balanced sampling is a random, equiprobable, efficient, low-cost, well-balanced, and adaptable sampling method. However, spatial balanced sampling still gives the same inclusion probability to the patches with different spatial heterogeneity. This cannot improve the phenomenon that the land-covers with broken spatial structures are more wrong than the classification results with regular spatial structures. In this paper, the idea of unequal inclusion probability is introduced on the basis of spatial balanced sampling through the landscape shape index. The inclusion probability of spatial units in patches is directly proportional to the degree of spatial heterogeneity. The higher the spatial heterogeneity of patches (the larger the landscape shape index), the higher the inclusion probability of spatial units in patches. The reverse is also true. This makes up for the defect of spatially balanced sampling and makes the sampling design more reasonable.
The classification error of remote sensing images follows certain rules: for those objects with a complete structure and large area, the classification error is small; for structural fragmentation and fragmentation, there is a greater classification error. In view of this phenomenon, the sampling method proposed in this paper can reasonably consider the relationship between the inclusion probability and the degree of spatial heterogeneity. By improving the inclusion probability of patches with strong heterogeneity and severe fragmentation, and increasing the number of their sample points, a more reasonable accuracy evaluation can be obtained. Considering the spatial characteristics of the land-cover data in a 30 m resolution, we propose a two-stage stratified spatially balanced sampling scheme. In order to ensure the representativeness of the sample points, the landscape shape index (LSI) was introduced into the unequal inclusion probability design in spatially balanced sampling. Then, in the regional validation of GlobeLand30 in China, we considered the validation in three parts, namely, sampling design, response design, and accuracy assessment. In addition, we also studied the factors affecting the product accuracy, including reference data, sampling methods, classification methods and phenological conditions. Finally, we provide suggestions and guidance for the improvement of land-cover products.

2. Methods

2.1. Spatially Balanced Sampling

At present, the most popular algorithm for spatially balanced sampling is the generalized random tessellation stratified (GRTS) method. Reversed randomized quadrant-recursive raster (RRQRR) is a refinement of or variation on GRTS. The idea behind GRTS is to map the two-dimensional locations into one dimension while preserving some spatial order. The sample is then selected in one dimension, using systematic pps sampling, and mapped back into two dimensions [37,38]. A primary difference between the RRQRR and GRTS algorithms is that GRTS is feature based, so that quadrant-recursion occurs only in areas where features are located and only to a level k that resolves features in the sampling frame so that the sum of inclusion probabilities in a cell is less than 1. In contrast, RRQRR uses a raster to provide a discrete, but fine-grained, approximation of features—which can include any combination of geographic feature types. Again, the RRQRR and GRTS approaches differ. In GRTS, inclusion probabilities are used as weights, before reversing the Morton addresses. In RRQRR, inclusion probabilities are relative inclusion probabilities that specify the probability that a given location (raster cell) will be selected, relative to other locations [38]. Since GRTS and RRGRR share a general resemblance, with only minor differences, we take RRQRR as an example and list the main steps of RRQRR below (see Figure 1):
  • Morton code
Divide the study area into four parts, and then assign every unit a sequencing number of 1, 2, 3, or 4, which follows the “N” or “Z” order, thus obtaining the first layer L 1 . Do the same in the four parts of L 1 , thus obtaining the second layer L 2 . Go on until all the units in the study area are included. From the bottom to the top, each unit has a series of layer numbers which make a unique code, i.e., the Morton code.
2.
Linear address
Starting from the top-left corner, the Morton code of each unit is converted into a one-dimensional linear address which also follows the same “N” or “Z” order.
3.
Reversed Morton code
Reverse the layer numbers of each unit, and the reversed Morton code is made. For example, the reversed Morton code of M 1234 is M 4321 . At the same time, the linear address of the corresponding unit is also changed.
4.
Hierarchical random sorting
Firstly, undertake random sorting of the four parts of L 1 , then carry on random sorting of all the four pars of L 2 until all four parts of L k are finished. At the same time, the linear address of each unit is changed again.
5.
Sampling
In order to ensure that samples of different importance have different probabilities of being chosen, an inclusion probability raster is introduced (the value of the raster means the possibility of a unit being sampled relative to others). By the filtering calculation between the inclusion probability and the random raster ([0, 1] random even distribution), units with a higher inclusion probability than random probability are saved. According to the new linear address produced by Step 4, randomly choose a number as a starting point, and then extract n consecutive units. Spatially balanced sampling is thus achieved.
The inclusion probability raster plays a decisive role in spatially balanced sampling. For different applications, the factors considered in the design of the inclusion probability raster are distinct. For example, in field surveys, accessibility is the key factor influencing the design of the sampling method, and road network data are usually introduced. By defining the accessibility of the road buffers in different ranges, points far from the road have a small possibility of being chosen, thus avoiding the no-response sampling units. For land-cover products of a 30 m resolution, the landscape in transitional areas often exhibits high fragmentation. If patches of different fragmentation degrees are given an equal inclusion probability, the distribution of the sample points in spatially balanced sampling will be as shown in Figure 2. In contrast, if the patches are given different inclusion probabilities according to the complexity, the distribution of the sample points in spatially balanced sampling will be as shown in Figure 3. Therefore, the sampling method proposed in this paper reasonably considers the relationship between the inclusion probability of samples and the degree of spatial heterogeneity. The patches with large heterogeneity can improve the sampling probability of samples. Otherwise, reduce the inclusion probability of the samples.

2.2. Using the Landscape Pattern-Based Inclusion Probability to Improve Spatially Balanced Sampling

As is well known, the classification errors of remotely sensed images are not evenly distributed, but follow certain rules. For spatial objects with a more regular structure and larger area, the classification results are likely to be better, while objects with more fragmented and sporadic structures are more likely to be misclassified. Therefore, this paper introduces the concept of patches to landscape patterns, regarding land-cover types as patches or corridors with different sizes and shapes. Considering the relationship between landscape complexity and classification error, a landscape which is made up of complex, heterogeneous, and broken patches will have a larger inclusion probability than a landscape which is composed of simple, uniform, and regular patches. The complexity of the landscape can be measured by the LSI, which is calculated by the perimeter ratio of a patch to its equal size square. The calculation of the LSI is as shown in Equation (1):
LSI = 0.25 E A ,
where E means the perimeter of the patch, A means the area of the patch, and the range of the LSI is equal to or larger than 1. When the LSI equals 1, this indicates that the shape of the patch is the simplest square, and when the LSI value increases, the shape of the patch becomes more complex and irregular.
Since the inclusion probability indicates the possibility of a unit being sampled relative to other units in the population, it is necessary to normalize the LSI. The steps are as follows:
  • Calculate the LSI values of all the water patches in the area and draw the corresponding frequency distribution histogram, as shown in Figure 4. Affected by the spatial distribution of the land cover on the surface, the number of broken patches is large but the area proportion is small, while the number of complex patches is small but the area proportion is large. Therefore, the frequency histogram usually shows a left-sided distribution, and the statistics such as mean, median, and mode are small and not suitable for stratification.
  • Therefore, in order to narrow the gap between the patches, they are clustered into four groups according to the LSI values, as shown in Figure 5b.
  • The higher the landscape fragmentation, the greater the probability of misclassification. Therefore, the unequal inclusion probability is introduced according to the fragmentation of the patches. As shown in Figure 5c, the patches are divided into four layers, and the inclusion probabilities of 1, 0.8, 0.6, and 0.4 are given, respectively, according to the LSI. The patches with strong heterogeneity and large fragmentation have more samples, thus improving the representativeness of the sample points.

3. Experiment

There are three basic components of the accuracy assessment of land-cover products, namely, sampling design, response design, and analysis [39,40,41]. Sampling design refers to how to select the typical sample points from the study area. Response design means how to interpret the reference label of the sample points through a ground survey or visual interpretation of finer images. Analysis involves comparing the difference between the classification and reference data and then expressing it in quantitative ways, so that users can better understand the classification result. Figure 6 depicts the overall design of the experiments. The three main parts are described in detail in the following.

3.1. Sampling Design

Land cover usually has an uneven distribution on the Earth’s surface, and study areas vary widely, thus making it difficult for the determination of sample size, sample allocation, and the spatial layout of sample points. To ensure the representativeness of sample points, a zoning strategy is introduced. In this paper, a comprehensive physical geographical regionalization is used to divide the whole area into three parts: the Tibetan Plateau region, the Northwest China region, and the East China region. The sampling design adopts a top-down design and divides the whole process into two stages. In the first stage, spatial stratification is made and some primary sample units are randomly chosen from each strata. In the second stage, secondary sample units are further chosen from the primary sample units.
In the first stage, 1° × 1° grids serve as the sampling framework under the original map sheet (5° × 6°). At the same time, the broken grids on the edge are merged into a new grid, ensuring that each grid is of the same size. Stratified random sampling is then employed with the sampling ratio of five grids per map sheet, and the spatial stratification is as shown in Figure 7.
In the second stage, the land-cover types are used as strata for their differences in classification accuracy. In addition, considering the characteristics of classification error, the LSI is introduced into the design of the inclusion probability raster to ensure the spatial representativeness and uniformity of the sample points. About 50 sampling points are drawn from each grid, and then sample allocation of the different land-cover types is made according to the proportion of the square root of the area.

3.2. Response Design

Visual interpretation is the main method of sample judgment in this paper. When loading the sample points in Google Earth, first of all, we browsed the available images in this area and selected the best-quality images which had obvious characteristics of the land-cover features and which were taken around the vegetation growth period of the year 2010. We then drew a 3 × 3 pixel block (90 m × 90 m rectangle), with the sampling point as the center. When the sampling unit fell into a homogeneous area, i.e., only one land-cover type in the box, we recorded the land-cover type as the reference label. When the sampling unit lay in an heterogeneous area, i.e., more than one land-cover type in the box, we recorded the primary label and secondary label according to the area size. In addition, in order to facilitate the follow-up analysis, the level of degree of trust of the judgment results was scored (1 for uncertain, 2 for basically certain, and 3 for very certain).

3.3. Accuracy Assessment

The accuracy assessment in this paper is mainly based on the confusion matrix and common accuracy measures, such as overall accuracy, producer’s accuracy, user’s accuracy, and the Kappa coefficient [41,42,43,44]. These indexes describe the classification accuracy from different aspects, and they are also simple and feasible. The overall accuracy is one of the most popular measures, and it indicates the percentage of correctly classified pixels in the population, as shown in Equation (2), where n means the total number of pixels, m means the number of land-cover types, and n ii means the number of correctly classified pixels in the diagonal.
OA = i = 1 m n ii n ,
In addition, the accuracy of each land-cover type can be calculated in the same way as proposed in Equations (3) and (4), where n + i and n i + represent the marginal sum of columns and the marginal sum of rows, respectively. The producer’s accuracy represents the probability of each land-cover type being correctly classified in the reference data. Similarly, the user’s accuracy means the probability of each land-cover type being correctly classified in the classification data.
PA i = n ii n + i , i = 1 , 2 , , m ,
UA i = n ii n i + , i = 1 , 2 , , m ,
Kappa analysis is a multivariate statistical analysis method which can reflect whether the classification result is statistically significant compared to a random classification result [45]. Differing from the overall accuracy, Kappa analysis takes a comprehensive consideration of all the elements in the confusion matrix (not only the elements in the diagonal, but also the omission and commission errors off the diagonal). These indexes should usually be calculated at the same time, in order to obtain more information about the classification.
Kappa = n i = 1 m n ii i = 1 m n i + n + i n 2 i = 1 m n i + n + i ,

4. Results

4.1. Selected Samples

As shown in Figure 5, the city of Suzhou in China, which is located in the Taihu River Basin, is rich in water resources (Figure 5a). We take the water as an example to show how LSI-based spatially balanced sampling works.
As shown in Figure 5d, the inclusion probability raster was set up, and 300 spatially balanced sample points were extracted using the RRQRR algorithm.
China was selected as the validation region due to its vast territory, large latitude range, complex terrain, diverse climate, rich vegetation species, and complex spatial distribution. GlobeLand30-2010 for China includes nine types of land cover, namely, cropland, forest, grass, shrub, wetland, water, artificial, bare land, and snow and ice (the land cover in China, not including tundra). The reference data refer to the dataset representing the ground truth, which can be used to judge whether the classification is right or wrong. This paper takes Google Earth as the main reference data source.
Based on the two-stage stratified spatially balanced sampling method, a total of 160 primary sample units and 7490 sample points were selected in China. The sample size calculation is shown in Table 1 and the spatial distribution of each region is shown in Figure 8.
According to the degree of trust system in the sample judgment results, there were 6336 points with a high level of trust, accounting for 84.59% of the total. There were 906 points in the middle level of degree of trust, with most in the Tibetan Plateau area (416 in the Tibetan Plateau, 274 in Northwest China, and 216 in East China). The remaining points were of low quality and could be excluded from the confusion matrix (248 in total, 124 in the Tibetan Plateau, 20 in Northwest China, and 104 in East China).
According to the results of the 7490 sample points, about 6.84% of the total samples lacked high-resolution images, and the images were characterized by a dim tone, blurred texture, and no shadow in Google Earth. A statistical analysis of the capture time of the images of all the sample points was conducted. From the statistics of the year distribution, the time span was quite long, with a range of 17 years. As shown in Table 2, 1.50% of the total images were recorded early in 2000–2002 and 2003–2005, and 88.10% of the total were recorded in 2006–2008, 2009–2011, and 2012–2014. From the statistics of the yearly distribution, each month was relatively balanced, except for October and December, as shown in Table 3. For the period of vegetation growth, only 24.73% of the images were taken between June and September.

4.2. Accuracy Assessment

As a result of location or registration errors, more than one label was recorded in the 90 m × 90 m sampling unit. The confusion matrix is based on hard classification results, which means that only one reference label corresponds to the sample point, so that it is necessary to define the consistency between the classification label and the reference label. When compared with the classification data, the classification is correct as long as the classified label is the same as the primary label or secondary label. Otherwise, the primary label is considered the reference land-cover type. According to the confusion matrix based on the 7242 sample points (sample points with a lower level of trust are not included), the overall accuracy of GlobeLand30 in China was 80.46% and the Kappa coefficient was 0.76. The sum of each row in Table 4 is the classified number of the object class, and each column is the real number of the object class. As shown in Table 4, the highest user’s accuracy was 88.50% (snow and ice), while the lowest was 53.00% (shrub). The highest producer’s accuracy was 89.80% (cropland), while the lowest was 56.12% (wetland).
The confusion between GlobeLand30 and the reference data is established on the overlay analysis. As shown is Figure 9a, the horizontal axis represents the land-cover type in the reference data, and the vertical axis represents the percentage of land cover i in the reference data misclassified as other land-cover types, which corresponds to the omission errors. Similarly, as shown in Figure 9b, the horizontal axis represents the land-cover type in GlobeLand30, and the vertical axis represents the percentage of land cover i in GlobeLand30 classified as other land-cover types, which corresponds to the commission errors. When the land-cover type described in the vertical axis is inconsistent with what is in the horizontal axis, confusion between land-cover types occurs. As shown in Figure 9a,b, confusion between forest/grass/shrub, wetland/water, and grass/bare land are important factors affecting the classification accuracy of GlobeLand30.
In addition, according to the regional statistics, the classification accuracies of the three regions are shown in Figure 10a–c. In terms of vertical comparison, the classification accuracy of cultivated land in the three regions was the most stable, and the user’s and producer’s accuracies were all about 80%. The reason for this was the image characteristics of cropland, which show an obvious striped texture in the images after cultivation and harvesting. The cropland accounts for a small proportion in the Tibetan Plateau and Northwest China regions, and it is easily affected by image quality, resulting in a relatively low producer’s accuracy, while the irrigated lands in East China are easily confused with wetland and water body. The seasonal rivers in the Tibetan Plateau and Northwest China regions are easily confused with wetlands. In addition, economic crops such as tea and fruit trees in the East China region, which should be classified as cropland, are easily confused with forest, while in the Northwest China region, they are confused with pasture. From a horizontal perspective, the classification accuracy of forest and shrub in the Northwest China region was poor, which is closely related to the vegetation morphology (low, sparsely populated, low canopy coverage) and a sparse distribution pattern in this arid region.
The results from the confusion matrix based on 7242 sample points with a high degree of trust show that the overall accuracy of GlobeLand30 in China was 80.46% and the Kappa consistency coefficient was 0.76, which is close to the 84.2% in Wang et al. [45], but the total number of samples used in this paper was 7490, which is slightly less than the number of samples (8400) in Wang et al. (See Figure 11). Compared with stratified sampling, the improved spatial balanced sampling proposed in this paper gives higher inclusion probability to areas with high spatial heterogeneity, and distributes more samples while ensuring the uniformity of samples in this area, so as to improve the classification accuracy. As shown in Figure 8, the samples of the sampling design method in this paper were more evenly distributed, especially in places where the fragmentation degree of patches is high, and the samples were denser. In addition, the omission errors of shrubs, wetlands, and artificial land in this paper were obviously less than under stratified random sampling. Yang et al. [15] evaluated the accuracy of GlobeLand30 products in China by collecting 600 m × 600 m validation sample units (VSUs) in homogeneous areas. Although its accuracy was 82.39%, which is slightly higher than in this paper, this method does not consider the heterogeneity of landscape (which is also one of the defects of the verification unit mentioned in this paper). For example, most rivers in China are small patches and are long and thin, so that this 600 m × 600 m verification unit excludes these patches, which makes the estimated classification accuracy of the corresponding land-cover higher, and the accuracy of this paper is comparable to this higher accuracy. In addition, VSUs were selected according to normal distribution, and the distribution of samples was not more uniform than that collected in this paper.

5. Discussion and Conclusions

In this paper, we evaluated the accuracy of the Chinese GlobeLand30 product using a two-stage stratified spatially balanced sampling plan, obtaining 7242 valid samples and 248 invalid samples according to the degree of trust in sample judgment.
For sampling design, compared with simple random sampling (SRS), cluster sampling, systematic sampling, and stratified sampling, spatial balanced sampling is a random, equiprobable, efficient, low-cost, well-balanced, and adaptable sampling method. We used landscape pattern-based inclusion probability to improve the spatially balanced sampling in this paper. The improved spatially balanced sampling provides a higher inclusion probability to areas with high spatial heterogeneity, and distributes more samples while ensuring the uniformity of samples in this area, so as to improve the classification accuracy.
For accuracy analysis, the results from the confusion matrix based on 7242 sample points with a high degree of trust show that the overall accuracy of GlobeLand30 in China was 80.46% and the Kappa consistency coefficient was 0.76, which is close to the third-party assessment accuracy of GlobeLand30.
In this paper, the sampling method was applied to the regional validation of GlobeLand30 in China. Compared with other scholars’ research, the results show that the method is effective and reasonable.
Besides the classification error, the raw remote sensing images and the phenological conditions also influenced the accuracy of GlobeLand30. There are two main factors that can influence the validation accuracy of GlobeLand30: (1) the quality of the reference data, i.e., the coverage and the time distribution of the high-resolution images may not be comprehensive enough to support all the validation points; and (2) the sample unit design, i.e., when the sample falls on the adjacent area due to positioning or matching error, it will be judged as another type of surface coverage, and the positioning error will be confused as a classification error. According to the survey results of the 7490 sample points, 12.71% of the 3 × 3 sample units (952 in total) were mixed with two or more kinds of land cover, and mixed units were easily found in cropland, forest, grass, and bare land.
Table 2 shows the distribution of the reference data of sample points in the acquisition years. The reference data acquired around 2010 accounted for 88.10% of all the reference data, that is, the phase of most reference data was close to that of Global30 products, which provided guarantees for the verification of land cover data products. This is because the extent to which the time phase is reasonable and rich is one of the key factors of image reference data quality. The land cover will of course change constantly with time, for example, cultivated land becomes building area, deforestation, lakes dry up, and so on. If there is no image of the corresponding year, it is easy to make the wrong judgment because of the change of land cover. Moreover, the land cover shows seasonal and periodic changes, such as seasonal changes of water and grassland, and melting of snow and ice. The occurrence of these phenomena will have an impact on both remote sensing mapping and subsequent accuracy evaluation. Therefore, the rationality and richness of image phase are also of great significance to the verification of land cover data products. Table 3 summarizes the distribution of all reference data in each month in the past 17 years. The number of reference data in other months is relatively balanced except for individual months, which ensures that the sample points have corresponding images in each month and prevents most of the reference images from being concentrated in a few months. Otherwise, for some seasonal land cover, the time phase is not rich enough, which will lead to classification errors.

Author Contributions

H.X., F.W. and X.T. conceived the study, supervised the experiments, and edited the manuscript. The other co-authors contributed to the sampling experiment, analysis, and discussion. All authors contributed significantly and participated sufficiently to take responsibility for this research. Moreover, all authors are in agreement with the submitted and accepted versions of the publication. All authors have read and agreed to the published version of the manuscript.

Funding

The work described in the paper was substantially supported by the National Natural Science Foundation of China (Project Nos. 41631178, and 41822106), National Key Research and Development of China (Project No. 2018YFB0505400), the Dawn Scholar of Shanghai (Project No. 18SG22), the State Key Laboratory of Disaster Reduction in Civil Engineering (Project No. SLDRCE19-B-35), and the Fundamental Research Funds for the Central Universities of China.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Chen, J.; Chen, J.; Liao, A.; Cao, X.; Chen, L.; Chen, X.; He, C.; Han, G.; Peng, S.; Lu, M.; et al. Global land cover mapping at 30 m resolution: A POK-based operational approach. ISPRS J. Photogramm. Remote Sens. 2015, 103, 7–27. [Google Scholar] [CrossRef] [Green Version]
  2. Nguyen, K.A.; Liou, Y.A. Mapping global eco-environment vulnerability due to human and nature disturbances. MethodsX 2019, 6, 862–875. [Google Scholar] [CrossRef] [PubMed]
  3. Talukdar, S.; Pal, S. Effects of damming on the hydrological regime of Punarbhaba river basin wetlands. Ecol. Eng. 2019, 135, 61–74. [Google Scholar] [CrossRef]
  4. Chen, Z.; Wang, L.; Wei, A.; Gao, J.; Lu, Y.; Zhou, J. Land-use change from arable lands to orchards reduced soil erosion and increased nutrient loss in a small catchment. Sci. Total Environ. 2019, 648, 1097–1104. [Google Scholar] [CrossRef]
  5. Dong, W.H.; Liu, Z.; Zhang, L.J.; Tang, Q.H.; Liao, H.; Li, X.E. Assessing Heat Health Risk for Sustainability in Beijing’s Urban Heat Island. Sustainability 2014, 6, 7334–7357. [Google Scholar] [CrossRef] [Green Version]
  6. Recanatesi, F.; Clemente, M.; Grigoriadis, E.; Ranalli, F.; Zitti, M.; Salvati, L. A Fifty-Year Sustainability Assessment of Italian Agro-Forest Districts. Sustainability 2016, 8, 32. [Google Scholar] [CrossRef] [Green Version]
  7. Torre, I.; Jaime-González, C.; Díaz, M. Habitat Suitability for Small Mammals in Mediterranean Landscapes: How and Why Shrubs Matter. Sustainability 2022, 14, 1562. [Google Scholar] [CrossRef]
  8. Loveland, T.R.; Belward, A.S. The IGBP-DIS global 1 km land cover data set, DISCover: First results. Int. J. Remote Sens. 1997, 18, 3289–3295. [Google Scholar] [CrossRef]
  9. Loveland, T.R.; Reed, B.C.; Brown, J.F. Development of a Global Land Cover Characteristics Database and IGBP DISCover from 1 km AVHRR Data. Int. J. Remote Sens. 2000, 21, 1303–1330. [Google Scholar] [CrossRef]
  10. Hansen, M.C.; Defries, R.S.; Townshend, J.R.G. Global Land Cover Classification at 1km Spatial Resolution Using a Classification Tree Approach. Int. J. Remote Sens. 2000, 21, 1331–1364. [Google Scholar] [CrossRef]
  11. Friedl, M.A.; Mciver, D.K.; Hodges, J.C.F. Global Land Cover Mapping from MODIS: Algorithms and Early Results. Remote Sens. Environ. 2002, 83, 287–302. [Google Scholar] [CrossRef]
  12. Friedl, M.A.; Sulla-Menashe, D.; Tan, B. MODIS Collection 5 Global Land Cover: Algorithm Refinements and Characterization of New Datasets. Remote Sens. Environ. 2010, 114, 168–182. [Google Scholar] [CrossRef]
  13. Bartholomé, E.; Belward, A.S. GLC2000: A New Approach to Global Land Cover Mapping from Earth Observation Data. Int. J. Remote Sens. 2005, 26, 1959–1977. [Google Scholar] [CrossRef]
  14. Bicheron, P.; Defourny, P.; Brockmann, C.; Schouten, L.; Vancutsem, C.; Huc, M.; Bontemps, S.; Leroy, M.; Achard, F.; Herold, M. Globcover-Products Description and Validation Report; JRC49240; European Space Agency: Toulouse, France, 2008. [Google Scholar]
  15. Yang, Y.K.; Xiao, P.F.; Feng, X.Z.; Li, H.X. Accuracy assessment of seven global land cover datasets over China. ISPRS J. Photogramm. Remote Sens. 2017, 125, 156–173. [Google Scholar] [CrossRef]
  16. Ji, L.Y.; Gong, P.; Geng, X.R.; Zhao, Y.C. Improving the Accuracy of the Water Surface Cover Type in the 30 m FROM-GLC Product. Remote Sens. 2015, 7, 13507–13527. [Google Scholar] [CrossRef] [Green Version]
  17. Xu, Y.D.; Yu, L.; Feng, D.L.; Peng, D.L.; Li, C.C.; Huang, X.M.; Lu, H.; Gong, P. Comparisons of three recent moderate resolution African land cover datasets: CGLS-LC100, ESA-S2-LC20, and FROM-GLC-Africa30. Int. J. Remote Sens. 2019, 40, 6185–6202. [Google Scholar] [CrossRef]
  18. Gong, P.; Li, X.C.; Wang, J.; Baia, Y.Q.; Chen, B.; Hu, T.Y.; Liu, X.P.; Xu, B.; Yang, J.; Zhang, W.; et al. Annual maps of global artificial impervious area (GAIA) between 1985 and 2018. Remote Sens. Environ. 2020, 236, 111510. [Google Scholar] [CrossRef]
  19. Strahler, A.H.; Boschetti, L.; Foody, G.M.; Friedl, M.A.; Hansen, M.C.; Herold, M.; Mayaux, P.; Morisette, J.T.; Stehman, S.V.; Woodcock, C.E. Global Land Cover Validation: Recommendations for Evaluation and Accuracy Assessment of Global Land Cover Maps; EUR 22156 EN; Office for Official Publication of the European Communities: Luxembourg, 2006. [Google Scholar]
  20. Stehman, S.V.; Foody, G.M. Key issues in rigorous accuracy assessment of land cover products. Remote Sens. Environ. 2019, 231, 111199. [Google Scholar] [CrossRef]
  21. Tong, X.H.; Wang, Z.H.; Xie, H.; Liang, D.; Jiang, Z.Q.; Li, J.C.; Li, J. Designing a two-rank acceptance sampling plan for quality inspection of geospatial data products. Comput. Geosci. 2011, 37, 1570–1583. [Google Scholar] [CrossRef]
  22. Scepan, J. Thematic validation of high-resolution global land cover data sets. Photogramm. Eng. Remote Sens. 1999, 65, 1051–1060. [Google Scholar]
  23. Tsendbazar, N.E.; Herold, M.; Li, L.; Tarko, A.; de Bruin, S.; Masiliunas, D.; Lesiv, M.; Fritz, S.; Buchhorn, M.; Smets, B.; et al. Towards operational validation of annual global land cover maps. Remote Sens. Environ. 2021, 266, 112686. [Google Scholar] [CrossRef]
  24. Mayaux, P.; Eva, H.; Gallego, J.; Strahler, A.H.; Herold, M.; Agrawal, S.; Naumov, S.; De Miranda, E.E.; Di Bella, C.M.; Ordoyne, C.; et al. Validation of the global land cover 2000 map. IEEE Trans. Geosci. Remote Sens. 2006, 44, 1728–1739. [Google Scholar] [CrossRef] [Green Version]
  25. Song, X.P.; Potapov, P.V.; Krylov, A.; King, L.A.; Di Bell, C.M.; Hudson, A.; Khan, A.; Adusei, B.; Stehman, S.V.; Hansen, M.C. National-scale soybean mapping and area estimation in the United States using medium resolution satellite imagery and field survey. Remote Sens. Environ. 2017, 190, 383–395. [Google Scholar] [CrossRef]
  26. Zhao, Y.; Gong, P.; Yu, L.; Hu, L.; Li, X.; Li, C.; Zhang, H.; Zheng, Y.; Wang, J.; Zhao, Y.; et al. Towards a common validation sample set for global land-cover mapping. Int. J. Remote Sens. 2014, 35, 4795–4814. [Google Scholar] [CrossRef]
  27. Benedetti, R.; Piersimoni, F.; Postiglione, P. Advanced Methods to Design Samples for Land Use/Land Cover Surveys. In Topics on Methodological and Applied Statistical Inference; Di Battista, T., Moreno, E.W., Eds.; Springer: Heidelberg/Berlin, Germany, 2016; pp. 31–42. [Google Scholar]
  28. Benedetti, R.; Piersimoni, F.; Postiglione, P. Spatially Balanced Sampling: A Review and A Reappraisal. Int. Stat. Rev. 2017, 85, 439–454. [Google Scholar] [CrossRef]
  29. Xie, H.; Tong, X.; Meng, W.; Liang, D.; Wang, Z.; Shi, W. A Multilevel Stratified Spatial Sampling Approach for the Quality Assessment of Remote-Sensing-Derived Products. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 4699–4713. [Google Scholar] [CrossRef]
  30. Grafstrom, A.; Lundstrom, N.L.P.; Schelin, L. Spatially balanced sampling through the pivotal method. Biometrics. 2012, 68, 514–520. [Google Scholar] [CrossRef] [PubMed]
  31. Benedetti, R.; Postiglione, P.; Piersimoni, F. Alternative and complementary approaches to spatially balanced samples. Metron 2017, 75, 249–264. [Google Scholar] [CrossRef]
  32. Benedetti, R.; Piersimoni, F. A spatially balanced design with probability function proportional to the within sample distance. Biom. J. 2017, 59, 1067–1084. [Google Scholar] [CrossRef]
  33. Stevens, D.L.; Olsen, A.R. Spatially balanced sampling of natural resources. J. Am. Stat. Assoc. 2004, 99, 262–278. [Google Scholar] [CrossRef]
  34. Thompson, W.L.; Miller, A.E.; Mortenson, D.C.; Woodward, A. Developing effective sampling designs for monitoring natural resources in Alaskan national parks: An example using simulations and vegetation data. Biol. Conserv. 2011, 144, 1270–1277. [Google Scholar] [CrossRef]
  35. Dickson, M.M.; Tillé, Y. Ordered spatial sampling by means of the traveling salesman problem. Comput. Stat. 2016, 31, 1359–1372. [Google Scholar] [CrossRef]
  36. Van Dam-Bates, P.; Gansell, O.; Robertson, B. Using balanced acceptance sampling as a master sample for environmental surveys. Methods Ecol. Evol. 2018, 9, 1718–1726. [Google Scholar] [CrossRef] [Green Version]
  37. Grafstrom, A. Spatially correlated Poisson sampling. J. Stat. Plan. Infer. 2012, 142, 139–147. [Google Scholar] [CrossRef] [Green Version]
  38. Theobald, D.M.; Stevens, D.L.; White, D. Using GIS to Generate Spatially Balanced Random Survey Designs for Natural Resource Applications. Environ. Manag. 2007, 40, 134–146. [Google Scholar] [CrossRef] [PubMed]
  39. Stehman, S.V.; Czaplewski, R.L. Design and analysis for thematic map accuracy assessment: Fundamental principles. Remote Sens. Environ. 1998, 64, 331–344. [Google Scholar] [CrossRef]
  40. Stehman, S.V.; Fonte, C.C.; Foody, G.M.; See, L. Using volunteered geographic information (VGI) in design-based statistical inference for area estimation and accuracy assessment of land cover. Remote Sens. Environ. 2018, 212, 47–59. [Google Scholar] [CrossRef] [Green Version]
  41. Brennan, R.L.; Prediger, D.J. Coefficient Kappa: Some Uses, Misuses, and Alternatives. Educ. Psychol. Meas. 1981, 41, 687–699. [Google Scholar] [CrossRef]
  42. Gao, Y.; Liu, L.Y.; Zhang, X.; Chen, X.D.; Mi, J.; Xie, S. Consistency Analysis and Accuracy Assessment of Three Global 30-m Land-Cover Products over the European Union Using the LUCAS Dataset. Remote Sens. 2020, 12, 3479. [Google Scholar] [CrossRef]
  43. Brovelli, A.M.; Molinari, M.E.; Hussein, E.; Chen, J.; Li, R. The First Comprehensive Accuracy Assessment of GlobeLand30 at a National Level: Methodology and Results. Remote Sens. 2015, 7, 4191–4212. [Google Scholar] [CrossRef] [Green Version]
  44. Foody, G.M. Explaining the unsuitability of the kappa coefficient in the assessment and comparison of the accuracy of thematic maps obtained by image classification. Remote Sens. Environ. 2020, 239, 111630. [Google Scholar] [CrossRef]
  45. Wang, Y.; Zhang, J.X.; Liu, D.; Yang, W.J.; Zhang, W.L. Accuracy Assessment of GlobeLand30 2010 Land Cover over China Based on Geographically and Categorically Stratified Validation Sample Data. Remote Sens. 2018, 10, 1213. [Google Scholar] [CrossRef] [Green Version]
Figure 1. RRQRR algorithm of spatially balanced sampling. (a) Morton code; (b) linear address; (c) reversed Morton code; (d) reversed linear address; (e) hierarchical random sorting; (f) spatially balanced sampling; (g) the “N” order; (h) the “N” or “Z” order.
Figure 1. RRQRR algorithm of spatially balanced sampling. (a) Morton code; (b) linear address; (c) reversed Morton code; (d) reversed linear address; (e) hierarchical random sorting; (f) spatially balanced sampling; (g) the “N” order; (h) the “N” or “Z” order.
Sustainability 14 02479 g001aSustainability 14 02479 g001b
Figure 2. Spatially balanced sampling of equal inclusion probability raster. (a) Study area; (b) equal inclusion probability; (c) filtering calculation; (d) spatially balanced sampling.
Figure 2. Spatially balanced sampling of equal inclusion probability raster. (a) Study area; (b) equal inclusion probability; (c) filtering calculation; (d) spatially balanced sampling.
Sustainability 14 02479 g002
Figure 3. Spatially balanced sampling of unequal inclusion probability raster. (a) Study area; (b) unequal inclusion probability; (c) filtering calculation; (d) spatially balanced sampling.
Figure 3. Spatially balanced sampling of unequal inclusion probability raster. (a) Study area; (b) unequal inclusion probability; (c) filtering calculation; (d) spatially balanced sampling.
Sustainability 14 02479 g003aSustainability 14 02479 g003b
Figure 4. Frequency distribution histogram of the LSI.
Figure 4. Frequency distribution histogram of the LSI.
Sustainability 14 02479 g004
Figure 5. Example of using the landscape pattern-based inclusion probability to improve spatially balanced sampling. (a) Test area; (b) spatial clustering based on the LSI; (c) inclusion probability raster based on the LSI; (d) spatially balanced sampling based on the LSI.
Figure 5. Example of using the landscape pattern-based inclusion probability to improve spatially balanced sampling. (a) Test area; (b) spatial clustering based on the LSI; (c) inclusion probability raster based on the LSI; (d) spatially balanced sampling based on the LSI.
Sustainability 14 02479 g005
Figure 6. Overall design of the experiments.
Figure 6. Overall design of the experiments.
Sustainability 14 02479 g006
Figure 7. Spatial stratification based on regular grids.
Figure 7. Spatial stratification based on regular grids.
Sustainability 14 02479 g007
Figure 8. Spatial distribution of points based on two-stage stratified spatially balanced sampling.
Figure 8. Spatial distribution of points based on two-stage stratified spatially balanced sampling.
Sustainability 14 02479 g008
Figure 9. Confusion analysis between GlobeLand30 and the reference data: (a) Omission error; (b) Commission error.
Figure 9. Confusion analysis between GlobeLand30 and the reference data: (a) Omission error; (b) Commission error.
Sustainability 14 02479 g009aSustainability 14 02479 g009b
Figure 10. Classification accuracy in different regions of China: (a) Tibetan Plateau; (b) Northwest China; (c) East China.
Figure 10. Classification accuracy in different regions of China: (a) Tibetan Plateau; (b) Northwest China; (c) East China.
Sustainability 14 02479 g010aSustainability 14 02479 g010b
Figure 11. Comparison of the results of the sampling scheme in this paper and in Wang et al. (2018): (a) comparison of overall accuracy; (b) comparison of the number of samples.
Figure 11. Comparison of the results of the sampling scheme in this paper and in Wang et al. (2018): (a) comparison of overall accuracy; (b) comparison of the number of samples.
Sustainability 14 02479 g011
Table 1. Sample distribution based on two-stage stratified spatially balanced sampling.
Table 1. Sample distribution based on two-stage stratified spatially balanced sampling.
RegionArea RatioPrimary Sampling UnitsSampling Points
Tibetan Plateau28.68%472198
Northwest China26.30%421967
East China45.01%713325
Total11607490
Table 2. Statistics of the yearly distribution of the images.
Table 2. Statistics of the yearly distribution of the images.
RegionYear
2000–20022003–20052006–20082009–20112012–20142015–2017No ImageTotal
Tibetan Plateau229591369494272182198
Northwest China333611039620761351967
East China738164135014421651593325
Total12100284375825562685127490
Table 3. Statistics of the monthly distribution of the images.
Table 3. Statistics of the monthly distribution of the images.
RegionMonth
123456789101112No ImageTotal
Tibetan Plateau991982341176325491041112441695672182198
Northwest China4871177156149115195176215276971571351967
East China2051673053472382251641862873532614281593325
Total35243671662045036540846661387352711525127490
Table 4. Confusion matrix of GlobeLand30 in China.
Table 4. Confusion matrix of GlobeLand30 in China.
CroplandForestGrassShrubWetlandWaterArtificialBare LandSnow and IceTotal
Cropland12865325817302201432
Forest71113255185931211306
Grass14319714455010141616552045
Shrub414106110450162
Wetland7318355606098
Water12522342560150317
Artificial346530227150326
Bare land25181589292117671406
Snow and Ice0914002025100150
Total1582142417462007830632214711137242
UA (%)81.2979.4982.7653.0070.5183.6684.1679.9588.50-
PA (%)89.8086.6870.6665.4356.1280.7683.1383.6466.67-
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Xie, H.; Wang, F.; Gong, Y.; Tong, X.; Jin, Y.; Zhao, A.; Wei, C.; Zhang, X.; Liao, S. Spatially Balanced Sampling for Validation of GlobeLand30 Using Landscape Pattern-Based Inclusion Probability. Sustainability 2022, 14, 2479. https://doi.org/10.3390/su14052479

AMA Style

Xie H, Wang F, Gong Y, Tong X, Jin Y, Zhao A, Wei C, Zhang X, Liao S. Spatially Balanced Sampling for Validation of GlobeLand30 Using Landscape Pattern-Based Inclusion Probability. Sustainability. 2022; 14(5):2479. https://doi.org/10.3390/su14052479

Chicago/Turabian Style

Xie, Huan, Fang Wang, Yali Gong, Xiaohua Tong, Yanmin Jin, Ang Zhao, Chao Wei, Xinyi Zhang, and Shicheng Liao. 2022. "Spatially Balanced Sampling for Validation of GlobeLand30 Using Landscape Pattern-Based Inclusion Probability" Sustainability 14, no. 5: 2479. https://doi.org/10.3390/su14052479

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop