Clustering Arid Rangelands Based on NDVI Annual Patterns and Their Persistence

Sanz, Ernesto; Sotoca, Juan José Martín; Saa-Requejo, Antonio; Díaz-Ambrona, Carlos H.; Ruiz-Ramos, Margarita; Rodríguez, Alfredo; Tarquis, Ana M.

doi:10.3390/rs14194949

Open AccessArticle

Clustering Arid Rangelands Based on NDVI Annual Patterns and Their Persistence

by

Ernesto Sanz

^1,2,*

,

Juan José Martín Sotoca

^1,2,

Antonio Saa-Requejo

^1,3

,

Carlos H. Díaz-Ambrona

^1,4

,

Margarita Ruiz-Ramos

^1,4,

Alfredo Rodríguez

^1,5

and

Ana M. Tarquis

^1,2

¹

CEIGRAM, Universidad Politécnica de Madrid, 28040 Madrid, Spain

²

Grupo de Sistemas Complejos, Universidad Politécnica de Madrid, 28040 Madrid, Spain

³

Evaluación de Recursos Naturales, ETSI Agronómica, Alimentaria y Biosistemas, Universidad Politécnica de Madrid, 28040 Madrid, Spain

⁴

AgSystems, ETSI Agronómica, Alimentaria y Biosistemas, Universidad Politécnica de Madrid, 28040 Madrid, Spain

⁵

Departamento de Análisis Económico y Finanzas, Universidad de Castilla-La Mancha, 45071 Toledo, Spain

^*

Author to whom correspondence should be addressed.

Remote Sens. 2022, 14(19), 4949; https://doi.org/10.3390/rs14194949

Submission received: 19 July 2022 / Revised: 28 September 2022 / Accepted: 29 September 2022 / Published: 4 October 2022

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Rangeland ecosystems comprise more than a third of the global land surface, sustaining essential ecosystem services and livelihoods. In Spain, Southeast Spain includes some of the driest regions; accordingly, rangelands from Murcia and Almeria provinces were selected for this study. We used time series metrics and the Hurst Exponent from rescale range and detrended fluctuation analysis to cluster different rangeland dynamics to classify temporally and spatially diverse rangelands. The metrics were only calculated for three time periods that showed significant NDVI changes: March to April, April to July, and September to December. Detrended fluctuation analysis was not previously employed to cluster vegetation. This study used it to improve rangeland classification. K-means and unsupervised random forest were used to cluster the pixels using time series metrics and Hurst exponents. The best clustering results were obtained when unsupervised random forest was used with the Hurst exponent calculated with detrended fluctuation analysis. We used the Silhouette Index to evaluate the clustering results and a spatial comparison with topographical data. Our results show that adding the Hurst exponent, calculated with detrended fluctuation analysis, provided a better classification when clustering NDVI time series, while classifications without the Hurst exponent or with the Hurst exponent calculated with the rescale range method showed lower silhouette values. Overall, this shows the importance of using detrending when calculating the Hurst exponent on vegetation time series, and its usefulness in studying rangeland dynamics for management and research.

Keywords:

NDVI; multiscaling; vegetation dynamics; rangelands; detrended fluctuation analysis; random forest

1. Introduction

Ecosystems were considered complex systems with non-linear dynamics in space and time for more than three decades [1,2,3,4]. However, only recent research focuses on tackling the complexity of ecosystem temporal dynamics with various methodologies [5,6,7,8,9,10,11,12]. As an eco-social system, rangelands comprise 30–40% of the Earth’s landmass, supporting approximately 1 billion people [13,14]; this makes them suitable land types to study ecosystem dynamics with significant human activity effects. This type of land is heavily affected by land degradation, affecting 73% of all rangelands [15,16,17,18,19]. Land degradation reduces biological productivity, ecosystem functions, and complexity [20,21].

Climate change and social-economic trends are some of the main challenges in rangeland conservation, often with interactive and synergy responses [22,23]. An integrated approach to land management is required to address these issues. Understanding the dynamics and characteristics of rangelands is a vital part of their conservation [4,24,25,26]. The Normalized Differentiated Vegetation Index (NDVI) was widely used to monitor, assess, and classify vegetation [27,28,29,30,31]. Moreover, more recently, supervised and unsupervised machine learning was used to classify rangeland pixels based on values of NDVI at the spatial level. Summary statistics of NDVI time series were also used to study spatiotemporal data [32,33,34,35]. The unsupervised classification does not require labelled data. K-means and ISODATA algorithms are commonly used in unsupervised land cover and crop classification. However, these algorithms are susceptible to outliers, high dimensionality, and noise. Unsupervised Random Forest (URF) was previously used with other biological data, such as genomic sequence data [36] and vegetation [37]. Different metrics to measure the fitness of the cluster were developed. The Silhouette Index is an excellent internal cluster validation metric, more robust than other metrics, such as the Rand Index and Dun Index [38,39,40,41].

A method to study the complexity of time series is their fractal character using the Hurst exponent [42]. This method was developed to measure the persistence (H > 0.5) or antipersistence (H < 0.5) of a time series. This analysis can be calculated using the Rescaled Range (R/S) method, named the Hurst Index (HI, [42]). Another method uses detrended fluctuation analyses (DFA), which removes tendencies of the time series before calculating the Hurst exponent (H2), the generalised Hurst exponent for q = 2 [43]. Both methods were used in long-term ecosystem dynamics on vegetation [8,10,44,45]. Another application was to localise changes in those dynamics, such as those affected by fire [46]. When a time series is persistent, the trend of that time series will continue in the same direction. However, if a time series is antipersistent, the trend will be followed by the opposite (e.g., if the trend were increasing, it would be followed by a descent). If the Hurst exponent is close to 0.5, the time series will follow a random process, such as a random walk.

The Hurst exponent was applied to the NDVI time series to quantify the long-term memory as well as their trend. Long-term memory is affected by land use, land changes, and climate change, making it useful for rangeland managers. Topographical variables were also linked to Hurst exponent values [5,6]. Several authors used it to map rangelands or other vegetation, and comment on their connection with slope and elevation values [5,8,10,47,48,49,50]. However, to our knowledge, integrating the system persistence characteristics with the NDVI annual pattern to classify rangelands was not yet accomplished. The Hurst exponent represents vegetation dynamics and NDVI time-series summary statistics represent the vegetation types. This research aims to provide new insights into a spatially complex eco-social system, where aridity, land degradation, and climate change restrict agricultural practices and ecosystem services [51,52]. Clustering rangeland pixels in arid areas can be used to prioritise field visits where different vegetation dynamics and trends are found.

The present study attempted rangeland classification, including the Hurst exponent. Two Hurst exponent methods (HI and H2) were used to evaluate the influence of the DFA in capturing vegetation dynamics. Additionally, two different machine learning methods (k-means and URF) were applied to decide which provided a more accurate outcome based on the Silhouette Index.

2. Materials and Methods

2.1. Area of Study

Three agricultural regions of Southeast Spain were selected (Figure 1): Los Velez in the province of Almeria, and the Northwest and Northeast in the province of Murcia, which will be called Murcia-NW and Murcia-NE, respectively, for clarity. These three regions have a Mediterranean arid climate with an average annual precipitation of less than 300 mm, although with regional variations [53]. The spatial resolution used was 250 m/pixel. This spatial resolution matches the resolution used by most stakeholders in the Spanish agricultural insurance system. The pixel selection was provided by ENtidad Estatal de Seguros Agrarios (ENESA, Ministerio de Agricultura, Pesca y Alimentación, Government of Spain), using the Sistema de Información Geográfico de Parcelas Agrícolas (SIGPAC [54]) and the Mapa Forestal Español (MFE, Spanish Forest Map). Firstly, pixels categorised as rangeland were selected using the SIGPAC. Secondly, using the previous selection, pixels with a tree coverage higher than 15% were discarded to ensure a low tree coverage, based on the MFE. Three thousand six hundred and fifty-four (3654) pixels of rangelands were selected, consisting of grasslands, shrublands, and open woodlands.

The three regions are mainly located in mountainous areas. The Murcia-NE region is mainly a mix of grassland and shrubland; Murcia-NW is dominated by sparse woodland mixed with shrubs; and in Los Velez, grasslands and shrublands are the primary vegetation with minimal areas of sparse woodland. These regions include areas with different aspects and changing slopes and elevations.

2.2. Data Collection

2.2.1. NDVI Data

The NDVI data were collected from MOD09A1.006, using the AppEEARS tool [55], and downloading the RED (band 1) and NIR (band 2) values for the target areas. This tool has a 250 m spatial resolution, collecting a set of 3654 pixels and an 8-day temporal resolution from the beginning of 2000 to 2019, a total of 20 years of data. R software [56] was used for each pixel series to calculate the NDVI, using Equation (1) below. The 8-day temporal resolution was transformed to a 10-day resolution as used by the Spanish indexed agricultural insurance.

NDVI = 100 \times \frac{NIR - RED}{NIR + RED}

(1)

The possible NDVI values range from 0 to 100. The obtained NDVI values were then checked for quality. The data were deleted if they were not categorised as ideal quality (quality values in band from AppEEARS, less than 0.01%). The gaps were filled using running averages with a gap interval of seven dates. The time series were then smoothed using the Savitzky–Golay method [57], with a window size of 9 selected, based on the best-fitted outputs.

2.2.2. GIS Data

A Digital Elevation Model (DEM, 10 m resolution) was downloaded from the Copernicus website [58], and ArcGIS software v. 10.8.1 [59] was used to calculate the slope based on the DEM. These two datasets, and the variables used in the clustering analysis (Hurst exponent and NDVI summary statistics), were used to compare the clustering results through boxplots for a visual comparison.

2.3. Fractal Analysis

2.3.1. Rescale Ranged Hurst Exponent

Hurst Index analysis was used to analyse the persistence of NDVI in each area [42]. For this index, the package “pracma” (version 1.9.9) [60] was used in R Software. This index splits the time series into τ subseries. Each subseries calculates the mean and cumulative sum of the mean to calculate the range (R(τ)). This range is divided by each subseries standard deviation (S(τ)). The Hurst exponent (HI) is then calculated using the following formula and by averaging each subseries, where c is a constant of proportionality, τ is the time span, and H is the Hurst scaling exponent.

\frac{R (τ)}{S (τ)} = {c τ}^{H}

(2)

2.3.2. Multifractal Detrended Fluctuation Analysis

A Mann–Kendall test [61,62] was applied to the whole temporal series of each pixel. Since most of the NDVI series presented a trend, Multifractal Detrended Fluctuation Analysis (MF-DFA) was used following [43], developed to calculate multifractal properties after removing trends in the time series. The main feature of multifractals is that they are characterised by high variability over wide ranges of temporal or spatial scales associated with intermittent fluctuations and long-range power-law correlations.

The MF-DFA operates on x(i), where

i = 1, 2, \dots, N

, with N being the series length;

\bar{x}

represents the mean value, and x(i) are increments of a random walk process around the average

\bar{x}

. The integration of the signal, therefore, provides what is called the ‘trajectory’ or ‘profile’:

y (i) = \sum_{k = 1}^{i} [x (k) - \bar{x}]

(3)

Furthermore, the integration will reduce the level of measurement noise present in observational and finite records. Next, the integrated series was divided into N_s = int (N/s), the integer part of non-overlapping segments of equal lengths s. The local trend was then calculated for each Ns segment by a least-squares fit, and then the variance was determined:

F^{2} (s, ν) = \frac{1}{s} \sum_{i = 1}^{s} {y [(ν - 1) s + i] - y_{ν} (i)}^{2}

(4)

for each segment ν, where

ν = 1, \dots, N_{S}

. Here,

y_{ν}

(i) is the fitting curve in segment ν. In this case study, a line was chosen. After detrending the series, the average was performed over all segments to obtain the 2nd-order fluctuation function:

F_{q} (s) = {\frac{1}{2 N_{s}} \sum {[F^{2} (s, ν)]}^{\frac{q}{2}}}^{\frac{1}{q}}

(5)

H(q) is the generalised MF-DFA exponent in the function of q. H(q) was calculated for the time scales where the fluctuation functions increased linearly to allow detrending calculations, starting at 32 days. Observing Equations (4) and (5), in the case that q = 2, the equation will be:

F_{2} (s) \propto s^{H (2)}

(6)

Therefore, H2 = H(2) is the Hurst index estimated using MF-DFA as it was used by [63]. In this study, given that only one exponent (H2) was used, this method will be referred to as DFA.

2.4. Variable Selection for Clustering

Summary statistics of the NDVI time series (quartiles 1, 2, 3, and variances) were calculated to analyse vegetation dynamics, similarly to [34,35]. However, the statistics were calculated at different year moments (phases) where NDVI behaves differently across the year. Three periods were chosen when the NDVI experienced more significant changes: Phase 2 (March and the first two ten-day periods of April), Phase 3 (from the last ten-day period of April to the last ten-day period of July), and Phase 5 (September to December) following [64]. For these three periods, the mentioned summary statistics were calculated. The Hurst exponent was then calculated for the whole NDVI time series using two methods, R/S and DFA. Afterwards, clustering techniques were used on the selected summary statistics independently, and with each of the Hurst exponents. The results were compared to topographical data: elevation and slope.

Among all summary statistics and the Hurst exponents, a correlation matrix was applied to select variables that did not have a strong correlation (i.e., <0.75). Principal component analysis was run when strong correlations were present to select the most explanatory variables. Upon selection, clustering analyses were run and compared.

2.5. Clustering

Clustering was made using two unsupervised machine learning methods (k-means and URF). The Silhouette Index [39] was used to compare the different classification results and select the best option based on the partition and all proximities for all objects. The Silhouette Index was calculated for clusters A and C, following Equation (7):

SI (i) = \frac{b (i) - a (i)}{\max {a (i), b (i)}}

(7)

where a(i) is the average dissimilarity i to all other objects of cluster A, and b(i) is the minimum average dissimilarity of i to the centroid of cluster C.

To study the differences and similarities between the clusters, the adjusted Rand Index was used [65] from the R package “fossil v. 0.4.0” [66], which determines whether two clusters are similar to each other using a contingency table of the two clusters making an all pair-wise comparison.

2.5.1. K-Means

K-means was developed by Stuart Lloyd in 1957 and published in 1982 [67]. It is a non-hierarchical technique, and one of the simplest methods to solve clustering problems. James MacQueen first coined this method as k-means in 1967 [68]. This algorithm starts clustering by randomly assigning a K number of centroids. Secondly, it calculates the distance between the data points, and the closest centroid minimises the sum of the square as in Equation (8):

d (x, y) = \frac{1}{2} \sum_{i} {(x_{i} - y_{i})}^{2}

(8)

The algorithm repeats this process by adjusting the centroids based on the calculated distance, iterating a set number, and converging in a fixed point [69]. In this paper, Hartigan and Wong’s method was used [70] with the R package “stats v. 3.6.2” [56]. This method reassigns point by point, considering the shift in the means after the reassignment of previous points, and it may reassign a point even if it already has an assigned centre.

2.5.2. Unsupervised Random Forest

Random forest [71] is a tree-based ensemble method, i.e., methods that generate many classifiers and aggregate their results. It uses bootstrap aggregating (bagging [72]) to calculate a large number of trees based on the fed predictor variables and to select the most voted trees. Random forest is a non-parametric method that builds each tree using a deterministic algorithm based on the three main variables: (1) the number of trees (nt); (2) the number of predictors tested on each node (m); and (3) the minimal size for each node (nodesize). A third of the bootstrap is omitted in each node and is considered out-of-the-bag (OOB) data. These data are used to obtain a classification rate for each node. The variable importance is calculated for the averaged final tree based on the OOB data and their classification rate. Each tree presents a different variable importance, but these are averaged [73]. The R package “randomForest v. 4.6-14” was used [74] to calculate the RF as an unsupervised method, utilising the proximity matrix as predictor variables.

3. Results

3.1. Variable Selection Approach

All summary statistics between the three selected phases presented a robust linear correlation (>0.75) except for the three variances (Figure 2). Principal component analyses (PCA) were performed with our statistic variables, including the Hurst exponents (HI and H2). Moreover, among those variables with a strong correlation, quartile 3 of phase 5 was chosen for its higher explanatory power (Table A1 and Table A2). The three variances and quartile 3 of phase 5 were used separately with HI and H2, and with neither of them.

3.2. Clustering Analysis

3.2.1. K-Means

The k-mean analyses were applied, using the aforementioned selected variables, for three and four clusters based on the elbow method. The elbow method is a heuristic method to determine the number of clusters in a dataset [75], as shown in Figure 3. K-means clustering was different when three (three-cluster analyses) and four (four-cluster analyses) clusters were used. However, for each cluster number, the results were identical whether no Hurst exponent, H2 or HI were used. The clustering results presented an adjusted Rand Index of 1 among the three-cluster analyses and an adjusted Rand Index of 0.84 when comparing the results of three- and four-cluster analyses. The fourth cluster showed very few pixels with a low Silhouette Index for this cluster, as shown in Figure 4. The Silhouette Index was the same in all k-means analyses with three- and four-cluster analyses (Table 1).

3.2.2. Unsupervised Random Forest

Using the elbow method with the partitioning around medoids method showed a similar graphic as using the k-means method, indicating that three and four clusters may be the most appropriate to use (Figure 5).

The URF has more variables that affect the results: the number of trees (nt) and several variables (m) used for splitting branches. Three or four clusters were used and H2, HI, and no Hurst exponent analyses, were calculated. For each combination, URF was calculated for different nt and m to obtain the analysis with the highest Silhouette Index (Figure A4, Figure A5 and Figure A6). Compared with k-means, URF showed higher variability between the results, whether using H2, HI, or no Hurst exponent. The silhouette values from URF were consistently higher when three groups were used for the three analyses regarding the Hurst exponent (Table 1). When four clusters were used in our analyses, the additional fourth cluster showed a low Silhouette Index for that cluster (Figure 6). Therefore, only the URF clustering for three clusters will be discussed with and without the Hurst exponents, focusing on the cluster with the highest Silhouette Index (H2).

The clustering results were more similar between the use of HI and no Hurst exponent than when H2 or HI was used, presenting 0.82 and 0.74 in the adjusted Rand Index, respectively. For all cases, cluster 1 was the most predominant, and cluster 2 had a higher NDVI and variance, while the opposite can be said for cluster 3. These differences were more remarkable when H2 was used. The difference in Hurst exponent (HI or H2, respectively) between the three clusters was more evident when H2 was used. The major differences in clustering among these three analyses were found in cluster 2, that with the highest H2 and NDVI (Figure 7 and Figure 8). These distinct pixels were found mainly in the Murcia-NW region (Figure 9, Figure A4, Figure A5 and Figure A6).

3.2.3. Cluster Characterisation

The Hurst exponent from DFA showed a stronger linear correlation with elevation and slope than HI. The same occurred with the selected variables used for the clustering analyses (Table 2), the variances from phases 5 and 2 (those with the highest correlation to H2 and HI, respectively). These correlations were reflected in the clustering process. When URF with H2 was used, slope and elevation were more heavily differentiated for clusters 2 and 3. These differences were not found when k-means was used since the clustering outcome was the same when H2 substituted HI, or no Hurst exponent was used. Furthermore, slope and elevation showed a more considerable overlap between the clusters on the three-cluster analyses when k-means was used (Figure 8).

When H2 was used, the three-clusters analyses presented more significant differences. These differences are shown in their dynamics, as seen in the variances calculated separately for each cluster, phase, and NDVI (Figure 10), where some pixels were distinct. These differences were still found when all pixels were averaged for each cluster (Figure 8 and Figure 10). These differences in NDVI are reflected in the type of vegetation found dominating each pixel. Cluster 1, where we found the majority of pixels, reflects a great variation from woodlands to grasslands. In this region with an arid climate, patchy landscapes with different vegetation are typical and they can occur along an ecological continuum, rather than as well-defined and separated ecosystems [76,77]. Cluster 2 shows a vast majority of woodland, while cluster 3 consists mainly of grassland (Table 3), despite cluster 1 having both grassland and woodland, as reflected by an intermediate average NDVI for cluster 1. Pixels from cluster 2 are those with higher NDVI representing thicker forests, unlike the more dispersed forests with shrubs found in cluster 1.

The Mann–Kendall test was performed for all the pixels and the area. Although all three clusters showed that most pixels had a significant positive trend, cluster 2 had 90% of the pixels in that category, while clusters 1 and 3 only had 67% and 64%, respectively (Table 4).

4. Discussion

The link between elevation and the Hurst exponent was previously reviewed by Peng [5], who found a good relationship between HI and elevation. In our study, the stronger correlation of H2 with NDVI time-series variances, compared with HI, suggests the importance of detrending in fractal analyses when studying vegetation time series. Differences between R/S and DFA were previously reported [50], as DFA is less affected by size effects or spurious correlation of non-stationary time series [50,78]. Our results support these findings, highlighting the relevance of detrending, especially when studying different vegetation types. Limited differences in pixel clustering were found in both methods of calculating the Hurst exponent in areas dominated by grasslands, suggesting that a tendency is not present in this NDVI series probably due to the grazing effect on these areas. On the other hand, more significant differences in areas with more trees were found. In this case, grazing does not limit the vegetation growth of trees, showing a trend in their vegetation time series.

Arid rangelands are spatially heterogeneous [4,26], and land degradation and overgrazing can affect the landscape creating a grassland/woodland continuum [79,80]. This effect is reflected in the overlapping clusters, showing that discrete areas can have similar vegetation. However, differences among the majority of the pixels of each cluster in persistence, elevation, and slope were found. In further research, other factors relating to elevation and slope could be considered, such as availability for machine use in agriculture (easier on flatter areas), rainfall, soil depth, or erosion. These factors should be considered in land management.

Clustering vegetation dynamics and comparing those clusters with vegetation type illustrate the tendencies related to each vegetation. Understanding these processes is key to the spatiotemporal interactions between human and natural systems [18,19]. Most pixels were categorised as antipersistent and with a significantly increasing trend. Land managers should make special efforts to avoid further land degradation. Pixels categorised as the least antipersistent and with an increasing NDVI trend (as no persistent pixels were found) can be used as reference. These pixels can be studied to see if different management practices are in place leading to differences in persistence and NDVI trends.

The variability in arid areas was expected since minor changes in slope, rainfall, or other characteristics, mean a significant difference in water availability and plant growth [81,82]. Using URF to study rangelands can improve our understanding of the area even when fieldwork is unavailable, highlighting areas with different dynamics, crucial when monitoring vegetation. These techniques can also cluster a more extensive range of land uses, not only be limited to rangeland, since they will have more distinctive spectral signatures. Further research should be made in other arid areas to contrast whether this method can allow us to analyse previous land classification, prioritise areas for future surveys, and improve management action.

This study includes several limitations. (1) MODIS spatial resolution is much larger than most land plots in this region. Despite remote-sensing data with a higher spatial resolution, 250 m spatial resolution was chosen, as it is used for indexed agricultural insurance in Spain. (2) Ground field visits were not possible to formally validate our results, and the Silhouette Index was used to compare the clustering results. Future steps could be to formally visit different areas for each cluster to validate these results. However, this study aids the body of research [5,6,8,44,45,50] supporting the use of persistence (DFA) and trends (Mann–Kendall) for vegetation series, using these techniques in arid rangelands to aid rangeland managers and policymakers.

5. Conclusions

Two methods (R/S and DFA) were used to calculate the Hurst exponent (HI and H2). The results were compared using two clustering methods, with summary statistics from the NDVI time series. The combination providing the best results was obtained based on the Silhouette Index and cluster characteristics. URF with the Hurst exponent from DFA (H2) showed the best outcome, compared with URF performed with the Hurst exponent calculated with R/S (HI), URF made without the Hurst exponent, and all the k-means results.

URF found differences when different Hurst exponent methods were used, while k-means found no differences. URF with H2 showed greater differences between areas with higher tree coverage and those with a mix of grassland and shrubland. Additionally, the H2 time series presented a stronger linear correlation with slope and elevation, an essential aspect of vegetation dynamics in arid environments.

Detrended fluctuation analyses produced significant differences when calculating the Hurst exponent in time series that presented a tendency. Detrending time series can allow for a better understanding of the dynamics of vegetation time series, as well as rangeland evolution and future trends. Rangeland persistence was a key aspect to consider in rangeland management and research. Thus, future research should explore more rangeland, and other land uses, and compare different land management practices.

Author Contributions

Conceptualization, A.M.T., A.S.-R., C.H.D.-A. and E.S.; methodology, A.M.T., A.S.-R., E.S. and E.S.; formal analysis, A.M.T., J.J.M.S., A.S.-R., M.R.-R. and E.S.; writing—original draft preparation, E.S.; writing—review and editing, E.S., J.J.M.S., A.S.-R., C.H.D.-A., M.R.-R., A.R. and A.M.T., visualization, A.M.T., A.R. and E.S.; supervision, A.M.T.; funding acquisition, A.M.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially funded by Boosting Agricultural Insurance based on Earth Observation data–BEACON project under agreement No. 821964, funded under H2020_EU, DT-SPACE-01-EO-2018-2020 and the Ministerio de Ciencia e Innovación (grant no. AGRISOST-CM S2018/BAA-4330). The authors also acknowledge support from Project No. PID2021-122711NB-C21 of the Ministerio de Ciencia, Innovación y Universidades of Spain.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

The data provided by ENESA, the Ministerio de Agricultura, Pesca y Alimentación is greatly appreciated.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A. Principal Component Analyses Made to Select the Most Explanatory Variables

Table A1. Three first principal components from the PCA for all the NDVI time-series variables and H2. Background color highlights those variables with higher explanatory power for each principal component.

Variables	PC1	PC2	PC3
NDVI_H2	0.14	−0.27	0.84
Var_Ph2	0.13	−0.59	−0.22
Var_Ph3	0.13	−0.56	−0.37
Var_Ph5	0.23	−0.38	0.17
Quartile_1_Ph2	0.32	0.01	0.03
Quartile_1_Ph3	0.31	0.19	−0.13
Quartile_1_Ph5	0.31	0.16	001
Median_Ph2	0.32	0.05	0.2
Median_Ph3	0.31	0.14	−0.14
Median_Ph5	0.31	0.11	0.04
Quartile_3_Ph2	0.32	−0.01	0.01
Quartile_3_Ph3	0.32	0.08	−0.17
Quartile_3_Ph5	0.33	0.05	0.07
Standard deviation	3.10	1.36	0.95
Proportion of Variance	0.74	0.14	0.07
Cumulative Proportion	0.74	0.88	0.94

Table A2. After removing the variables with a strong correlation for the summary statistics with the least explanatory power, the first principal components from the PCA for the selected variables are shown. Those variables with higher explanatory power for each principal component are highlighted.

Variables	PC1	PC2	PC3
NDVI_H2	0.36	−0.54	0.74
Var_Ph5	0.47	0.45	0.13
Var_Ph3	0.43	0.54	0.07
Var_Ph2	0.54	−0.13	−0.16
Quartile_3_Ph5	0.12	−0.43	−0.64
Standard deviation	1.72	1.02	0.76
Proportion of Variance	0.59	0.21	0.12
Cumulative Proportion	0.59	0.79	0.91

Appendix B. The Silhouette Indexes Calculated for All URF Changing Mtry and Number of Trees for Three Clusters

Figure A1. Silhouette indexes for three clusters for URF using Hurst exponent calculated with Rescaled Range method. “m” represents the number of predictors tested on each node.

Figure A2. Silhouette indexes for three clusters for URF using Hurst exponent calculated with Detrended fluctuation analysis. “m” represents the number of predictors tested on each node.

Figure A3. Silhouette indexes for three clusters for URF not using any Hurst exponent. “m” represents the number of predictors tested on each node.

Appendix C. Maps of the Clusters Using URF with HI and H2 for the Three Study Provinces

Figure A4. Comparison of the clustering results for URF using HI (a) and H2 (b) in the agricultural region of Murcia-NE. Cluster 1 is pink, cluster 2 is green, and cluster 3 is blue.

Figure A5. Comparison of the clustering results for URF using HI (a) and H2 (b) in the agricultural region of Murcia-NW. Cluster 1 is pink, cluster 2 is green, and cluster 3 is blue.

Figure A6. Comparison of the clustering results for URF using HI (a) and H2 (b) in the agricultural region of Los Vélez (Almería). Cluster 1 is pink, cluster 2 is green, and cluster 3 is blue.

References

Levin, S.A. Ecosystems and the Biosphere as Complex Adaptive Systems. Ecosystems 1998, 1, 431–436. [Google Scholar] [CrossRef] [Green Version]
Levin, S.A. The Problem of Pattern and Scale in Ecology: The Robert H. MacArthur Award Lecture. Ecology 1992, 73, 1943–1967. [Google Scholar] [CrossRef]
Emanuel, W.R.; Shugart, H.H.; Stevenson, M.P. Climatic Change and the Broad-Scale Distribution of Terrestrial Ecosystem Complexes. Clim. Chang. 1985, 7, 29–43. [Google Scholar] [CrossRef]
Vetter, S. Rangelands at Equilibrium and Non-Equilibrium: Recent Developments in the Debate. J. Arid Environ. 2005, 62, 321–341. [Google Scholar] [CrossRef]
Peng, J.; Liu, Z.; Liu, Y.; Wu, J.; Han, Y. Trend Analysis of Vegetation Dynamics in Qinghai–Tibet Plateau Using Hurst Exponent. Ecol. Indic. 2012, 14, 28–39. [Google Scholar] [CrossRef]
Tong, S.; Zhang, J.; Bao, Y.; Lai, Q.; Lian, X.; Li, N.; Bao, Y. Analyzing Vegetation Dynamic Trend on the Mongolian Plateau Based on the Hurst Exponent and Influencing Factors from 1982–2013. J. Geogr. Sci. 2018, 28, 595–610. [Google Scholar] [CrossRef] [Green Version]
Almeida-Ñauñay, A.F.; Benito, R.M.; Quemada, M.; Losada, J.C.; Tarquis, A.M. The Vegetation–Climate System Complexity through Recurrence Analysis. Entropy 2021, 23, 559. [Google Scholar] [CrossRef]
Sanz, E.; Saa-Requejo, A.; Díaz-Ambrona, C.H.; Ruiz-Ramos, M.; Rodríguez, A.; Iglesias, E.; Esteve, P.; Soriano, B.; Tarquis, A.M. Generalized Structure Functions and Multifractal Detrended Fluctuation Analysis Applied to Vegetation Index Time Series: An Arid Rangeland Study. Entropy 2021, 23, 576. [Google Scholar] [CrossRef]
Bruzzone, O.; Easdale, M.H. Archetypal Temporal Dynamics of Arid and Semi-Arid Rangelands. Remote Sens. Environ. 2021, 254, 112279. [Google Scholar] [CrossRef]
Rivas-Tabares, D.; Saa-Requejo, A.; Martín-Sotoca, J.J.; Tarquis, A.M. Multiscaling NDVI Series Analysis of Rainfed Cereal in Central Spain. Remote Sens. 2021, 13, 568. [Google Scholar] [CrossRef]
Liu, X.; Zhang, J.; Zhu, X.; Pan, Y.; Liu, Y.; Zhang, D.; Lin, Z. Spatiotemporal Changes in Vegetation Coverage and Its driving Factors in the Three-River Headwaters Region during 2000–2011. J. Geogr. Sci. 2014, 24, 288–302. [Google Scholar] [CrossRef]
Ndayisaba, F.; Guo, H.; Bao, A.; Guo, H.; Karamage, F.; Kayiranga, A. Understanding the Spatial-Temporal Vegetation Dynamics in Rwanda. Remote Sens. 2016, 8, 129. [Google Scholar] [CrossRef] [Green Version]
Food and Agriculture Organization. Review of Evidence on Drylands Pastoral Systems and Climate Change: Implications and Opportunities for Mitigation and Adaptation. In Land and Water Discussion Paper 8; Citeseer: Rome, Italy, 2009; p. 38. ISBN 978-92-5-106413-9. [Google Scholar]
Reid, W.V.; Mooney, H.A.; Cropper, A.; Capistrano, D.; Carpenter, S.R.; Chopra, K.; Dasgupta, P.; Dietz, T.; Duraiappah, A.K.; Hassan, R. Millennium Ecosystem Assessment Synthesis Report; WRI (World Resources Institute): Washington, DC, USA, 2005. [Google Scholar]
United Nations Conference on Environment and Development. Agenda 21; United Nations Conference on Environment and Development: New York, NY, USA, 1992. [Google Scholar]
United Nations. Elaboration of an International Convention to Combat Desertification in Countries Experiencing Serious Droughts and/or Desertification Particularly in Africa; UNEP: Geneva, Switzerland, 1994. [Google Scholar]
Kapalanga, T.S. A Review of Land Degradation Assessment Methods. In Land Restoration Training Programme; Citeseer: Rome, Italy, 2008; Volume 2011, p. 68. [Google Scholar]
Aide, T.M.; Grau, H.R.; Graesser, J.; Andrade-Nuñez, M.J.; Aráoz, E.; Barros, A.P.; Campos-Cerqueira, M.; Chacon-Moreno, E.; Cuesta, F.; Espinoza, R. Woody Vegetation Dynamics in the Tropical and Subtropical Andes from 2001 to 2014: Satellite Image Interpretation and Expert Validation. Glob. Chang. Biol. 2019, 25, 2112–2126. [Google Scholar] [CrossRef]
Woodward, F.I.; Lomas, M.R. Vegetation Dynamics–Simulating Responses to Climatic Change. Biol. Rev. 2004, 79, 643–670. [Google Scholar] [CrossRef] [PubMed]
Warren, A. Land Degradation Is Contextual. Land Degrad. Dev. 2002, 13, 449–459. [Google Scholar] [CrossRef]
Lambin, E.F.; Turner, B.L.; Geist, H.J.; Agbola, S.B.; Angelsen, A.; Bruce, J.W.; Coomes, O.T.; Dirzo, R.; Fischer, G.; Folke, C.; et al. The Causes of Land-Use and Land-Cover Change: Moving beyond the Myths. Glob. Environ. Chang. 2001, 11, 261–269. [Google Scholar] [CrossRef]
Gartzia, M.; Fillat, F.; Pérez-Cabello, F.; Alados, C.L. Influence of Agropastoral System Components on Mountain Grassland Vulnerability Estimated by Connectivity Loss. PLoS ONE 2016, 11, e0155193. [Google Scholar] [CrossRef] [Green Version]
Herrera, P.M.; Davies, J. Governance of the Rangelands in a Changing World. In The Governance of Rangelands; Routledge: Abingdon-on-Thames, UK, 2014; pp. 54–66. ISBN 1315768011. [Google Scholar]
Robinson, N.P.; Allred, B.W.; Naugle, D.E.; Jones, M.O. Patterns of Rangeland Productivity and Land Ownership: Implications for Conservation and Management. Ecol. Appl. 2019, 29, e01862. [Google Scholar] [CrossRef]
Perrings, C.; Walker, B. Conservation in the Optimal Use of Rangelands. Ecol. Econ. 2004, 49, 119–128. [Google Scholar] [CrossRef]
Bird, S.B.; Herrick, J.E.; Wander, M.M.; Wright, S.F. Spatial Heterogeneity of Aggregate Sility and Soil Carbon in Semi-Arid Rangeland. Environ. Pollut. 2002, 116, 445–455. [Google Scholar] [CrossRef]
Evans, J.P.; Geerken, R. Classifying Rangeland Vegetation Type and Coverage Usitabng a Fourier Component Based Similarity Measure. Remote Sens. Environ. 2006, 105, 1–8. [Google Scholar] [CrossRef]
Ünal, E.; Mermer, A.; Yildiz, H. Assessment of Rangeland Vegetation Condition from Time Series NDVI Data. J. Field Crops Cent. Res. Inst. 2014, 23, 14–21. [Google Scholar] [CrossRef] [Green Version]
Huang, F.; Wang, P. Vegetation Change of Ecotone in West of Northeast China Plain Using Time-Series Remote Sensing Data. Chin. Geogr. Sci. 2010, 20, 167–175. [Google Scholar] [CrossRef]
Wang, Y.; Zang, S.; Tian, Y. Mapping Paddy Rice with the Random Forest Algorithm Using MODIS and SMAP Time Series. Chaos Solitons Fractals 2020, 140, 110116. [Google Scholar] [CrossRef]
Mangiarotti, S.; Sharma, A.K.; Corgne, S.; Hubert-Moy, L.; Ruiz, L.; Sekhar, M.; Kerr, Y. Can the Global Modeling Technique Be Used for Crop Classification? Chaos Solitons Fractals 2018, 106, 363–378. [Google Scholar] [CrossRef]
Fathizad, H.; Tazeh, M.; Kalantari, S.; Shojaei, S. The Investigation of Spatiotemporal Variations of Land Surface Temperature Based on Land Use Changes Using NDVI in Southwest of Iran. J. Afr. Earth Sci. 2017, 134, 249–256. [Google Scholar] [CrossRef]
Ahmed, K.R.; Akter, S.; Marandi, A.; Schüth, C. A Simple and Robust Wetland Classification Approach by Using Optical Indices, Unsupervised and Supervised Machine Learning Algorithms. Remote Sens. Appl. Soc. Environ. 2021, 23, 100569. [Google Scholar] [CrossRef]
Triscowati, D.W.; Sartono, B.; Kurnia, A.; Domiri, D.D.; Wijayanto, A.W. Multitemporal Remote Sensing Data for Classification of Food Crops Plant Phase Using Supervised Random Forest. In Proceedings of the Sixth Geoinformation Science Symposium; International Society for Optics and Photonics, Yogyakarta, Indonesia, 26–27 August 2019; Volume 11311, p. 1131102. [Google Scholar]
Uehara, T.D.T.; Soares, A.R.; Quevedo, R.P.; Körting, T.S.; Fonseca, L.M.G.; Adami, M. Land Cover Classification of an Area Susceptible to Landslides Using Random Forest and NDVI Time Series Data. In Proceedings of the IGARSS 2020—2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, HI, USA, 26 September–2 October 2020; pp. 1345–1348. [Google Scholar]
Breiman, L.; Cutler, A. Random Forests Manual V4; Technical Report; UC Berkeley: Berkeley, NJ, USA, 2003. [Google Scholar]
Peerbhay, K.; Mutanga, O.; Lottering, R.; Ismail, R. Mapping Solanum Mauritianum Plant Invasions Using WorldView-2 Imagery and Unsupervised Random Forests. Remote Sens. Environ. 2016, 182, 39–48. [Google Scholar] [CrossRef]
Lopez, C.; Tucker, S.; Salameh, T.; Tucker, C. An Unsupervised Machine Learning Method for Discovering Patient Clusters Based on Genetic Signatures. J. Biomed. Inform. 2018, 85, 30–39. [Google Scholar] [CrossRef]
Rousseeuw, P.J. Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef]
Bezdek, J.C.; Pal, N.R. Some New Indexes of Cluster Validity. IEEE Trans. Syst. Man Cybern. Part B 1998, 28, 301–315. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Frey, B.J.; Dueck, D. Clustering by Passing Messages between Data Points. Science 2007, 315, 972–976. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hurst, H.E. Long-Term Storage Capacity of Reservoirs. Trans. Am. Soc. Civ. Eng. 1951, 116, 770–799. [Google Scholar] [CrossRef]
Kantelhardt, J.W.; Zschiegner, S.A.; Koscielny-Bunde, E.; Havlin, S.; Bunde, A.; Stanley, H.E. Multifractal Detrended Fluctuation Analysis of Nonstationary Time Series. Phys. A Stat. Mech. Its Appl. 2002, 316, 87–114. [Google Scholar] [CrossRef] [Green Version]
Igbawua, T.; Zhang, J.; Yao, F.; Ali, S. Long Range Correlation in Vegetation Over West Africa From 1982 to 2011. IEEE Access 2019, 7, 119151–119165. [Google Scholar] [CrossRef]
Kalisa, W.; Igbawua, T.; Henchiri, M.; Ali, S.; Zhang, S.; Bai, Y.; Zhang, J. Assessment of Climate Impact on Vegetation Dynamics over East Africa from 1982 to 2015. Sci. Rep. 2019, 9, 16865. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ba, R.; Song, W.; Lovallo, M.; Lo, S.; Telesca, L. Analysis of Multifractal and Organization/Order Structure in Suomi-NPP VIIRS Normalized Difference Vegetation Index Series of Wildfire Affected and Unaffected Sites by Using the Multifractal Detrended Fluctuation Analysis and the Fisher-Shannon Analysis. Entropy 2020, 22, 415. [Google Scholar] [CrossRef] [Green Version]
Emamian, A.; Rashki, A.; Kaskaoutis, D.G.; Gholami, A.; Opp, C.; Middleton, N. Assessing Vegetation Restoration Potential under Different Land Uses and Climatic Classes in Northeast Iran. Ecol. Indic. 2021, 122, 107325. [Google Scholar] [CrossRef]
Liu, Y.; Hou, E.; Yue, H. Dynamic Monitoring and Trend Analysis of Vegetation Change in Shendong Mining Area Based on MODIS. Remote Sens. Land Resour. 2017, 132–137. [Google Scholar] [CrossRef]
Zhang, X.; Liu, R.; Gan, F.; Wang, W.; Ding, L.; Yan, B. Evaluation of Spatial-Temporal Variation of Vegetation Restoration in Dexing Copper Mine Area Using Remote Sensing Data. In Proceedings of the IGARSS 2020—2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, HI, USA, 26 September–2 October 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 2013–2016. [Google Scholar]
Guo, X.; Zhang, H.; Yuan, T.; Zhao, J.; Xue, Z. Detecting the Temporal Scaling Behavior of the Normalized Difference Vegetation Index Time Series in China Using a Detrended Fluctuation Analysis. Remote Sens. 2015, 7, 12942–12960. [Google Scholar] [CrossRef]
Martinez-Mena, M.; Castillo, V.; Albaladejo, J. Hydrological and Erosional Response to Natural Rainfall in a Semi-arid Area of South-east Spain. Hydrol. Process. 2001, 15, 557–571. [Google Scholar] [CrossRef]
Dargie, T.C.D. On the Integrated Interpretation of Indirect Site Ordinations: A Case Study Using Semi-Arid Vegetation in Southeastern Spain. Vegetatio 1984, 55, 37–55. [Google Scholar] [CrossRef]
Barceló, A.M.; Nunes, L.F. Atlas Climático Ibérico—Iberian Climate Atlas 1971–2000. AEMET: Madrid, Spain, 2009; ISBN 9788478370795. [Google Scholar]
Fondo Español de Garantía Agraria Visor SigPac (FEGA). Madrid. Spain. Available online: http://www.fega (accessed on 20 January 2022).
Team, A. Application for Extracting and Exploring Analysis Ready Samples (AppEEARS). NASA EOSDIS Land Processes Distributed Active Archive Center (LP DAAC), USGS/Earth Resources Observation and Science (EROS) Center: Sioux Falls, SD, USA. 2020. Available online: https://lpdaacsvc.cr.usgs.gov/appeears/ (accessed on 2 June 2020).
R Core Team. R: A Language and Environment for Statistical Computing; R Core Team: Vienna, Austria, 2021. [Google Scholar]
Savitzky, A.; Golay, M.J.E. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
EU-DEM v1.1. Available online: https://land.copernicus.eu/imagery-in-situ/eu-dem/eu-dem-v1.1?tab=download (accessed on 4 March 2022).
ESRI ArcGIS Desktop. Release 10.8.1; ESRI: Redlands, CA, USA, 2020. [Google Scholar]
Borchers, H.W.; Borchers, M.H.W. Package ‘Pracma’. 2019. Available online: https://CRAN.R-project.org/package=pracma (accessed on 4 March 2022).
Zhou, Z.; Ding, Y.; Shi, H.; Cai, H.; Fu, Q.; Liu, S.; Li, T. Analysis and Prediction of Vegetation Dynamic Changes in China: Past, Present and Future. Ecol. Indic. 2020, 117, 106642. [Google Scholar] [CrossRef]
Kendall, M.G. Rank Correlation Methods; Charles Griffin & Co.: London, UK, 1975. [Google Scholar]
Li, X.; Lanorte, A.; Lasaponara, R.; Lovallo, M.; Song, W.; Telesca, L. Fisher–Shannon and Detrended Fluctuation Analysis of MODIS Normalized Difference Vegetation Index (NDVI) Time Series of Fire-Affected and Fire-Unaffected Pixels. Geomat. Nat. Hazards Risk 2017, 8, 1342–1357. [Google Scholar] [CrossRef]
Sanz, E.; Saa-Requejo, A.; Díaz-Ambrona, C.H.; Ruiz-Ramos, M.; Rodríguez, A.; Iglesias, E.; Esteve, P.; Soriano, B.; Tarquis, A.M. Normalized Difference Vegetation Index Temporal Responses to Temperature and Precipitation in Arid Rangelands. Remote Sens. 2021, 13, 840. [Google Scholar] [CrossRef]
Hubert, L.; Arabie, P. Comparing Partitions. J. Classif. 1985, 2, 193–218. [Google Scholar] [CrossRef]
Vavrek, M.J. Fossil: Palaeoecological and Palaeogeographical Analysis Tools. Palaeontol. Electron. 2011, 14, 16. [Google Scholar]
Lloyd, S. Least Squares Quantization in PCM. IEEE Trans. Inf. Theory 1982, 28, 129–137. [Google Scholar] [CrossRef] [Green Version]
MacQueen, J. Some Methods for Classification and Analysis of Multivariate Observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Oakland, CA, USA, 1 January 1967; Volume 1, pp. 281–297. [Google Scholar]
MacKay, D. An Example Inference Task: Clustering. In Information Theory, Inference and Learning Algorithms; Cambridge University Press: Cambridge, UK, 2003; Volume 20, pp. 284–292. [Google Scholar]
Hartigan, J.A.; Wong, M.A. Algorithm AS 136: A k-Means Clustering Algorithm. J. R. Stat. Soc. Ser. Appl. Stat. 1979, 28, 100–108. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Breiman, L. Bagging Predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef] [Green Version]
Breiman, L.; Cutler, A. Random Forests-Classification Description. Available online: http://stat-www.berkeley.edu/users/breiman/RandomForests/cc_home.htm (accessed on 21 January 2022).
Liaw, A.; Wiener, M. Classification and Regression by RandomForest. R News 2002, 2, 18–22. [Google Scholar]
Ng, A. Clustering with the K-Means Algorithm. Mach. Learn. 2012.
Ludwig, J.A.; Tongway, D.J. Viewing Rangelands as Landscape Systems. In Rangeland Desertification; Springer: Amsterdam, The Netherlands, 2000; pp. 39–52. [Google Scholar]
Tongway, D.J.; Ludwig, J.A. The Nature of Landscape Dysfunction in Rangelands. In Landscape Ecology: Function and Management: Principles from Australia’s Rangelands; CSIRO Publishing: Melbourne, Australia, 1997. [Google Scholar]
Coronado, A.V.; Carpena, P. Size Effects on Correlation Measures. J. Biol. Phys. 2005, 31, 121–133. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Schwinning, S.; Parsons, A.J. The Stability of Grazing Systems Revisited: Spatial Models and the Role of Heterogeneity. Funct. Ecol. 1999, 13, 737–747. [Google Scholar] [CrossRef]
Martens, S.N.; Breshears, D.D.; Meyer, C.W. Spatial Distributions of Understory Light along the Grassland/Forest Continuum: Effects of Cover, Height, and Spatial Pattern of Tree Canopies. Ecol. Model. 2000, 126, 79–93. [Google Scholar] [CrossRef]
Stavi, I.; Ungar, E.D.; Lavee, H.; Sarah, P. Grazing-Induced Spatial Variability of Soil Bulk Density and Content of Moisture, Organic Carbon and Calcium Carbonate in a Semi-Arid Rangeland. Catena 2008, 75, 288–296. [Google Scholar] [CrossRef]
Le Houérou, H.N.; Bingham, R.L.; Skerbek, W. Relationship between the Variability of Primary Production and the Variability of Annual Precipitation in World Arid Lands. J. Arid Environ. 1988, 15, 1–18. [Google Scholar] [CrossRef]

Figure 1. Location of the study area. (a) Selected Provinces. (b) Selected agricultural regions of Almeria and Murcia. (c) Selected pixels in three agricultural regions of Almería and Murcia. Source basemap: Invierno 2020. Gobierno de España y Comunidad Autónoma de Murcia. CC-BY 4.0 scne.es 2020.

Figure 2. Correlation matrix of all variables tested for the three study regions. Large, dark blue circles indicate a high correlation, while small, light blue circles indicate a low correlation. Ph2/3/5 stands for Phase 2/3/5, and Var for variance.

Figure 3. Elbow method on the selected variables using k-means clustering. Three- and four-cluster analyses were performed.

Figure 4. Silhouette plots for all pixels using k-means for three (a) and four clusters (b), showing the Silhouette Index (y-axis) for all pixels for each cluster represented on the x-axis with H2. The same results were obtained when k-means were run with HI or without HI/H2.

Figure 5. Elbow method of the selected variables using partitioning around medoids clustering, where three and four clusters were selected.

Figure 6. Silhouette plots for the different analyses performed with URF. On the left are those performed with H2, from DFA; the analyses performed with HI, from R/S, are on the right. The top graphics are for three clusters and the bottom graphics are for four. Silhouette plots show the Silhouette Index (y-axis) for all pixels for each cluster represented on the x-axis.

Figure 7. Comparison of H2 and HI for all clusters when URF was used with H2 (top) and HI (bottom) in all study areas.

Figure 8. Slope and elevation comparison for all clusters when URF (a) and k-means (b) were used with H2 in all study areas.

Figure 9. (a) On the top are the clustering results of URF in the Murcia-NW when HI (a) or H2 (b) was used, showing cluster 1 and 2 present in this region, while cluster three was not present in this area. (c) compares the differences in clustering when HI (bottom) and H2 (top) were used in URF for all the study areas.

Figure 10. Time series of cluster prototypical pixels in: (a) the cluster type (selected based on their vegetation type: mixed shrubland for cluster 1, open woodland for cluster 2 and grassland for cluster 3); (b) the average of each cluster; and (c) the variances for each cluster and phase, based on the NDVI dynamics following [64].

Table 1. Average Silhouette Indexes for 3 and 4 clusters for k-means and optimised URF.

Analysis	K-Means		Unsupervised Random Forest
Analysis	3 Clusters	4 Clusters	3 Clusters	4 Clusters
Without H2/HI	0.33	0.34	0.51	0.49
With H2	0.33	0.34	0.62	0.47
With HI	0.33	0.34	0.50	0.45

Table 2. Correlations between H2 and HI with elevation, slope, and variances from phases 5 and 2.

Hurst Exponent	Elevation	Slope	Var_Ph5	Var_Ph2
H2	−0.81	−0.53	0.54	0.29
HI	−0.25	−0.07	0.05	0.21

Table 3. Percentages of vegetation type of the selected pixels, based on the National Forest Map. Results for each cluster are based on URF with H2.

Cluster	Woodland	Shrubland	Grassland
1	48.0	7.7	44.3
2	99.7	0.3	0.0
3	15.5	4.4	80.1

Table 4. Percentages of Mann–Kendall results for each cluster based on URF with H2.

Significance	Cluster 1	Cluster 2	Cluster 3
Significant decrease	6.2%	0 %	2.2%
Not significant	26.5%	10%	34.3%
Significant increase	67.3%	90%	63.5%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sanz, E.; Sotoca, J.J.M.; Saa-Requejo, A.; Díaz-Ambrona, C.H.; Ruiz-Ramos, M.; Rodríguez, A.; Tarquis, A.M. Clustering Arid Rangelands Based on NDVI Annual Patterns and Their Persistence. Remote Sens. 2022, 14, 4949. https://doi.org/10.3390/rs14194949

AMA Style

Sanz E, Sotoca JJM, Saa-Requejo A, Díaz-Ambrona CH, Ruiz-Ramos M, Rodríguez A, Tarquis AM. Clustering Arid Rangelands Based on NDVI Annual Patterns and Their Persistence. Remote Sensing. 2022; 14(19):4949. https://doi.org/10.3390/rs14194949

Chicago/Turabian Style

Sanz, Ernesto, Juan José Martín Sotoca, Antonio Saa-Requejo, Carlos H. Díaz-Ambrona, Margarita Ruiz-Ramos, Alfredo Rodríguez, and Ana M. Tarquis. 2022. "Clustering Arid Rangelands Based on NDVI Annual Patterns and Their Persistence" Remote Sensing 14, no. 19: 4949. https://doi.org/10.3390/rs14194949

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Clustering Arid Rangelands Based on NDVI Annual Patterns and Their Persistence

Abstract

1. Introduction

2. Materials and Methods

2.1. Area of Study

2.2. Data Collection

2.2.1. NDVI Data

2.2.2. GIS Data

2.3. Fractal Analysis

2.3.1. Rescale Ranged Hurst Exponent

2.3.2. Multifractal Detrended Fluctuation Analysis

2.4. Variable Selection for Clustering

2.5. Clustering

2.5.1. K-Means

2.5.2. Unsupervised Random Forest

3. Results

3.1. Variable Selection Approach

3.2. Clustering Analysis

3.2.1. K-Means

3.2.2. Unsupervised Random Forest

3.2.3. Cluster Characterisation

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Acknowledgments

Conflicts of Interest

Appendix A. Principal Component Analyses Made to Select the Most Explanatory Variables

Appendix B. The Silhouette Indexes Calculated for All URF Changing Mtry and Number of Trees for Three Clusters

Appendix C. Maps of the Clusters Using URF with HI and H2 for the Three Study Provinces

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI