Next Article in Journal
Relative Merits of Optimal Estimation and Non-Linear Retrievals of Sea-Surface Temperature from MODIS
Next Article in Special Issue
Self-Supervised Denoising for Real Satellite Hyperspectral Imagery
Previous Article in Journal
Graph-Based Deep Multitask Few-Shot Learning for Hyperspectral Image Classification
Previous Article in Special Issue
Hyperspectral Image Denoising via Adversarial Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Reconstruction of Vegetation Index Time Series Based on Self-Weighting Function Fitting from Curve Features

1
State Key Laboratory of Remote Sensing Science, Jointly Sponsored by Beijing Normal University and Aerospace Information Research Institute of Chinese Academy of Sciences, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China
2
Beijing Engineering Research Center for Global Land Remote Sensing Products, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China
3
Academy of Disaster Reduction and Emergency Management, Ministry of Emergency Management & Ministry of Education, School of National Safety and Emergency Management, Beijing Normal University, Beijing 100875, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(9), 2247; https://doi.org/10.3390/rs14092247
Submission received: 10 February 2022 / Revised: 26 April 2022 / Accepted: 29 April 2022 / Published: 7 May 2022
(This article belongs to the Special Issue Remote Sensing Image Denoising, Restoration and Reconstruction)

Abstract

:
Vegetation index (VI) time series derived from satellite sensors have been widely used in the estimation of vegetation parameters, but the quality of VI time series is easily affected by clouds and poor atmospheric conditions. The function fitting method is a widely used effective noise reduction technique for VI time series, but it is vulnerable to noise. Thus, ancillary data about VI quality are utilized to alleviate the interference of noise. However, this approach is limited by the availability, accuracy, and application rules of ancillary data. In this paper, we aimed to develop a new reconstruction method that does not require ancillary data. Based on the assumptions that VI time series follow the gradual growth and decline pattern of vegetation dynamics, and that clouds or poor atmospheric conditions usually depress VI values, we proposed a reconstruction method for VI time series based on self-weighting function fitting from curve features (SWCF). SWCF consists of two major procedures: (1) determining a fitting weight for each VI point based on the curve features of the VI time series and (2) implementing the weighted function fitting to reconstruct the VI time series. The double logistic function, double Gaussian function, and polynomial function were tested in SWCF based on a simulated dataset. The results indicate that the weighted function fitting with SWCF outperformed the corresponding unweighted function fitting with the root-mean-square error (RMSE) significantly reduced by 26.82–52.44% (p < 0.05), and it also outperformed the Savitzky–Golay filtering with the RMSE significantly reduced by 13.98–54.04% (p < 0.05) for 270 sample points selected in mid-high latitudes of the Northern Hemisphere. Moreover, SWCF showed excellent robustness and applicability in regional applications.

1. Introduction

Vegetation index time series (e.g., normalized-difference vegetation index, NDVI [1]) derived from satellite sensors have been widely used in the estimation of land surface vegetation parameters (e.g., fractional vegetation coverage, leaf area index, vegetation phenology, and vegetation productivity) and the monitoring of land cover and land use change [2,3,4,5,6]. However, the quality of vegetation index (VI) time series is easily affected by the disturbances caused by cloud contamination and atmospheric conditions [7]. Although the maximum value composition [8] and the cloud detection identification algorithm [9] are often applied for the preprocessing of VI products, significant residual noise still remains [10]. Therefore, a number of noise-reduction techniques have been developed to reduce noise and reconstruct high-quality VI time series.
According to the reconstruction principle, the noise-reduction techniques can be divided into four categories [11]: (1) temporal filtering methods, such as Savitzky–Golay filtering [12], mean-value iterative filtering (MVI) [10], moving average filtering (MA) [13], changing-weight filtering [14], best index slope extraction (BISE) [15], and modified BISE [16]; (2) temporal function fitting methods, such as the double logistic function fitting [17], double Gaussian function fitting [18], and polynomial function fitting [19]; (3) frequency-based methods, such as the harmonic analysis of time series (HANTS) [20], fast Fourier transform (FFT) [21], and wavelet transform (WT) [22]; and (4) spatiotemporal methods, such as the spatial–temporal Savitzky–Golay (STSG) method [23]. Although these methods have been successfully applied, certain drawbacks greatly limit their applications. For the frequency-based methods, they assume that the high frequency components are usually noise. They may reduce some reasonable high values when there are frequent variations, so they will fail to maintain vegetation signals completely. For the spatiotemporal methods, for example, STSG uses spatially adjacent pixels to restore the VI of target pixels, but it will fail when large spatially continuous clouds occur. This case happens often on high spatial resolution images, thus greatly limiting the application scope of STSG. For the temporal filtering methods and some frequency-based methods, there are usually no objective rules used for setting the key parameters such as the width of the smoothing window and the degree of the smoothing polynomial for the Savitzky–Golay filtering, the sliding period and acceptable threshold value for BISE, the size of moving window for MA, the key threshold value for MVI, and the control parameters for HANTS. Researchers need to empirically determine the reasonable values for these key parameters according to their study area, which may result in new uncertainties.
For the temporal function fitting methods, their key parameters can be automatically optimized by iterative nonlinear least square fitting, which has less dependence on manual settings. Moreover, the temporal function fitting methods have the advantage of expressing the VI time series as a function, which helps to interpret vegetation signals. However, the function fitting methods are vulnerable to noisy VI points contaminated by clouds and poor atmospheric conditions. Therefore, ancillary data about VI quality are often used to alleviate the interference of noise before fitting. Ancillary data are used to (1) replace the VI points identified as clouds in ancillary data with the linear interpolation values of their neighboring VI points [3,24] and (2) set a fitting weight for each VI point according to the signal-to-noise ratio indicated in ancillary data [25,26,27]. These approaches can effectively improve the performance of the function fitting methods, but meanwhile, are greatly limited by the availability, accuracy, and application rules of ancillary data. Ancillary data are generated by some noise detection algorithms and have limited accuracy [12,25,28]. In particular, the accuracy of ancillary data is not sufficiently high when the background is ice or snow in identifying clouds. For example, the accuracy for the C code based on the Function of Mask algorithm (CFMASK) in identifying clouds is 64.09% in this case [29]. Moreover, some VI datasets may lack available ancillary data about VI quality, such as the GIMMS NDVI dataset [30]. Furthermore, it varies greatly in the marker rules of VI quality for different ancillary data, for which it is difficult to set a uniform weighting method for different datasets [25,26].
In fact, the VI time-series data itself contains the quality information of each data point. That is, good-quality VI points usually follow the gradual process of vegetation dynamics, while suddenly dropping VI points are more likely to be contaminated by clouds and poor atmospheric conditions [12,31]. This pattern provides a new idea to estimate the signal-to-noise ratio of each VI data point, thus avoiding the limitations of ancillary data. Therefore, this study proposes a reconstruction method based on self-weighting function fitting from curve features (SWCF) to reduce noise in VI time series. SWCF utilizes the curve features of VI time series to generate the weight for each VI data point and then implements the weighted function fitting, which is expected to reconstruct high-quality VI time series without using ancillary data.

2. Materials and Methods

2.1. Data Sources

The MOD09GA (version 6) dataset was used to test the newly proposed SWCF. It included daily 500 m surface reflectance data and a 1 km cloud flag as ancillary data. The red and near-infrared reflectance bands were used to calculate NDVI, and the cloud flags were utilized to indicate whether the pixel was contaminated by clouds. NDVI and cloud flag data were used for the simulation experiment in the pixel-level evaluation (Section 2.3.1) and the region-level evaluation (Section 2.3.2). The MOD09A1 (Version 6) product, the 8 day maximum synthetic product derived from the MOD09GA dataset, was used to obtain the demonstrated NDVI time series curve in Section 2.2.1 for the clear introduction to the new method. The demonstrated NDVI time series curve consists of 46 VI points for one year.
The MCD12Q1 (version 6, 500 m) data product provided the annual global land covers identified as the IGBP (International Geosphere Biosphere Programme) classification. IGBP classes for the period of 2004–2019 were used for the selection of vegetation samples. These two datasets are both acquired from the NASA’s Land Processes Distributed Active Archive Center (LP DAAC, https://lpdaac.usgs.gov, accessed on 1 August 2021).

2.2. Algorithm of SWCF

Ideally, a noise-free VI time series changes gradually and continuously, which is consistent with the vegetation dynamics of growth and decline [12,31]. Therefore, SWCF is based on two assumptions: (1) that a yearly VI time series primarily indicates the seasonal dynamics of vegetation and (2) that clouds and poor atmospheric conditions usually tend to depress VI values, causing sudden drops in VI time series. SWCF utilizes the curve features of VI time series to determine a fitting weight for each VI data point and then implements the weighted function fitting to reconstruct the VI time series. The double logistic function, double Gaussian function, and polynomial function (Appendix B) are used for fitting in SWCF. Apparently, the core point in SWCF is how to obtain an appropriate fitting weight indicating the confidence level for each VI data point.

2.2.1. Self-Weighting from the VI Curve Features

SWCF exploits the signal-to-noise ratio of each data point in the VI time series to generate its weight (Figure 1). In line with the two assumptions mentioned above, during one vegetation growing season, VI points consistent with the vegetation variation pattern usually have higher signal-to-noise ratios, while suddenly dropping VI points incompatible with the gradual process are inclined to be degraded by noise to varying degrees, owning relatively lower signal-to-noise ratios. The weight is introduced as an indicator of the signal-to-noise ratio, with a higher signal-to-noise ratio leading to a higher weight.
(1)
Definition of gradually changing points and suddenly dropping points
Following the growth and decline of vegetation during one growing season, the VI temporal profiles will rise and fall monotonically, while clouds and poor atmospheric conditions will cause drops in VI time series, thereby disturbing the trend. Based on this VI temporal change pattern, VI points rising monotonically from both ends to the peak are defined as gradually changing points, while others are defined as suddenly dropping points (Figure 2). The gradually changing points tend to contain actual vegetation signals and the suddenly dropping points are more likely to be disturbed by noise to varying extents.
The specific algorithm is described as follows. As shown in Figure 2, the VI time series is divided into the rising and falling parts (green curve and brown curve, respectively) by the VI peak point. In the rising part, starting from the first point to the peak point, if the VI value is larger than or equal to the maximum VI value of all previous points, then it will be defined as the gradually changing point; otherwise, it will be defined as the suddenly dropping point. Similarly, in the falling part, the gradually changing points and the suddenly dropping points are defined starting from the last point to the peak point. In Figure 2, the light blue solid points are cloud-contaminated VI marked in ancillary data, and DOY denotes the Julian day of year.
(2)
Calculation of the weights for the VI time series
The weight is set to 1 for the gradually changing point (Equation (1)).
W GCP = 1
where WGCP is the weight of the gradually changing point.
For the suddenly dropping point, the weight is determined by two features: one is the magnitude of the drop (Δh), and the other is the temporal proximity to the VI peak point (P) (Figure 3). The weight of the suddenly dropping point is determined by the following four steps:
(I) Linear stretching of the original VI time series. Since the data value range of the VI time series differs greatly in pixels, the linear stretching is adopted to uniformly transform the original VI time series to the same data value range to ensure that various curves with different amplitudes can be processed automatically (Equation (2)).
V I LS   = V I V I min V I max V I min × R
where VILS denotes the stretched VI value, VI is the original VI value, R (Range) is the maximum value of VILS, and VImax and VImin indicate the maximum and minimum values of the original VI time series, respectively. The optimal stretching range R will be determined by experimental tests (see details in Section 2.3.3).
(II) Definition of Δh. Δh is defined as the vertical distance to the straight line formed by two adjacent stretched gradually changing points (Equation (3)). Δh reflects the deviation of a suddenly dropping point from the vegetation seasonal change trend and is used to estimate the signal-to-noise ratio. As shown in Figure 3, A and B are suddenly dropping points (red solid points); Δh1 and Δh2 are their vertical distances to the straight line formed by two adjacent stretched gradually changing points, respectively.
Δ h = ( V I LS _ GCP 1 + ( D O Y SDP D O Y GCP 1 ) × ( V I LS _ GCP 2 V I LS _ GCP 1 D O Y GCP 2 D O Y GCP 1 ) ) V I LS _ SDP
where Δh is the dropping magnitude of the suddenly dropping point; VILS_SDP, VILS_GCP1 and VILS_GCP2 are the VILS values of the suddenly dropping point and its adjacent gradually changing points; and DOYSDP, DOYGCP1 and DOYGCP2 are the DOYs of the suddenly dropping point and its adjacent gradually changing points. For the VILS data points that span to the next year, their DOYs should be added with 365. Generally, a larger Δh indicates a lower signal-to-noise ratio.
(III) Definition of P. P shows the temporal proximity to the VILS peak, which can be calculated by their DOYs (Equation (4)). As shown in Figure 3, P1 and P2 represent the temporal proximity of A and B to M, respectively; M is the peak VILS point; DOYA, DOYB and DOYM are the DOYs of A, B, and M, respectively; and DOYmin and DOYmax are the DOYs of the first and last VI points, respectively.
P = { D O Y SDP D O Y min D O Y m D O Y min ,     when   D O Y SDP < D O Y m D O Y max D O Y SDP D O Y max D O Y m ,     when   D O Y SDP > D O Y m
where P is the temporal proximity of the suddenly dropping point to the peak point; DOYSDP and DOYm are the DOYs of the suddenly dropping point and peak VILS point, respectively; and DOYmin and DOYmax represent the minimum and maximum values of the DOY series, respectively. For the VI data points that span to the next year, their DOYs should be added with 365. Obviously, P ranges from 0 to 1, and a larger P indicates that the suddenly dropping point is closer to the peak of vegetation growth.
(IV) Calculation of the weight for the suddenly dropping point. For the suddenly dropping point, higher noise levels and closer temporal proximity to the peak of growth lead to relatively lower weights. Since Δh and P are two independent variables, we take the multiplication combination of these two to comprehensively calculate the weights in Equation (5):
W SDP = { 1 Δ h × P ,     Δ h × P < 1 0 ,     Δ h × P 1
where WSDP is the weight of the suddenly dropping point; Δh is the dropping magnitude of suddenly dropping point; and P is the temporal proximity of the suddenly dropping point to the peak VI point. Figure 4 shows the weight series for the VI time series.

2.2.2. Weighted Function Fitting

The double logistic function, double Gaussian function, and polynomial function (Appendix B) are used for fitting in SWCF. The parameters for these functions are optimized with the weighted least squares fitting process. It should be noted that SWCF emphasizes the importance of vegetation signals and lowers the weights of contaminated data points, which distinguishes SWCF from the unweighted function fitting methods.

2.2.3. Software Implementations

In this study, the self-weighting method and the function fitting methods are implemented in Google Earth Engine, and the code is available on the website provided in Appendix C. Other common softwares for remote sensing processing (e.g., Python, Matlab and RStudio) are also convenient for the implementation of SWCF.

2.3. Evaluation of SWCF

2.3.1. Evaluation at the Pixel Level

Similar to the evaluation techniques for noise-reduction methods [32,33,34], a simulation experiment was conducted to compare SWCF with other noise-reduction methods. At each test pixel, a reference daily NDVI time series was constructed from the average of multiyear noise-free daily NDVI data. Then, simulated noise would be added to the reference NDVI time series to test the performance of all methods.
(1)
Preselection of test pixels
According to the MCD12Q1 IGBP land cover types, we randomly selected 50 test pixels with a constant IGBP class during 2004–2019 for each land cover type. These types included evergreen needleleaf forests, deciduous needleleaf forests, deciduous broadleaf forests, mixed forests, shrublands, savannas, and grasslands across mid-high latitudes of the Northern Hemisphere.
(2)
Construction of the reference NDVI curve
A. Division of growing season
Since the annual vegetation cycle may not coincide with the calendar year, that is, the growing season may span two years, the actual start date of a growing season must be determined. To highlight the seasonal changes in vegetation, the 8-day maximum value composition was performed on daily NDVI time series to reduce noise levels, and then the 7-point Gaussian filtering was applied to generate a long-term NDVI time series in 2004–2019. Two adjacent growing seasons can be divided by the DOY corresponding to the lowest NDVI point in every year. As shown in Figure 5a, the green points are the minimum NDVI values between adjacent growing seasons from 2004 to 2007, and Date1, Date2, and Date3 are the corresponding dates, respectively. Here, we took the average multiyear DOYs in 2004–2019 as the average start date of all growing seasons.
B. Construction of reference NDVI time series
The reference daily NDVI time series was constructed from the temporal average of multiyear cloud-free NDVI values on the same date in all growing seasons. Since the cloud flag data cannot include all cases in which NDVI data points are contaminated, the 7-point Gaussian filtering was applied to remove potential noisy data and produce a smoother curve (Figure 5b).
(3)
Classifications of the reference NDVI time series
When the environmental conditions differ a lot, even for the same land cover type, the curve shapes of NDVI time series would be quite different [35]. To comprehensively evaluate the performance of SWCF, here we defined 9 curve types (see Table 1) based on the width and amplitude of a growing season derived from the reference NDVI time series (Appendix A). As shown in Table 1, A1W1, A1W2, A1W3, A2W1, A2W2, A2W3, A3W1, A3W2 and A3W3 are curve types of the reference NDVI time series, classified by the width and amplitude of a growing season. The threshold values of 210 and 260 are the 33rd and 66th percentile values of the widths of growing season derived from the reference NDVI time series for all preselected sample points, respectively, and the threshold values of 0.3 and 0.5 are the 33rd and 66th percentile values of the amplitudes of growing season derived from the reference NDVI time series for all preselected sample points, respectively. These nine curve types included almost all possible curve shapes of any land cover type and explicitly reflected the vegetation growth states. Then 30 samples were randomly selected for each curve type from previous preselected test pixels (Figure 6). These selected samples were used to evaluate the performance of different reconstruction methods.
(4)
Construction of the noise NDVI time series
A noise NDVI time series was generated by introducing random noise to a reference NDVI curve. At each test pixel, we simulated a group of random noise according to the actual occurrence of noise. Then, they were added to reduce the NDVI values of the reference NDVI time series based on the reality that clouds and atmospheric interference tend to degrade NDVI.
The noise simulation experiment consisted of the determinations of the number, location, and intensity of noise. The number depended on the average of annual cloudy days in 2004–2019. In a descending order of cloud probability per day, the corresponding DOYs were selected as the noise introduction locations (Figure 7a). That is, for a day, the higher probability of cloud occurrence made it more likely to be selected. For an NDVI temporal profile of the same day over multiple years, we assumed that its level of noise followed the normal distribution. Therefore, the noise intensity should be within 0–3 times its standard deviation. At each location, a random number within 0–3 times its standard deviation was independently generated as the noise intensity.
At each noise introduction location, we subtracted the noise intensity from the NDVI value of the reference NDVI time series (Equation (6)), thus generating a noise NDVI time-series (Figure 7b).
N D V I _ n o i s e i = N D V I _ r e f i N o i s e i
where NDVI_noise and NDVI_ref represent the noise NDVI time series and the reference NDVI time series, respectively. Noise is the simulated noise, and i is the DOY of the noise introduction location.
(5)
Reconstruction of the noise NDVI time series
At each test pixel, the noise NDVI time series was reconstructed by the weighted function fitting methods with SWCF, unweighted function fitting methods and Savitzky–Golay filtering, respectively (Appendix B).
(6)
Evaluation of the reconstructed NDVI time series
At each test pixel, the qualitative and quantitative analyses were carried out to evaluate the reconstructed NDVI time series. First, we gave a visual inspection of the overlap degrees between the reference NDVI time series and the reconstructed NDVI time series. Then, the root-mean-square error (RMSE) was used to quantitatively evaluate the performance of each method. The RMSE reflects the error of the reconstructed time series relative to the reference time series (Equation (7)).
RMSE = i = 1 365 ( N D V I _ r e f i N D V I _ f i t i ) 2 365
where RMSE is the root-mean-square error of the reconstructed NDVI time series, NDVI_ref and NDVI_fit represent the reference NDVI time series and the reconstructed NDVI time series, respectively, and i is the ordinal number of points in the yearly NDVI time series. A smaller RMSE indicates a better capacity to restore the noisy NDVI to its true value. For each curve shape type (Table 1), the paired sample t-test is used to analyze whether the RMSE values for these methods are significantly different.

2.3.2. Evaluation at the Region Level

To test the applicability of SWCF applied in large regions, we selected a test area (the geographic center is at 96.3°E, 33.8°N) on the Tibetan Plateau which is easily affected by clouds but rarely disturbed by human beings. The test area includes 782 × 345 MOD09GA 500 m pixels, and its major land cover type is natural grasslands according to IGBP classifications. The NDVI image on 5 August 2018 was considerably contaminated by clouds. For comparison, the weighted double logistic function fitting with SWCF and the unweighted double logistic function fitting were applied to reconstruct this NDVI image, respectively. Then, we visually inspected their reconstructed NDVI images by referencing the average cloud-free NDVI images on the same day in 2017 and 2019.

2.3.3. Determination of the Optimal Stretching Range for SWCF

In SWCF, the VI time series should be linearly stretched to a uniform range (0–R). The optimal R was determined according to the performance of SWCF at different values of R. Similar to the evaluation at the pixel level, we selected a new batch of samples (Figure 8) and constructed a reference NDVI time series and a noise NDVI time series for each of them. For all test pixels, when R varied from 0–30, the reduction rates of RMSE values (Equation (8)) were calculated for the weighted function fitting with SWCF relative to the unweighted function fitting. Obviously, the maximum reduction rate indicated the optimal R for SWCF.
D = R M S E un R M S E w _ R R M S E un × 100
where D is the reduction rate of RMSEw_R, RMSEun denotes the RMSE for unweighted function fitting, and RMSEw_R represents the RMSE for weighted function fitting with SWCF.

3. Results

3.1. The Optimal Stretching Range of VI Time Series

Figure 9 shows the reduction rates of RMSE values when R varies from 2 to 28 in steps of 2. When R varied from 2 to 10, the RMSE reduction rates for the weighted double logistic function fitting, double Gaussian function fitting, and polynomial function fitting with SWCF showed obvious increases, which indicated that the reconstruction accuracy improved rapidly. When R varied from 10 to 28, the RMSE reduction rates for the weighted double logistic function fitting and polynomial function fitting with SWCF decreased gradually, while those for the weighted double Gaussian function fitting with SWCF did not change obviously. Overall, the weighted function fitting with SWCF showed the best performance when R was equal to 10. Therefore, the optimal stretching range of VI time series is set to 10 for SWCF.

3.2. The Reconstruction Performance at the Pixel Level

Figure 10 shows the NDVI time series reconstructed by the weighted double logistic function fitting with SWCF. For the 9 curve shape types, the reference NDVI time series and the reconstructed time series by SWCF had the highest degree of overlap. In particular, they were quite close in width and amplitude. SWCF could retain the noise-free NDVI well and restore most noisy NDVIs to their true values. In contrast, the unweighted double logistic function fitting and Savitzky–Golay filtering were greatly affected by noise, and their reconstructed NDVI time series showed a large displacement from the reference NDVI time series. In addition, the Savitzky–Golay filtering may introduce some new noise when reconstructing noise NDVI time series with a high noise level (e.g., the curve shape types A1W2 and A1W3 in Figure 10).
As we can see from Table 2, for the nine curve shape types, the RMSE values for the weighted double logistic function fitting, polynomial function fitting and double Gaussian function fitting with SWCF were all significantly reduced (p < 0.05) by 33.95–52.44%, 29.63–49.42%, and 26.82–51.75% compared with those for the corresponding unweighted ones, respectively, and also significantly reduced (p < 0.05) by 26.87–54.04%, 16.90–48.82%, and 13.98–51.18% compared with those for the Savitzky–Golay filtering, respectively. On the whole, the weighted double logistic function fitting with SWCF showed the best performance on reconstructing NDVI time series of various shapes.

3.3. The Reconstruction Performance at the Regional Level

In comparison with the unweighted double logistic function fitting, the weighted double logistic function fitting with SWCF reconstructed a NDVI image closer to that of the same day in the adjacent years (Figure 11). Obviously, the weighted double logistic function fitting with SWCF could restore most cloud-contaminated NDVIs to their actual values, showing great robustness and effectiveness in regional applications (Appendix C).

4. Discussion

4.1. The Advantages and Potential Applications of SWCF

SWCF is free from ancillary data. SWCF exploits the signal-to-noise ratio information contained in the VI time series itself to generate weight series for fitting. Distinguished from the existing weighting methods [25,26,27], SWCF is free from auxiliary data, thus avoiding potential uncertainties in ancillary data.
SWCF has a strong self-adaptive capacity. In the pixel-level evaluation, SWCF is applicable for 270 selected samples containing all kinds of vegetation types across the mid-high latitudes of the Northern Hemisphere. In addition, the weighted double logistic function fitting with SWCF shows excellent robustness and applicabilities in a grassland test area on the Tibetan Plateau including 782 × 345,500 m pixels from MOD09GA data.
SWCF can reconstruct high-quality VI time series. Evaluation at the pixel level indicates that the weighted function fitting with SWCF outperforms the corresponding unweighted function fitting with the RMSE significantly reduced by 26.82–52.44% (p < 0.05), and it also outperforms the Savitzky–Golay filtering with the RMSE significantly reduced by 13.98–54.04% (p < 0.05). In the regional evaluation, compared with the unweighted double logistic function fitting, the weighted double logistic function fitting with SWCF shows better performance in preserving the original noise-free NDVIs and restoring most cloud-contaminated NDVIs to their actual values.
SWCF can contribute in VI-based studies. The VI time series are important data sources for the estimation of land surface vegetation parameters (e.g., fractional vegetation cover, leaf area index, vegetation phenology, and vegetation productivity) and the monitoring of land cover and land use change [2,3,4,5,6]. SWCF is able to generate high-quality VI time series, thus having great application potential in VI-based studies mentioned above. For example, the cloud-contaminated VI will result in the underestimation of fractional vegetation cover when using the pixel dichotomy model [36]. SWCF can be used to reconstruct the VI time series and restore the cloud-contaminated VI to the actual value, thus contributing in the accurate estimation of fractional vegetation cover.

4.2. Limitations and Prospects

SWCF may have difficulty in fitting VI time series with an irregular or inconspicuous temporal profile due to the limitations of function models. Moreover, proper initial values for function parameters are necessary in nonlinear function fitting [26,27]. In regional applications, a set of fixed initial parameter values may not be applicable for reconstructing VI time series with different curve shapes.
SWCF is designed based on a single vegetation growth cycle. Therefore, it is necessary to first divide the long-term VI time series into each single vegetation growth cycle before applying SWCF. For example, when reconstructing a whole year VI time series with the vegetation growth cycle spanning two years, the VI time series for this year and the adjacent two years should be used to cover a whole vegetation growth cycle. Then, SWCF can be applied for each complete vegetation growth cycle. Finally, we can extract the VI time series of this year from the reconstructed results. Similarly, this strategy can also be applied for the reconstruction of VI time series with multiple growth cycles in a single year.
SWCF assumes that higher VI values usually have higher signal-to-noise ratios, which may sometimes be invalid. For example, satellite data transmission errors may cause abnormally high NDVI values [15], and excessive atmospheric correction may also lead to spurious EVI high values in cloud-contaminated pixels [37]. Therefore, some priori knowledge is required to preprocess the VI time-series data before applying SWCF. For example, an NDVI value that increases more than 0.4 in 20 days should be removed because such a large increase is unlikely to be caused by vegetation changes [12], and spurious EVI values larger than NDVI should be rejected because EVI should be smaller than NDVI under normal conditions [38]. These approaches can help to identify high-value noisy VIs, thus ensuring the effectiveness of applying SWCF.

5. Conclusions

This study proposed a new reconstruction method based on self-weighting function fitting from curve features (SWCF) to reduce noise in the vegetation index time series. SWCF is free from ancillary data. For the vegetation index time series of various shapes, the weighted function fitting with SWCF reduced the root-mean-square error significantly by 26.82–52.47% (p < 0.05) compared with the corresponding unweighted function fitting methods and reduced the root-mean-square error significantly by 13.98–54.04% (p < 0.05) compared with the Savitzky–Golay filtering method. Moreover, SWCF was strongly robust and effective in regional applications. It showed an excellent denoising and self-adaptive capacity and could be applicable to reconstruct high-quality vegetation index time series.

Author Contributions

W.Z. and B.H. were responsible for the conceptualization of the study and analysis of the data; Z.X. and C.Z. were responsible for validation; H.Z. and P.L. contributed in writing and reviewing. All authors have read and agreed to the published version of the manuscript.

Funding

This study is funded by the Second Tibetan Plateau Scientific Expedition and Research Program (STEP) (Grant No. 2019QZKK0606).

Data Availability Statement

Data is available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Definition of the Width and Amplitude of a Vegetation Growth Cycle

The amplitude of a vegetation growth cycle is defined as the difference between the peak NDVI values and the average of preseasonal and postseasonal background NDVI values (Figure A1, Equation (A1)).
Amp = N D V I max 0.5 × ( N D V I min 1 + N D V I min 2 )
where Amp is the amplitude of a vegetation growth cycle, NDVImin1 is the average of NDVI values for the first 30 days in the reference NDVI time series, NDVImin2 is the average of NDVI values for the last 30 days in the reference NDVI time series, and NDVImax is the average of the first five maximum NDVI values in the reference NDVI time series. The amplitude indicates the extent of the seasonal changes in vegetation during a growing season.
The width of a vegetation growth cycle is defined by reference to the calculation of the length of the growing season (Figure A1) [25,35]. The start of a season is defined from the reference NDVI time series as the point in time for which the NDVI value has increased by 20% of the distance between the preseasonal background NDVI values and the peak NDVI values, above the pre-seasonal background NDVI values. The end of a season is defined in a similar way. Then, the length of the growing season is defined as the difference between the end date of a season and the start date of a season. Here, the width of a vegetation growth cycle is equal to the length of the growing season.
Figure A1. Definition of the width and amplitude of a vegetation growth cycle from the NDVI timeseries curve. NDVImin1 is the average of NDVI values for the first 30 days in the reference NDVI time series, NDVImin2 is the average of NDVI values for the last 30 days in the reference NDVI time series, and NDVImax is the average of the first five maximum NDVI values in the reference NDVI time series.
Figure A1. Definition of the width and amplitude of a vegetation growth cycle from the NDVI timeseries curve. NDVImin1 is the average of NDVI values for the first 30 days in the reference NDVI time series, NDVImin2 is the average of NDVI values for the last 30 days in the reference NDVI time series, and NDVImax is the average of the first five maximum NDVI values in the reference NDVI time series.
Remotesensing 14 02247 g0a1

Appendix B. Existing Reconstruction Methods Used in This Study

(1)
Double logistic function fitting
The VI time series data are fitted by the double logistic function (Equation (A2)) [17].
f ( t ) = v 0 + v 1 1 + exp ( m 1 + n 1 t ) v 2 1 + exp ( m 2 + n 2 t )
where t is the DOY of the VI. v0, v1, v2, m1, n1, m2, and n2 are the seven function parameters to be optimized. v0 is the preseasonal background VI value, v1 is the difference between the peak VI value and the preseasonal background VI value, and v2 is the difference between the peak VI value and the postseasonal background VI value. The pair-parameters (i.e., m1 and n1, m2 and n2) capture the trends of the green-up and senescence phases of vegetation growth, respectively.
(2)
Double Gaussian function fitting
Two Gaussian functions are overlaid to model the VI time series (Equation (A3)) [18], and each Gaussian function is determined by the peak height ai, peak position bi, and peak width ci.
f ( t ) = a 1 exp ( ( t b 1 c 1 ) 2 ) + a 2 exp ( ( t b 2 c 2 ) 2 )
where t is the DOY of the VI. a1, b1, c1, a2, b2 and c2 are the function parameters to be optimized.
(3)
Polynomial function fitting
A polynomial function is used to fit the VI time series (Equation (A4)) [19].
f ( t ) = a 0 + a 1 t + a 2 t 2 + + a n t n
where t is the DOY of the VI. a0, a1, …, an are the function parameters to be optimized, and n is set to 6 in this paper.
(4)
Savitzky–Golay filtering
Savitzky–Golay filtering is a weighted moving average filter for smoothing VI time series [39]. The VI values within each moving window are fitted by a higher-order polynomial function, and the original VI value at the center of the sliding window is replaced by the fitted value. The general equation of fitting progress is shown in Equation (A5):
Y j * = i = m i = m C i × Y j + i N
where Y is the original VI value, Y* is the resultant VI value, Ci is the coefficient given by the Savitzky–Golay filtering, m is the half-width of the moving window, N is the size of the moving window (2m + 1), and j is the running index of the original ordinate data table. Ci can be calculated from the equations presented by [40]. In this paper, the size of the moving window is set to 91 days and the degree of the polynomial function is set to 6 [12].

Appendix C

The SWCF method is programmed in Google Earth Engine, and the code is available on the website (https://code.earthengine.google.com/52c85a1da4449c7d8582bf0a2711b4d9, accessed on 15 December 2021).

References

  1. Tucker, C.J. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef] [Green Version]
  2. Defries, R.S.; Townshend, J.R.G. NDVI-derived land cover classifications at a global scale. Int. J. Remote Sens. 1994, 15, 3567–3586. [Google Scholar] [CrossRef]
  3. Zhang, X.; Friedl, M.A.; Schaaf, C.B.; Strahler, A.H.; Hodges, J.C.F.; Gao, F.; Reed, B.C.; Huete, A. Monitoring vegetation phenology using MODIS. Remote Sens. Environ. 2003, 84, 471–475. [Google Scholar] [CrossRef]
  4. Wang, J.; Rich, P.M.; Price, K.P.; Kettle, W.D. Relations between NDVI and tree productivity in the central Great Plains. Int. J. Remote Sens. 2004, 25, 3127–3138. [Google Scholar] [CrossRef]
  5. Wang, Q.; Adiku, S.; Tenhunen, J.; Granier, A. On the relationship of NDVI with leaf area index in a deciduous forest site. Remote Sens. Environ. 2005, 94, 244–255. [Google Scholar] [CrossRef]
  6. Lyon, J.G.; Yuan, D.; Lunetta, R.S.; Elvidge, C.D. A change detection experiment using vegetation indices. Photogramm. Eng. Remote Sens. 1998, 64, 143–150. [Google Scholar]
  7. Kobayashi, H.; Dye, D.G. Atmospheric conditions for monitoring the long-term vegetation dynamics in the Amazon using normalized difference vegetation index. Remote Sens. Environ. 2005, 97, 519–525. [Google Scholar] [CrossRef]
  8. Holben, B.N. Characteristics of maximum-value composite images from temporal AVHRR data. Int. J. Remote Sens. 1986, 7, 1417–1434. [Google Scholar] [CrossRef]
  9. Stowe, L.L.; McClain, E.P.; Carey, R.; Pellegrino, P.; Gutman, G.G.; Davis, P.; Long, C.; Hart, S. Global distribution of cloud cover derived from NOAA/AVHRR operational satellite data. Adv. Space Res. 1991, 11, 51–54. [Google Scholar] [CrossRef]
  10. Ma, M.; Veroustraete, F. Reconstructing pathfinder AVHRR land NDVI time-series data for the Northwest of China. Adv. Space Res. 2006, 37, 835–840. [Google Scholar] [CrossRef]
  11. Li, S.; Xu, L.; Jing, Y.; Yin, H.; Li, X.; Guan, X. High-quality vegetation index product generation: A review of NDVI time series reconstruction techniques. Int. J. Appl. Earth Observ. Geoinf. 2021, 105, 102640. [Google Scholar] [CrossRef]
  12. Chen, J.; Jönsson, P.; Tamura, M.; Gu, Z.; Matsushita, B.; Eklundh, L. A simple method for reconstructing a high-quality NDVI time-series data set based on the Savitzky-Golay filter. Remote Sens. Environ. 2004, 91, 332–344. [Google Scholar] [CrossRef]
  13. Jönsson, P.; Eklundh, L. TIMESAT—A program for analyzing time-series of satellite sensor data. Comput. Geosci. 2004, 30, 833–845. [Google Scholar] [CrossRef] [Green Version]
  14. Zhu, W.; Pan, Y.; He, H.; Wang, L.; Mou, M.; Liu, J. A Changing-Weight Filter Method for Reconstructing a High-Quality NDVI Time Series to Preserve the Integrity of Vegetation Phenology. IEEE Trans. Geosci. Remote Sens. 2012, 50, 1085–1094. [Google Scholar] [CrossRef]
  15. Viovy, N.; Arino, O.; Belward, A.S. The Best Index Slope Extraction (BISE): A method for reducing noise in NDVI time-series. Int. J. Remote Sens. 1992, 13, 1585–1590. [Google Scholar] [CrossRef]
  16. Lovell, J.L.; Graetz, R.D. Filtering Pathfinder AVHRR Land NDVI data for Australia. Int. J. Remote Sens. 2001, 22, 2649–2654. [Google Scholar] [CrossRef]
  17. Beck, P.S.A.; Atzberger, C.; Høgda, K.A.; Johansen, B.; Skidmore, A.K. Improved monitoring of vegetation dynamics at very high latitudes: A new method using MODIS NDVI. Remote Sens. Environ. 2006, 100, 321–334. [Google Scholar] [CrossRef]
  18. Li, M.; Sheng, Y. Study on application of Gaussian fitting algorithm to building model of spectral analysis. Guang Pu Xue Yu Guang Pu Fen Xi = Guang Pu 2008, 28, 2352–2355. [Google Scholar]
  19. Piao, S.; Fang, J.; Zhou, L.; Ciais, P.; Zhu, B. Variations in satellite-derived phenology in China’s temperate vegetation. Glob. Change Biol. 2006, 12, 672–685. [Google Scholar] [CrossRef]
  20. Roerink, G.J.; Menenti, M.; Verhoef, W. Reconstructing cloudfree NDVI composites using Fourier analysis of time series. Int. J. Remote Sens. 2000, 21, 1911–1917. [Google Scholar] [CrossRef]
  21. Menenti, M.; Azzali, S.; Verhoef, W.; van Swol, R. Mapping agroecological zones and time-lag in vegetation growth by means of Fourier-analysis of time-series of NDVI images. Adv. Space Res. 1993, 13, 233–237. [Google Scholar] [CrossRef]
  22. Lu, X.; Liu, R.; Liu, J.; Liang, S. Removal of noise by wavelet method to generate high quality temporal data of terrestrial MODIS products. Photogramm. Eng. Remote Sens. 2007, 73, 1129–1139. [Google Scholar] [CrossRef] [Green Version]
  23. Cao, R.; Chen, Y.; Shen, M.; Chen, J.; Zhou, J.; Wang, C.; Yang, W. A simple method to improve the quality of NDVI time-series data by integrating spatiotemporal information with the Savitzky-Golay filter. Remote Sens. Environ. 2018, 217, 244–257. [Google Scholar] [CrossRef]
  24. Zhang, X. Reconstruction of a complete global time series of daily vegetation index trajectory from long-term AVHRR data. Remote Sens. Environ. 2015, 156, 457–472. [Google Scholar] [CrossRef]
  25. Jönsson, P.; Eklundh, L. Seasonality extraction by function fitting to time-series of satellite sensor data. IEEE Trans. Geosci. Remote Sens. 2002, 40, 1824–1832. [Google Scholar] [CrossRef]
  26. Yang, Y.; Luo, J.; Huang, Q.; Wu, W.; Sun, Y. Weighted Double-Logistic Function Fitting Method for Reconstructing the High-Quality Sentinel-2 NDVI Time Series Data Set. Remote Sens. 2019, 11, 2342. [Google Scholar] [CrossRef] [Green Version]
  27. Fisher, J.I.; Mustard, J.F.; Vadeboncoeur, M.A. Green leaf phenology at Landsat resolution: Scaling from the field to the satel-lite. Remote Sens. Environ. 2006, 100, 265–279. [Google Scholar] [CrossRef]
  28. Claverie, M.; Ju, J.; Masek, J.G.; Dungan, J.L.; Vermote, E.F.; Roger, J.; Skakun, S.V.; Justice, C. The Harmonized Landsat and Sentinel-2 surface reflectance data set. Remote Sens. Environ. 2018, 219, 145–161. [Google Scholar] [CrossRef]
  29. Foga, S.; Scaramuzza, P.L.; Guo, S.; Zhu, Z.; Dilley, R.D.; Beckmann, T.; Schmidt, G.L.; Dwyer, J.L.; Joseph Hughes, M.; Laue, B. Cloud detection algorithm comparison and validation for operational Landsat data products. Remote Sens. Environ. 2017, 194, 379–390. [Google Scholar] [CrossRef] [Green Version]
  30. Julien, Y.; Sobrino, J.A. Comparison of cloud-reconstruction methods for time series of composite NDVI data. Remote Sens. Environ. 2010, 114, 618–625. [Google Scholar] [CrossRef]
  31. Reed, B.C.; Brown, J.F.; VanderZee, D.; Loveland, T.R.; Merchant, J.W.; Ohlen, D.O. Measuring phenological variability from satellite imagery. J. Veg. Sci. 1994, 5, 703–714. [Google Scholar] [CrossRef]
  32. Rahman, M.S.; Di, L.; Shrestha, R.; Yu, E.G.; Lin, L.; Kang, L.; Deng, M. Comparison of selected noise reduction techniques for MODIS daily NDVI: An empirical analysis on corn and soybean. In Proceedings of the 2016 Fifth International Conference on Agro-Geoinformatics (Agro-Geoinformatics), Tianjin, China, 18–20 July 2016; pp. 1–5. [Google Scholar]
  33. Atkinson, P.M.; Jeganathan, C.; Dash, J.; Atzberger, C. Inter-comparison of four models for smoothing satellite sensor time-series data to estimate vegetation phenology. Remote Sens. Environ. 2012, 123, 400–417. [Google Scholar] [CrossRef]
  34. Hird, J.N.; McDermid, G.J. Noise reduction of NDVI time series: An empirical comparison of selected techniques. Remote Sens. Environ. 2009, 113, 248–258. [Google Scholar] [CrossRef]
  35. White, M.A.; de Beurs, K.M.; Didan, K.; Inouye, D.W.; Richardson, A.D.; Jensen, O.P.; O’Keefe, J.; Zhang, G.; Nemani, R.R.; van Leeuwen, W.J.D.; et al. Intercomparison, interpretation, and assessment of spring phenology in North America estimated from remote sensing for 1982–2006. Global Chang. Biol. 2009, 15, 2335–2359. [Google Scholar] [CrossRef]
  36. Qi, J.; Marsett, R.C.; Moran, M.S.; Goodrich, D.C.; Heilman, P.; Kerr, Y.H.; Dedieu, G.; Chehbouni, A.; Zhang, X.X. Spatial and temporal dynamics of vegetation in the San Pedro River basin area. Agric. For. Meteorol. 2000, 105, 55–68. [Google Scholar] [CrossRef] [Green Version]
  37. Justice, C.O.; Román, M.O.; Csiszar, I.; Vermote, E.F.; Wolfe, R.E.; Hook, S.J.; Friedl, M.; Wang, Z.; Schaaf, C.B.; Miura, T.; et al. Land and cryosphere products from Suomi NPP VIIRS: Overview and status. J. Geophys. Res. Atmos. 2013, 118, 9753–9765. [Google Scholar] [CrossRef] [Green Version]
  38. Jiang, Z.; Huete, A.; Didan, K.; Miura, T. Development of a two-band enhanced vegetation index without a blue band. Remote Sens. Environ. 2008, 112, 3833–3845. [Google Scholar] [CrossRef]
  39. Savitzky, A.; Golay, M. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
  40. Madden, H.H. Comments on the Savitzky-Golay convolution method for least-squares-fit smoothing and differentiation of digital data. Anal. Chem. 1978, 50, 1383–1386. [Google Scholar] [CrossRef]
Figure 1. The implementation flowchart for self-weighting from the VI curve features.
Figure 1. The implementation flowchart for self-weighting from the VI curve features.
Remotesensing 14 02247 g001
Figure 2. Illustration of the definition of the gradually changing points and suddenly dropping points in a VI time series curve.
Figure 2. Illustration of the definition of the gradually changing points and suddenly dropping points in a VI time series curve.
Remotesensing 14 02247 g002
Figure 3. Determination of the features of the suddenly dropping points.
Figure 3. Determination of the features of the suddenly dropping points.
Remotesensing 14 02247 g003
Figure 4. Generated weight series for the original VI time series. For comparison, the points labeled as cloud contamination in ancillary data are marked with a blue background.
Figure 4. Generated weight series for the original VI time series. For comparison, the points labeled as cloud contamination in ancillary data are marked with a blue background.
Remotesensing 14 02247 g004
Figure 5. Division of the growing season and the constructed reference NDVI time series. (a) The division of the growing season, and (b) the constructed reference NDVI time series.
Figure 5. Division of the growing season and the constructed reference NDVI time series. (a) The division of the growing season, and (b) the constructed reference NDVI time series.
Remotesensing 14 02247 g005
Figure 6. Distribution of 270 sample points for the 9 curve types classified by the width and amplitude of a growing season.
Figure 6. Distribution of 270 sample points for the 9 curve types classified by the width and amplitude of a growing season.
Remotesensing 14 02247 g006
Figure 7. Construction of the noise NDVI time series. (a) Histogram of the cloud probability on a day-by-day basis at a sample site for the years 2004–2019, and (b) the noise NDVI time series. In (b), the black curve is the reference NDVI time series, the blue curve is the noise NDVI time series, and the gray area is the day-by-day NDVI standard deviation of a sample point in 2004–2019.
Figure 7. Construction of the noise NDVI time series. (a) Histogram of the cloud probability on a day-by-day basis at a sample site for the years 2004–2019, and (b) the noise NDVI time series. In (b), the black curve is the reference NDVI time series, the blue curve is the noise NDVI time series, and the gray area is the day-by-day NDVI standard deviation of a sample point in 2004–2019.
Remotesensing 14 02247 g007
Figure 8. Distribution of the test samples for determining the optimal R.
Figure 8. Distribution of the test samples for determining the optimal R.
Remotesensing 14 02247 g008
Figure 9. RMSE reduction rate of reconstructed NDVI time series under different linear stretch ranges (0-R). The overall change is the average reduction rate of the three functions.
Figure 9. RMSE reduction rate of reconstructed NDVI time series under different linear stretch ranges (0-R). The overall change is the average reduction rate of the three functions.
Remotesensing 14 02247 g009
Figure 10. Comparison of the reconstruction performance for different methods. A1W1, A1W2, A1W3, A2W1, A2W2, A2W3, A3W1, A3W2, and A3W3 refer to different types of NDVI curve shapes (Table 1). The main vegetation type corresponding to each curve shape type is indicated in the upper right corner of each subplot. ENF, DNF, DBF, MF, Shrub, Savannas, and Grass represent evergreen needleleaf forests, deciduous needleleaf forests, deciduous broadleaf forests, mixed forests, shrublands, savannas, and grasslands, respectively.
Figure 10. Comparison of the reconstruction performance for different methods. A1W1, A1W2, A1W3, A2W1, A2W2, A2W3, A3W1, A3W2, and A3W3 refer to different types of NDVI curve shapes (Table 1). The main vegetation type corresponding to each curve shape type is indicated in the upper right corner of each subplot. ENF, DNF, DBF, MF, Shrub, Savannas, and Grass represent evergreen needleleaf forests, deciduous needleleaf forests, deciduous broadleaf forests, mixed forests, shrublands, savannas, and grasslands, respectively.
Remotesensing 14 02247 g010
Figure 11. Application effect of the weighted double logistic function fitting with SWCF in the Tibetan Plateau region. (a) The NDVI image contaminated by clouds in the test area on 5 August 2018; (b) the average of NDVI images on 5 August 2017, and 2019; (c) the NDVI image reconstructed by the unweighted double logistic function fitting; (d) the NDVI image reconstructed by the weighted double logistic function fitting with SWCF. The white area in (c,d) is the part where the reconstruction fails.
Figure 11. Application effect of the weighted double logistic function fitting with SWCF in the Tibetan Plateau region. (a) The NDVI image contaminated by clouds in the test area on 5 August 2018; (b) the average of NDVI images on 5 August 2017, and 2019; (c) the NDVI image reconstructed by the unweighted double logistic function fitting; (d) the NDVI image reconstructed by the weighted double logistic function fitting with SWCF. The white area in (c,d) is the part where the reconstruction fails.
Remotesensing 14 02247 g011
Table 1. Classification of the reference NDVI time-series curves.
Table 1. Classification of the reference NDVI time-series curves.
AmplitudeWidth
Width ≤ 210 d210 d < Width ≤ 260 dWidth > 260 d
Amplitude ≤ 0.3A1W1A1W2A1W3
0.3 < Amplitude ≤ 0.5A2W1A2W2A2W3
Amplitude > 0.5 A3W1A3W2A3W3
Table 2. Comparison of Reconstruction Quality of Different Methods.
Table 2. Comparison of Reconstruction Quality of Different Methods.
Curve
Shape Types
Double Logistic Function FittingPolynomial Function FittingDouble Gaussian Function FittingSavitzky–Golay
Filtering
UnweightedWeightedRMSERMSEUnweightedWeightedRMSERMSEUnweightedWeightedRMSERMSE
RMSERMSEReductionReductionRMSERMSEReductionReductionRMSERMSEReductionReductionRMSE
Average
AverageAverageRate *
(%)
Rate #
(%)
AverageAverageRate *
(%)
Rate #
(%)
AverageAverageRate *
(%)
Rate #
(%)
A1W10.0574 0.0273 52.44 54.04 0.0601 0.0290 51.75 51.18 0.0601 0.0304 49.42 48.82 0.0594
A1W20.0662 0.0350 47.13 43.55 0.0630 0.0335 46.83 45.97 0.0652 0.0392 39.88 36.77 0.0620
A1W30.0667 0.0402 39.73 37.48 0.0629 0.0325 48.33 49.46 0.0779 0.0416 46.60 35.30 0.0643
A2W10.0922 0.0496 46.20 44.02 0.0969 0.0610 37.05 31.15 0.1023 0.0678 33.72 23.48 0.0886
A2W20.1051 0.0593 43.58 41.05 0.1058 0.0614 41.97 38.97 0.1144 0.0685 40.12 31.91 0.1006
A2W30.1157 0.0703 39.24 38.01 0.1136 0.0643 43.40 43.30 0.1177 0.0728 38.15 35.80 0.1134
A3W10.1022 0.0675 33.95 26.87 0.1085 0.0794 26.82 13.98 0.1090 0.0767 29.63 16.90 0.0923
A3W20.1072 0.0689 35.73 29.91 0.1073 0.0704 34.39 28.38 0.1127 0.0784 30.43 20.24 0.0983
A3W30.1196 0.0771 35.54 27.74 0.1122 0.0687 38.77 35.61 0.1168 0.0784 32.88 26.52 0.1067
Thirty sample points were selected for each curve type. For each sample point, the VI time series curve contained 365 VI points. * represents the RMSE reduction rate (%) of the weighted fitting reconstruction results relative to the unweighted fitting ones for the same curve shape type and function. # represents the RMSE reduction rate (%) of the weighted fitting reconstruction results relative to the Savitzky–Golay filtering method for the same curve shape type. A1W1, A1W2, A1W3, A2W1, A2W2, A2W3, A3W1, A3W2, and A3W3 refer to different curve shape types of NDVI time series (Table 1).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhu, W.; He, B.; Xie, Z.; Zhao, C.; Zhuang, H.; Li, P. Reconstruction of Vegetation Index Time Series Based on Self-Weighting Function Fitting from Curve Features. Remote Sens. 2022, 14, 2247. https://doi.org/10.3390/rs14092247

AMA Style

Zhu W, He B, Xie Z, Zhao C, Zhuang H, Li P. Reconstruction of Vegetation Index Time Series Based on Self-Weighting Function Fitting from Curve Features. Remote Sensing. 2022; 14(9):2247. https://doi.org/10.3390/rs14092247

Chicago/Turabian Style

Zhu, Wenquan, Bangke He, Zhiying Xie, Cenliang Zhao, Huimin Zhuang, and Peixian Li. 2022. "Reconstruction of Vegetation Index Time Series Based on Self-Weighting Function Fitting from Curve Features" Remote Sensing 14, no. 9: 2247. https://doi.org/10.3390/rs14092247

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop