Next Article in Journal
A Perifacial EMG Acquisition System for Facial-Muscle-Movement Recognition
Next Article in Special Issue
NIR Spectral Inversion of Soil Physicochemical Properties in Tea Plantations under Different Particle Size States
Previous Article in Journal
Research on Three-Dimensional Shape Curve Reconstruction Technology for a Scraper Conveyor on an Intelligent Working Face
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Regional Inversion of Soil Heavy Metal Cr Content in Agricultural Land Using Zhuhai-1 Hyperspectral Images

1
School of Architecture and Urban Planning, Guangdong University of Technology, Guangzhou 510090, China
2
International Research Center of Big Data for Sustainable Development Goals, Beijing 100094, China
3
Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
4
School of Land and Space Information, Yunnan Land and Resources Vocational College, Kunming 652501, China
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(21), 8756; https://doi.org/10.3390/s23218756
Submission received: 6 September 2023 / Revised: 22 October 2023 / Accepted: 25 October 2023 / Published: 27 October 2023
(This article belongs to the Special Issue Soil Sensing and Mapping for a Sustainable Future)

Abstract

:
With the development of hyperspectral imaging technology, the potential for utilizing hyperspectral images to accurately estimate heavy metal concentrations in regional soil has emerged. Currently, soil heavy metal inversion based on laboratory hyperspectral data has demonstrated a commendable level of accuracy. However, satellite images are susceptible to environmental factors such as atmospheric and soil background, presenting a significant challenge in the accurate estimation of soil heavy metal concentrations. In this study, typical chromium (Cr)-contaminated agricultural land in Shaoguan City, Guangdong Province, China, was taken as the study area. Soil sample collection, Cr content determination, laboratory spectral measurements, and hyperspectral satellite image collection were carried out simultaneously. The Zhuhai-1 hyperspectral satellite image spectra were corrected to match laboratory spectra using the direct standardization (DS) algorithm. Then, the corrected spectra were integrated into an optimal model based on laboratory spectral data and sample Cr content data for regional inversion of soil heavy metal Cr content in agricultural land. The results indicated that the combination of standard normal variate (SNV)+ uninformative variable elimination (UVE)+ support vector regression (SVR) model performed best with laboratory spectral data, achieving a high accuracy with an R2 of 0.97, RMSE of 5.87, MAE of 4.72, and RPD of 4.04. The DS algorithm effectively transformed satellite hyperspectral image data into spectra resembling laboratory measurements, mitigating the impact of environmental factors. Therefore, it can be applied for regional inversion of soil heavy metal content. Overall, the study area exhibited a low-risk level of Cr content in the soil, with the majority of Cr content values falling within the range of 36.21–76.23 mg/kg. Higher concentrations were primarily observed in the southeastern part of the study area. This study can provide useful exploration for the promotion and application of Zhuhai-1 image data in the regional inversion of soil heavy metals.

1. Introduction

Chromium (Cr) is a prevalent soil pollutant. Excessive Cr release into soil not only adversely impacts crop quality and human health but also jeopardizes the safety of the entire ecosystem [1,2]. Therefore, conducting a monitoring study on soil heavy metal Cr content is of great importance. Traditional field monitoring methods for soil heavy metal content are inefficient and expensive. These methods typically focus on monitoring specific points or soil profiles, making it challenging to accurately assess the spatial distribution of heavy metal content on a regional scale. However, with the development of hyperspectral imaging technology, it is possible to acquire large-scale and high-spectral-resolution remote-sensing images, which presents an opportunity for efficiently monitoring heavy metal pollution across extensive areas [3,4,5].
Currently, there are two main approaches for regional inversion of soil heavy metal content based on remote-sensing images: one that excludes the use of ground-measured laboratory spectral (GMLS) data and another that incorporates such data. The first approach involves the construction of models directly using soil sample heavy metal content data and remote sensing [6]. For instance, Guo et al. [7] utilized soil metal content data in conjunction with hyperspectral images from the GF-5 satellite to develop a random forest (RF) model for estimating the distribution of soil zinc (Zn) and nickel (Ni) concentrations. Yang Lingyu et al. [8] employed soil heavy metal content data and Hyperion images to create a partial least squares regression (PLSR) model, which they used to estimate the distribution of arsenic (As), lead (Pb), zinc (Zn), and cadmium (Cd). In a similar vein, Baihan et al. [9] used measured soil sample data and GF-5 hyperspectral satellite images to construct an extreme gradient boosting (XGBoost) inversion model, aided by genetic algorithm features, to assess the pollution status of heavy metal copper (Cu) in the Daxigou mining area of Shanxi Province, China. Liu et al. [10] employed hyperspectral images to accurately map the distribution of heavy metal contents (specifically As, Cd, and Hg) in the soil of Conghua District, Guangzhou, China. In addition to these models, support vector regression (SVR) [11,12] and artificial neural network (ANN) [8,13,14] are commonly used for this purpose. However, this approach is often associated with reduced reliability due to factors such as atmospheric and soil background interference during remote-sensing image acquisition. In light of these challenges, the second approach involves the integration of GMLS data. Many scholars have explored different spectral processing methods encompassing spectral pre-processing techniques such as Savitzky–Golay (SG) smoothing [15], SNV [16], multiplicative scatter correction (MSC) [17], first-order differential reflectance (FD) [18], and second-order differential reflectance (SD) [19]. Additionally, spectral selection methods such as principal component analysis (PCA) [20], competitive adaptive reweighted sampling (CARS) [21], UVE [22], and successive projections algorithm (SPA) [23] have been employed. These methods have significantly improved the accuracy of soil heavy metal inversion when referring to GMLS data in many studies. However, the variance in spectral resolution (i.e., the number of bands) between remote-sensing bands and GMLS data presents a challenge when applying these methods from GMLS data to remote-sensing images. Furthermore, the mixed-pixel effect complicates the direct application of models derived from ground spectra to remote-sensing images. Therefore, despite the successful models for soil heavy metal content estimation based on GMLS data, there are limited instances of their subsequent application to remote-sensing images.
In this study, Orbita hyperspectral (OHS) data from the Zhuhai-1 satellite, which has 32 spectral bands with a spatial resolution of 10 m, were used for regional soil heavy metal Cr content inversion combined with GMLS data in Shaoguan City, Guangdong Province, China. These rich spectral bands and high spatial resolution enabled us to integrate satellites with GMLS data for soil heavy metal inversion. The OHS data were calibrated using the direct standardization (DS) algorithm to ensure its alignment with the ground spectral data. Subsequently, by applying the optimal Cr inversion model derived from GMLS data and Cr content data, we achieve regional inversion of soil Cr content in agricultural land. The results can provide valuable technical support for the rapid and efficient monitoring of soil ecological conditions in agricultural land.

2. Materials and Methods

2.1. Study Area

The study area is located in a region highly impacted by Cr heavy metal pollution in Shaoguan City, Guangdong Province, China. Geographically, the center coordinate of the study area is 113.55° E longitude and 24.65° N latitude. Covering an area of 131.52 km2, the study area predominantly includes Zhangshi Town and its surrounding villages, characterized by a concentration of agricultural land parcels. The topography of the region primarily consists of mountains and basins, with higher elevations in the west and lower elevations in the east. Climatically, this area falls within the subtropical monsoon climate zone, marked by the prevalence of southwest and southeast monsoons during the summer and the influence of the northeast monsoon in winter. Since the 1950s, the establishment of heavy industries such as power plants, smelters, and steel factories has transformed this area, gradually making non-ferrous metal processing the dominant industry. Consequently, a substantial volume of industrial waste has been discharged into the soil, leading to serious heavy metal pollution in the agricultural land of this region [24,25]. In this study, an expansive area characterized by agricultural land and relatively flat terrain was deliberately selected. This choice aimed to minimize the potential impact of varying elevations and slopes on the experiment. Soil sampling was systematically conducted in areas with potential contamination, guided by the spatial distribution of the factories and the prevailing wind direction (Figure 1).

2.2. Workflow of Data Processing

This study consists of four main steps (Figure 2): (1) Data collection: The OHS images of the study area were acquired from the Orbit Zhuhai-1 remote sensing data service platform. We collected a total of 65 soil samples from various locations within the study area and then measured the concentration of the heavy metal Cr using graphite furnace atomic absorption spectrophotometry according to the GB/T 17141-1997 standard. These soil samples were then dispatched to the laboratory for analysis, where both the spectra and heavy metal content data of the samples were measured; (2) Model construction based on GMLS data: The spectral data and Cr content of 65 samples measured in the laboratory formed the foundation for model development. The dataset was partitioned into a training set and a validation set in a 7:3 ratio. The optimal combination of inversion models was determined through rigorous evaluation; (3) Hyperspectral image processing and calibration: Techniques, including geometry, radiation, and atmospheric calibration, as well as mixed-pixel decomposition methods, were employed to extract the bare land image from the OHS image. Additionally, the DS algorithm was applied to calibrate the OHS image with the GMLS data; (4) Regional mapping of Cr content: Building upon the optimal Cr inversion model derived from the previous step, we successfully achieved the regional soil heavy metal Cr content in the study area. The accuracy assessment was subsequently carried out through validation processes to verify the model’s performance.

2.3. Data Collection and Preprocessing

2.3.1. Soil Sample Collection and Processing

In accordance with the spatial distribution characteristics of the local agricultural land, this study collected a total of 65 soil samples in July 2021. These 65 sampling points were strategically arranged, adhering to the principles of uniformity and representativeness. The spatial distribution of these sampling points is visually presented in Figure 1. Upon collection, the collected soil samples underwent a systematic process. They were first air-dried and subsequently ground in the laboratory before being sieved through a 20-mesh sieve. Following this preparation, each soil sample was divided into two equal parts. One part was dedicated to the measurement of Cr content, while the other was reserved for spectral measurements.

2.3.2. Spectral Measurement and Preprocessing

The GMLS data were acquired using the ASD FieldSpec4 spectrometer (Analytical Spectral Devices Inc., Boulder, CO, USA). The spectral band range is 350–2500 nm, with a total of 2152 bands. For this study, we meticulously selected the 32 bands that corresponded to the Zhuhai-1 OHS data as the original soil spectral dataset. In the laboratory setting, the spectral measurements were conducted under controlled conditions. This involved employing a 1000 W halogen lamp as the light source. The observation angle between the light source and the vertical direction was set at 30°, with the distance between the light source and the sample consistently maintained at 30 cm. The probe was positioned vertically above the soil surface, maintaining a 5 cm distance from the soil’s surface. Each soil sample was measured in four directions, with rotations at 90° intervals each time. This process resulted in a total of 10 spectral curves being recorded for each direction. The actual spectral data utilized in this study was determined by averaging the 40 individual spectra obtained, ensuring the accuracy and reliability of the measurements.
During the process of acquiring soil spectra, various factors such as acquisition instruments, soil moisture, and external environment conditions can introduce interference into the spectral data. This interference can lead to a spectral curve containing unwanted noise, which will seriously affect the inversion accuracy. To mitigate this challenge, spectral preprocessing techniques were employed to eliminate the noise and enhance the spectral characteristics specific to soil heavy metal Cr. In this study, the first step involves the application of the Savitzky–Golay (SG) smoothing filter to eliminate the noise present in the original spectral data. Subsequently, six distinct spectral transformation methods were employed: SNV, MSC, FD, SD, discrete wavelet transform (DWT), and reciprocal logarithm (RL). These transformation methods served to highlight subtle differences within spectral curves and enhance the spectral features associated with soil heavy metal Cr [26,27]. Finally, the Pearson correlation coefficient (PCC) was utilized. PCC quantifies the strength of the linear correlation between variables X (the spectra) and Y (the soil heavy metal content). Its values range from −1 to 1, with larger absolute values indicating stronger correlations [28,29]. The method that demonstrates the highest correlation with the soil heavy metal content is selected, and feature bands are subsequently extracted from the transformed spectra using this chosen method. This approach ensures the effective enhancement of relevant spectral characteristics while minimizing noise, ultimately contributing to the accuracy of the study’s findings.

2.3.3. Image Data Acquisition and Preprocessing of Zhuhai-1

In this study, we employed Zhuhai-1 OHS data [30], which can be obtained at https://www.obtdata.com/#/index (accessed on 15 February 2023). The data comprises 32 spectral bands, characterized by a band interval of either 14 nm or 16 nm. The spectral resolution ranges from 400 nm to 1000 nm, and the spatial resolution is 10 m. Compared to widely used hyperspectral satellite data such as Hyperion, Modis, and HJ-1, the OHS image has a relatively higher spatial resolution (10 m). This increased spatial resolution plays a crucial role in reducing the influence of mixed-pixel effects on the results of heavy metal inversion from soil. Furthermore, it is available free of charge. The detailed parameters are shown in Table 1.
The OHS data underwent essential preprocessing steps, including geometric correction, radiometric calibration, and atmospheric correction. Geometric correction ensured that image errors were kept within half a pixel. Radiometric scaling converted a pixel’s digital number (DN) value into apparent reflectance data. Since the OHS images do not include atmospheric correction parameters, a swift atmospheric correction method was applied to mitigate the influence of factors such as water vapor, carbon dioxide, various atmospheric molecules, and aerosols. This correction process effectively retrieves surface reflectance data, enhancing the quality and reliability of the dataset for subsequent analysis.
In addition, the spatial resolution of the OHS images is 10 m, resulting in mixed pixels that encompass various ground features such as buildings and vegetation. This significantly impacts the accuracy of soil spectral information. To effectively tackle this challenge, a two-step process was undertaken. Firstly, through a combination of visual interpretation and on-site verification, distinct end members were identified on the images, representing pure soil, water, vegetation, clouds, and building land pixels. Subsequently, a fully constrained least squares mixed-pixel decomposition method was applied to these pixels. This approach yielded abundance maps for each of the land cover types [31,32]. The abundance within each pixel represents the proportion of each land cover type. The resulting soil abundance map is shown in Figure 3a, with pixels exhibiting soil abundance values surpassing 0.6 being earmarked as agricultural land for the regional inversion of soil heavy metals [33], as illustrated in Figure 3b.

2.4. Image Spectral Calibration Based on Direct Correction (DS) Algorithm

To mitigate the impact of uncertain environmental factors, the DS algorithm was employed to calibrate the OHS image spectrally. The DS algorithm is a common, simple, and effective method for removing environmental factors from spectral data. It calculates a transfer matrix based on the relationship between satellite spectra and laboratory spectra, allowing for the correction of satellite spectra to mitigate the influence of environmental factors like moisture, particle size, and temperature. Several studies have successfully employed the DS algorithm to correct satellite spectra, yielding favorable correction results [34,35,36]. The transfer matrix was determined by ArcGIS 10.3 software to extract the corresponding spectral data (XOHS) of the sampling point on the OHS bare land images and corresponding GMLS data (XLab). This matrix quantifies the deviation between OHS image spectral and GMLS data.
X Lab = X OHS B + E  
where B represents the transfer matrix between XLab and XOHS, and E is the residual matrix. If the parameters B and E of the DS algorithm are obtained, the entire OHS image spectral can be calibrated to spectra similar to the GMLS data through B and E.
X O H S = X O H S B + E
where X’OHS is the spectra of the OHS image spectral calibrated by the DS algorithm.
In addition, it is important to note that the representativeness and number of transfer samples can influence the performance of the DS algorithm, subsequently affecting the accuracy of the inversion model. To address this, the Kennard-Stone (KS) algorithm was employed to select a series of transfer samples from the pixels corresponding to the 26 bare soil sampling points. The number of transfer samples, which yields the best performance of the DS algorithm, was determined based on the inversion accuracy, assessed by the determination coefficient R2 [37].

2.5. Band Selection and Modeling

Previous research has underscored the importance of band selection and modeling methods in achieving precise hyperspectral inversion estimates of soil heavy metals. In this study, the KS algorithm was employed to partition the dataset, with 70% of the samples allocated to the training set for model development, and the remaining 30% earmarked as the verification set for assessing the model accuracy. In order to estimate the content of heavy metal Cr with high precision, four feature band selection algorithms, namely PCA [38], CARS [39], UVE [40], and SPA [41], were systematically combined with four modeling algorithms, including SVR [42], BPNN [43,44], PLSR [45,46] and RF [47,48], in various combinations (Figure 4). By comparing the accuracy of these diverse combinations, the optimal inversion model for heavy metal Cr was pinpointed.

2.6. Model Accuracy Evaluation

In this study, four metrics including coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and relative analysis error (RPD) [49] were used for evaluating the performance of the inversion model. The R2 reflects the fitting degree of the model, with values ranging from 0 to 1. A value closer to 1 indicates a higher degree of fit. RMSE represents the deviation between the predicted value and the true value. A smaller RMSE value signifies lower prediction error and higher model accuracy. MAE is the average value of the absolute value of all individual observations deviating from the arithmetic mean. A smaller MAE value indicates a superior predictive capacity of the model. The reliability of the model is assessed based on RPD. When RPD ≥ 2, the model is considered reliable. When 2 > RPD ≥ 1.5, the model is relatively reliable. If RPD < 1.5, the model is deemed unreliable [50,51].

3. Results

3.1. Descriptive Statistics of Cr Content

Descriptive statistics of the Cr content (mg/kg) measured from the in situ samples are shown in Table 2 below. The Cr content ranged from 8 to 139 mg/kg, with an average concentration of 47.28 mg/kg. When examining the spatial distribution, the coefficient of variation falls within the range of 0.5 to 0.75, indicating a moderate level of variability. This implies that the distribution of Cr in the soil is not uniform, and there exists significant spatial variability in its occurrence.

3.2. Optimal Inversion Model Combination

3.2.1. Optimal Spectral Transformation Method

PCC analysis was conducted on the Cr content and the original spectral data, as well as the spectral data after six transformations of SNV, MSC, FD, SD, DWT, and RL, using MATLAB. Figure 5 shows the results of the correlation analysis, which demonstrates that there is some improvement in the correlation after several spectral transformation processes. This indicates that the spectral transformation can effectively mitigate the impact of noise originating from factors such as instrument variations and external conditions. Among these spectral transformation methods, as shown in Table 3, SNV exhibited the highest maximum correlation coefficient value, achieving −0.804 at 922 nm, indicating an overall superior performance. Therefore, for this study, SNV transformation data were ultimately selected as the foundation dataset for feature band selection.

3.2.2. Optimal Inversion Model Combination

After applying the SNV transformation, four feature band selection methods—PCA, CARS, UVE, and SPA—were employed to extract the sensitive spectral response band associated with the Cr element. These methods yielded 6, 11, 7, and 10 feature bands, respectively, with most of them concentrated in the 866–940 nm wavelength range. Subsequently, the feature bands selection by these algorithms were utilized to estimate the Cr content using the SVR, BPNN, PLSR, and RF models respectively. The inversion results are shown in Table 4. Following a comparative analysis of model accuracy, the optimal model combination for Cr content inversion was determined to be SNV + UVE + SVR. The training model accuracy R2, MAE, and RMSE were 0.89, 7.55, and 11.60, respectively. Meanwhile, the verification model exhibited the following accuracy metrics: R2 of 0.97, RMSE of 5.87, MAE of 4.72, and RPD of 4.04. Notably, R2 = 0.97 > 0.66 and RPD = 4.04 > 2, indicating that the model combination has strong stability and reliability, and can be used for regional inversion of soil heavy metal Cr content.

3.3. DS Transfer Set Size Setting and Calibration

The performance of the DS algorithm is closely related to the number of samples in the transfer set. The determination coefficient R2 in the inversion accuracy of Cr content is used to determine the optimal DS transfer set size. The R2 values associated with different transfer set sizes are shown in Figure 6d. It can be seen from Figure 6d that the R2 value reaches its pinnacle when the transfer set size is 21. Consequently, this study has chosen to utilize 21 samples for calculating the transfer matrix.
Figure 6 presents a notable observation, where the spectral features of the 21 OHS image samples, calibrated by the DS algorithm (Figure 6b), closely resemble the corresponding GMLS data (Figure 6a). The results underscore the effectiveness of the DS algorithm in mitigating the inevitable environmental influences such as soil moisture content, soil particle size, and meteorological factors. The obtained transfer matrix can be applied to calibrate the entire OHS image spectra.

3.4. Inversion of Spatial Distribution of Heavy Metal Cr

Based on the optimal model combination (SNV + CARS + SVR) of the GMLS data and the calibrated entire OHS image spectral data, heavy metal Cr content inversion was carried out in the bare land area of the study area. The inversion result was categorized into five levels by using the natural discontinuity method, as shown in Figure 7. The result illustrates that the majority of Cr content falls within the range of 36.21–76.23 mg/kg. Higher concentrations of Cr elements were observed in the southeastern and southwestern part of the study area, ranging from 110.53 to 149.41 mg/kg, while the lower Cr content is primarily distributed in the northern region. This distribution may be closely related to the location of heavy industry enterprises in the area, the prevailing wind direction, and the flow direction of rivers. Enterprises are mainly situated in the northeastern part of the study area, with industrial emissions released through elevated chimneys. These emissions are influenced by the prevailing wind direction (northeast monsoon), and particles containing heavy metal elements are transferred by wind and rainfall, resulting in higher Cr element content in the southwestern part of the study area. Furthermore, enterprises are mainly situated in the upper reaches of the river, and industrial waste is transported by flowing water and adsorbed by sediment, leading to higher Cr element content in the lower reaches of the river, which is distributed in the southeast of the study area.
Twenty sets of inversion values and measured values were randomly selected to evaluate the accuracy of the regional inversion results for Cr content. The coefficient of determination (R2) and root mean square error (RMSE) were utilized for this analysis. The accuracy verification results (Figure 8) revealed an R2 value of 0.63 and an RMSE value of 20.20. These results indicate a reasonable level of reliability in the regional inversion results for soil heavy metal content in the study area [6].
According to the “Soil Environmental Quality Agricultural Land Soil Pollution Risk Control Standards (Trial)” (GB 15618-2018) of the People’s Republic of China, the pH values of the sampling points were examined. The majority of pH values in the study area were found to be below 6.5. Consequently, the corresponding risk screening value for Cr content was selected based on two pH ranges: PH ≤ 5.5 and 5.5 < PH ≤ 6.5. The risk screening value for both ranges is 250 mg/kg. It is worth noting that the highest inversion value obtained for Cr content in this study is 149.41 mg/kg, which falls below the risk screening value for paddy fields and other agricultural uses of land.

4. Discussion

4.1. The Impact of Mixed Pixels

When employing satellite imagery for the inversion of soil heavy metal content, the predominant challenge is effectively addressing the issue of mixed pixels. Field soil samples are typically collected from areas of bare soil, which are then combined with satellite image data from the same time period. As spatial resolution decreases, the likelihood of other interfering surface features coexisting within a single pixel increases. There must be a sufficient number of bare soil pixel samples available to meet the modeling requirements. This issue becomes particularly pronounced in regions with abundant water and favorable thermal conditions, coupled with dense vegetation cover. With a spatial resolution of 30 m to 1 km, traditional hyperspectral satellite data such as Hyperion, Modis, and HJ-1 are difficult to ensure that all image pixels corresponding to the sampling points represent pure bare soil, as they are frequently affected by surrounding factors like vegetation, built-up areas, and water bodies. The Zhuhai-1 OHS image has a relatively higher spatial resolution (10 m) which plays a crucial role in reducing the influence of mixed-pixel effects on the results of heavy metal inversion from soil. Nevertheless, even though our sampling locations were positioned within areas of bare soil during collection, a slight presence of vegetation cover persisted within the corresponding OHS data pixels captured in some of the samples. We anticipate a substantial improvement in this regard with the enhancement of spatial resolution in hyperspectral imagery.

4.2. Challenges of Model Generalization

In this study, many data processing methods, such as DS conversion of satellite images and combined transformation of spectral data were utilized for developing an optimal model. While these methods have improved the accuracy of the model, they also present challenges for the generalization of the model to other research cases. Establishing a robust inversion model necessitates a larger sample size to support the data. In this experiment, we observed a significant correlation between Cr content and the wavelength range of 866–940 nm in the near-infrared spectra. Interestingly, this band corresponds to the feature band position of typical substances found in soil, such as organic carbon, nitrogen, and clay minerals. Whether the spectral response of Cr in soil is attributed to the difference in the material composition of the adsorbed soil, and to what extent this difference affects the model’s generalizability, requires further experiments for exploration. Additionally, we did not incorporate factors like soil physical and chemical properties, such as pH value, particle size, and moisture content, into the modeling process. Nevertheless, these factors may exert a potential influence on the inversion model and warrant further comprehensive investigation. Future research endeavors could contemplate the inclusion of these soil physical and chemical properties in the modeling process to enhance the model’s applicability and transferability.

5. Conclusions

This study conducted satellite hyperspectral image acquisition and indoor soil sample collection in typical Cr-polluted areas in Shaoguan, China. The optimal inversion model combination was obtained by applying spectral preprocessing, feature band selection, and model construction techniques to the spectra and Cr content data of indoor soil samples. The result shows that the optimal model combination for Cr content inversion is SNV + UVE + SVR. The model exhibits high accuracy, with R2, RMSE, MAE, and RPD values of 0.97, 5.87, 4.72, and 4.04, respectively, indicating its stability and reliability. Furthermore, we employed the DS algorithm to eliminate the influence of environmental factors on the image spectra, and in conjunction with the OHS images, we successfully inverted the regional Cr content of the study area. The Cr content ranged from 36.21 to 149.41 mg/kg, and the accuracy of R2 and RMSE was 0.63 and 20.20, respectively, indicating a high level of reliability in the inversion results.
This study demonstrated the feasibility of using hyperspectral images to monitor soil heavy metal content on a regional scale by establishing an inversion model based on laboratory data. The workflow and methodology employed in this study provide valuable scientific support for such work.

Author Contributions

Conceptualization, H.G. and Y.C.; methodology, H.G., K.Y. and F.W.; investigation, K.Y. and F.W.; resources, H.G.; data curation, K.Y.; writing—original draft preparation, H.G., K.Y. and F.W.; writing—review and editing, Y.C. and J.S.; visualization, K.Y.; supervision, J.S.; project administration, Y.C.; funding acquisition, Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

The research was supported by the International Research Centre of Big Data for Sustainable Development Goals (CBAS) (Grant Number: CBASYX0906), the Engineering Center of Yunnan Education Department for Health Geological Survey and Evaluation, and the National Natural Science Foundation of China (42271422).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the finding of this study are available from the corresponding author upon reasonable request. The data are not publicly available due to restrictions eg privacy.

Acknowledgments

Thanks to Zhuhai Orbita Aerospace Science and Technology Co., Ltd., for providing Orbita hyperspectral imagery.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Owlad, M.; Aroua, M.K.; Daud, W.A.W.; Baroutian, S. Removal of Hexavalent Chromium-Contaminated Water and Wastewater: A Review. Water Air Soil Pollut. 2009, 200, 59–77. [Google Scholar] [CrossRef]
  2. Ali, H.; Khan, E.; Ilahi, I. Environmental Chemistry and Ecotoxicology of Hazardous Heavy Metals: Environmental Persistence, Toxicity, and Bioaccumulation. J. Chem. 2019, 2019, 6730305. [Google Scholar] [CrossRef]
  3. Wang, F.; Gao, J.; Zha, Y. Hyperspectral sensing of heavy metals in soil and vegetation: Feasibility and challenges. ISPRS J. Photogramm. Remote Sens. 2018, 136, 73–84. [Google Scholar] [CrossRef]
  4. Zhang, B.; Wu, D.; Zhang, L.; Jiao, Q.; Li, Q. Application of hyperspectral remote sensing for environment monitoring in mining areas. Environ. Earth Sci. 2012, 65, 649–658. [Google Scholar] [CrossRef]
  5. Yanping, L.; Qing, L.; Hefa, C. Application and development of hyperspectral remote sensing technology to determine the heavy metal content in soil. J. Agro-Environ. Sci. 2020, 39, 2699–2709. [Google Scholar]
  6. Shi, T.; Chen, Y.; Liu, Y.; Wu, G. Visible and near-infrared reflectance spectroscopy-An alternative for monitoring soil contamination by heavy metals. J. Hazard. Mater. 2014, 265, 166–176. [Google Scholar] [CrossRef]
  7. Guo, B.; Guo, X.; Zhang, B.; Suo, L.; Bai, H.; Luo, P. Using a Two-Stage Scheme to Map Toxic Metal Distributions Based on GF-5 Satellite Hyperspectral Images at a Northern Chinese Opencast Coal Mine. Remote Sens. 2022, 14, 5804. [Google Scholar] [CrossRef]
  8. Lingyu, Y.; Xiaohong, G.; Wei, Z.; Feifei, S.; Linhua, H.; Wei, J. Estimating heavy metal concentrations in topsoil from vegetation reflectance spectra of Hyperion images: A case study of Yushu County, Qinghai, China. Chin. J. Appl. Ecol. 2016, 27, 1775–1784. [Google Scholar]
  9. Han, B.; Yun, Y.; Qinfang, C.; Peng, J.; Lixia, W. Retrieval of Heavy Metal Content in Soil Using GF-5 Satellite Images Based on GA-XGBoost Model. Laser Optoelectron. Prog. 2022, 59, 525–534. [Google Scholar]
  10. Wen, L.; Ling, H.; Ming, L.; Liangzhi, L. Inversion of Cd Content in Soil Around Mining Area Based on GF-5 Hyperspectral Band Selection. Laser Optoelectron. Prog. 2022, 17, 1728001. [Google Scholar]
  11. Parveen, N.; Zaidi, S.; Danish, M. Development of SVR-based model and comparative analysis with MLR and ANN models for predicting the sorption capacity of Cr(VI). Process Saf. Environ. Prot. 2017, 107, 428–437. [Google Scholar] [CrossRef]
  12. Su, Y.; Guo, B.; Lei, Y.; Zhang, D.; Guo, X.; Suo, L.; Zhao, Y.; Bian, Y. An Indirect Inversion Scheme for Retrieving Toxic Metal Concentrations Using Ground-Based Spectral Data in a Reclamation Coal Mine, China. Water 2022, 14, 2784. [Google Scholar] [CrossRef]
  13. Hamidian, A.H.; Esfandeh, S.; Zhang, Y.; Yang, M. Simulation and optimization of nanomaterials application for heavy metal removal from aqueous solutions. Inorg. Nano-Met. Chem. 2019, 49, 217–230. [Google Scholar] [CrossRef]
  14. Li, F.; Lu, A.; Wang, J.; You, T. Back-propagation neural network-based modelling for soil heavy metal. Int. J. Robot. Autom. 2021, 36, 1–7. [Google Scholar]
  15. Zhou, X.; Sun, J.; Tian, Y.; Chen, Q.; Wu, X.; Hang, Y. A deep learning based regression method on hyperspectral data for rapid prediction of cadmium residue in lettuce leaves. Chemom. Intell. Lab. 2020, 200, 103996. [Google Scholar]
  16. Liu, J.; Zhang, Y.; Wang, H.; Du, Y. Study on the prediction of soil heavy metal elements content based on visible near-infrared spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2018, 199, 43–49. [Google Scholar] [CrossRef]
  17. Liu, Z.; Nan, Z.; Zhao, C.; Yang, Y. Potato absorption and phytoavailability of Cd, Ni, Cu, Zn and Pb in sierozem soils amended with municipal sludge compost. J. Arid Land. 2018, 10, 638–652. [Google Scholar] [CrossRef]
  18. Tan, K.; Ye, Y.; Pei-jun, D.; Zhang, Q. Estimation of Heavy Metal Concentrations in Reclaimed Mining Soils Using Reflectance Spectroscopy. Spectrosc. Spectr. Anal. 2014, 34, 3317–3322. [Google Scholar]
  19. Liu, H.; Niu, T.; Yu, Q.; Su, K.; Yang, L.; Liu, W.; Wang, H. Inversion and Estimation of Heavy Metal Element Content in Peach Forest Soil in Pinggu District of Beijing. Spectrosc. Spectr. Anal. 2022, 42, 3552–3558. [Google Scholar]
  20. Tu, Y.; Zou, B.; Jiang, X.; Tao, C.; Tang, Y.; Feng, H. Hyperspectral Remote Sensing Based Modeling of Cu Content in Mining Soil. Spectrosc. Spectr. Anal. 2018, 38, 575–581. [Google Scholar]
  21. Tan, K.; Ma, W.; Chen, L.; Wang, H.; Du, Q.; Du, P.; Yan, B.; Liu, R.; Li, H. Estimating the distribution trend of soil heavy metals in mining area from HyMap airborne hyperspectral imagery based on ensemble learning. J. Hazard. Mater. 2021, 401, 123288. [Google Scholar] [CrossRef]
  22. Sun, T.; Wu, Y.; Liu, X.; Mo, X.; Liu, M. Detection of Chromium Content in Soybean Oil by Laser Induced Breakdown Spectroscopy and UVE Method. Spectrosc. Spectr. Anal. 2016, 36, 3341–3345. [Google Scholar]
  23. Tang, R.; Chen, X.; Li, C. Detection of Nitrogen Content in Rubber Leaves Using Near-Infrared (NIR) Spectroscopy with Correlation-Based Successive Projections Algorithm (SPA). Appl. Spectrosc. 2018, 72, 740–749. [Google Scholar] [CrossRef]
  24. Qifeng, W.; Fuhua, W.; Fangfang, S.; Xu, W.; Dian, W.; Xiangxiang, L.; Kai, Z. Fractionation and Bioavailability of Pb and Cd in Agricultural Soils Around Mining Area in Shaoguan Guangdong Province, China. J. Agro-Environ. Sci. 2012, 31, 1097–1103. [Google Scholar]
  25. Dong, X.; Li, C.; Li, J.; Wang, J.; Liu, S.; Ye, B. A novel approach for soil contamination assessment from heavy metal pollution: A linkage between discharge and adsorption. J. Hazard. Mater. 2010, 175, 1022–1030. [Google Scholar] [CrossRef] [PubMed]
  26. Wang, J.; Wang, S.; Bai, X.; Liu, F.; Lu, Q.; Tian, S.; Wang, M. Prediction Soil Heavy Metal Zinc Based on Spectral Reflectance in Karst Area. Spectrosc. Spectr. Anal. 2019, 39, 3873–3879. [Google Scholar]
  27. Zhang, S.; Shen, Q.; Nie, C.; Huang, Y.; Wang, J.; Hu, Q.; Ding, X.; Zhou, Y.; Chen, Y. Hyperspectral inversion of heavy metal content in reclaimed soil from a mining wasteland based on different spectral transformation and modeling methods. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2019, 211, 393–400. [Google Scholar] [CrossRef] [PubMed]
  28. Teng, J.; He, Z.; Ni, Z.; Zhao, Y.; Zhang, Z. Spectral Response and Inversion Models for Prediction of Total Copper Content in Soil of Xifanping Mining Area. Spectrosc. Spectr. Anal. 2016, 36, 3637–3642. [Google Scholar]
  29. Jia, Y.; Wang, L.; Qu, Z.; Yang, Z. Distribution, contamination and accumulation of heavy metals in water, sediments, and freshwater shellfish from Liuyang River, Southern China. Environ. Sci. Pollut. Res. 2018, 25, 7012–7020. [Google Scholar] [CrossRef]
  30. Jiang, Y.; Wang, J.; Zhang, L.; Zhang, G.; Li, X.; Wu, J. Geometric Processing and Accuracy Verification of Zhuhai-1 Hyperspectral Satellites. Remote Sens. 2019, 11, 996. [Google Scholar] [CrossRef]
  31. Ruan, O.; Liu, S.; Luo, J.; Hu, H. Rocky Desertification Information Extraction in Karst Terrain Complex Area Based on Endmember Variable. Spectrosc. Spectr. Anal. 2022, 42, 2269–2277. [Google Scholar]
  32. Miao, L.; Qi, H.; Szu, H. A maximum entropy approach to unsupervised mixed-pixel decomposition. IEEE Trans. Image Process. 2007, 16, 1008–1021. [Google Scholar] [CrossRef] [PubMed]
  33. Kailei, X.; Yuqing, W.; Xiaoguang, J.; Bin, H. Inversion Technology of Heavy Metal Pollution in Soil of Silong Town Based on OHS-D Data. Environ. Sci. Technol. 2021, 44, 101–106. [Google Scholar]
  34. Zhang, B.; Guo, B.; Zou, B.; Wei, W.; Lei, Y.; Li, T. Retrieving soil heavy metals concentrations based on GaoFen-5 hyperspectral satellite image at an opencast coal mine, Inner Mongolia, China. Environ. Pollut. 2022, 300, 118981. [Google Scholar] [CrossRef]
  35. Tian, J.; Chen, X.; Liang, Z.; Qi, W.; Zheng, X.; Lu, D.; Chen, B. Application of NIR Spectral Standardization Based on Principal Component Score Evaluation in Wheat Flour Crude Protein Model Sharing. J. Food Qual. 2022, 2022, 9009756. [Google Scholar] [CrossRef]
  36. Li, Q.; Zhang, G.; Xu, K.; Wang, Y. Application of DS algorithm to the calibration transfer in near-infrared spectroscopy. Spectrosc. Spectr. Anal. 2007, 27, 873–876. [Google Scholar]
  37. Xia, Z.; Songtao, D.; Yi, C.; Weichao, S.; Jinnian, W. Soil Heavy Metal Pb Content Estimation Method by Combining Field Spectra with Laboratory Spectra. Geomat. Inf. Sci. Wuhan Univ. 2022, 47, 1479–1485. [Google Scholar]
  38. Guo, F.; Xu, Z.; Ma, H.-H.; Liu, X.-J.; Yang, Z.; Tang, S. A Comparative Study of the Hyperspectral Inversion Models Based on the PCA for Retrieving the Cd Content in the Soil. Spectrosc. Spectr. Anal. 2021, 41, 1625–1630. [Google Scholar]
  39. Ran, Y.Z.; Fei, W.L.; Xi, Z.Y.; Ming, Y.; Ru, Y.X. Hyperspectral Inversion and Analysis of Heavy Metal Arsenic Content in Farmland Soil Based on Optimizing CARS Combined with PSO-SVM Algorithm. Spectrosc. Spectr. Anal. 2020, 40, 567–573. [Google Scholar]
  40. Su, L.; Shi, W.; Chen, X.; Meng, L.; Yuan, L.; Chen, X.; Huang, G. Simultaneously and quantitatively analyze the heavy metals in Sargassum fusiforme by laser-induced breakdown spectroscopy. Food Chem. 2021, 338, 127797. [Google Scholar] [CrossRef]
  41. Shi, T.; Liu, H.; Chen, Y.; Wang, J.; Wu, G. Estimation of arsenic in agricultural soils using hyperspectral vegetation indices of rice. J. Hazard. Mater. 2016, 308, 243–252. [Google Scholar] [CrossRef] [PubMed]
  42. Mousavi, S.; Esteki, M.; Mostafazadeh-Fard, B.; Dehghani, S.; Khorvash, M. Linear and Nonlinear Modeling for Predicting Nickel Removal from Aqueous Solutions. Environ. Eng. Sci. 2012, 29, 765–775. [Google Scholar] [CrossRef]
  43. Chen, T.; Chang, Q.; Clevers, J.G.P.W.; Kooistra, L. Rapid identification of soil cadmium pollution risk at regional scale based on visible and near-infrared spectroscopy. Environ. Pollut. 2015, 206, 217–226. [Google Scholar] [CrossRef] [PubMed]
  44. Chen, Y.; Li, L.; Whiting, M.; Chen, F.; Sun, Z.C.; Song, K.S.; Wang, Q.J. Convolutional neural network model for soil moisture prediction and its transferability analysis based on laboratory Vis-NIR spectral data. Int. J. Appl. Earth Obs. Geoinf. 2021, 104, 102550. [Google Scholar] [CrossRef]
  45. Wang, J.; Cui, L.; Gao, W.; Shi, T.; Chen, Y.; Gao, Y. Prediction of low heavy metal concentrations in agricultural soils using visible and near-infrared reflectance spectroscopy. Geoderma 2014, 216, 1–9. [Google Scholar] [CrossRef]
  46. Pandit, C.M.; Filippelli, G.M.; Li, L. Estimation of heavy-metal contamination in soil using reflectance spectroscopy and partial least-squares regression. Int. J. Remote Sens. 2010, 31, 4111–4123. [Google Scholar] [CrossRef]
  47. Wang, H.Z.; Yilihamu, Q.; Yuan, M.N.; Bai, H.T.; Xu, H.; Wu, J. Prediction models of soil heavy metal(loid)s concentration for agricultural land in Dongli: A comparison of regression and random forest. Ecol. Indic. 2020, 119, 106801. [Google Scholar] [CrossRef]
  48. Guo, B.; Zhang, B.; Su, Y.; Zhang, D.; Wang, Y.; Bian, Y.; Suo, L.; Guo, X.; Bai, H. Retrieving zinc concentrations in topsoil with reflectance spectroscopy at Opencast Coal Mine sites. Sci. Rep. 2021, 11, 19909. [Google Scholar] [CrossRef]
  49. Liu, J.; Han, J.; Xie, J.; Wang, H.; Tong, W.; Ba, Y. Assessing heavy metal concentrations in earth-cumulic-orthic-anthrosols soils using Vis-NIR spectroscopy transform coupled with chemometrics. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2020, 226, 117639. [Google Scholar] [CrossRef]
  50. Wu, X.; Zhang, J.; Xiao, C.; Zhao, X.; Li, K.; Pang, L.; Shi, Y.; Li, S. Study on Inversion Model of Soil Heavy Metal Content Based on NMF-PLS Water Content. Spectrosc. Spectr. Anal. 2021, 41, 271–277. [Google Scholar]
  51. Wang, F.; Li, C.; Wang, J.; Cao, W.; Wu, Q. Concentration estimation of heavy metal in soils from typical sewage irrigation area of Shandong Province, China using reflectance spectroscopy. Environ. Sci. Pollut. R. 2017, 24, 16883–16892. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Location of the study area and sampling distributions (the figure on the right displays an OHS band composite image from the Zhuhai-1 satellite with Bands 1, 2, and 3 assigned to the RGB channels, respectively).
Figure 1. Location of the study area and sampling distributions (the figure on the right displays an OHS band composite image from the Zhuhai-1 satellite with Bands 1, 2, and 3 assigned to the RGB channels, respectively).
Sensors 23 08756 g001
Figure 2. The flowchart of this study.
Figure 2. The flowchart of this study.
Sensors 23 08756 g002
Figure 3. Bare land abundance map before extraction (a) and abundance map after extraction (b).
Figure 3. Bare land abundance map before extraction (a) and abundance map after extraction (b).
Sensors 23 08756 g003
Figure 4. Inversion model combination.
Figure 4. Inversion model combination.
Sensors 23 08756 g004
Figure 5. Correlation curve between Cr content and different transformations of spectral data.
Figure 5. Correlation curve between Cr content and different transformations of spectral data.
Sensors 23 08756 g005
Figure 6. GMLS data curve (a), images spectral curve (b), corrected image spectra (c), and modeling accuracy corresponding to different numbers of transfer sets (d).
Figure 6. GMLS data curve (a), images spectral curve (b), corrected image spectra (c), and modeling accuracy corresponding to different numbers of transfer sets (d).
Sensors 23 08756 g006
Figure 7. Spatial distribution of heavy metal Cr content.
Figure 7. Spatial distribution of heavy metal Cr content.
Sensors 23 08756 g007
Figure 8. Accuracy verification result.
Figure 8. Accuracy verification result.
Sensors 23 08756 g008
Table 1. Zhuhai-1 satellite parameters.
Table 1. Zhuhai-1 satellite parameters.
ParameterValue
Center longitude and latitudeLon113.500775
Lat24.495786
SensorCMOS
Imaging time27 July 2021
Solar elevation angle62.59
Side swing angle3.40
Ground resolution10 m
Track height520 km
Cloud cover5%
Data levelL1B
Table 2. Descriptive statistics of Cr content in the study area.
Table 2. Descriptive statistics of Cr content in the study area.
MetalNumberMean
(mg/kg)
Standard Deviation
(mg/kg)
CV
(%)
Minimum (mg/kg)Maximum
(mg/kg)
SkewnessKurtosisTopsoil of Guangdong ProvinceTopsoil of China
Cr6547.2825.185381390.751.0854.9154
Table 3. Statistical analysis of correlation coefficient.
Table 3. Statistical analysis of correlation coefficient.
Spectral TransformationBand (nm)Maximum Correlation Coefficient
Original468−0.549
SNV922−0.804
MSC922−0.802
FD840−0.792
SD4740.688
DWT460−0.516
RL4680.573
Table 4. Results of different band selections and different inversion models.
Table 4. Results of different band selections and different inversion models.
PreprocessingBand SelectionModeling MethodTraining SetValidation Set
R2RMSEMAER2RMSEMAERPD
SNVPCASVR0.9010.867.740.9110.438.582.27
BPNN0.8912.169.440.9011.689.122.03
PLSR0.8413.7710.990.9410.708.372.21
RF0.8513.719.230.948.666.542.74
CARSSVR0.8114.9611.630.9211.078.412.14
BPNN0.8612.8610.070.9110.898.302.18
PLSR0.8214.6511.960.8512.019.131.85
RF0.8513.749.560.8911.889.791.99
UVESVR0.8911.607.550.975.874.724.04
BPNN0.7019.1713.440.8314.3212.741.66
PLSR0.8314.2111.680.8611.589.161.92
RF0.8413.759.720.957.906.063.00
SPASVR0.7815.8311.970.929.367.222.36
BPNN0.8911.559.170.938.606.732.76
PLSR0.8513.3011.380.8512.398.961.91
RF0.8513.379.200.958.046.362.95
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Guo, H.; Yang, K.; Wu, F.; Chen, Y.; Shen, J. Regional Inversion of Soil Heavy Metal Cr Content in Agricultural Land Using Zhuhai-1 Hyperspectral Images. Sensors 2023, 23, 8756. https://doi.org/10.3390/s23218756

AMA Style

Guo H, Yang K, Wu F, Chen Y, Shen J. Regional Inversion of Soil Heavy Metal Cr Content in Agricultural Land Using Zhuhai-1 Hyperspectral Images. Sensors. 2023; 23(21):8756. https://doi.org/10.3390/s23218756

Chicago/Turabian Style

Guo, Hongxu, Kai Yang, Fan Wu, Yu Chen, and Jinxiang Shen. 2023. "Regional Inversion of Soil Heavy Metal Cr Content in Agricultural Land Using Zhuhai-1 Hyperspectral Images" Sensors 23, no. 21: 8756. https://doi.org/10.3390/s23218756

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop