Next Article in Journal
A Proteinaceous Alpha-Amylase Inhibitor from Moringa Oleifera Leaf Extract: Purification, Characterization, and Insecticide Effects against C. maculates Insect Larvae
Next Article in Special Issue
DSC, FT-IR and NIR with Chemometric Assessment Using PCA and HCA for Estimation of the Chemical Stability of Oral Antidiabetic Drug Linagliptin in the Presence of Pharmaceutical Excipients
Previous Article in Journal
Structure of χ3-Borophene Studied by Total-Reflection High-Energy Positron Diffraction (TRHEPD)
Previous Article in Special Issue
Quantification of Salicylates and Flavonoids in Poplar Bark and Leaves Based on IR, NIR, and Raman Spectra
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Geochemical and Spatial Distribution of Topsoil HMs Coupled with Modeling of Cr Using Chemometrics Intelligent Techniques: Case Study from Dammam Area, Saudi Arabia

1
Interdisciplinary Research Center for Membrane and Water Security, King Fahd University of Petroleum and Minerals, Dhahran 31261, Saudi Arabia
2
College of Petroleum Engineering and Geosciences, King Fahad University of Petroleum and Minerals, Dhahran 31261, Saudi Arabia
3
Ministry of Environment, Water, and Agriculture, Riyadh 11195, Saudi Arabia
4
Centre for Environmental Management and Control, Enugu Campus, University of Nigeria, Nsukka 410001, Nigeria
5
Operational Research Centre in Healthcare, Near East University, TRNC, Mersin 10, Nicosia 99138, Cyprus
6
Department of Analytical Chemistry, Faculty of Pharmacy, Near East University, TRNC, Mersin 10, Nicosia 99138, Cyprus
*
Author to whom correspondence should be addressed.
Molecules 2022, 27(13), 4220; https://doi.org/10.3390/molecules27134220
Submission received: 5 June 2022 / Revised: 22 June 2022 / Accepted: 27 June 2022 / Published: 30 June 2022
(This article belongs to the Special Issue Chemometrics in Analytical Chemistry)

Abstract

:
Unconsolidated earthen surface materials can retain heavy metals originating from different sources. These metals are dangerous to humans as well as the immediate environment. This danger leads to the need to assess various geochemical conditions of the materials. In this study, the assessment of topsoil materials’ contamination with heavy metals (HMs) was conducted. The material’s representative spatial samples were taken from various sources: agricultural, industrial, and residential areas. The materials include topsoil, eolian deposits, and other unconsolidated earthen materials. The samples were analyzed using the ICP-OES. The obtained results based on the experimental procedure indicated that the average levels of the heavy metals were: As (1.21 ± 0.69 mg/kg), Ba (110.62 ± 262 mg/kg), Hg (0.08 ± 0.18 mg/kg), Pb (6.34 ± 14.55 mg/kg), Ni (8.95 ± 5.66 mg/kg), V (9.98 ± 6.08 mg/kg), Cd (1.18 ± 4.33 mg/kg), Cr (31.79 ± 37.9 mg/kg), Cu (6.76 ± 12.54 mg/kg), and Zn (23.44 ± 84.43 mg/kg). Subsequently, chemometrics modeling and a prediction of Cr concentration (mg/kg) were performed using three different modeling techniques, including two artificial intelligence (AI) techniques, namely, generalized neural network (GRNN) and Elman neural network (Elm NN) models, as well as a classical multivariate statistical technique (MST). The results indicated that the AI-based models have a superior ability in estimating the Cr concentration (mg/kg) than MST, whereby GRNN can enhance the performance of MST up to 94.6% in the validation step. The concentration levels of most metals were found to be within the acceptable range. The findings indicate that AI-based models are cost-effective and efficient tools for trace metal estimations from soil.

1. Introduction

Soil is one of the most important natural resources in the environment. It is continuously evolving by weathering rock materials, the accumulation of decaying plant matter, eolian deposits, fluvial/marine deposits, etc. Soil provides vital living space to humans and substrate for many microbes, plants, and animal species [1]. Therefore, proper monitoring, protection, and preservation methods of soil are among the primary environmental sustainability objectives. Trace metals are considered major pollutants in soil [2]. These materials are found naturally in the environment with various variable concentrations reported by different environmental scientists. Moreover, various artificial sources such as pollution have been reported [3]. The abnormal levels accumulated may directly impact human health and the ecosystem [4]. It is equally important to note that there is a high tendency for heavy metals to accumulate in the food chain, owing to their sophisticated nature. Human-induced metal accumulations come from various sources [5], ranging from vehicular emissions to industrial waste, fossil fuel combustion, and municipal waste residue disposal.
Although authorities require environmental impact assessment and monitoring reporting, only very few environmental soil studies based in Saudi Arabia have been reported. Reported studies were performed in the areas of Jizan City [6], Red Sea coastal line [7], Jeddah City [8], and Wadi Hanifah, Riyadh, Saudi Arabia [9]. Arif and Hashem [6] studied soil samples in five localities in Jizan City for metal contents and fungal flora. They observed that the soil contained high Pb, Cu, Ni, Mn, and Cd concentrations. Al-Hefner et al. [7] measured heavy element concentrations in the surface soil along the Red Sea coastline using the ICP-MS technique. They reported high Sr, Hf, and U concentrations among the 28 elements found in the soil. Kadi [8] investigated soil samples from roads in Jeddah City. Samples were analyzed for nine elements, and only Pb and Zn were found in higher concentrations. They reported that traffic conditions on the roads are the main reason for higher concentrations of these two elements. Al Yemeni et al. [9] studied Wadi Hanifah, which served as a natural wastewater drainage system for the City of Riyadh. They found high levels of Cd, Ni, Pb, and Zn in both the wastewater and soil sediments of the region. Due to these trace elements’ hazardous nature, it is, therefore, necessary for environmental scientists to identify the degree, concentrations, and primary sources of heavy metals from the agricultural, residential, and industrial soils.
The application of modeling techniques such as the artificial intelligence (AI)-based approach provides a cost-effective, powerful, and efficient strategy for estimating and predicting heavy metals, which can be employed to solve various challenges and issues. Using computational methods such as AI can minimize the financial stress related to earth science, environmental engineering, and spatial geology based on time, cost, space, and labor requirements. Computational techniques were developed recently to solve issues and challenges that cannot be solved using different mathematical models, classical approaches, and linear statistical methods [10,11,12,13]. The robust application of AI techniques is not only limited to the mechanisms involved in minimizing or removing heavy metals, but also can be used in system identification in different fields of science and technology [14,15,16,17,18,19,20].
The supremacy of computational techniques is related to various factors, such as the development of the models, the kind of learning network to be implemented, and the structure of the network [21,22,23,24]. Even though numerous research works on the implementation of AI-based techniques on heavy metals have been reported in the technical literature, several characteristics could be explored regarding the long-term applications of heavy metal modeling. Hence, to our current knowledge, no study in the technical literature reports the prediction of Cr concentration (mg/kg) using GRNN and Elm NN models in Saudi Arabia. Therefore, this study evaluated the level of heavy metals (HMs)/trace metals in the soil samples taken from different areas of Dammam region, Kingdom of Saudi Arabia. The literature search revealed no reliable reports on the distribution of trace metals in soils of Dammam metropolitan areas, especially using AI-based models. Therefore, this study aims to assess the spatial distribution of heavy metals around the Dammam area, east of Saudi Arabia. Moreover, the study equally employed the application of two AI-based models in the form of generalized neural network (GRNN) and Elman neural network (Elm NN) models, as well as a classical multivariate statistical technique (MST) as a linear method for the prediction of Cr concentration (mg/kg). Another approach to understanding the wide range of applications and connection strength of AI-based models and HMs is by using a bibliographic survey. A survey of the reported literature on the database Scopus yielded the finding that about 200 peer-reviewed papers from between Jan 1985 and June 2022 adopted over the feasibility of a wide interest in HMs and AI-based techniques (see, Figure 1). Figure 1 shows that 1000 keywords occurring between those studies, indicating the deep interest and implementation of this field. In addition, the popularity of this study topic can be investigated in different regions throughout the world, with the bulk of the countries producing the output related to AI and HMs using the same data.

2. Material and Methods

2.1. Study Location

The city of Dammam is located from latitude 26°20′18″ to 26°32′51″ and longitude 49°49′54″ to 50°09′17″. The city of Dammam is the capital city of the Eastern Province of Saudi Arabia. It is regarded as the largest oil region in the world. It houses the judiciary, administrative, and other government agencies and departments. A large percentage of the province’s population resides in this area. Based on its geographical location, its closeness with the Arab Gulf countries enables it to be a great tourist center. It equally experienced various forms of industrialization and urbanization activities recently.
The city of Dammam is also surrounded by many farms that produce dates and other fruits and vegetables. It also has two main industrial cities for small to medium-size industries. Dammam has recently experienced migration, sub-urbanization, and rapid industrialization. A geological map with agricultural (AG), industrial (ID), residential (R), and background (BG) areas was considered and is presented in Figure 2.
The classification of soil types is based on the grain size distribution and soil consistencies. It provides an avenue through which research related to soil can be addressed in a rigorously systematic manner. The unified soil classification system was adopted in classifying the soil. The soil ID is classified as coarse gravel if more than 50% of the soil is gravel, and coarse sand if more than 50% of the soil is sand. The following conditions were also used in classifying the soil: 1 < Cc < 3 and Cu ≥ 4 for gravel; 1 < Cc < 3 and Cu ≥ 6 for sand; P = poorly graded: <5% fine, Cu < 6, and 1 > Cc > 3; W = well-graded: <5% fine, Cu ≥ 6, and 1 ≤ Cc ≤ 3; M = silty: >12% fine; C = clayey: >12% fine.
Soil types in the selected sampling location are as follows:
  • For the AG samples, it is mainly poorly graded sand with silt to silty sand;
  • For the ID samples, it is mainly silty sand to poorly graded sand with silt and gravel;
  • For the R samples, it is mainly well-graded sand with silt and gravel to silty sand;
  • For the BG samples, it is mainly poorly graded sand with silt to silty sand.

2.2. Sampling Method

For sampling purposes, the Dammam region is in the southern part of Saudi Arabia, whereby the dataset was classified into agricultural, industrial, background, and residential areas. Thirty-three representatives from each of the classes were taken, with a total of one hundred thirty-two samples. The locations of these samples are presented in Figure 2. The procedure involves collecting different soil samples from other locations, recorded by miniaturized and portable positioning system instruments. The samples were then stored using a polythene bag safely in the laboratory. The topsoil samples were collected over a two-month period (i.e., February to March 2014). The procedure is considered as a powerful acid digesting process capable of dissolving all elements that are naturally widespread in the environment.

2.3. Geochemical Analysis of Soil Samples

The samples were prepared as per the USEPA method 3050B to digest soil, sediment, and sludge. The samples were then analyzed by inductively coupled plasma-optical emission spectrometer (ICP-OES) (SPECTRO Analytical Instruments GmbH & Co. KG, Kleve, Germany) as per USEPA method 200.7, revision 4.4 [25]. The chemical reagent employed complied with the standards of the ACSCAR (American Chemical Society’s Committee on Analytical Reagents). Distilled water (DI), concentrated nitric acid (HNO3), concentrated hydrochloric acid (HCl), and 30% hydrogen peroxide are examples of such reagents (H2O2). Because the digestion required the use of acid, it was carried out under a fume hood, under the supervision of an expert, and with the certified and recommended laboratory safety equipment. The equipment was calibrated using the multi-elemental standardization approach, which consists of six different working standards incorporated with a single blank used to identify the tools’ accuracy and suitability. Quality control measurements were equally used for each batched processing of the samples under consideration. Each batch was composed of 20 different samples.

2.4. Proposed Machine Learning Methodology

Basic data statistics were performed using Minitab version 16. Additionally, PCA and CA were equally conducted to differentiate various groups of the analytes with approximate geochemical behaviors and explore the correlations among the various elements. Before the simulation process, the raw data was analyzed and standardized into unit variance as well as zero means. Then, the eleven heavy metals were grouped based on CA. The data used in the modeling process of the current study were taken from our experimental research for trace elements present in the soil in the Dammam region of Saudi Arabia. Three different computational techniques were used in modeling and predicting the Cr concentration (mg/kg) as the dependent (target) variable, whereas the concentrations (mg/kg) of six other trace metals, Ba, Cu, Ni, Ti, V, and Zn, were chosen as the independent (input) variables. The spatial analysis for input–output variables used in this study are presented in Figure 3.
Three different data-driven approaches, including GRNN, Elm NN, and MST, were comparatively used in the current work to determine the best model that can be employed in estimating the concentration of Cr as a potentially toxic trace element to both humans and soil microorganisms. Before starting the estimation step, preliminary data processing techniques were carried out by employing both the AI-based models GRNN and Elm NN, and the linear approach MST (Figure 4). The cross-validation technique was equally utilized to ensure that both under- and overfitting were avoided in both the calibration and validation steps. The data was subsequently divided into 65% calibration and 35% validation. Furthermore, the current study employed the application of the Spearman–Pearson-based correlation technique for the determination of feature selection as well as for understanding the nature of the dataset for the agricultural area (see Figure 5). Based on the correlation performance of the input variables against the target Cr, the independent variables were classified into C1: Zn, Ni, and Ba, and C2: V, Cu, and Ti before the estimation stage.

2.5. Generalized Regression Neural Network (GRNN)

The GRNN was introduced by Specht [26] as the family of ANN models; the models used single-pass learning ANN. The GRNN consists of four layers (input, hidden, summation, output) unlike the typical neural network (see Figure 6a). As a traditional network, all the classifications of neural networks can serve as the multi-input multi-output model; this learning capability also was attributed to GRNN. To understand the learning paradigm of GRNN, the inputs’ player is similar to the classical neural network, where the weight is transposed and the training was associated with a Gaussian kernel’s functions. The major different between GRNN and ANN is that the former is a single-pass network, while the latter uses two passes. This makes the GRNN take less time for training. Secondly, the type of neurons differs for GRNN as well; in GRNN, the weight of each neuron is computed using a special parameter called a smoothing parameter (σ), which is also regarded as the hyper-parameter tuning. The σ is responsible for learning improvements of the models based on the accuracy evaluation [27].
Its general form is depicted below:
P i = e x p X X i T X X i 2 σ 2
where X is the input data of the testing dataset, X i is the ith input of the training dataset, and σ is the smoothing parameter.

2.6. Elman Neural Network (Elm NN)

As a subset of ANN, the Elman NN is a recurrent neural network (RNN) that consists of layers, such as those of traditional BPPNNs. The Elm NN has been applied in several fields of science and engineering to solve the problems of simulation, prediction, and estimation [28,29]. Basically, Elm NN consists of four layers, including inputs, hidden, context, and output layers. The interaction between the layers are considered as FFCNs (feed-forward connection networks). The context layers of Elm NN are used to contain the hidden layers’ output values [30]. Elman NN is, generally, regarded as a feedback NN with tap delay layers. As a black-box dynamic model, Elman NN can map both the single- and multiple-input–output relationship; such models can be applied to simplify several derivative calculations, and are highly valued for research purposes. It is evident that a significant amount of research is devoted to showing the advantage of introducing RNNs. System identification has been popular recently, and RNNs such as Elan NN are widely recommended within this field. It is reported that feed-pass loops have the greatest influence on the learning capability of RNNs [31]. In comparison with classical ANNs, RNNs are promising in terms of capturing complex system and complexity behavior (see Figure 6b).

2.7. Multivariate Statistical Techniques (MST)

Generally, we have different kinds of multivariate statistical techniques (MST), including multi-linear regression (MLR), stepwise-linear regression (SWLR), interaction-linear regression (ILR), etc. The MST technique is regarded as one of the conventional regression approaches used to understand and model the collinearity of various parameters as targets and independent parameters. Overall, it is described using Equation (4):
y = b 0 + b 1 x 1 + b 2 x 2 + b i x i
where y is the target parameter, x 1 is the value of the ith predictor, b 0 is the regression constant, and b i is the coefficient of the ith predictor.

2.8. Evaluation Metrics of the Models

The predictive values were compared with the measured values and then checked using different performance indices to determine various computation methods’ performances. For instance, the current study explores the application of three other statistical indices, namely, the Nash–Sutcliffe coefficient efficiency (NSE) and the Pearson coefficient (PC), whereby the root mean squared error (RMSE) is used in determining the error depicted by each model.
N S E = 1 j = 1 N Y o b s , j Y c o m , j 2 j = 1 N Y o b s , j Y ¯ o b s , j 2
P C = i = 1 N Y o b s Y ¯ o b s Y c o m Y ¯ c o m i = 1 N Y o b s Y ¯ o b s 2 i = 1 N Y c o m Y ¯ c o m 2                          
R M S E = i = 1 N Y o b s i Y c o m i 2 N

3. Results and Discussion

The American Association of State Highway and Transportation Official (AASHTO) sieve analysis procedure was adopted to assess the concentrations and spatial distributions of different trace elements at AG and ID. This provides a method for addressing soil research through a systematic and thorough methodology. The grain size distribution and soil consistency were used to classify soil characteristics. The topsoil was classified using a unified soil classification method. The geochemical map of the sample sites was created using AutoCAD and ArcMap software. During sample collection, a portable global positioning system (GPS) was used to record sample locations. The geochemical spatial distribution map of each element discovered in the different regions was then created using Surfer 8 software. After analysis, the trace metal content in each place was combined with the geographical coordinates of each sample location obtained during sample collection. The levels of metals are presented below in parentheses, with the mean levels followed by the maximum level detected in (mg/kg) in the sampled locations. Since the Kingdom of Saudi Arabia does not have well-defined guidelines regarding the limits of trace metals in soil, the Canadian Environmental Soil Quality Guidelines (CESQG) standards for the Protection of Environment and Human Health (PEHH) were adopted for comparison purposes.

3.1. Spatiotemporal Characteristics of Soil Heavy Metal Contents

Most of the metals followed a general trend in concentration with the highest means found in the sample from the industrial region, then agricultural, and subsequently, the residential locations. The lowest means were found in the background area. The results of the analysis of the topsoil samples are shown in Table 1, along with the allowable limits stated by CESQG for different heavy metals. The concentration of Arsenic (As) was highest in the industrial area (mean = 1.58 mg/kg, maximum = 4.56 mg/kg) followed by the agricultural region (mean = 1.516 mg/kg, max = 3.135 mg/kg), while the lowest was in the residential area (mean = 0.97 mg/kg, max = 2.22 mg/kg). However, none of the samples was tested above the 12 mg/kg threshold. The concentrations of barium (Ba) was highest in the industrial area (mean = 335.5 mg/kg, max = 1966.5 mg/kg), followed by the agricultural (mean = 34.46 mg/kg, max = 100.62 mg/kg) and residential areas (mean = 34.11 mg/kg, max = 98.55 mg/kg). Some of the samples from the industrial areas exceeded the allowable limit of 500 mg/kg. The concentration of cadmium (Cd) was highest in the industrial area (mean = 1.878 mg/kg, max = 28.69 mg/kg), followed by the residential area (mean = 1.878 mg/kg, max = 28.69 mg/kg) and agricultural area (mean = 0.07, max = 1.137). One sample from each industrial and agricultural area measured above the allowable 10 mg/kg limit.
Nevertheless, the amount of Cr was found to be more prevalent in the industrial sample (with mean = 51.77 mg/kg, max = 247.6 mg/kg) than the residential soil samples (mean = 29.64 mg/kg, max = 120.22 mg/kg) and agricultural area (mean = 24.4 mg/kg, max = 74.7 mg/kg). The relative averages of the total samples measured were above the permissible concentration. The concentration of copper (Cu) was highest in the industrial area (mean = 11.0 mg/kg, max = 95.75 mg/kg), followed by the agricultural (mean = 8.74, max = 31.64) and residential areas (mean = 4.38 mg/kg, max = 19.17 mg/kg). Two of the samples from the industrial area exceeded the allowable limit of 63 mg/kg. The concentration of mercury (Hg) was highest in the industrial area (mean = 0.11 mg/kg, max = 1.44 mg/kg), followed by the agricultural (mean = 0.087 mg/kg, max = 1.02 mg/kg) and residential areas (mean = 0.06 mg/kg, max = 0.59 mg/kg). None of the samples exceeded the allowable limit of 6.6 mg/kg. The concentration of nickel (Ni) was the highest in the industrial area (mean = 13.14 mg/kg, max = 45.2 mg/kg), followed by the agricultural (mean = 9.22 mg/kg, max = 16.25 mg/kg) and residential areas (mean = 6.44 mg/kg, max = 13.23 mg/kg). None of the samples exceeded the allowable limit of 50 mg/kg. The concentration of lead (Pb) was highest in the industrial area (mean = 11.42 mg/kg, max = 100.25 mg/kg), followed by the agricultural (mean = 6.49 mg/kg, max = 52.35 mg/kg) and residential areas (mean = 4.79 mg/kg, max = 25.6 mg/kg). None of the samples exceeded the allowable limit of 140 mg/kg. Similarly, the concentration of vanadium (V) was highest in the industrial area (mean = 13.11 mg/kg, max = 20.42 mg/kg), followed by the agricultural (mean = 11.52 mg/kg, max = 21.89 mg/kg) and residential areas (mean = 7.0 mg/kg, max = 17.73 mg/kg). None of the samples exceeded the allowable limit of 130 mg/kg. The concentration of zinc (Zn) was highest in the industrial area (mean = 65.44 mg/kg, max = 676.5 mg/kg), followed by the agricultural (mean = 12.43 mg/kg, max = 46.25 mg/kg) and residential areas (mean = 8.47 mg/kg, max = 39.39 mg/kg). None of the samples exceeded the allowable limit of 200 mg/kg.

3.2. Results of Multivariate Statistics

For the agricultural area, cluster analysis (CA) was performed to distinguish different groups of elements with approximate geochemical behaviors. The results reveal that, based on the close relations between heavy metals in the soil, As, Cd, Hg, Cu, V, and Pb could be sorted into a single group; Ni, Zn, Cr, and Ba in another group, and Ti as the third group To further explore the relationships among the eleven elements, PCA was performed. The PCA showed that ~74.4% of the data variance could be explained by the first three principal components (factors). These three principal components were extracted with an eigenvalue >1. In particular, the eigenvalues of component 1 (~44% of inertia), component 2 (16.8% of inertia), and component 3 (13.6% of inertia) were 4.8, 1.8, and 1.5, respectively. Component 1 was mainly related to V, Cr, and Cu, component 2 was associated mainly with Pb and hg, and component 3 was related to As.
In addition, the CA analysis for the industrial area reveals that As, Hg, Cu, and Pb could be sorted into a single group; V, Ni, Cr, Cd, and Zn in another group, and Ti and Ba as the third group Furthermore, the PCA analysis shows that ~77% of the data variance can be explained by the first three principal components (factors). Ni, Cr, and Cu are the predominant elements in the first component, and Cd, V, and Ti are the predominant elements in the second component, while the contribution to the third component was mainly due to As and Hg. These components exhibit eigenvalues more significant than one and have therefore been considered here. In particular, the eigenvalues of component 1 (~48.9% of inertia), component 2 (17.4% of inertia), and component 3 (10.7% of inertia), were 5.38, and 1.91, and 1.18, respectively.
The CA analysis for the residential area reveals that As, Pb, Cu, V, Hg, and Zn could be sorted into a single group; Ni, Cd, Ba, and Cr in another group, and Ti as the third group. Moreover, PCA analysis reflects that ~78% of the data variance can be explained by the first their principal components with the eigenvalues of component 1 (~52.7% of inertia), component 2 (15.4% of inertia), and component 3 (9.9% of inactivity) as 5.79, 1.69, and 1.08, respectively. V, Ti, Ni, and Ba are the predominant elements in the first component, Hg and Cr are the predominant elements in the second component, while the contribution to the third component was mainly due to As, Hg, and Pb. However, CA analysis for the background area reveals that As, Hg, Cu, Pb, Ni, V, and Zn could be sorted into one group; Ba, Cr, and Cd in another group, and Ti as the third group. PCA for the background area shows that ~84.1% of the data variance can be explained by the first two principal components (factors). Particularly, the eigenvalues of component 1 (~52.7% of inertia) and component 2 (15.4% of inertia) were 5.8 and 1.7, respectively. V, Cu, Ti, and Ni were the predominant elements in the first component, while the contribution to the second component was mainly due to As, Hg, and Cr.

3.3. Results for AI-Based Computational Models

Traditional and classical trivial regression approaches have been utilized for the extraction, analytical exploration, and estimation of trace metals, despite their numerous issues and limitations. Based on this, the AI-based technique was developed to enhance and improve the performance prediction of trace elements from the soil, water, and air. One of the primary motivations of this article is the implementation of two different AI techniques based on the most widely employed neural networks (NN). Different training networks inform GRNN, and Elm NN models integrated with the MST linear approach to predict Cr concentration (mg/kg) in the AG regions of Dammam, Saudi Arabia. The proper selection of hyper tuning parameters plays a crucial role in the modeling process and model accuracy. Therefore, the tuning parameters used for NN, for example, are 1000, 0.01, and 0.0001 for the maximum number of iterations, learning rate, and MSE, respectively. For the development of NN, formulating a proper number of hidden nodes is the most important aspect, as such hidden layers were identified using (2n1/2 + m) to (2n + 1), where n is the number of input neurons and m is the number of output nodes. The optimal structure was determined using the range of 2–10 hidden nodes, and 15–60 calibration epochs. The major advantage of GRNN over other types of neural networks is that there is only a single hyper-parameter, namely, the sigma. The random search strategy is to find a close-to-optimal value of the sigma by using various random numbers. Elman NN has been designed and trained with a conventional learning algorithm, with the values obtained in the initial iteration of the learning process as the initial set values for optimal tuning algorithms.
In the validation process, different types of validation approaches can be applied, including cross-validation, which is called k-fold cross-validation; others are holdout, leave one out, and so on. We applied k-fold cross-validation in this study. The sensitivity and importance of the sampling sites and the nature of the dataset were currently considered of global attention. Based on Yassen, [32], Cr is considered one of the most utilized heavy metals using different computational approaches due to its hazardous impact on both humans and the environment. Moreover, the AI-based technique as a cost-effective approach provides economical and efficient benefits to environmental policymakers, especially those dealing with trace metals. The performance of the two classes (C1 and C2) of the utilized models in the current study (GRNN, Elm NN, and MST) was determined using three different performance metrics, NSE, PC, and RMSE, as mentioned in the previous section.
The predicted outcomes of the models in both the calibration and validation steps are shown in Table 2. The calibration step and C2 class showed superior performance to the validation step and C1 class, respectively. The comparative performance of the models (GRNN, Elm NN, and MST) showed different predictability of the Cr concentration (mg/kg) due to the reactive nature of each model network towards the learning process the dataset. Furthermore, this indicates the need to utilize various metrics and visualizations for comparative analysis of the models’ performance. The quantitative interpretation of the models, as indicated in Table 2, shows the robustness of the AI-based models (GRNN and Elm NN) over the classical MST model, even though only the GRNN showed a performance with a minimum or higher NSE value of 0.8 in both the two input combinations (C1 and C2) and two steps (calibration and verification), respectively.
The Elm NN-C2 showed higher performance than Elm NN-C1 in the calibration and validation steps. At the same time, both MST-C1 and MST-C2 failed to estimate the Cr concentration (mg/kg) in both the calibration and validation phases with significant performance. Moreover, the comparative performance of the models indicated in Table 2 depicts that GRNN as an AI-based model can boost the performance of the classical linear MST model up to 27% and 94.6% in the calibration and validation phases, respectively, for the C2 input combination. The performance of the models can be better pictured using different visualizations to ease grasping of the performance results of the models. For instance, Figure 7 indicates the comparative error performance of the models based on their respective RMSE values in both the calibration and validation stages.
Moreover, the comparative performance of the models based on their goodness-of-fit in the form of PC and NSE can be visualized using the radar plot as shown in Figure 8. To better understand the data-driven techniques’ performance of each of the input combinations (C1 and C2), a visualization based on graphical illustration using time series and scatter plots was performed. These graphs can be utilized in evaluating the precision of the developed models. Based on the goodness-of-fit-values, the scatter plots illustration of the models is shown in Figure 9.
The scatter plot is one of the effective visualizations used in evaluating the predictability performance of various data-driven approaches, indicating the degree of deviation of the predicted values against the experimental values. Moreover, as shown in Figure 10, the time series depicts the trend of estimation of each data-driven approach (GRNN, Elm NN, and MST models) against the experimental Cr concentration (mg/kg). Based on the numerical and visualized performance of the models used in the current study for the estimation of Cr concentration (mg/kg), the models can be arranged based on their performance skills as follows: GRNN-C2 > GRNN-C1 > Elm NN-C2 > Elm NN-C1 > MST-C2 > MST-C1.

3.4. Discussion of Spatiotemporal Analysis

An analysis of topsoil can be used to estimate trace metal pollution in our environment [33]. Therefore, regular assessment of metal contamination in topsoil is necessary [34]. The USA’s standards for maximum contaminant levels are even higher than the Canadian allowable limits. The abundance of As in agricultural areas can be attributed to insecticides such as hydrogen arsenate on fruit trees [35]. Another valid reason for high levels of As in agricultural areas can also be attributed to the use of arsenic as a food additive in poultry to boost their weight, enhance their feeding, and preclude diseases [36], or the use of Ba titanate in a promising electro-ceramic procedure [37] and the use of mineral barite as an additive in oil well-drilling muds.
Further analysis indicated that the result of Cd is within the standard allowable limit for the three sample locations (background, residential, industrial). It is indicated in the industrial and residential zones that one sample has exceeded the allowable limit. This could be due to several industrial activities in the study locations. However, it can also be seen that some deposited material forms the illegal sources and other sources contained Cd. For Cr, the elevated levels might be associated with both natural, manmade, and atmospheric deposition of compound containing Cr. Samples 3, 5, 18, and 24 displayed high levels of Cr in the residential soil samples due to accidental release, wood additives, or atmospheric deposition of compounds. The Cr compound was also found to be at elevated levels for samples 13, 14, 15, 17, 18, and 30, which is due to the industrial application of Cr such as in the paint industry, tanning, roofing, dyes, etc. [38].
With regards to Cu compounds, the increasing levels in 2 out of 33 industries were linked to the substantial production of electrical cables, plumbing indifferent industries across the study area [39]. Other factors could also be attributed to atmospheric deposition or accidents containing Cu compounds. Meanwhile, in order to reduce human exposure to Cu compounds, extreme care needs to be exercised when working with Cu compounds [40]. The adopted guidelines for the level of Zn in the soil specify 200 mg/kg as the allowable limit, and all the analyzed samples fell within this range. Thus, no immediate threat is in sight from the tested trace metals in the studied areas of Dammam. However, proper monitoring is essential to keep these levels of metals in check. Rules must also be implemented to transport and handle chemicals that contain these trace metals. The predicted spatial analysis results using AI-based models are presented in Figure 11.

4. Conclusions and Recommendations

Research on the soil environment is necessary for sustainable development and improvement of the quality of life. As mentioned in the previous sections, there is very little information available regarding toxic metal contamination of topsoil in the Dammam area. So, the main motivation for this study was to assess the topsoil in the area to determine trace metal concentration, their “hot spot” (area of high concentrations of trace metals), and spatial distribution. These will help in assessing the health conditions of the topsoil in the region and may later be used for setting up remediation measures or land use allocations. This study outcome may also create further research in environmental assessment and monitoring. The current study analyzed various soil samples from different locations in the Dammam region of the Saudi Area, which equally determined the concentration of ten other heavy metals. The attention to these heavy metals almost follows the same trend, whereby they were all found at relatively higher amounts in the study locations. Most of the concentrations of the heavy metals were found to be within the accepted range, but some were found to exceed the accepted range. Conversely, based on their average concentrations, none of the heavy metals exceed their allowable threshold in the whole experiment. These findings, at first instance, provide a relief that the concentration of these heavy metals determined was found to be within the allowable range. Hence, there is no alarming threat to human life and the ecosystem.
Most of the metals followed a general trend in concentration with the highest means found in the sample from the industrial region, then agricultural, and subsequently, the residential locations. The lowest means were found in the background area. The amount of Cr was found to be more prevalent in the industrial samples than in the residential soil samples and agricultural areas. The relative average of the total samples measured was above the permissible concentration. The concentration of copper (Cu) was highest in the industrial area, followed by the agricultural and residential areas. Nevertheless, higher concentrations of these metals compared to the control and residential areas suggest careless handling of chemicals in these areas. It is, therefore, evident that if proper monitoring schemes are not devised to plug any existing loopholes, it can lead to greater exposure of these metals into the ecosystem. Eventually, various aquatic life, animals, and plants might be at risk, which is related to heavy metals exposure. Furthermore, the prediction approach showed the performance abilities of the AI-based models (GRNN and Elm NN) over the classical linear MST approach in predicting the concentration (mg/kg) of Cr trace elements. However, the results based on the numerical and visualized performance obtained at a particular stage indicate the need for employing more robust techniques such as the metaheuristic approaches and ensemble machine learning to improve the performance prediction. The research also indicated the smaller number of the data instances as the limitation of the research as the machine learning needs a huge amount of data in order to represent the real information of the scenario.

Author Contributions

Conceptualization, M.A.Y.; B.T. and S.I.A.; methodology, S.I.A., J.A.A.; M.A.Y.; and M.B.; validation, M.A.Y., B.T.; M.B. and S.I.A.; formal analysis. B.T., S.I.A. and J.A.A.; investigation, S.I.A., M.A.Y., J.A.A. and M.B.; resources, S.I.A., J.A.A. and M.A.Y.; data curation, J.A.A. and A.A.-S.; writing—original draft preparation, M.A.Y., B.T., S.I.A., M.B., J.A.A.; A.M.A.-A.; A.G.U.; and A.A.-S.; writing—review and editing, S.I.A.; visualization, M.A.Y.; A.M.A.-A. and S.I.A.; supervision, M.A.Y. and B.T.; project administration, B.T. and A.A.-S.; funding acquisition, M.A.Y.; A.A.-S. and B.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by King Fahd University of Petroleum and Minerals under Project Grant Number INMW2209.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available upon request.

Acknowledgments

Authors would like to acknowledge the support provided by the Interdisciplinary Research Center for Membrane and Water Security, King Fahd University of Petroleum and Minerals under Project number INMW2209. The authors also would like to acknowledge College of Petroleum Engineering and Geosciences, King Fahd University of Petroleum and Minerals for all support and assistance provided for Field work and laboratory analysis.

Conflicts of Interest

The authors declare no conflict of interest.

Sample Availability

Samples of the compounds are available from the authors.

References

  1. Peña-Fernández, A.; González-Muñoz, M.; Lobo-Bedmar, M. Establishing the importance of human health risk assessment for metals and metalloids in urban environments. Environ. Int. 2014, 72, 176–185. [Google Scholar] [CrossRef] [PubMed]
  2. Abel, M.T.; Suedel, B.; Presley, S.; Rainwater, T.R.; Austin, G.P.; Cox, S.B.; McDaniel, L.N.; Rigdon, R.; Goebel, T.; Zartman, R.; et al. Spatial distribution of lead concentrations in urban surface soils of New Orleans, Louisiana USA. Environ. Geochem. Health 2010, 32, 379–389. [Google Scholar] [CrossRef] [PubMed]
  3. Suthar, S.; Nema, A.K.; Chabukdhara, M.; Gupta, S. Assessment of metals in water and sediments of Hindon River, India: Impact of industrial and urban discharges. J. Hazard. Mater. 2009, 171, 1088–1095. [Google Scholar] [CrossRef] [PubMed]
  4. Zhang, J.; Pu, L.; Peng, B.; Gao, Z. The impact of urban land expansion on soil quality in rapidly urbanizing regions in China: Kunshan as a case study. Environ. Geochem. Health 2010, 33, 125–135. [Google Scholar] [CrossRef]
  5. Rodríguez-Salazar, M.T.; Morton-Bermea, O.; Hernández-Álvarez, E.; Lozano, R.; Tapia-Cruz, V. The study of metal contamination in urban topsoils of Mexico City using GIS. Environ. Earth Sci. 2011, 62, 899–905. [Google Scholar] [CrossRef]
  6. Arif, I.; Hashem, A.R. Soil analysis and mycoflora of Jizan city, Saudi Arabia. Phyton Rev. Int. Bot. Exp. 1998, 62, 109–114. [Google Scholar]
  7. Al-Hefne, J.; Al-Dyel, O.; Chowdhury, D.A.; Al-Ajayan, T. Distribution and ICP-MS determination of heavy elements in the surfacial sand along the red sea coastline of Saudi Arabia. At. Spectrosc. 2005, 26, 51–58. [Google Scholar]
  8. Kadi, M.W. Soil Pollution Hazardous to Environment: A case study on the chemical composition and correlation to automobile traffic of the roadside soil of Jeddah city, Saudi Arabia. J. Hazard. Mater. 2009, 168, 1280–1283. [Google Scholar] [CrossRef]
  9. Alyemeni, M.N.; Wijaya, L.; Hayat, S. Physico-chemical and heavy metal status of wastewater and sediment of Wadi Hanifah (Riyadh, Saudi Arabia): A case study. Fresenius Environ. Bull. 2014, 23, 130–137. [Google Scholar]
  10. Alamrouni, A.; Aslanova, F.; Mati, S.; Maccido, H.S.; Jibril, A.A.; Usman, A.G.; Abba, S.I. Multi-Regional Modeling of Cumulative COVID-19 Cases Integrated with Environmental Forest Knowledge Estimation: A Deep Learning Ensemble Approach. Int. J. Environ. Res. Public Health 2022, 19, 738. [Google Scholar] [CrossRef]
  11. Hadi, S.J.; Abba, S.I.; Sammen, S.S.; Salih, S.Q.; Al-Ansari, N.; Yaseen, Z.M. Non-Linear Input Variable Selection Approach Integrated With Non-Tuned Data Intelligence Model for Streamflow Pattern Simulation. IEEE Access 2019, 7, 141533–141548. [Google Scholar] [CrossRef]
  12. Tung, T.M.; Yaseen, Z.M. A survey on river water quality modelling using artificial intelligence models: 2000–2020. J. Hydrol. 2020, 585, 124670. [Google Scholar] [CrossRef]
  13. Tao, H.; Salih, S.; Oudah, A.Y.; Abba, S.I.; Ameen, A.M.S.; Awadh, S.M.; Alawi, O.A.; Mostafa, R.R.; Surendran, U.P.; Yaseen, Z.M. Development of new computational machine learning models for longitudinal dispersion coefficient determination: Case study of natural streams, United States. Environ. Sci. Pollut. Res. 2022, 29, 35841–35861. [Google Scholar] [CrossRef]
  14. Malami, S.I.; Anwar, F.H.; Abdulrahman, S.; Haruna, S.; Ali, S.I.A.; Abba, S. Implementation of hybrid neuro-fuzzy and self-turning predictive model for the prediction of concrete carbonation depth: A soft computing technique. Results Eng. 2021, 10, 100228. [Google Scholar] [CrossRef]
  15. Elkiran, G.; Nourani, V.; Abba, S. Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach. J. Hydrol. 2019, 577, 123962. [Google Scholar] [CrossRef]
  16. Malami, S.I.; Musa, A.A.; Haruna, S.I.; Aliyu, U.U.; Usman, A.G.; Abdurrahman, M.I.; Bashir, A.; Abba, S.I. Implementation of soft-computing models for prediction of flexural strength of pervious concrete hybridized with rice husk ash and calcium carbide waste. Model. Earth Syst. Environ. 2021, 8, 1933–1947. [Google Scholar] [CrossRef]
  17. Haruna, S.I.; Malami, S.I.; Adamu, M.; Usman, A.G.; Farouk, A.; Ali, S.I.A.; Abba, S.I. Compressive Strength of Self-Compacting Concrete Modified with Rice Husk Ash and Calcium Carbide Waste Modeling: A Feasibility of Emerging Emotional Intelligent Model (EANN) Versus Traditional FFNN. Arab. J. Sci. Eng. 2021, 46, 11207–11222. [Google Scholar] [CrossRef]
  18. Musa, B.; Yimen, N.; Abba, S.; Adun, H.; Dagbasi, M. Multi-State Load Demand Forecasting Using Hybridized Support Vector Regression Integrated with Optimal Design of Off-Grid Energy Systems—A Metaheuristic Approach. Processes 2021, 9, 1166. [Google Scholar] [CrossRef]
  19. Mahmoud, K.; Bebiş, H.; Usman, A.G.; Salihu, A.N.; Gaya, M.S.; Dalhat, U.F.; Abdulkadir, R.A.; Jibril, M.B.; Abba, S.I. Prediction of the effects of environmental factors towards COVID-19 outbreak using AI-based models. IAES Int. J. Artif. Intell. 2021, 10, 35–42. [Google Scholar] [CrossRef]
  20. Abdulkadir, R.A.; Ali, S.I.A.; Abba, S.; Esmaili, P. Forecasting of daily rainfall at Ercan Airport Northern Cyprus: A comparison of linear and non-linear models. Desalination Water Treat. 2020, 177, 297–305. [Google Scholar] [CrossRef]
  21. Abba, S.I.; Abdulkadir, R.A.; Sammen, S.S.; Usman, A.G.; Meshram, S.G.; Malik, A.; Shahid, S. Comparative implementation between neuro-emotional genetic algorithm and novel ensemble computing techniques for modelling dissolved oxygen concentration. Hydrol. Sci. J. 2021, 66, 1584–1596. [Google Scholar] [CrossRef]
  22. Sammen, S.S.; Ehteram, M.; Abba, S.I.; Abdulkadir, R.A.; Ahmed, A.N.; El-Shafie, A. A new soft computing model for daily streamflow forecasting. Stoch. Hydrol. Hydraul. 2021, 35, 2479–2491. [Google Scholar] [CrossRef]
  23. Pham, Q.B.; Abba, S.I.; Usman, A.G.; Linh, N.T.T.; Gupta, V.; Malik, A.; Costache, R.; Vo, N.D.; Tri, D.Q. Potential of Hybrid Data-Intelligence Algorithms for Multi-Station Modelling of Rainfall. Water Resour. Manag. 2019, 33, 5067–5087. [Google Scholar] [CrossRef]
  24. Pham, Q.B.; Gaya, M.; Abba, S.; Abdulkadir, R.; Esmaili, P.; Linh, N.T.T.; Sharma, C.; Malik, A.; Khoi, D.N.; Dung, T.D.; et al. Modelling of Bunus regional sewage treatment plant using machine learning approaches. Desalination Water Treat. 2020, 203, 80–90. [Google Scholar] [CrossRef]
  25. Martin, T.; Brockhoff, C.; Creed, J. Method 200.7, Revision 4.4: Determination of Metals and Trace Elements in Water and Wastes by Inductively Coupled Plasma-Atomic Emission Spectrometry; US EPA: Cincinatti, OH, USA, 1994. [Google Scholar]
  26. Wiangkham, A.; Ariyarit, A.; Aengchuan, P. Prediction of the influence of loading rate and sugarcane leaves concentration on fracture toughness of sugarcane leaves and epoxy composite using artificial intelligence. Theor. Appl. Fract. Mech. 2021, 117, 117. [Google Scholar] [CrossRef]
  27. Ardejanii, F.D.; Rooki, R.; Shokri, B.J.; Kish, T.E.; Aryafar, A.; Tourani, P. Prediction of Rare Earth Elements in Neutral Alkaline Mine Drainage from Razi Coal Mine, Golestan Province, Northeast Iran, Using General Regression Neural Network. J. Environ. Eng. 2013, 139, 896–907. [Google Scholar] [CrossRef]
  28. Liu, H.; Sun, X.; Xu, L.; Gu, S.; Sun, F. Short-term load forecasting based on Elman Neural Network optimized by Firefly Algorithm. IEEE Innov. Smart Grid Technol. Asia 2019, 1425–1429. [Google Scholar] [CrossRef]
  29. Huang, W.; Wang, Z.; Che, F. Short-term Load Forecasting Based on the Improved BAS Optimized Elman Neural Network. In Proceedings of the 2019 IEEE Innovative Smart Grid Technologies—Asia (ISGT Asia), Chengdu, China, 21–24 May 2019; pp. 1–7. [Google Scholar] [CrossRef]
  30. Jia, H.; Taheri, B. Model identification of Solid Oxide Fuel Cell using hybrid Elman Neural Network/Quantum Pathfinder algorithm. Energy Rep. 2021, 7, 3328–3337. [Google Scholar] [CrossRef]
  31. Manzar, M.S.; Benaafi, M.; Costache, R.; Alagha, O.; Mu’Azu, N.D.; Zubair, M.; Abdullahi, J.; Abba, S. New generation neurocomputing learning coupled with a hybrid neuro-fuzzy model for quantifying water quality index variable: A case study from Saudi Arabia. Ecol. Inform. 2022, 70, 101696. [Google Scholar] [CrossRef]
  32. Yaseen, Z.M. An insight into machine learning models era in simulating soil, water bodies and adsorption heavy metals: Review, challenges and solutions. Chemosphere 2021, 277, 130126. [Google Scholar] [CrossRef]
  33. Zhao, Y.-F.; Shi, X.-Z.; Huang, B.; Yu, D.-S.; Wang, H.-J.; Sun, W.-X.; Öboern, I.; Blombäck, K. Spatial Distribution of Heavy Metals in Agricultural Soils of an Industry-Based Peri-Urban Area in Wuxi, China. Pedosphere 2007, 17, 44–51. [Google Scholar] [CrossRef]
  34. Han, B.-C.; Jeng, W.L.; Chen, R.Y.; Fang, G.T.; Hung, T.C.; Tseng, R.J. Estimation of Target Hazard Quotients and Potential Health Risks for Metals by Consumption of Seafood in Taiwan. Arch. Environ. Contam. Toxicol. 1998, 35, 711–720. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Peryea, F.J. Historical use of lead arsenate insecticides, resulting soil contamination and implications for soil remediation. In Proceedings of the 16th World Congress of Soil Science, Montpellier, France, 20–26 August 1998; pp. 20–26. [Google Scholar]
  36. Nachman, K.E.; Graham, J.; Price, L.B.; Silbergeld, E.K. Arsenic: A Roadblock to Potential Animal Waste Management Solutions. Environ. Health Perspect. 2005, 113, 1123–1124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Wadhawan, V. Introduction to Ferroic Materials; CRC Press: Boca Raton, FL, USA, 2000. [Google Scholar] [CrossRef]
  38. Hunger, K. Industrial Dyes: Chemistry, Properties, Applications; John Wiley & Sons: Hoboken, NJ, USA, 2002; p. 20072. [Google Scholar] [CrossRef]
  39. Emsley, J. Nature’s Building Blocks: Everything You Need to Know about the Elements; Oxford University Press: Oxford, UK, 2011; p. 699. [Google Scholar]
  40. Gaetke, L.M. Copper toxicity, oxidative stress, and antioxidant nutrients. Toxicology 2003, 189, 147–163. [Google Scholar] [CrossRef]
Figure 1. Major keywords used in AI technique analysis on HMs.
Figure 1. Major keywords used in AI technique analysis on HMs.
Molecules 27 04220 g001
Figure 2. Sample locations used in the current study.
Figure 2. Sample locations used in the current study.
Molecules 27 04220 g002
Figure 3. Spatial variability of input–output variables used for modeling.
Figure 3. Spatial variability of input–output variables used for modeling.
Molecules 27 04220 g003
Figure 4. Proposed methodology adopted in this study.
Figure 4. Proposed methodology adopted in this study.
Molecules 27 04220 g004
Figure 5. Feature selection of the variables based on the Spearman–Pearson-based correlation technique.
Figure 5. Feature selection of the variables based on the Spearman–Pearson-based correlation technique.
Molecules 27 04220 g005
Figure 6. Proposed architecture for (a) generalized neural network (GRNN) and (b) Elman neural network (Elm NN).
Figure 6. Proposed architecture for (a) generalized neural network (GRNN) and (b) Elman neural network (Elm NN).
Molecules 27 04220 g006aMolecules 27 04220 g006b
Figure 7. Comparative RMSE performance of the GRNN, Elm NN, and MST techniques for the Cr concentration (mg/kg) estimation in both calibration and validation phases.
Figure 7. Comparative RMSE performance of the GRNN, Elm NN, and MST techniques for the Cr concentration (mg/kg) estimation in both calibration and validation phases.
Molecules 27 04220 g007
Figure 8. Comparative (a) PC and (b) NSE performance of the GRNN, Elm NN, and MST techniques for the Cr concentration (mg/kg) estimation in both calibration and validation.
Figure 8. Comparative (a) PC and (b) NSE performance of the GRNN, Elm NN, and MST techniques for the Cr concentration (mg/kg) estimation in both calibration and validation.
Molecules 27 04220 g008
Figure 9. Scatter plots of the GPRNN, Elm NN, and MST for both C1 and C2.
Figure 9. Scatter plots of the GPRNN, Elm NN, and MST for both C1 and C2.
Molecules 27 04220 g009
Figure 10. Trend of estimation of the GRNN, Elm NN, and MST models against the experimental Cr concentration (mg/kg).
Figure 10. Trend of estimation of the GRNN, Elm NN, and MST models against the experimental Cr concentration (mg/kg).
Molecules 27 04220 g010
Figure 11. Predicted spatial analysis for both (a) C1 and (b) C2 combination.
Figure 11. Predicted spatial analysis for both (a) C1 and (b) C2 combination.
Molecules 27 04220 g011
Table 1. Concentration of trace metals in the Dammam area.
Table 1. Concentration of trace metals in the Dammam area.
Agriculture Area
ElementAsBaCdCrCuHgNiPbVZn
Unitmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kg
Allowable limit125001064636.650140130200
MCLs127501.464636.65070130200
Min0.457.803.030.9704.60.91.6351.015
Max3.135100.621.13574.7 *31.641.02516.2552.3521.88546.25
Mean1.51634.460.0724.48.740.0879.226.4911.5212.43
No. of samples33333333333333333333
Industrial Area
ElementAsBaCdCrCuHgNiPbVZn
Unitmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kg
Allowable limit125001064636.650140130200
MCLs1220002287915050600130360
Min0.520.000.120.180.014.760.040.090.1
Max4.561966.5 *28.69 *247.695.75 *1.4445.2100.2520.42676.5 *
Mean1.58335.351.87851.7711.00.1113.1411.4213.1165.44
No. of samples33333333333333333333
Residential Area
ElementAsBaCdCrCuHgNiPbVZn
Unitmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kgmg/kg
Allowable limit125001064636.650140130200
MCLs125001064636.650140130200
Min0.130.3300.070.040.02.250.080.020.01
Max2.2298.5523.01120.219.170.5913.2325.617.7339.39
Mean0.9734.111.8729.644.380.066.444.797.008.47
No. of samples33333333333333333333
* The value that exceeds the permissible concentration.
Table 2. Performance of the GRNN, Elm NN, and MST techniques.
Table 2. Performance of the GRNN, Elm NN, and MST techniques.
Calibration Phase
NSEPCRMSE
GRNN-C10.9980.9990.990
GRNN-C20.9990.9990.027
Elm NN-C10.5240.7233.677
Elm NN-C20.8980.9486.503
MST-C10.4810.69314.688
MST-C20.7310.85510.575
Validation Phase
GRNN-C10.9760.9880.829
GRNN-C20.9990.9990.158
Elm NN-C10.4070.63815.689
Elm NN-C20.8120.9012.315
MST-C10.1940.4414.785
MST-C20.530.2305.189
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Yassin, M.A.; Tawabini, B.; Al-Shaibani, A.; Adetoro, J.A.; Benaafi, M.; AL-Areeq, A.M.; Usman, A.G.; Abba, S.I. Geochemical and Spatial Distribution of Topsoil HMs Coupled with Modeling of Cr Using Chemometrics Intelligent Techniques: Case Study from Dammam Area, Saudi Arabia. Molecules 2022, 27, 4220. https://doi.org/10.3390/molecules27134220

AMA Style

Yassin MA, Tawabini B, Al-Shaibani A, Adetoro JA, Benaafi M, AL-Areeq AM, Usman AG, Abba SI. Geochemical and Spatial Distribution of Topsoil HMs Coupled with Modeling of Cr Using Chemometrics Intelligent Techniques: Case Study from Dammam Area, Saudi Arabia. Molecules. 2022; 27(13):4220. https://doi.org/10.3390/molecules27134220

Chicago/Turabian Style

Yassin, Mohamed A., Bassam Tawabini, Abdulaziz Al-Shaibani, John Adedapo Adetoro, Mohammed Benaafi, Ahmed M. AL-Areeq, A. G. Usman, and S. I. Abba. 2022. "Geochemical and Spatial Distribution of Topsoil HMs Coupled with Modeling of Cr Using Chemometrics Intelligent Techniques: Case Study from Dammam Area, Saudi Arabia" Molecules 27, no. 13: 4220. https://doi.org/10.3390/molecules27134220

Article Metrics

Back to TopTop