Next Article in Journal
Participatory Mapping of Demand for Ecosystem Services in Agricultural Landscapes
Next Article in Special Issue
A Four Stage Image Processing Algorithm for Detecting and Counting of Bagworm, Metisa plana Walker (Lepidoptera: Psychidae)
Previous Article in Journal
Using Channel and Network Layer Pruning Based on Deep Learning for Real-Time Detection of Ginger Images
Previous Article in Special Issue
Estimation of Soil Nutrient Content Using Hyperspectral Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling the Essential Oil and Trans-Anethole Yield of Fennel (Foeniculum vulgare Mill. var. vulgare) by Application Artificial Neural Network and Multiple Linear Regression Methods

by
Mohsen Sabzi-Nojadeh
1,*,
Gniewko Niedbała
2,*,
Mehdi Younessi-Hamzekhanlu
3,
Saeid Aharizad
4,
Mohammad Esmaeilpour
1,
Moslem Abdipour
5,
Sebastian Kujawa
2 and
Mohsen Niazian
6,*
1
Department of Natural Resources and Forestry, Faculty of Agriculture and Natural Resources, University of Tabriz, Ahar 5451645857, Iran
2
Department of Biosystems Engineering, Faculty of Environmental and Mechanical Engineering, Poznań University of Life Sciences, 60627 Poznań, Poland
3
Department of Horticulture Sciences, Faculty of Agriculture and Natural Resources, University of Tabriz, Ahar 5451645857, Iran
4
Department of Plant Breeding and Biotechnology, University of Tabriz, Tabriz 5166616471, Iran
5
Kohgiluyeh and Boyerahmad Agricultural and Natural Resources Research and Education Center, Agricultural Research Education and Extension Organization (AREEO), Yasouj 7589172050, Iran
6
Field and Horticultural Crops Research Department, Kurdistan Agricultural and Natural Resources Research and Education Center, Agricultural Research, Education and Extension Organization (AREEO), Sanandaj 6616936311, Iran
*
Authors to whom correspondence should be addressed.
Agriculture 2021, 11(12), 1191; https://doi.org/10.3390/agriculture11121191
Submission received: 24 September 2021 / Revised: 11 November 2021 / Accepted: 23 November 2021 / Published: 26 November 2021
(This article belongs to the Special Issue Digital Innovations in Agriculture)

Abstract

:
Foeniculum vulgare Mill. (commonly known as fennel) is used in the pharmaceutical, cosmetic, and food industries. Fennel widely used as a digestive, carminative, galactagogue and diuretic and in treating gastrointestinal and respiratory disorders. Improving low heritability traits such as essential oil yield (EOY%) and trans-anethole yield (TAY%) of fennel by direct selection does not result in rapid gains of EOY% and TAY%. Identification of high-heritable traits and using efficient modeling methods can be a beneficial approach to overcome this limitation and help breeders select the most advantageous traits in medicinal plant breeding programs. The present study aims to compare the performance of the artificial neural network (ANN) and multilinear regression (MLR) to predict the EOY% and TAY% of fennel populations. Stepwise regression (SWR) was used to assess the effect of various input variables. Based on SWR, nine traits—number of days to 50% flowering (NDF50%), number of days to maturity (NDM), final plant height (FPH), number of internodes (NI), number of umbels (NU), seed yield per square meter (SY/m2), number of seeds per plant (NS/P), number of seeds per umbel (NS/U) and 1000-seed weight (TSW)—were chosen as input variables. The network with Sigmoid Axon transfer function and two hidden layers was selected as the final ANN model for the prediction of EOY%, and the TanhAxon function with one hidden layer was used for the prediction of TAY%. The results revealed that the ANN method could predict the EOY% and TAY% with more accuracy and efficiency (R2 of EOY% = 0.929, R2 of TAY% = 0.777, RMSE of EOY% = 0.544, RMSE of TAY% = 0.264, MAE of EOY% = 0.385 and MAE of TAY% = 0.352) compared with the MLR model (R2 of EOY% = 0.553, R2 of TAY% = 0.467, RMSE of EOY% = 0.819, RMSE of TAY% = 0.448, MAE of EOY% = 0.624 and MAE of TAY% = 0.452). Based on the sensitivity analysis, SY/m2, NDF50% and NS/P were the most important traits to predict EOY% as well as SY/m2, NS/U and NDM to predict of TAY%. The results demonstrate the potential of ANNs as a promising tool to predict the EOY% and TAY% of fennel, and they can be used in future fennel breeding programs.

1. Introduction

Fennel (Foeniculum vulgare Mill. var. vulgare), which belongs to the Apiaceae family, is an open-pollinated plant, originating from the Mediterranean regions where it is possible to observe high genetic diversity. Fennel essential oil is widely used in pharmaceutical, food, and cosmetic industries. Trans-anethole is the main compound of the fennel essential oil, and the highest value of this compound is existing in seeds of fennel [1]. Trans-anethole is known as a flavoring agent in the food industry and in the production of perfume, as well as an anti-bloating compound in traditional medicine. It is an effective substance in the taste and smell of fennel [2,3,4].
Medicinal plants have a special place in traditional medicine folk and are used as treatment for many diseases [5,6]. Essential oils are naturally occurring in medicinal and aromatic plants which are rich in valuable biochemical compounds with high bioactivities such as antibacterial, antioxidant and phytotoxic activity [7,8,9,10].
Classical breeding methods, including the selection and release of elite cultivars, as well as the recognition of the agronomic traits as the suitable criteria to use in breeding programs, are being remembered as fast, easy and reliable methods to introduce superior fennel cultivars. Screening the best indicators to use in essential oil yield (EOY%) improvement plans and finding the correlation between EOY% and its components is the first priority [11,12,13]. Piccaglia and Marotti [14] reported that biomass weight and the number of umbels per plant are positively related to the essential oil content (R = 0.651 and 0.569, respectively) [14]. Cosge et al. [15] have elucidated that EOY% positively correlated with the 1000-grain weight trait [15]. The phenotypic expression process of quantitative and polygenic traits such as EOY% is non-linear, intricate, complex and time-variant. This intricacy is due to the high diversity within and between populations, also due to environmental impacts [16,17,18]. Therefore, because of the low heritability of EOY%, direct selection to improve this characteristic may lead to low genetic gain [19], whereas the indirect selection of EOY% through high-heritable and related characters with EOY% directly or indirectly affects the EOY% via positive or negative effects of other traits [20,21].
So far, some modeling studies have been used for EOY% prediction and researchers have studied the EOY% using physiological, phenological, morphological and phytochemical properties of plant by application of parametric analysis such as path analysis (PA), stepwise regression (SWR) and other techniques [11,14,15,22]. Previous studies have used linear procedures such as correlation analysis and multiple linear regressions (MLR), in which a linear correlation among variables is presumed. Nevertheless, linear methods were inadequate and could not really explain the interactions between variables and EOY% [23,24,25]. These complex relationships require non-linear methods such as adaptive neuro-fuzzy inference system (ANFIS), Bayesian classification (BC), artificial neural networks (ANNs) and genetic expression programming (GEP) to overcome the drawbacks of linear methods and find an accurate relationship among the studied traits [26,27,28,29,30]. Nonlinear nonparametric machine learning algorithms, such as ANN, have great potential in yield component analysis and indirect selection of highly complex quantitative traits of plants, which are strongly affected by several genes, the environment and their interaction (G × E) [31].
Some researchers have used the ANN method to predict the performance of some medicinal plants such as cumin (Cuminum cyminum L.) [32], ajowan (Trachyspermum ammi L.) [33] and sunflower (Helianthus annuus) [30].
The ANN topology is used to solve complex systems which it tries to imitate into numerical models [23]. ANN models are classified according to their structure, neurons type, etc. Furthermore, according to the training convergence in an ANN model, different algorithms can be used [34]. Multi-layer perceptron (MLP) is one of the most commonly used ANNs in biological studies [25,35,36,37,38,39]. An MLP is a feed-forward ANN model that contains an input layer, one or more hidden layers and an output layer. In each MLP, multiple layers of nodes in a directed graph are fully connected to the next one and each node (except for the input nodes) is a neuron with a nonlinear activation function [40]. According to the literature review, there is no report on forecasting fennel EOY% and trans-anethole yield (TAY%) using ANN methods, and predicting these traits based on the characteristics and parameters affecting them using an effective ANN method seems necessary to facilitate the process of breeding such complex traits. Therefore, the aims of the current project were to (1) model and predict the most important ingredient in the essential oil of fennel using an artificial neural network, (2) compare the predicted results with the results of conventional regression-based method and (3) determine the most important selection criteria for EOY% and TAY% in different fennel populations using sensitivity analysis. The developed model could be helpful to improve the TAY% and EOY% of different fennel subspecies/varieties.

2. Materials and Methods

2.1. Plant Material Source and Recorded Traits

In the present study, the seeds of 16 populations of fennel (Foeniculum vulgare Mill. var. vulgare) from Iran, 2 populations from Turkey and 2 populations from Germany (Table 1) were collected and cultured in the experimental field section at the Faculty of Agriculture, University of Tabriz, Iran (46°17′ N, 37°5′ E), with an altitude of 710 m during 2017–2018. Plant authentication was performed by using the voucher specimens (Table 1) available at the herbarium of the University of Tabriz, Tabriz, Iran. The soil texture was loamy clay with a pH value of 7.5 and less than 1% organic matter. The field was not under cultivation for any plant during the past year. The seeds of each population were sown in a plot (4 m × 0.5 m) as a randomized complete block design (RCBD) with three replications. The fertilizers used in this study were 70 kg N, 40 kg P and 25 kg K per hectare. Essential oil yield (EOY%), trans-anethole yield (TAY%), and different agro-morphological traits including number of days to germination (NDG), number of days to 50% flowering (NDF50%), number of days to 100% flowering (NDF100%), number of days to maturity (NDM), initial plant height (IPH) (plant height at the time of the first inflorescence emergence), final plant height (FPH) (plant height at harvest time), number of stems (NS), stem diameter (SD), number of internodes (NI), length of the first internode (LFI), length of the longest internode (LLOI), length of the last internode (LLAI), length of the peduncle (LP), number of umbels (NU), biomass per square meter (B/m2), 1000-seed weight (TSW), seed yield per plant (SY/P), seed yield per square meter (SY/m2), number of seeds per plant (NS/P), number of seeds per umbel (NS/U) and harvest index (HI) were randomly recorded from 15 plants per plot. An analysis of variance (ANOVA) was conducted to assess the significant statistical differences among evaluated fennel populations for the studied characteristics. A normality test was conducted with SAS software before the analysis of variance. The means of significant differences of traits (average of two years) were used for the statistical analysis (Table 2).

2.2. Isolation of Essential Oils and GC/MS Analysis

For the isolation of essential oils, 100 g of mature seeds of each population were subjected to hydro-distillation using a Clevenger-type apparatus for 3 h and the collected essential oil was dried over anhydrous sodium sulfate and kept at 4 °C until analysis. Some physical characteristics of mature seed including moisture content and average length, thickness and density of selected seeds were equal to 8%, 5.5 mm, 1.6 mm and 410 Kg/m3, respectively. Chemical compositions of essential oils were analyzed by an Agilent 7890A Network GC system pooled with Agilent 5975C Network with Triple-Axis mass detector. The GC analysis was carried out on the Agilent 7890A Network GC system equipped with a splitless model injector (with 1.0 µm volume and 250 °C temperature). The carrier gas was helium with a flow rate of 1.1 mL/min and the capillary column used was HP 5-MS (30 m × 0.25 mm, film thickness 0.25 µm). The column pressure was fixed to 56,054.38 Pa. The oven temperature was initially kept at 50 °C for 2 min after injection and then increased to 250 °C with a rate of 6 °C/min heating ramp and kept constant at 250 °C for 4 min. The ionization voltage and mass range were 70 eV and 34–500 m/z, respectively. The temperatures 280 °C and 250 °C were used as anion source and interface temperatures, respectively. Constituents of the essential oils were recognized based on their retention time and mass spectra pattern with related available data or with the Wiley library and literature. Percentages of each compound were calculated from the given GC peak area and these data were used for quantification purposes.

2.3. Data Processing and Statistical Analysis

2.3.1. Input Variables Selection

To ensure a realistic model, only a portion of independent variables was carefully chosen. At the beginning of this process, the relationship between the dependent and independent variables was found using Pearson’s correlation. The important input variables were selected based on the stepwise regression (SWR) analysis results. Pearson correlation and SWR analyses were conducted using SAS® software.

2.3.2. Multiple Linear Regression

Multiple linear regression (MLR) (stepwise method) was used and for each dependent variable (EOY% and TAY%), the desired models were fixed. All values for the independent variables X (Table 2) are related to the dependent value of the variable Y. The general equation is as follows (Equation (1)):
y i = β 0 + β 1 × 1 + β 2 × 2 + + β n x n + ε i
where yi is EOY% or TAY%, β0 + βn are coefficients of regression, x1xn are input variables and ε is an error associated with the ith observation. Stepwise regression was applied to estimate the MLR coefficients. The MLR analysis was carried out using SAS® software.

2.3.3. Artificial Neural Network

The computation of ANN was conducted using Neuro-Solutions software version 5.07 software package from NeuroDimension Inc. (http://www.Neurosolutions.com, accessed on 22 November 2021). The variables including EOY% and TAY% were used as the dependent and the other traits were defined as independent variables. Two different ANN models were established for each EOY% and TAY%. The training and testing of ANN and MLR were carried out based on the recorded traits from 15 samples of each of the 20 fennel populations over two years. Therefore, the field experiment dataset was based on the 15 samples of 20 fennel populations. All the data were randomly divided into three subsets: 65% for training, 20% for network test and 15% for validation. This classification was based on (i) the results of previous studies with the same number of data, and (ii) trial and error and comparing the modeling results with different ratios. Descriptive statistical analysis of the measured traits in two years is shown in Table 2.
For the efficient ANN analysis and to avoid bias estimation due to differences in units of input variables, all data were normalized and transferred into values between −1 and +1 for hyperbolic tangent and 0 and 1 for sigmoid transfer functions [25] using Equation (2).
x norm = [ ( ( x i x min x max x min ) ) × 0.8 ] + 1
where xi is the original data, xnorm is the normalized input or output values and xmax and xmin are the maximum and minimum values of the resultant variable, respectively.
The three main input, hidden and output layers are essential for building the topology of a neural network system. In this study, the output of the network is assumed by Equation (3).
y t = α 0 + j = 1 n α j f ( i = 1 m β i j y t 1 + β 0 j ) + ε t
where yt is the network output (essential oil), n and m are the number of hidden nodes and number of input nodes, respectively, and f shows the transfer function. β i j { i = 1, 2, …, m ; j = 0, 1, …, n } are the weights from the input to hidden nodes, α j {j = 0, 1, …, n } are the vectors of weights from the hidden to the output nodes and α 0 and β 0 j denote the weights of arcs leading from the bias terms, which always are equal to 1.
The feed-forward multilayer perceptron (MLP) architecture with three layers and a Back-Propagation (BP) training algorithm along with the Levenberg–Marquardt, Momentum and Conjugate Gradient learning algorithms were applied in the present study. The most appropriate topology in various numbers of hidden layers (1–4) and neurons related to each layer was determined by trial and error tests. Several activation functions including Tangent Hyperbolic Axon, Linear Tangent Hyperbolic Axon, Sigmoid Axon and Linear Sigmoid Axon were tested with the aim of finding the equation with high capability in both hidden and output layers.

2.4. Performance and Sensitivity Analysis

Three statistical quality parameters, mean absolute error (MAE), root mean square error (RMSE) and coefficient of determination (R2), were used to compare the performance of the developed ANN with different transfer functions and hidden layers and MLR models for estimating the desired output of EOY% and TAY% according to Equations (4)–(6), respectively.
MAE = 1 n i , j = 1 n | y i y j |
RMSE = i , j = 1 n ( y i y j ) 2 n
R 2 = i , j = 1 n ( y i y i ¯ ) ( y j y j ¯ ) i = 1 n ( y i y i ¯ ) 2 j = 1 n ( y j y j ¯ ) 2
where n is the number of data, y i is the observed values, y j represents the predicted values and the bars denote the mean of the variable. High values of R2 and low values of RMSE and MAE indicate the better performance of the ANN and MLR model.
After developing the final ANN model, a sensitivity test was applied for choosing the most influential input variables on the EOY% and TAY% as the outputs. For this, the dataset was run without any input variables (i.e., SY, NS, NDM, TSW, NU and NI), and the models’ performance was assessed using R2, RMSE and MAE. Neuro-Solutions software (version 5.0) was used for the ANN model developing, evaluating and sensitivity analysis.

3. Results and Discussions

3.1. Selection of Input Variables

Since the input variables have a significant effect on the weighted coefficients and the final architecture of the model, the selection of these variables is a very important step for the development of the model [35]. To this end, Pearson’s correlation coefficient was used to consider the relationship between dependent variables (EOY% and TAY%) and the other characteristics (as the independent variables) (Figure 1 and Figure 2). EOY% had the highest positive correlation with HI (R = 0.794), followed by SY/m2 (R = 0.756), NU (R = 0.754), SY/P (R = 0.732), NDM (R = 0.706), NS/P (R = 0.699), LFI (R = 0.676) and NS/U (R = 0.467). A negative significant correlation coefficient was observed between EOY% and NDF50% (R = −0.842), NDF100% (R = −0.659), NI (R = −0.719), LLOI (R = −0.713), FPH (R = −0.661), LP (R = −0.616) and SP (R = −0.518) (Figure 1). Rahimmalek et al. [41] also reported a negative significant correlation between essential oil yield with plant height and flowering date of Iranian fennel accessions. Overall, the results of the correlation analysis showed that HI, SY/m2, NU, SY/P and NS/P are the most important parameters to determine essential oil yield in fennel populations. As reported by Bahmani et al. [11,22] and Cosges [15], there is a significant correlation between the essential oil content of fennel and length of the peduncle, stem diameter, plant height, the weight of dry biomass, number of nodes, number of leaves, length of middle internodes, number of inflorescences and 1000-seed weight [11,15,22].
TAY% (as a second dependent variable) had the strongest positive correlation with HI (R = 0.823) followed by SY/P (R = 0.693), SY/m2 (R = 0.693), EOY% (R = 0.590), NS/P (R = 0.573), NU (R = 0.572) and LFI (R = 0.456), as well as a negative significant correlation with LLOI (Figure 2). The correlation between various traits can be positively or negatively affected by other variables and these low coefficients can significantly reduce the capability of the correlation analysis to select the input variables [42,43]. However, there are parameters than other morphological and yield components that affect the oil yield and trans-anethole content of fennel. The correlation of climatic data (temperature) with oil yield and trans-anethole content of Iranian fennel accessions was assessed and a negative correlation between oil yield and Tmax and a positive correlation between trans-anethole and Tmax (r = 0.459) were reported [41]. These results indicate the importance of environmental parameters in assessing the correlation analysis of fennel populations. Incorporating such data into the model can increase the decision-making power and accuracy of the predictive model. In addition to perform the correlation analysis, stepwise regression (SWR) analysis was employed in this study to optimize the number of input variables [43].
Based on the SWR (Table 3 and Table 4), nine traits out of eleven (SY/m2, NDF 50%, NS/P, NU, FPH, NS/U, NDM, TSW and NI) were entered into the models as the most suitable input variables (the dependent variables were EOY% and TAY%). The number of umbels is an important yield component characteristic that can affect grain yield and subsequently the essential oil yield of fennel populations. This characteristic affect both EOY% and TAY% of the evaluated fennel population of the present study. These results are consistent with the results of Sefidan et al. [44] and Kalleli et al. [45].
Although HI and LFI were significantly correlated with EOY% and TAY%, according to the SWR analysis, these parameters could not be recognized as appropriate input variables (Table 3 and Table 4). MLR results revealed the weakness of the correlation method, which can be due to the indirect positive and negative effects of other traits on the correlation between dependent traits (EOY% and TAY%) and other independent traits [43]. Bahmani et al. [11] also applied SWR to identify to most important characteristics affecting the essential oil content of Iranian fennel and reported low partial R2 values for inserted variables in the model (partial R2 = 0.32, 0.06, 0.03, 0.02 and 0.02 for number of leaves, length of peduncle, plant height and days to 50% flowering, respectively). The low estimated partial R2 values for all independent variables indicate the insufficient efficiency of the linear regression model in interpreting the relationships between independent and dependent variables. Therefore, a non-linear model is needed to better interpret these relationships.

3.2. Prediction of Dependent Variables Using MLP/ANN Model

Several factors as numbers of the hidden layer (s) and their neurons (nods) and determination of the transfer function are important for the selection of input variables. They can be determined using trial and error [46]. There are four different transfer functions to running supervised neural networks, namely Sigmoid Axon, Linear Sigmoid Axon, Tangent Hyperbolic Axon and Linear Tangent Hyperbolic Axon, to select the proper transfer function (Table 5).
As presented in Table 6, the lowest values of MAE and RMSE and the highest R2 values were obtained by the Sigmoid Axon function in both training and testing stages for the prediction of EOY%, as well as the TanhAxon transfer function to predict TAY%.
The results of the linear transfer functions are not presented to maintain the clarity of the useful and applied results. In the testing and training phases, linear transfer functions reduced the efficiency of the models. These functions apply a simple linear conversion to the processed input variables and transfer it to the output stage, whereas nonlinear and TanhAxon functions produce outputs in the ranges 0 to 1 and −1 to 1, respectively [25]. Similar to the present study, Sigmoid Axon and TanhAxon functions have been applied in previous studies [24,25,32,35]. This is probably due to the high capability of these functions to justify nonlinear changes compared to the other functions. Various ANN models were implemented by selected transfer functions, with 1–5 hidden layers and 1–20 nodes per layer. As shown in Table 6, for the prediction of EOY%, the ANN model with two hidden layers provided the best performances in the training phases (R2 = 0.953, RMSE = 0.522, and MAE = 0.375) and testing (R2 = 0.929, RMSE = 0.544 and MAE = 0.385). Therefore, the results of Table 5 and Table 6 reveal that the best EOY% (essential oil) predictive model consisted of an input layer with 11 input variables (NDF50%, NDF100%, NDM, FPH, NI, NU, SY/P, SY/m2, NS/P, NS/U, and TSW) and two hidden layers with nine and seven neurons in each layer, i.e., the 11-9-7-1 structure (Figure 3). The TanhAxon transfer function, Momentum learning algorithm and one hidden layer (with 11-10-1 structure) were the best parameters in the ANN model to predict TAY% of fennel (Table 6). This topology had the minimum amounts of RMSE and MAE and the highest coefficient of determination (Table 6). Levenberg–Marquardt back-propagation and Logsig and Tansig transfer functions for hidden and output layers algorithm and the number of 10 neurons in the hidden layer have been reported as best parameters of an ANN for the modeling and optimization of anethole ultrasound-assisted extraction from fennel seeds [47]. One of the main objectives of ANN modeling studies is to achieve a simple model with the least number of hidden layers and neurons and the highest performance values [30,32]. Niazian et al. [25] reported an ANN model with a 4-4-1 structure, for the prediction of grain yield in ajowan (Trachyspermum ammi L.) belonging to the Apiaceae family [25]. These results will be useful to fit an excellent model structure of ANN in future research on the Apiaceae family.
An insufficient number of epochs can decrease the ANN performance and too many epochs can increase the risk of network overtraining and subsequent memorization [25]. To minimize the over-training and memorization, a pretest using two hidden layers and various numbers of epochs (50–2000) was conducted and the convergence point between training and validation was considered as the completion of training time to avoid over-training (Figure 4).
The comparison of predicted and measured EOY% values is shown in Figure 5 and Figure 6 for both training and testing datasets in the form of scatter plots. As shown in the scatter plots, the measured data and the ANN model had the same distribution. The EOY% values predicted by the ANN model tended to follow the corresponding actual ones quite closely.
A scatter plot was also applied to compare observed and predicted values of TAY% from the ANN model in both training and testing datasets. The ability of the ANN model to predict TAY% in training (R2 = 0.794) and testing (R2 = 0.777) stages are shown in Figure 7 and Figure 8, respectively. According to the scatter plot, there was no significant difference between predicted data and measured data of TAY% in the ANN model in both training and testing datasets (Figure 7 and Figure 8).

3.3. Comparing MLR and ANN Models to Predict EOY% and TAY% of Fennel Populations

The MLR models, especially when there are linear relationships between the input and output variables are known as efficient modeling approaches [48]. In order to determine the strength of linear regression to predict EOY% and TAY%, two stepwise regression of MLR models were determined and the following equations (Equations (7) and (8)) were computed to predict dependent variables.
EOY% = 0.037 + 0.216 SY/m2 + 0.113 NDF 50% + 0.126 NS/P + 0.086 NU + 0.079 FPH
TAY% = 0.915 + 0.196 SY/m2 + 0.108 NS/U + 0.089 NDM + 0.073 TSW + 0.065 NU + 0.044 NI
According to Equation (7), the predicted value of EOY% is a linear transformation of seed yield per square meter, number of days to 50% flowering, number of seeds per plant, number of umbels and final plant height variables. On the other hand, seed yield per square meter, number of seeds per umbel, number of days to maturity, thousand seed weight, number of umbels and number of internodes were entered in the SWR model, when TAY% was considered as a dependent variable (Equation (8)). The depended variables are those that selected based on stepwise regression analysis (Table 3 and Table 4). As shown in Table 3 and Table 4, independent variables explained approximately 55% and 47% of variations in EOY% and TAY%, respectively. Therefore, the linear model was not strong enough to explain the variations of the dependent variables. The MLR constructions (Equations (7) and (8)) showed the importance and the effect of independent variables on dependent variables and these equations revealed that how the amounts of EOY% or TAY% in fennel can change by different amounts of independent variables. Bahmani et al. [11] used an MLR model to find the relationship between independent variables and grain yield of fennel and showed that 55.41%, 12.72%, 2.21% and 11.63% of total variance of grain yield was explained by weight of dry biomass, days to 50% flowering, number of inflorescent and days to 70% seed pasty, respectively [11]. Niazian et al. [25] studied the seed yield of ajowan using an MLR model and introduced shoot dry weight, number of umbellets in main inflorescence, number of branches and the biological yield as important independent variables [25].
The results of the ANN and MLR models’ development based on performance evaluation indices including R2, RMSE and MAE provide a set of the reasonable criteria for comparison between two modeling methods. Compared to the MLR model, the ANN models could predict EOY% and TAY% much better than the MLR model with 39.97% and 32.69% increases in R2, reductions of 0.30 and 0.20 in RMSE and reductions of 0.25 and 0.12 in MAE, respectively. According to the obtained results, the ANN model had higher predictive power than the MLR model and was more efficient than MLR in predicting EOY% and TAY% traits in fennel populations. The different performance of the two models to predict EOY% and TAY% shows the importance of choosing the more suitable model. The superiority of the ANN modeling methods compared to the MLR methods has been reported in other previous studies [24,25,32,35]. The supremacy of ANN modeling seems to be due to the high capability of this model to capture the highly nonlinear and complex relationship between EOY% or TAY% and the relevant traits [23]. There is a considerable variation among different populations of fennel in terms of seed yield, yield components, essential oil content and essential oil composition [11,49]. This variation along with high genotype × environment interaction create a difficult situation to improve the fennel population for the desired traits in a short period using conventional statistical methods and direct selection [49]. However, using non-linear predicting methods, breeders are able to estimate the desired values of their desired traits in a faster and more confident way. Therefore, an advanced computational method can play a complementary role to conventional statistical methods previously employed to improve the fennel populations [11,49].

3.4. Sensitivity Analysis

A sensitivity analysis is a method of studying the behavior of a model and assessing the significance of each input variable on the values of the output variable of the model. Sensitivity analysis provides insight into the usefulness of individual variables. With the help of this kind of analysis, it is possible to judge which inputs for modeling EOY% or TAY% parameters should be considered as the most and least significant ones in the ANN model [36]. For this purpose, the sensitivity tests for ANN and MLR models were performed without a specific input variable, i.e., SY/m2, NS/P, NDF50%, NS/U and NDM. The results of the sensitivity test for EOY% showed that the highest RMSE (0.608, 0.911) and MAE (0.439, 0.659) and the lowest R2 (75.45, 39.12) were achieved without seed yield per square meter in both ANN and MLR models (Table 7). Number of days to 50% flowering and number of seeds per plant were the other most effective characteristics on the EOY% of fennel populations. As shown in Table 7, the ANN and MLR models for TAY% without the seed yield per square have the lowest R2 (61.43 and 35.52) and highest RMSE (0.332 and 0.532) and MAE (0.487 and 0.471), respectively. As the results showed, SY/m2 is the most influential factor to predict EOY% and TAY% in both models.
Bahmani et al. [11] investigated the direct and indirect effects of some morphological traits on the essential oil content of fennel populations by using path analysis. They reported that the number of leaves, days to 50% flowering and plant height had direct effects on essential oil content in the first year of the experiment as well as days to 50% flowering, stem diameter and the number of seeds per largest inflorescence in the second year of the experiment [11]. Abdipour et al. [24] used the sensitivity analysis in both ANN and MLR models to find the importance of each input variable on the oil content of sesame and reported capsule number per plant as the most important input variable that can significantly affect RMSE, MAE and R2 of both ANN and MLR models [24]. In the other study, sensitivity tests were conducted in both MLR and ANN models and results showed that the highest RMSE and MAE and the lowest R2 were achieved in the MLR and ANN models without biological yield [25].
Since fennel is an indeterminate plant, having continuous growth of new leaves, flowers and seeds during the growing season, it can be said that indeterminate populations of fennel with a long period from flowering to maturity are the best in stable environmental conditions. This finding can be seen in the results of the sensitivity analysis (Table 7), indicating that the number of days to 50% flowering and number of days to maturity had significant effects on EOY% and TAY%, respectively. The findings of previous studies determined possible differences and similarities in essential oils and chemical composition of various plants at different phenological stages in the various medicinal plants [49,50,51]. The results showed that the EOY% and TAY% depend on different phenological stages. The effect of phenological stages on essential oil and its composition may be due to its effect on enzyme activity and the metabolism of essential oil production [52]. The contribution of each input trait to predict EOY% and TAY% are ranked from highest to lowest in both ANN and MLR models in Table 7.

4. Conclusions

Identifying high-heritable yield components and using efficient modeling methods can help breeders select the most advantageous traits in medicinal plant breeding programs. The results of the present study revealed that the ANN compared to the MLR was able to predict the EOY% and TAY% of fennel populations with more accuracy. The classical MLR model could not interpret the non-linear relationships between EOY% and TAY% and their corresponding independent variables. However, the ANN model showed more accuracy in interpreting complex relationships among EOY%, TAY% and other variables in the model, according to R2, RMSE and MAE indicators. These results showed that the selected ANN model could surely replace MLR to predict EOY% and TAY% of fennel populations. Based on the sensitivity analysis, SY/m2, NDF50%, and NS/P were the most important traits to predict EOY%, whereas SY/m2, NS/U and NDM were the most important traits to predict the TAY% of fennel populations. The findings of the present study can provide important information to improve the EOY% of the other medicinal plants of the Apiaceae family. Plant breeders can also use the optimized artificial neural network models to model other complicated polygenic traits of medicinal plants, such as the content of various secondary metabolites that are more valuable for the food and pharmaceutics industries.

Author Contributions

Conceptualization, M.S.-N., G.N., M.Y.-H., S.A. and M.E.; methodology, M.S.-N., G.N., M.Y.-H., S.A. and M.A.; software, M.A.; validation, G.N., M.A. and S.K.; formal analysis, M.S.-N., G.N., M.Y.-H., S.A. and M.A.; investigation, M.S.-N., M.Y.-H., S.A. and M.A.; resources, M.S.-N., M.Y.-H., S.A. and M.E.; data curation, M.S.-N., M.Y.-H., S.A. and M.E.; writing—original draft preparation, M.S.-N., G.N., M.Y.-H. and S.A.; writing—review and editing, G.N., M.Y.-H., S.K. and M.N.; visualization, M.S.-N., M.Y.-H. and S.A.; supervision, M.S.-N., G.N., M.Y.-H., S.A. and M.E.; project administration, M.S.-N., M.Y.-H. and S.A.; funding acquisition, M.S.-N., M.Y.-H., S.A. and M.E. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated during and/or analysed during the current study are available from the first corresponding author on reasonable request.

Acknowledgments

This paper was published as part of a research project (contract number 27.21) supported by the Research Affairs Office of the University of Tabriz.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations:

ANNArtificial neural network
EOYEssential oil yield
FPHFinal plant height
HIHarvest index
LFILength of the first internode
LLAILength of the last internode
LLOILength of the longest internode
LPLength of the peduncle
MAEMean absolute error
MLRMultilinear regression
NDF50%Number of days to 50% flowering
NDF100%Number of days to 100% flowering
NDGNumber of days to germination
NDMNumber of days to maturity
NINumber of internodes
NSNumber of stems
NS/PNumber of seeds per plant
NS/UNumber of seeds per umbel
NUNumber of umbels
RMSERoot mean square error
SDStem diameter
SWRStepwise regression
SYSeed yield
SY/m2Seed yield per square meter
SY/PSeed yield per plant
TAYTrans-anethole yield
TSW1000-seed weight

References

  1. Nojadeh, M.S.; Pouresmaeil, M.; Younessi-Hamzekhanlu, M.; Venditti, A. Phytochemical profile of fennel essential oils and possible applications for natural antioxidant and controlling Convolvulus arvensis L. Nat. Prod. Res. 2020, 1–5. [Google Scholar] [CrossRef]
  2. Yaylayan, V.A. Flavor Technology: Recent Trends and Future Perspectives. Can. Inst. Food Sci. Technol. J. 1991, 24, 2–5. [Google Scholar] [CrossRef]
  3. Guillén, M.D.; Manzanos, M.J. A study of several parts of the plant Foeniculum vulgate as a source of compounds with industrial interest. Food Res. Int. 1996, 29, 85–88. [Google Scholar] [CrossRef]
  4. He, W.; Huang, B. A review of chemistry and bioactivities of a medicinal spice: Foeniculum vulgare. J. Med. Plants Res. 2011, 5, 3595–3600. [Google Scholar]
  5. Younessi-Hamzekhanlu, M.; Abdipour, M.; Dejahang, A.; Sabzi-Nojadeh, M.; Amani, M. Herbals Used in Western Iran as Food and for Health Treatments. In Biodiversity, Conservation and Sustainability in Asia; Springer: Cham, Switzerland, 2021; pp. 547–599. [Google Scholar]
  6. Younessi-Hamzekhanlu, M.; Ozturk, M.; Altay, V.; Nojadeh, M.S.; Alakbarli, F. Ethnopharmacological study of medicinal plants from khoy city of west Azerbaijan-Iran. Indian J. Tradit. Knowl. 2020, 19, 251–267. [Google Scholar]
  7. Pouresmaeil, M.; Nojadeh, M.S.; Movafeghi, A.; Maggi, F. Exploring the bio-control efficacy of Artemisia fragrans essential oil on the perennial weed Convolvulus arvensis: Inhibitory effects on the photosynthetic machinery and induction of oxidative stress. Ind. Crops Prod. 2020, 155, 112785. [Google Scholar] [CrossRef]
  8. Aghbashd, B.N.; Pouresmaeil, M.; Dehghan, G.; Nojadeh, M.S.; Mobaiyen, H.; Maggi, F. Chemical Composition, Antibacterial and Radical Scavenging Activity of Essential Oils from Satureja macrantha C.A.Mey. At different growth stages. Foods 2020, 9, 494. [Google Scholar] [CrossRef] [Green Version]
  9. Aghbash, B.N.; Dehghan, G.; Movafeghi, A.; Talebpour, A.H.; Pouresmaeil, M.; Maggi, F.; Nojadeh, M.S. Chemical compositions and biological activity of essential oils from four populations of Satureja macrantha C.A.Mey. J. Essent. Oil Res. 2021, 33, 133–142. [Google Scholar] [CrossRef]
  10. Younessi-Hamzekhanlu, M.; Sanjari, S.; Dejahang, A.; Karkaj, E.S.; Nojadeh, M.S.; Gönenç, T.M.; Ozturk, M. Evaluation of Essential Oil From Different Artemisia fragrans Willd. Populations: Chemical Composition, Antioxidant, and Antibacterial Activity. J. Essent. Oil-Bear. Plants 2020, 23, 1218–1236. [Google Scholar] [CrossRef]
  11. Bahmani, K.; Darbandi, A.I.; Ramshini, H.A.; Moradi, N.; Akbari, A. Agro-morphological and phytochemical diversity of various Iranian fennel landraces. Ind. Crops Prod. 2015, 77, 282–294. [Google Scholar] [CrossRef]
  12. Devesh, P.; Moitra, P.K.; Shukla, R.S. Correlation and path coefficient analysis for yield, yield components and quality traits in wheat. Electron. J. Plant. Breed. 2021, 12, 388–395. [Google Scholar] [CrossRef]
  13. Togay, N.; Togay, Y.; Dogan, Y. Correlation and path coefficient analysis for yield and some yield components of wheat (Triticum aestivum l.). Oxid. Commun. 2017, 40, 946–951. [Google Scholar]
  14. Piccaglia, R.; Marotti, M. Characterization of some Italian types of wild fennel (Foeniculum vulgare mill.). J. Agric. Food Chem. 2001, 49, 239–244. [Google Scholar] [CrossRef] [PubMed]
  15. Coşge, B.; Ipek, A.; Gürbüz, B. Some phenotypic selection criteria to improve seed yield and essential oil percentage of sweet fennel (Foeniculum vulgare Mill. var. dulce). Tarim. Bilim. Derg. 2009, 15, 127–133. [Google Scholar] [CrossRef] [Green Version]
  16. Coruzzi, G.M.; Burga, A.R.; Katari, M.S.; Gutiérrez, R.A. Systems Biology: Principles and Applications in Plant Research. In Annual Plant Reviews Online; Wiley-Blackwell: London, UK, 2018; pp. 3–40. [Google Scholar]
  17. Gago, J.; Martínez-Núñez, L.; Landín, M.; Gallego, P.P. Artificial neural networks as an alternative to the traditional statistical methodology in plant research. J. Plant. Physiol. 2010, 167, 23–27. [Google Scholar] [CrossRef] [PubMed]
  18. Prasad, V.S.S.; Gupta, S.D. Applications And Potentials Of Artificial Neural Networks In Plant Tissue Culture. In Plan Tissue Culture Engineering; Springer: Dordrecht, The Netherlands, 2007; pp. 47–67. [Google Scholar]
  19. Gross, M.; Lewinsohn, E.; Tadmor, Y.; Bar, E.; Dudai, N.; Cohen, Y.; Friedman, J. The inheritance of volatile phenylpropenes in bitter fennel (Foeniculum vulgare Mill. var. vulgare, Apiaceae) chemotypes and their distribution within the plant. Biochem. Syst. Ecol. 2009, 37, 308–316. [Google Scholar] [CrossRef]
  20. Kiruthika, S.; Narayanan, S.L.; Parameshwari, C.; Arunachalam, P.; Mini, M.L. Studies on trait association and path co-efficient analysis of sesame (Sesamum sp.) for quantitative traits and oil quality parameters. Electron. J. Plant. Breed. 2020, 11, 18–24. [Google Scholar] [CrossRef] [Green Version]
  21. Ibhrahim, E.; Khidir, O. Genotypic correlation and path coefficient analysis of yield and some yield components in sesame (Sesamum indicum L.). Int. J. AgriScience 2012, 2, 664–670. [Google Scholar]
  22. Bahmani, K.; Izadi-Darbandi, A.; Noori, S.A.S.; Jafari, A.A.; Moradi, N. Determination of interrelationships among phenotypic traits of iranian fennel (foeniculum vulgare mill.) using correlation, stepwise regression and path analyses. J. Essent. Oil-Bear. Plants 2012, 15, 424–444. [Google Scholar] [CrossRef]
  23. Khairunniza Bejo, S.; Mustaffha, S.; Khairunniza-Bejo, S.; Ishak, W.; Ismail, W. Application of Artificial Neural Network in Predicting Crop Yield: A Review. J. Food Sci. Eng. 2014, 4, 1–9. [Google Scholar]
  24. Abdipour, M.; Ramazani, S.H.R.; Younessi-Hmazekhanlu, M.; Niazian, M. Modeling Oil Content of Sesame (Sesamum indicum L.) Using Artificial Neural Network and Multiple Linear Regression Approaches. JAOCS J. Am. Oil Chem. Soc. 2018, 95, 283–297. [Google Scholar] [CrossRef]
  25. Niazian, M.; Sadat-Noori, S.A.; Abdipour, M. Modeling the seed yield of Ajowan (Trachyspermum ammi L.) using artificial neural network and multiple linear regression models. Ind. Crops Prod. 2018, 117, 224–234. [Google Scholar] [CrossRef]
  26. Iquebal, M.A.; Ansari, M.S.; Sarika, S.; Dixit, S.P.; Verma, N.K.; Aggarwal, R.A.K.; Jayakumar, S.; Rai, A.; Kumar, D. Locus minimization in breed prediction using artificial neural network approach. Anim. Genet. 2014, 45, 898–902. [Google Scholar] [CrossRef]
  27. Khoshnevisan, B.; Rafiee, S.; Mousazadeh, H. Application of multi-layer adaptive neuro-fuzzy inference system for estimation of greenhouse strawberry yield. Meas. J. Int. Meas. Confed. 2014, 47, 903–910. [Google Scholar] [CrossRef]
  28. Silva, G.N.; Tomaz, R.S.; de Castro Sant’Anna, I.; Nascimento, M.; Bhering, L.L.; Cruz, C.D. Neural networks for predicting breeding values and genetic gains. Sci. Agric. 2014, 71, 494–498. [Google Scholar] [CrossRef] [Green Version]
  29. Goel, P.; Bapat, S.; Vyas, R.; Tambe, A.; Tambe, S.S. Genetic programming based quantitative structure-retention relationships for the prediction of Kovats retention indices. J. Chromatogr. A 2015, 1420, 98–109. [Google Scholar] [CrossRef]
  30. Zeng, W.; Xu, C.; Wu, J.; Huang, J. Sunflower seed yield estimation under the interaction of soil salinity and nitrogen application. Field Crops Res. 2016, 198, 1–15. [Google Scholar] [CrossRef]
  31. Niazian, M.; Niedbała, G. Machine learning for plant breeding and biotechnology. Agriculture 2020, 10, 436. [Google Scholar] [CrossRef]
  32. Mansouri, A.; Fadavi, A.; Mortazavian, S.M.M. An artificial intelligence approach for modeling volume and fresh weight of callus-A case study of cumin (Cuminum cyminum L.). J. Theor. Biol. 2016, 397, 199–205. [Google Scholar] [CrossRef]
  33. Niazian, M.; Sadat-Noori, S.A.; Abdipour, M. Artificial neural network and multiple regression analysis models to predict essential oil content of ajowan (Carum copticum L.). J. Appl. Res. Med. Aromat. Plants 2018, 9, 124–131. [Google Scholar] [CrossRef]
  34. Tanty, R.; Tanweer, S. Desmukh Application of Artificial Neural Network in Hydrology-A Review. Int. J. Eng. Res. 2015, 4, 184–188. [Google Scholar] [CrossRef]
  35. Emamgholizadeh, S.; Parsaeian, M.; Baradaran, M. Seed yield prediction of sesame using artificial neural network. Eur. J. Agron. 2015, 68, 89–96. [Google Scholar] [CrossRef]
  36. Naroui Rad, M.R.; Koohkan, S.; Fanaei, H.R.; Rad, M.R.P. Application of artificial neural networks to predict the final fruit weight and random forest to select important variables in native population of melon (Cucumis melo. Pahlavan). Sci. Hortic. 2015, 181, 108–112. [Google Scholar] [CrossRef]
  37. Safa, M.; Samarasinghe, S.; Nejat, M. Prediction of wheat production using artificial neural networks and investigating indirect factors affecting it: Case study in canterbury province, New Zealand. J. Agric. Sci. Technol. 2015, 17, 791–803. [Google Scholar]
  38. Hara, P.; Piekutowska, M.; Niedbała, G. Selection of independent variables for crop yield prediction using artificial neural network models with remote sensing data. Land 2021, 10, 609. [Google Scholar] [CrossRef]
  39. Niedbała, G.; Piekutowska, M.; Weres, J.; Korzeniewicz, R.; Witaszek, K.; Adamski, M.; Pilarski, K.; Czechowska-Kosacka, A.; Krysztofiak-Kaniewska, A. Application of artificial neural networks for yield modeling of winter rapeseed based on combined quantitative and qualitative data. Agronomy 2019, 9, 781. [Google Scholar] [CrossRef] [Green Version]
  40. White, B.W.; Rosenblatt, F. Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Am. J. Psychol. 1963, 76, 705. [Google Scholar] [CrossRef] [Green Version]
  41. Rahimmalek, M.; Maghsoudi, H.; Sabzalian, M.R.; Pirbalouti, A.G. Variability of essential oil content and composition of different Iranian fennel (Foeniculum vulgare Mill.) accessions in relation to some morphological and climatic factors. J. Agric. Sci. Technol. 2014, 16, 1365–1374. [Google Scholar]
  42. Li, C.C. The Concept of Path Coefficient and Its Impact on Population Genetics. Biometrics 1956, 12, 190. [Google Scholar] [CrossRef]
  43. Samarasinghe, S. Neural Networks for Nonlinear Pattern Recognition. In Neural Networks for Applied Sciences and Engineering; Auerbach: New York, NY, USA, 2006; pp. 69–112. [Google Scholar]
  44. Sefidan, A.Y.; Valizadeh, M.; Aharizad, S.; Sabzi, M. Path analysis of grain yield, some morphological traits and essential oil content in different fennel (Foeniculum vulgare Mill.) populations. J. Biodivers. Environ. Sci. 2014, 4, 10–15. [Google Scholar]
  45. Kalleli, F.; Rebey, I.B.; Razgallah, N.; Tounsi, M.S.; M’Hamdi, M. Comparative Analysis of Morphological and Chemical Variability in Different Fennel (Foeniculum Vulgare Mill.) Cultivars. Int. J. Agric. Environ. Biores. 2019, 4, 200–217. [Google Scholar]
  46. Baziar, M.H.; Ghorbani, A. Evaluation of lateral spreading using artificial neural networks. Soil. Dyn. Earthq. Eng. 2005, 25, 1–9. [Google Scholar] [CrossRef]
  47. Moradi, H.; Bahmanyar, H.; Azizpour, H.; Rezamandi, N.; Mirdehghan Ashkezari, S.M. Modeling and Optimization of Anethole Ultrasound-Assisted Extraction from Fennel Seeds using Artificial Neural Network. J. Chem. Pet. Eng. 2020, 54, 143–153. [Google Scholar]
  48. Stangierski, J.; Weiss, D.; Kaczmarek, A. Multiple regression models and Artificial Neural Network (ANN) as prediction tools of changes in overall quality during the storage of spreadable processed Gouda cheese. Eur. Food Res. Technol. 2019, 245, 2539–2547. [Google Scholar] [CrossRef] [Green Version]
  49. Moghaddam, M.; Miran, S.N.K.; Pirbalouti, A.G.; Mehdizadeh, L.; Ghaderi, Y. Variation in essential oil composition and antioxidant activity of cumin (Cuminum cyminum L.) fruits during stages of maturity. Ind. Crops Prod. 2015, 70, 163–169. [Google Scholar] [CrossRef]
  50. Moghaddam, M.; Omidbiagi, R.; Sefidkon, F. Changes in content and chemical composition of tagetes minuta oil at various harvest times. J. Essent. Oil Res. 2007, 19, 18–20. [Google Scholar] [CrossRef]
  51. Moghaddam, M.; Pirbalouti, A.G.; Mehdizadeh, L.; Pirmoradi, M.R. Changes in composition and essential oil yield of Ocimum ciliatum at different phenological stages. Eur. Food Res. Technol. 2015, 240, 199–204. [Google Scholar] [CrossRef]
  52. Sellami, I.H.; Maamouri, E.; Chahed, T.; Wannes, W.A.; Kchouk, M.E.; Marzouk, B. Effect of growth stage on the content and composition of the essential oil and phenolic fraction of sweet marjoram (Origanum majorana L.). Ind. Crops Prod. 2009, 30, 395–402. [Google Scholar] [CrossRef]
Figure 1. The Pearson correlation coefficient of input variables with EOY% of fennel populations in both ANN and MLR models. For detailed information on trait abbreviations, see Table 2. ns, * and **, non-significant, significant at the 0.05 and 0.01 probability level, respectively.
Figure 1. The Pearson correlation coefficient of input variables with EOY% of fennel populations in both ANN and MLR models. For detailed information on trait abbreviations, see Table 2. ns, * and **, non-significant, significant at the 0.05 and 0.01 probability level, respectively.
Agriculture 11 01191 g001
Figure 2. The Pearson correlation coefficient of input variables with TAY% of fennel populations in both ANN and MLR models. ns, * and **, non-significant, significant at the 0.05 and 0.01 probability level, respectively.
Figure 2. The Pearson correlation coefficient of input variables with TAY% of fennel populations in both ANN and MLR models. ns, * and **, non-significant, significant at the 0.05 and 0.01 probability level, respectively.
Agriculture 11 01191 g002
Figure 3. The topology of the ANN with 11 input parameters for prediction of essential oil yield.
Figure 3. The topology of the ANN with 11 input parameters for prediction of essential oil yield.
Agriculture 11 01191 g003
Figure 4. The convergence of the average MSE value during training and validation of the final 11-9-7-1 ANN structure to predict the essential oil yield of fennel populations.
Figure 4. The convergence of the average MSE value during training and validation of the final 11-9-7-1 ANN structure to predict the essential oil yield of fennel populations.
Agriculture 11 01191 g004
Figure 5. Scatter plot of measured and predicted essential oil yield of fennel populations in the training stage of ANN.
Figure 5. Scatter plot of measured and predicted essential oil yield of fennel populations in the training stage of ANN.
Agriculture 11 01191 g005
Figure 6. Scatter plot of measured and predicted essential oil yield of fennel populations in the testing stage of ANN.
Figure 6. Scatter plot of measured and predicted essential oil yield of fennel populations in the testing stage of ANN.
Agriculture 11 01191 g006
Figure 7. Scatter plot of measured and predicted trans-anethole yield of fennel populations in the training stage of the ANN.
Figure 7. Scatter plot of measured and predicted trans-anethole yield of fennel populations in the training stage of the ANN.
Agriculture 11 01191 g007
Figure 8. Scatter plot of measured and predicted trans-anethole yield of fennel populations in the testing stage of the ANN.
Figure 8. Scatter plot of measured and predicted trans-anethole yield of fennel populations in the testing stage of the ANN.
Agriculture 11 01191 g008
Table 1. Locality and average of essential oil yield and trans-anethole yield of studied fennel populations.
Table 1. Locality and average of essential oil yield and trans-anethole yield of studied fennel populations.
NoPopulationVarietyLocalityVoucher NumberLatitude (N)Longitude (E)Essential Oil Yield (%)trans-Anethole Yield (%)
1SalzlandVulgareGermanyAh12351°78′11°77′2.29 ± 0.851.46 ± 0.59
2GothaVulgareGermanyAh11551°07′10°87′2.14 ± 0.731.77 ± 0.57
3GazianetpVulgareTurkeyAh11437°05′37°37′2.67 ± 0.792.30 ± 0.68
4IzmirVulgareTurkeyAh11338°35′27°07′1.63 ± 0.411.17 ± 0.32
5BonabVulgareIranAh11137°35′46°03′2.89 ± 1.032.24 ± 0.91
6BirjandVulgareIranAh11032°84′59°18′0.73 ± 0.240.54 ± 0.16
7TatmajVulgareIranAh12633°69′51°62′1.88 ± 0.791.50 ± 0.54
8TorbatejamVulgareIranAh12735°23′60°66′2.90 ± 0.922.45 ± 0.76
9MeshkinshahrVulgareIranAh12038°37′47°69′2.30 ± 0.611.70 ± 0.62
10KhorobiabanakVulgareIranAh11833°89′54°87′0.99 ± 0.410.73 ± 0.51
11MoghanVulgareIranAh12139°62′47°87′4.12 ± 1.322.68 ± 0.68
12ZiarVulgareIranAh12932°50′51°94′1.66 ± 0.691.14 ± 0.42
13ShirvanVulgareIranAh12437°39′57°96′2.42 ± 0.861.77 ± 0.68
14KarajVulgareIranAh11635°77′51°06′1.54 ± 0.641.07 ± 0.44
15KermanVulgareIranAh11730°30′57°13′0.82 ± 0.330.55 ± 0.19
16KhorramabadVulgareIranAh11933°48′48°44′2.89 ± 0.672.10 ± 0.71
17NeishabourVulgareIranAh12236°19′58°83′2.02 ± 0.770.33 ± 0.15
18VaraminVulgareIranAh12835°34′51°62′3.77 ± 0.943.12 ± 0.67
19HamedanVulgareIranAh11234°81′48°48′2.92 ± 0.912.16 ± 0.56
20TabrizVulgareIranAh12538°07′46°08′2.50 ± 0.760.50 ± 0.12
Table 2. Descriptive statistics of morphological, phonological and yield-related characteristics in the fennel populations.
Table 2. Descriptive statistics of morphological, phonological and yield-related characteristics in the fennel populations.
CharacteristicAbbreviationMinMaxMeanStandard Deviation
Number of days to germinationNDG71812.454.21
Number of days to 50% floweringNDF50%5910279.1816.56
Number of days to 100% floweringNDF100%8211492.3624.98
Number of days to maturityNDM126180145.6836.15
Initial plant height (cm)IPH39.4782.8958.2917.14
Final plant height (cm)FPH68.56198107.2536.84
Number of stemsNS142.581.25
Stem diameter (cm)SD2.7515.858.543.62
Number of internodesNI6149.353.48
Length of the first internode (cm)LFI3.119.326.132.04
Length of the longest internode (cm)LLOI5.4819.1414.595.74
Length of the last internode (cm)LLAI2.3611.717.641.91
Length of the peduncle (cm)LP4.8514.299.254.89
Number of umbelsNU125836.2515.70
Biomass (g/m2)B/m2654.251457.83124.9162.25
Thousand seed weight (g)TSW2.857.655.163.11
Seed yield per plant (g)SY/P12.3586.5432.6710.29
Seed yield (g/m2)SY/m2115.12542.28315.2182.27
Number of seeds per plantNS/P985915378592141
Number of seeds per umbelNS/U112276192.5178.29
Harvest index (%)HI12.1146.8237.2616.28
Table 3. Stepwise regression analysis for essential oil yield as dependent variable.
Table 3. Stepwise regression analysis for essential oil yield as dependent variable.
StepEntered Variables in ModelPartial R2Model R2
1SY/m20.16420.1642
2SY/m2, NDF 50%0.14150.3057
3SY/m2, NDF 50%, NS/P0.12760.4333
4SY/m2, NDF 50%, NS/P, NU0.07810.5114
5SY/m2, NDF 50%, NS/P, NU, FPH0.07420.5856
Adjusted R2 = 0.5533.
Table 4. Stepwise regression analysis for trans anethole yield as the dependent variable.
Table 4. Stepwise regression analysis for trans anethole yield as the dependent variable.
StepEntered Variables in ModelPartial R2Model R2
1SY/m20.1230.123
2SY/m2, NS/U0.1140.237
3SY/m2, NS/U, NDM0.10560.3426
4SY/m2, NS/U, NDM, TSW0.05610.3987
5SY/m2, NS/U, NDM, TSW, NU0.0520.4507
6SY/m2, NS/U, NDM, TSW, NU, NI0.04410.4948
Adjusted R2 = 0.4672.
Table 5. Summary of the components of the neural networks used to predict essential oil and trans-anethole yield of fennel populations.
Table 5. Summary of the components of the neural networks used to predict essential oil and trans-anethole yield of fennel populations.
ANN MethodNumber of Hidden LayersNumber of Neurons in Each Hidden LayerTransfer FunctionLearning AlgorithmNumber of Epochs
Multi-layer perceptron (MLP)1–51–20Sigmoid AxonLevenberg–50–2000
Linear Sigmoid AxonMarquardt
TanhAxonMomentum
Liner TanhAxonConjugate Gradient
Table 6. The performance of the best artificial neural network models to predict outputs.
Table 6. The performance of the best artificial neural network models to predict outputs.
OutputNetwork StructureTransfer FunctionLearning AlgorithmTrainingTestingCross Validation
R2 aRMSE bMAE cR2 aRMSE bMAE cR2 aRMSE bMAE c
Essential oil yield11-9-7-1Sigmoid AxonLevenberg–Marquardt (LM)0.9530.5220.3750.9290.5440.3850.9040.5520.389
Trans-anethole yield11-10-1TanhAxonMomentum0.7940.2460.3340.7770.2640.3520.7640.2580.359
a: Determination coefficient; b: root mean square error; c: mean absolute error.
Table 7. Sensitivity analysis and selecting three of the most influential inputs on the essential oil and trans-anethole yield of fennel populations.
Table 7. Sensitivity analysis and selecting three of the most influential inputs on the essential oil and trans-anethole yield of fennel populations.
OutputMethodANNMLR
R2 a (%)RMSE bMAE cR2 a (%)RMSE bMAE c
Essential oil yieldThe best ANN (with all input)95.300.5220.37555.330.8190.624
ANN without SY/m275.450.6080.43939.120.9110.659
ANN without NDF50%84.700.5850.42142.180.7470.571
ANN without NS/P85.980.5780.41644.250.8120.583
trans-anethole yieldThe best ANN (with all input)79.410.2460.33446.720.4480.452
ANN without SY/m261.430.3320.48735.520.5320.471
ANN without NS/U66.400.3160.45937.140.4310.384
ANN without NDM68.400.3080.44538.760.4160.335
a: Determination coefficient; b: root mean square error; c: mean absolute error.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sabzi-Nojadeh, M.; Niedbała, G.; Younessi-Hamzekhanlu, M.; Aharizad, S.; Esmaeilpour, M.; Abdipour, M.; Kujawa, S.; Niazian, M. Modeling the Essential Oil and Trans-Anethole Yield of Fennel (Foeniculum vulgare Mill. var. vulgare) by Application Artificial Neural Network and Multiple Linear Regression Methods. Agriculture 2021, 11, 1191. https://doi.org/10.3390/agriculture11121191

AMA Style

Sabzi-Nojadeh M, Niedbała G, Younessi-Hamzekhanlu M, Aharizad S, Esmaeilpour M, Abdipour M, Kujawa S, Niazian M. Modeling the Essential Oil and Trans-Anethole Yield of Fennel (Foeniculum vulgare Mill. var. vulgare) by Application Artificial Neural Network and Multiple Linear Regression Methods. Agriculture. 2021; 11(12):1191. https://doi.org/10.3390/agriculture11121191

Chicago/Turabian Style

Sabzi-Nojadeh, Mohsen, Gniewko Niedbała, Mehdi Younessi-Hamzekhanlu, Saeid Aharizad, Mohammad Esmaeilpour, Moslem Abdipour, Sebastian Kujawa, and Mohsen Niazian. 2021. "Modeling the Essential Oil and Trans-Anethole Yield of Fennel (Foeniculum vulgare Mill. var. vulgare) by Application Artificial Neural Network and Multiple Linear Regression Methods" Agriculture 11, no. 12: 1191. https://doi.org/10.3390/agriculture11121191

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop