3.1. M5P and MLNR Derived Models
The relationship between the input and output parameters in the current study is not always linear, since the distribution of a wide range of values follows a nonlinear trend, such as a power law. When calculating the torsional capacity, the M5P model can be utilized to produce concise, transparent rules that are simple to implement. However, it could suggest that the input and output parameters are linearly related. Log (inputs) and log (outputs) were used to create a model to bypass this limitation (output). Consequently, and based on the results from studies [
1,
2], it was hypothesized that
is a power function that has the following form:
where the constants (
,
,
, and
) have different values under various circumstances and all other terms were previously defined.
Figure 3 displays the established model tree created using the M5P technique.
Table 2 provides the coefficients for Equation (4) predicted from the M5P algorithm.
The following example illustrates the methodology of using M5P to predict the torsional capacity of RC beams. For example, let us consider a reference beam from the testing dataset with the attributes summarized in
Table 3.
As can be seen from
Table 3 and
Figure 3, the beam sample belongs to the M5P LM1 prediction equation, since the value of
is ≤ 0.1164 and > 0.0625. Therefore, using the coefficients related to LM1, the predicted value for the torsional capacity of the reference beam in
Table 3 is 37.29 kN.m, which agrees very well with the experimental torsional capacity (37.48 kNm).
The newly developed equation to predict the torsional resistance of RC beams in this research using an MLNR (Multiple NonLinear Regression) model can be written as follow:
3.2. Performance Analysis
In the ML modeling process, the number of the entire dataset used to build a new regression model is a critical task. Frank and Todeschini [
52] recommended a minimal proportion of three between the full dataset utilized and the number of included variables for constructing a credible model based on data mining approaches. A value of five is considered a more conservative and safe option. As can be seen in this study, the ratio between experimental data points to the predictors is equal to 202/3 = 67.33.
In order to compare the M5P model with MLNR, performance metrics were used. The Root Mean Square Error (RMSE), Coefficient of Determination (
R2), and Mean Estimation Error (MAE) were used to evaluate the predictive performance of M5P and MLNR on a regular basis [
53,
54]. The three equations below are used to calculate these metrics.
where
denotes the overall number of data points used to calculate the bias,
and
are the observed values for the torsional strength and the predicted torsional strength of the RC beams for the
ith observation, respectively, and
is the average of all observed data.
As shown in
Figure 4a,b, for both training and testing datasets, the M5P and MNLR models’ results reveal a little scatter around the ideal line between the observed and predicted values of the torsional strength. Model performance measurements are provided in
Table 4 for the training and testing datasets to further check the validity of the obtained models.
There is a substantial correlation between the predicted and experimental values when |R| > 0.8, according to Smith [
55]. The matching between the observed and predicted values might not be achieved, even if the
R value is near 1. In other words, those two variables just vary similarly. It is possible to overcome this constraint by utilizing the coefficient of determination
R2. When the model’s
R2 value is close to one, and the model’s RMSE and MAE values are close to zero, the model can be trusted to produce precise results [
56]. In terms of accuracy, the models’ performance measures in
Table 4 indicate that M5P outperformed the MNLR in both the training and testing sets. For example, and for
R2, RMSE and MAE values for the training set of the M5P model are 0.983, 13.288, and 8.279, respectively, while the RMSE and MAE values of
R2 for the testing set are 0.981, 13.432, and 8.224, respectively. It should be noted that the performance of MNLR is also acceptable.
A comparison of the forecasting accuracy of the M5P and MLNR models from this study, with standards and simplified predicting models for the torsional strength of RC beams using the testing dataset, is performed, and the findings are summarized in
Table 5. The standard equations from SNiP18 [
9], ACI 318-19 [
10], MC10 [
12], EC2 [
13], CSA14 [
11], and the simplified equation developed by Rahal in 2013 [
2] are among the available torsional strength models investigated in this study. In order to do so, three statistical error measurements, including
R2, RMSE, and MAE, besides model uncertainty average value
and coefficient of variation
COV for the calculated ratios
, were utilized for the comparison between the previously developed models and the other referred ones. The lowest RMSE and MAE values with the highest
R2 are achieved with the M5P model, as shown in
Table 5. The mean value of the model uncertainty variable for the M5P is 1.0419, which is close to the optimal value (equal to 1). Moreover, with a
COV of 11.6419, the M5P model has the lowest dispersion of all the studied models. The most prominent conservative bias was obtained from the SNiP18 equations, with a mean value of model uncertainty equal to 1.3584, while the broadest dispersion is achieved with the MC10 equation, with the value 44.78%.
The model uncertainty variable between the predicted and observed torsional strength for all the RC beams in the testing dataset was plotted against the predictors used to build each of the models in this study, as shown in
Figure 5. These plots help to acquire a better understanding of the standards’ errors. In this regard, the results of the standards were compared to those of the MLNR and M5P models. An adequate model’s errors should be less sensitive or independent of changes in the input parameters involved in the phenomena. Otherwise, it is possible to conclude that such input parameters are either incorrectly included or should be included in the model [
55]. As demonstrated in
Figure 5, the errors of some of the design codes are sensitive to changes in the concrete compressive strength (
) or concrete area (
) values.
For example, the MC10 standard tends to overpredict the torsional strength as the value of or increases, while the ACI 318-19 standard tends to behave in the opposite direction as the value of the same variables increases. Therefore, it is possible to state that those two parameters might not have been correctly included in the standards equations, and more studies should be performed on this. On the other hand, the M5P and MLNR models’ errors are entirely unaffected by these variables. The EC2 standard is less sensitive to the changes in or values when compared to the other standards.
It can also be noticed that some trends in the behavior of the standards equations exist when it comes to relate the prediction of the torsional strength versus reinforcement factor. For example, the status of the equations for some of the standards changes from overprediction to underprediction as the reinforcement factor increases, while the status of the equations from other standards (namely MC10, EC2, and SNiP18) shows the opposite. Consequently, all input variables were better incorporated into the newly suggested models in this study, and the models’ errors were entirely independent of them.