Next Article in Journal
Behavior of Scaled Infilled Masonry, Confined Masonry & Reinforced Concrete Structures under Dynamic Excitations
Previous Article in Journal
The Unsustainable Direction of Green Building Codes: A Critical Look at the Future of Green Architecture
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning Algorithm for Shear Strength Prediction of Short Links for Steel Buildings

1
Department of Civil Engineering, Faculty of Engineering, The Hashemite University, Zarqa 13133, Jordan
2
Department of Civil Engineering, Hijjawi Faculty for Engineering Technology, Yarmouk University, Shafiq Irshidatst, Irbid 21163, Jordan
3
Department of Industrial Engineering, King Khalid University, King Fahad St., Guraiger, Abha 62529, Saudi Arabia
*
Author to whom correspondence should be addressed.
Buildings 2022, 12(6), 775; https://doi.org/10.3390/buildings12060775
Submission received: 9 May 2022 / Revised: 1 June 2022 / Accepted: 2 June 2022 / Published: 6 June 2022
(This article belongs to the Section Construction Management, and Computers & Digitization)

Abstract

:
The rapid growth of using the short links in steel buildings due to their high shear strength and rotational capacity attracts the attention of structural engineers to investigate the performance of short links. However, insignificant attention has been oriented to efficiently developing a comprehensive model to forecast the shear strength of short links, which is expected to enhance the steel structures’ constructability. As machine learning algorithms was successfully used in various fields of structural engineering, the current study fills the gap in estimating the shear strength of short links using sophisticated machine learning algorithms. The deriving factors such as web and flange slenderness ratios, the flange-to-web area ratio, the forces in web and flange, and the link length ratio were investigated in this study, which is imperative to formulate an integrated prediction model. Consequently, the aim of this study utilizes advanced machine learning (ML) models (i.e., Extreme Gradient Boosting ( X G B O O S T ), Light Gradient Boosting Machine ( L i g h t G B M ), and Artificial Neural Network ( A N N ) to produce accurate forecasting for the shear strength. In this study, publicly available datasets were used for the training, testing, and validation. Different evaluation metrics were employed to evaluate the prediction’s performance of the used models, such as Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R2). The prediction result displays that the X G B O O S T and L i g h t G B M provided better, and more reliable results compared to A N N and the AISC code. The X G B O O S T and L i g h t G B M models yielded higher values of R2, lower (RMSE), (MAE), and (MAPE) values and have shown to perform more accurate. Therefore, the overall outcomes showed that the L i g h t G B M outperformed the X G B O O S T model. Moreover, the overstrength ratio predicted by the L i g h t G B M showed an excellent performance compared to the Gene Expression and Finite Element-based models. The developed models are vital for practitioners to predict the shear strength accurately, which pave the road towards wider application for automation in the steel buildings.

1. Introduction

Short links are W s h a p e steel sections that are either constructed or rolled with link length ratio, e/(M/V), less than 1.6 (AISC, 2016) [1]; where e represents the link length, M and V represent the plastic moments, and shear capacity, respectively. Short links are widely employed in steel bridges, Eccentric Braced Frames ( E B F s ) and coupled walls. The short links have several advantages, such as exceptional plastic rotational capacity and plastic shear capacity [2]. However, several studies observed that the AISC formula underestimates the predicted short links’ shear strength [3,4,5,6,7]. The AISC, 2016 [2] Equation (F3–2) estimates the plastic short links’ shear strength using V p = 0.6 × F y   ( d     2 t f ) × t w ; where Vp represents the plastic shear strength (N), Fy represents the measured steel yield strength of the web (MPa), d is the link depth (mm), tf and tw are the flange and web thicknesses (mm), respectively. Several investigations revealed the major factors that control the shear link strength, such as flange contribution [3,5], cyclic hardening [3], web slenderness [4], and link length ratio [4,6,7].
The testing program of McDaniel et al., 2003 [5], which included two full-scale built-up short links, revealed that in terms of degrading the shear strength of tested links, the cutting-edge factor is a brittle fracture on the linked web. As a result, the tested short links exhibited overstrength factors of 1.83 and 1.94. In addition, Dusicka et al., 2010 [8] investigated the effect of steel yield stress 100, 225, integrated steel strength of 100 and 440, 345, 485 MPa for five plate steel shear links. The obtained results illustrate that the steel links with a low grade attained a plastic rotation of 0.2 rad while the conventional links reached 0.12 rad, and the low-grade steel links achieved an overstrength factor considerably higher than conventional links. Moreover, the average overstrength factor for the tested 12 short links (length ratio ranges from 0.58 to 0.97) was 1.9, Ji et al., 2015 [3]. Furthermore, the very short links reached a plastic rotation of 0.14 rad greater than the 8% limit of AISC 341-10 [9,10]. In addition, Ji et al., 2016 [11] found that the average shear strength of four built-up short links reached 2.0.
Similarly, Liu et al., 2017 [4] noticed that the short links’ shear strength was significantly impacted by the web slenderness and link length ratio, and the overstrength ratio for the 12 built-up short links was between 1.35 to 1.5. The link length was critical in the steel links, Okazaki, T. 2004 [7]. The experimental program included 16 link-to-column connections with different link length ratios, where the overstrength ratio varied between 1.05 and 1.47. Bozkurt and Topkaya 2017 [12] discovered that plastic rotation and the overstrength ratio are negatively associated with the link length ratio. The test program included seven short links considering several features (i.e., loading protocol, the link length ratio and stiffeners spacing). In addition, an overstrength ratio of 1.87 to 2.3 was achieved in Bozkurt et al., 2019 [13], where six specimens with a link length of 600 to 800 mm were tested.
To explore the link’s ultimate rotational capacity, shear capacity, buckling of flanges, and web, the analysis of complex finite element of shear links was implemented [3,14,15,16,17,18,19,20]. However, finite element simulation is considered time consuming, especially in the modeling process and validation of the performance of the predicting model. Moreover, FEA requires special experts in the mechanics of materials and computer aided-software engineering. Recently, the huge availability of databases in the wide range of engineering applications paved the way to extensively and successfully use the machine learning tools to help engineers save the cost, time, and efforts. A leaping use of machine learning has been witnessed in the various civil engineering fields over the last decade. While machine learning has been successfully utilized in the civil engineering applications [21,22,23,24,25,26,27,28,29,30,31,32,33], limited studies used machine learning tools to address the shear strength of short links; where a few studies have employed actual experimental databases to validate M L algorithms. Moreover, the dataset applied for forecasting experiments has restricted factors, such as mechanical properties and geometrical dimensions of steel links. The prediction problem for the short links’ shear strength was mostly investigated through conventional M L algorithms with limited variables such as FEM and multiple linear regression. To address the mentioned research gaps, the current study proposed to develop the prediction model of the short links’ shear strength using innovative systematic and understandable ML algorithms (i.e., X G B O O S T ,   L i g h t G B M ,   A N N ). Moreover, data preprocessing has generated a more precise forecasting model. Furthermore, statistical performance measurements such as ( R M S E ), ( M A E ), ( M A P E ), and ( R 2 ) were employed for comparative analysis. The prediction of the short links’ shear strength was formulated in terms of different influence features such as ( b f / t f ), ( d / t w ), ( A f / A w ), ( A f f y f l a n g e ), ( A w f y w e b ), and e / ( M / V ) ) to overcome the knowledge gap of the available models. Additionally, a comprehensive variables importance analysis has been conducted to investigate the relative impact of these variables on the proposed model.

2. Literature Review

In the related literature on forecasting models, the short links’ shear strength was categorized into analytical and M L -based models.

2.1. Analytical Models

The literature includes three analytical models to assess the shear strength of shear links (i.e., AISC 2016 [1], Corte et al., 2013 [15], and G. Almasabha 2022 [34]). The following discussion summarizes the available models.

2.1.1. AISC 2016

The AISC 2016 [1] adopted Equation (1) for the assessment of shear strength of links. It is worth mentioning that Equation (1) does not take into consideration the role of link length proportion, the contribution of flanges, and the slenderness ratio of web or flanges.
V p = 0.6 × F y × ( d 2 t f ) × t w

2.1.2. Corte et al., 2013

A finite element-based algorithm [15] has been proposed to estimate the overstrength ratio (V0.08/Vy) of wide flange shear links without axial restraint, where A v = ( d t f ) t w and V y = ( F y / 3 ) ( d t f ) t w . It is worth mentioning that the authors derived Equation (2) for the hot rolled steel link. However, the experimental database of the current study includes both hot rolled and built-up steel links.
V 0.08 V y = 1 + 1.35 ( A f A v ) ( d e )

2.1.3. G. Almasabha 2022

This study used the gene expression model to build a mathematical equation for the shear link strength (VGEP) [34]. Various parameters were considered in this equation, such as bf/tf, d/tw, Af/Aw, Af fyflange, Awfyweb, and e/(M/V).
V G E P = ( 1.047   ( e M / V ) 0.416   ) × ( A w F y w e b ) 0.017 × ( A f A w ) 0.12 × ( A w F y w e b ( d t w ) 0.6 + ( e M / V ) 0.2 + b f t f )

2.2. ML Models

Although innovative ML algorithms outperform traditional models in most research disciplines [35,36], ML models are still humbly utilized to forecast the short links’ shear strength such as [15,16,17,18,37,38,39], also limited variables with humble datasets have been applied in these studies. On the other hand, these studies’ executed ML models seem somewhat traditional. Therefore, the need to use sophisticated M L models for shear strength prediction of short links to reduce prediction error with better accuracy is becoming required. The X G B O O S T technique was recently developed using a tree-based ensemble, a complex gradient boosting with higher processing abilities, and an excellent tool to deal with over-fitting concerns [40]. The learning method uses a boosting framework-based decision tree called L i g h t G B M , released by Microsoft in 2017. It is quicker, uses less memory, and is more accurate than X G B O O S T [41]. L i g h t G B M also provided decision rules for category features, which transform factors into one-time multidimensional functionality, saving time and memory [42].

3. Methodology

The aim of this research is to promote an algorithm for predicting the short links’ shear strength accurately. In addition, various statistical testing (e.g., data cleaning, normalization, and standardization) were also employed for the purpose of assessing the model’s validity and reasonableness. Moreover, data collection and feature definition, data preprocessing, M L algorithm, and model performance evaluation are the four primary aspects of the suggested methodology. The methodology flowchart is shown in Figure 1.

3.1. Data collection and Feature Definition

The experimental data of 110 samples were gathered from 1983 to 2019 [3,5,6,7,8,11,12,13,43,44,45,46] to evaluate the features impacting the short links’ shear strength under natural settings. Several significant features have been considered in this study, such as ( b f / t f ), ( d / t w ), ( A f / A w ), ( A f f y f l a n g e ), ( A w f y w e b ), and e / ( M / V ) ) . Table 1 illustrates the model variables along with their description.

3.2. Data Preprocessing

Data preprocessing is a key factor in managing the dataset before employing M L algorithms. The data preparation procedure also increases the model’s prediction performance. Data noise, removing outliers, normalization, and standardization are among the approaches’ processes. All the related variables had numerical values, as shown in Table 1. The initial stage in data preparation is to eliminate outliers from the collected data set. There would be uncertainty and inaccuracy in the data set because of the outlier, which might reduce the efficacy of the linear regression technique. Interquartile ranges were performed to analyze extreme and outlier results in this research. Outliers were removed using graphical techniques such as Boxplots. To characterize and eliminate residuals from the acquired data, null indicators have been employed. Reliability discomfiture might happen when some data are lost from the initial database. It could be concluded that the missing data (values represented by the “Null” or “-” indicators) were taken into account. Preprocessing, which includes eliminating outliers and normalizing the dataset, is required once the necessary variables have been identified. Data standardization was implemented in the current study, where no outliers in the dataset need to be removed.

3.3. M L Algorithm

To forecast the short links’ shear strength, three ML algorithms have been utilized (i.e., A N N ,   X G B O O S T ,     a n d   L i g h t G B M ). Training and testing sets are extracted from the database to ensure the proposed ML algorithm’s effectiveness. The data was divided into training and testing with a proportion of 80 % and 20 % , respectively. The suggested prediction models were tested for robustness and effectiveness using 10 -fold cross-validation. The model characteristics and implementation procedure are depicted in the next section.

3.3.1. Artificial Neural Network

A N N is a neural network based on an organism’s nervous system model. The work principle of a neural network is to create neurons that save, handle data, and connect them with artificial synapses [47]. A plethora of layers of nodes are composed together and communicate with one another to build up a neural network, where the A N N consists of, mainly, an input layer, a hidden layer, and output layers. There is a direct relation between the number of nodes in the input layer and the number of variables that illustrate the assessed qualities; on the other hand, the number of neurons in the output layer corresponds to the number of classes. The process is in sequence where the output of the first layer is taken as input to the next layer. The output layer’s nodes may provide the desired result when this technique is repeated. The nature of the task and the size of the data assign the number of neurons and hidden layers. In the hidden and output layers, each neuron is connected to all nodes in the previous layer via a numerical weight. The A N N model has produced the best results, with a max depth of four and learning rate of 0.08. The Neural structure is built up of seven layers: the initial layer is called the input layer and contains the inputs parameters of an A N N with six neurons, the final layer is termed the output layer and contains a single neuron, while the five hidden layers contain eight neurons each. The input, hidden, and output layers of the A N N are shown in Figure 2. The A N N mathematical formulation is provided in the equations below [48].
n k h = j = 1 X w k j h t j + b k h ,                 k = 1   t o   z
where X represents the number of associated features, z denotes the number of hidden neurons, t denotes each input feature, b denotes the hidden layer’s bias, and w denotes the weight. The activation function takes the weight of each computed component as an input. The total of the weighted values determines the output.

3.3.2. Extreme Gradient Boosting

X G B O O S T is a newly introduced M L algorithm that has widespread use in several disciplines. This technique is set to be unique in gradient boosting machine and chiefly for regression and classification trees [49]. The X G B O O S T is created based on the “boosting” concept, which combines the forecast of weak learners with additive training methodologies to build a strong learner. It aids in the avoidance of over-fitting and enhances computing ability. Figure 3 illustrates the X G B O O S T schematic, which simplifies the goal functions, allowing the prediction and regularization terms to be combined while maintaining the fastest feasible processing performance. With a number of trees of 1200, a learning rate of 0.05, and a maximum depth of 15 in X G B O O S T , the best results were obtained. The following equations at step p used to determine the common function of the forecast [40].
f i ( p ) = k = 1 p f k ( x i ) = f i ( p 1 ) + f p ( x i )
where f p ( x i ) represents the learner at step p , f i ( p ) represents the prediction at p , f i ( p 1 ) represents the prediction at p 1 , and x i represents the input features.
To overcome the issue of over-fitting concerns on the cost of the model’s computing speed, the X G B O O S T produces analytical formula shown below to assess the “goodness” of the model from the authentic function.
O b j e c t i v e ( p ) = k = 1 n l ( y ¯ i ,   y i ) + k = 1 p σ ( f i )
where l represents the loss function, n represents the number of data utilized, and σ is the regularization term, which is described in equation below:
σ ( f ) = ϒ T + 0.5 λ ω 2
where ω denotes vector scores in leaves, ϒ denotes the minimal loss necessary to divide the leaf node further, and λ denotes the regularization parameters.

3.3.3. Light Gradient Boosting Machine ( L i g h t G B M )

Microsoft Research introduced L i g h t G B M , a decision tree with gradient-boosting based on a decision-tree method [50,51]. L i g h t G B M is a powerful technique for resolving regression and classification issues. It utilizes low computational memory with a higher accuracy of prediction in comparable to X G B O O S T . According to the histogram method and tree leaf-wise growth approach, L i g h t G B M enhances the training process and minimizes memory consumption. Figure 4 depicts the histogram decision tree-based algorithm. Figure 4 also depicts the level-wise and leaf-wise development techniques. According to the level-wise growth approach, the leaves on the same layer are divided simultaneously. To control the complexity of the model, it is desirable to optimize using several threads. Furthermore, leaves on the same layer are handled uniformly, despite absorbing different amounts of information. L i g h t G B M . produced the best results with a tree count of 1100, a learning rate of 0.06, a needed leaf count of 18, and a maximum depth of 13. The information gain (IG) depicts the predicted reduction in entropy caused by dividing nodes into qualities that can be computed in the equations below [40].
I G ( C , V ) = F n ( C ) V ϵ V a l u e s ( v ) | C v | C F n ( C v ) F n ( C ) = b = 1 B p b l o g 2   p b
where Fn(C) represents the input entropy of the group C, pb represents the proportion of C associated with part b, B represents the number of parts, v represents the features value V, and Cv represents the C’s subset for the features having value v. The process might be affected by several factors, resulting in it being an insignificant process, such as specific leaves with reasonably minimal information gain are discarded, gaining additional memory storage capacity.
Figure 4. Illustration of L i g h t G B M algorithm.
Figure 4. Illustration of L i g h t G B M algorithm.
Buildings 12 00775 g004
The leaf-wise technique has a better chance of succeeding since it splits the leaf with the largest information gain on a similar layer, as shown in Figure 4. Furthermore, trees with a high depth can be generated using this approach, resulting in an introduction to the maximum depth restriction through tree growth [52].

3.4. Stratified K-Fold Cross-Validation

The K-fold cross-validation approach can enhance the model’s accuracy by evaluating how the ML algorithm performs on a new data set. After the datasets have been divided into training and testing subsets, the modeling process is employed for the ML prediction algorithm. The training dataset is divided into several ‘k’ smaller portions throughout this process. As a result, the moniker ‘k’-fold was coined. K-fold is used for testing, and k-1 is used for training based on a random data set. The efficiency of the M L model is examined using a stratified 10-fold cross-validation approach. The data set is divided into ten folds at random using this method. As a result, each of the folds is used just once as a validation set. Finally, the error or accuracy measure for each fold may be compared, and if they are similar, the diversity of the model is high. The 10-fold cross-validation technique is depicted in Figure 5.

3.5. Prediction Accuracy Measurement

The evaluation measures have been employed to assess the proposed model’s suitability. After testing the major model assumptions, it is critical to assess the suggested model’s effectiveness and predictive potential. Four statistical indices (i.e., R M S E , M A E , M A P E , and R 2 ) were utilized to examine the effectiveness of the suggested algorithm quantitatively, as shown in the equations below:
M S E = 1 m i = 1 m ( Y i Y i ¯ ) 2
M A P E = 1 m i = 1 m | Y i Y i ¯ Y i | × 100
R 2 = 1 i = 1 m ( Y i Y i ¯ ) 2 i = 1 m ( Y i Y ¯ ) 2
where Y i represents the actual (measured) values of the overstrength ratio of short links, Y i ¯ represents the forecasted outcome, Y ¯ represents the mean of the Y i , and m represents the number of the datasets utilized. The model accuracy and performance will be enhanced if R 2 value approaches one as well as R M S E , M A E , and M A P E measures approach zero.

4. Result and Discussion

4.1. Descriptive Statistics

Table 2 summarizes the surveyed experimental results of tested 110 short links. In addition, Table 3 and Figure 6 show the statistical information of collected databases. Table 3 includes several features such as bf/tf from 10 to 20.71 with an average of 13.51, d/tw from 11.33 to 57.5 with an average of 36.66, e/(M/V), from 0.33 to 1.69, Af/Aw from 0.41 to 2.27 with an average of 1.86, Af fyflange from 260 to 9882 kN with an average of 879 kN, Awfyweb from 219.7 to 8524.3 kN with an average of 891.67 kN.

4.2. Correlation Matrix Analysis

Pearson’s correlation among and in between selected features and the short links’ shear strength was applied for the evaluation of the impact of these features, as shown in Figure 7. The relation’s sign determines the trend of correlation between terms to investigate the effect of each item against every other item. The correlation factor ranges from +1 (strong positive relation) to -1 (strong negative relation); the correlation near zero indicates a weak relation. Figure 7 shows the correlation between flange force, web force, and link length ratio with the shear link strengths are 0.943, 0.965, and −0.904, respectively. The results indicate that a strong relationship exists between these variables with the shear link strength. On the other hand, the correlation factors for web slenderness ratio, flange slenderness ratio, and flange to web area ratio are close to zero, which implies a minor effect of these parameters on the shear link strength.

4.3. Performance of ML Algorithms

The goal of this research is to assess the efficiency of current M L methods (i.e., A N N ,     X G B O O S T ,     a n d   L i g h t G B M ) to predict the overstrength ratio of short links. Table 4 shows the five-evaluation comparison. The short links’ shear strength predictions for various algorithms are displayed in Figure 8.
The comparison outcomes present that L i g h t G B M and X G B O O S T have higher measures of R 2   value and lower M A E , R M S E , M A P E values than the traditional models (i.e., A N N ,   A I S C   c o d e ) in the short links’ shear strength prediction. According to these metrics, the L i g h t G B M and X G B O O S T models surpassed other models. On the other hand, the results also indicate that L i g h t G B M had outstanding prediction ability compared to XGBOOST. As a result, it generates a precise forecast in comparison to other proposed methodologies. The L i g h t G B M was graded superior to the X G B O O S T , as its assessment measures for M A E and R M S E were 104.5 and 151.5 points lower than the X G B O O S T , respectively. Consequently, the L i g h t G B M has been the most accurate model, as it had the minimum M A E and RMSE assessment scores. The M A P E was implemented to ensure that the L i g h t G B M achieves the best overall accuracy when compared to other approaches. For MAPE measurement, L i g h t G B M ,   X G B O O S T ,   A N N , and A I S C code achieved precisions of 11.7, 24.1, 35.8, and 39.2, respectively, as shown in Table 4. This means that the X G B O O S T , A N N , and A I S C code value of this measurement is much higher than L i g h t G B M .
The result of R 2 is considered another realistic evidence that L i g h t G B M is superior to all other approaches for determining the short links’ shear strength, where, R 2 of L i g h t G B M is greater than R 2 of X G B O O S T ,   A N N   and   A I S C subspecialty 3, 9, 24%. In comparison to other approaches, the L i g h t G B M exhibits excellent forecasting accuracy. Moreover, the L i g h t G B M model ran 2 s quicker than the X G B O O S T model and 5 s quicker than the A N N model.
The forecasted results of the proposed illustrate that the L i g h t G B M prediction values are very close to the experimental strength measures. Therefore, the better fit and slight deviation to the experimental values is the L i g h t G B M model. The plots compare the predicted performance of the proposed models. As a result, the L i g h t G B M is the most efficient and proficient model in the short links’ shear strength prediction.

4.4. Features Importance Analysis

A better view of the model’s features helps structural engineers effectively judge trends. Therefore, feature importance assessment has been performed using L i g h t G B M , X G B O O S T , and A N N models to figure out how important each variable is in predicting the overstrength ratio of short links. Thus, the feature score plot has been conducted to provide a relative score for each variable. Accordingly, the features’ significance was in descending order: Web force, Link length ratio, Flange force, Web slenderness ratio, Flange slenderness ratio, and Flange to web area ratio, as shown in Figure 9.
The features’ importance results are compatible with the literature where Liu et al., 2017 [4] noticed a significant impact of web slenderness and link length ratio on the short links’ shear strength. The link length was critical in the steel links, Okazaki, T. 2004 [7]. Bozkurt and Topkaya 2017 [12] discovered that the overstrength ratio negatively relates to the link length ratio. Seven short links were tested, with numerous aspects such as link length ratio, stiffener spacing, and loading technique taken into account.
The overstrength ratio of experimental-to-LightGBM projected shear strength (Vu/VLightGBM) and the experimental-to-AISC projected shear strength (Vu/VP) are illustrated in Figure 10 and Figure 11. Likewise, the AISC based overstrength ratio, the L i g h t G B M demonstrated an excellent performance in the prediction of the shear link strength, where it is cruel to bf/tf, d/tw, Af/Aw, Af fyflange, Awfyweb, and e/(M/V). The L i g h t G B M predictions are flat and close to 1.0, which indicates that the L i g h t G B M model is a comprehensive and competent algorithm for predicting the short links’ shear strength.
Moreover, a comparison between the predicted overstrength ratio using L i g h t G B M (Vu/VLightGBM), AISC 2016 (Vu/Vp), Gene expression model (Vu/VGEP), and FEM-based model (V0.08/Vy) are presented in Figure 12. The average of the predicted overstrength ratio is 0.97, 1.11, 1.73, and 1.74 for the Vu/VLightGBM, Vu/VGEP, V0.08/Vy, and Vu/Vp, respectively. The results revealed that the LightGBM is an excellent model in order to evaluate the short links’ shear strength, while the AISC code equation is deficient to accurately estimate the shear link strength due to the fact the AISC code equation only considers the strength of web. This study revealed the significant effect of other variables such as the properties of flange and the link length ratio. The machine learning algorithm ( L i g h t G B M ) has successfully traced the contribution of elements other than the web on the short link shear strength.
Technology is increasingly being used by policymakers to modify and construct policies. The existing research creates a strong M L framework for forecasting short links’ shear strength that can be used as a universal model to mimic all relevant properties. This lays a solid platform for examining how various feature interconnections can help predict short links’ shear strength. Furthermore, as modern M L methods improve, more advanced forecast models become available, allowing for the development of more effective and precise models for short-link shear strength prediction, which many construction industry stakeholders can employ. The researchers are optimistic in the suggested models’ capabilities to supply decision-makers with more accurate projections to match available datasets as a beneficial prior acquaintance to power M L -based models, which is coherent with what is presently arising in the building business and construction projects.
In addition, as compared to other available models, created models were discovered to be convincing decision support aids in numerous sectors of the construction industry. The proposal could be a comprehensive, general, useful, and reliable prediction tool. Thus, the proposed methodology will play an important role in decreasing conflict among stakeholders in the steel construction sector, especially when decision-makers countenance massive problems and barriers indicating satisfactory short link shear strength that all contracting stakeholders agree on.

5. Conclusions

Although the machine learning algorithms were successfully implemented in the civil engineering applications, limited studies in the literature exploited the powerful capabilities of machine learning in estimating the shear link strength. The current study seeks to fill this gap with the help of the abundant short links experimental database in the literature to train, validate, and test sophisticated machine learning algorithms. Particularly, this research work examined the applications of the A N N , X G B o o s t , and L i g h t G B M algorithms to enhance the prediction accuracy for the shear strength of short links using related features such as ( b f / t f ), ( d / t w ), ( A f / A w ), ( A f f yflange ), ( A w f yweb ), and e / ( M / V ) . The adopted forecasting process was based on 80% training datasets and 20% testing datasets to confirm the precision of the developed ML models. Performance metrics of the MAE, RMSE, MAPE, and R2 under the 10-fold cross-validation process have been implemented to enhance the robustness and effectiveness of such models. These measures reveal that the performance of the ML models set side by side to A I S C   c o d e was arranged as follows: L i g h t G B M > X G B O O S T > A N N > A I S C   code . According to the importance of the features extracted from ML algorithms, Web force and Link length ratio were the most prominent variables in the prediction results of the overstrength ratio of short links. In addition, the predicted overstrength ratio using the L i g h t G B M was compared to the available models in the literature, where the proficiency of developed models was reasonable. The analysis disclosed that the L i g h t G B M has the least average predicted overstrength ratio compared to the GEP, FEM, or AISC-based models. For future research, a larger database can be adopted to demonstrate the adequacy of these models to predict the overstrength ratio of short links. The impact of other variables on the prediction accuracy needs to be adopted. Moreover, modern algorithms are required to improve results accuracy.

Author Contributions

Conceptualization, O.A. and G.A.; methodology, A.S.; software, A.S.; validation, G.A., O.A. and A.S.A.; formal analysis, G.A.; investigation, O.A.; resources, O.A. and A.S.A.; data curation, G.A.; writing—original draft preparation, A.S.; writing—review and editing, O.A.; visualization, A.S.; supervision, O.A.; project administration, G.A. and A.S.; funding acquisition, A.S.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through Large Groups Project under grant number (RGP. 2/178/43).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. AISC. Seismic Provisions for Structural Steel Building; ANSI/AISC 341-16; AISC: Chicago, IL, USA, 2016. [Google Scholar]
  2. Engelhardt, M.D.; Popov, E.P. Experimental Performance of Long Links in Eccentrically Braced Frames. J. Struct. Eng. 1992, 118, 3067–3088. [Google Scholar] [CrossRef]
  3. Ji, X.; Wang, Y.; Ma, Q.; Okazaki, T. Cyclic Behavior of Very Short Steel Shear Links. J. Struct. Eng. 2015, 142, 04015114. [Google Scholar] [CrossRef]
  4. Liu, X.-G.; Fan, J.-S.; Liu, Y.-F.; Yue, Q.-R.; Nie, J.-G. Experimental research of replaceable Q345GJ steel shear links considering cyclic buckling and plastic overstrength. J. Constr. Steel Res. 2017, 134, 160–179. [Google Scholar] [CrossRef]
  5. McDaniel, C.C.; Uang, C.-M.; Seible, F. Cyclic Testing of Built-Up Steel Shear Links for the New Bay Bridge. J. Struct. Eng. 2003, 129, 801–809. [Google Scholar] [CrossRef]
  6. Okazaki, T.; Arce, G.; Ryu, H.-C.; Engelhardt, M.D. Experimental Study of Local Buckling, Overstrength, and Fracture of Links in Eccentrically Braced Frames. J. Struct. Eng. 2005, 131, 1526–1535. [Google Scholar] [CrossRef]
  7. Okazaki, T. Seismic Performance of Link-To Column Connections in Steel Eccentrically Braced Frames; The University of Texas at Austin: Austin, TX, USA, 2004. [Google Scholar]
  8. Dusicka, P.; Itani, A.M.; Buckle, I.G. Cyclic Behavior of Shear Links of Various Grades of Plate Steel. J. Struct. Eng. 2010, 136, 370–378. [Google Scholar] [CrossRef]
  9. AISC. Seismic Provisions for Structural Steel Buildings; ANSI/AISC 341-02; AISC: Chicago, IL, USA, 2002. [Google Scholar]
  10. AISC. Specification for Structural Steel Buildings; ANSI/AISC 360-10; AISC: Chicago, IL, USA, 2010. [Google Scholar]
  11. Ji, X.; Wang, Y.; Ma, Q.; Okazaki, T. Cyclic Behavior of Replaceable Steel Coupling Beams. J. Struct. Eng. 2016, 143, 04016169. [Google Scholar] [CrossRef]
  12. Bozkurt, M.B.; Topkaya, C. Replaceable links with direct brace attachments for eccentrically braced frames: Replaceable Links with Direct Brace Attachments for EBF. Earthq. Eng. Struct. Dyn. 2017, 46, 2121–2139. [Google Scholar] [CrossRef]
  13. Bozkurt, M.B.; Kazemzadeh Azad, S.; Topkaya, C. Development of detachable replaceable links for eccentrically braced frames. Earthq. Eng. Struct. Dyn. 2019, 48, 1134–1155. [Google Scholar] [CrossRef]
  14. Chao, S.-H.; Khandelwal, K.; El-Tawil, S. Ductile Web Fracture Initiation in Steel Shear Links. J. Struct. Eng. 2006, 132, 1192–1200. [Google Scholar] [CrossRef]
  15. Della Corte, G.; D’Aniello, M.; Landolfo, R. Analytical and numerical study of plastic overstrength of shear links. J. Constr. Steel Res. 2013, 82, 19–32. [Google Scholar] [CrossRef]
  16. Hong, J.-K.; Uang, C.-M.; Okazaki, T.; Engelhardt, M.D. Link-to-Column Connection with Supplemental Web Doublers in Eccentrically Braced Frames. J. Struct. Eng. 2015, 141, 04014200. [Google Scholar] [CrossRef]
  17. Hu, S.; Xiong, J.; Zhou, Q.; Lin, Z. Analytical and Numerical Investigation of Overstrength Factors for Very Short Shear Links in EBFs. KSCE J. Civ. Eng. 2018, 22, 4473–4482. [Google Scholar] [CrossRef]
  18. Liu, X.-G.; Fan, J.-S.; Liu, Y.-F.; Zheng, M.-Z.; Nie, J.-G. Theoretical research into cyclic web buckling and plastic overstrength of shear links. Thin Walled Struct. 2020, 152, 106644. [Google Scholar] [CrossRef]
  19. Ohsaki, M.; Nakajima, T. Optimization of link member of eccentrically braced frames for maximum energy dissipation. J. Constr. Steel Res. 2012, 75, 38–44. [Google Scholar] [CrossRef] [Green Version]
  20. Yin, W.-H.; Sun, F.-F.; Jin, H.-J.; Hu, D.-Z. Experimental and analytical study on plastic overstrength of shear links covering the full range of length ratio. Eng. Struct. 2020, 220, 110961. [Google Scholar] [CrossRef]
  21. Song, H.; Ahmad, A.; Farooq, F.; Ostrowski, K.A.; Maslak, M.; Czarnecki, S.; Aslam, F. Predicting the compressive strength of concrete with fly ash admixture using machine learning algorithms. Constr. Build. Mater. 2021, 308, 125021. [Google Scholar] [CrossRef]
  22. Tarawneh, A.; Almasabha, G.; Murad, Y. ColumnsNet: Neural Network Model for Constructing Interaction Diagrams and Slenderness Limit for FRP-RC Columns. J. Struct. Eng. 2022, 148, 04022089. [Google Scholar] [CrossRef]
  23. Saleh, E.; Tarawneh, A.; Naser, M.; Abedi, M.; Almasabha, G. You only design once (YODO): Gaussian Process-Batch Bayesian optimization framework for mixture design of ultra high performance concrete. Constr. Build. Mater. 2022, 330, 127270. [Google Scholar] [CrossRef]
  24. Almasabha, G.; Tarawneh, A.; Saleh, E.; Alajarmeh, O. Data-Driven Flexural Stiffness Model of FRP-Reinforced Concrete Slender Columns. J. Compos. Constr. 2022, 26, 04022024. [Google Scholar] [CrossRef]
  25. Tarawneh, A.; Almasabha, G.; Alawadi, R.; Tarawneh, M. Innovative and Reliable Model for Shear Strength of Steel Fibers Reinforced Concrete Beams. Structures 2021, 32, 1015–1025. [Google Scholar] [CrossRef]
  26. Alshboul, O.; Alzubaidi, M.A.; Mamlook, R.E.A.; Almasabha, G.; Almuflih, A.S.; Shehadeh, A. Forecasting Liquidated Damages via Machine Learning-Based Modified Regression Models for Highway Construction Projects. Sustainability 2022, 14, 5835. [Google Scholar] [CrossRef]
  27. Alshboul, O.; Shehadeh, A.; Tatari, O.; Almasabha, G.; Saleh, E. Multiobjective and multivariable optimization for earthmoving equipment. J. Facil. Manag. 2022. [Google Scholar] [CrossRef]
  28. Shehadeh, A.; Alshboul, O.; Tatari, O.; Alzubaidi, M.A.; Hamed El-Sayed Salama, A. Selection of heavy machinery for earthwork activities: A multi-objective optimization approach using a genetic algorithm. Alex. Eng. J. 2022, 61, 7555–7569. [Google Scholar] [CrossRef]
  29. Alshboul, O.; Shehadeh, A.; Hamedat, O. Development of integrated asset management model for highway facilities based on risk evaluation. Int. J. Constr. Manag. 2021, 1–10. [Google Scholar] [CrossRef]
  30. Shehadeh, A.; Alshboul, O.; Hamedat, O. A Gaussian mixture model evaluation of construction companies’ business acceptance capabilities in performing construction and maintenance activities during COVID-19 pandemic. Int. J. Manag. Sci. Eng. Manag. 2022, 17, 112–122. [Google Scholar] [CrossRef]
  31. Alshboul, O.; Shehadeh, A.; Hamedat, O. Governmental Investment Impacts on the Construction Sector Considering the Liquidity Trap. J. Manag. Eng. 2022, 38, 04021099. [Google Scholar] [CrossRef]
  32. Shehadeh, A.; Alshboul, O.; Hamedat, O. Risk Assessment Model for Optimal Gain-Pain Share Ratio in Target Cost Contract for Construction Projects. J. Constr. Eng. Manag. 2022, 148, 04021197. [Google Scholar] [CrossRef]
  33. Alshboul, O.; Shehadeh, A.; Almasabha, G.; Almuflih, A.S. Extreme Gradient Boosting-Based Machine Learning Approach for Green Building Cost Prediction. Sustainability 2022, 14, 6651. [Google Scholar] [CrossRef]
  34. Almasabha, G. Gene expression model to estimate the overstrength ratio of short links. Structures 2022, 37, 528–535. [Google Scholar] [CrossRef]
  35. Alshboul, O.; Shehadeh, A.; Al-Kasasbeh, M.; Al Mamlook, R.E.; Halalsheh, N.; Alkasasbeh, M. Deep and machine learning approaches for forecasting the residual value of heavy construction equipment: A management decision support model. Eng. Constr. Archit. Manag. 2021. [Google Scholar] [CrossRef]
  36. Shehadeh, A.; Alshboul, O.; Al Mamlook, R.E.; Hamedat, O. Machine learning models for predicting the residual value of heavy construction equipment: An evaluation of modified decision tree, LightGBM, and XGBoost regression. Autom. Constr. 2021, 129, 103827. [Google Scholar] [CrossRef]
  37. Cevik, A. Genetic programming based formulation of rotation capacity of wide flange beams. J. Constr. Steel Res. 2007, 63, 884–893. [Google Scholar] [CrossRef]
  38. Fonseca, E.T.; da Vellasco, P.C.G.S.; de Andrade, S.A.L.; Vellasco, M.M.B.R. Neural network evaluation of steel beam patch load capacity. Adv. Eng. Softw. 2003, 34, 763–772. [Google Scholar] [CrossRef]
  39. Güneyisi, E.M.; D’Aniello, M.; Landolfo, R.; Mermerdaş, K. A novel formulation of the flexural overstrength factor for steel beams. J. Constr. Steel Res. 2013, 90, 60–71. [Google Scholar] [CrossRef]
  40. Fan, J.; Wang, X.; Wu, L.; Zhou, H.; Zhang, F.; Yu, X.; Lu, X.; Xiang, Y. Comparison of Support Vector Machine and Extreme Gradient Boosting for predicting daily global solar radiation using temperature and precipitation in humid subtropical climates: A case study in China. Energy Convers. Manag. 2018, 164, 102–111. [Google Scholar] [CrossRef]
  41. Zhang, M.; Xiang, F.; Liu, Z. Short-term traffic flow prediction based on combination model of XGBoost-LightGBM. In Proceedings of the 2018 International Conference on Sensor Networks and Signal Processing (SNSP), Xi’an, China, 28–31 October 2018; pp. 322–327. [Google Scholar]
  42. Pathy, A.S.; Meher, B.P. Predicting algal biochar yield using eXtreme Gradient Boosting (XGB) algorithm of machine learning methods. Algal Res. 2020, 50, 102006. [Google Scholar] [CrossRef]
  43. Price, B. Investigation on Innovative Shear Link Configurations and Optimal Design for Earthquake Resistant Steel Eccentrically Braced Frames. Master’s Thesis, University of Texas at Arlington, Arlington, TX, USA, 2015. [Google Scholar]
  44. Dubina, D.; Stratan, A.; Dinu, F. Dual high-strength steel eccentrically braced frames with removable links. Earthq. Eng. Struct. Dyn. 2008, 37, 1703–1720. [Google Scholar] [CrossRef]
  45. Hjelmstad, K.D.; Popov, E.P. Cyclic Behavior and Design of Link Beams. J. Struct. Eng. 1983, 109, 2387–2403. [Google Scholar] [CrossRef]
  46. Volynkin, D.; Dusicka, P.; Clifton, G.C. Intermediate Web Stiffener Spacing Evaluation for Shear Links. J. Struct. Eng. 2018, 145, 04018257. [Google Scholar] [CrossRef]
  47. Yang, X. Artificial neural networks. In Handbook of Research on Geoinformatics; IGI Global: Hershey, PA, USA, 2009; pp. 122–128. [Google Scholar]
  48. Kim, M.; Jung, S.; Kang, J. Artificial Neural Network-Based Residential Energy Consumption Prediction Models Considering Residential Building Information and User Features in South Korea. Sustainability 2020, 12, 109. [Google Scholar] [CrossRef] [Green Version]
  49. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef] [Green Version]
  50. Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Statist. 2001, 29, 11891232. [Google Scholar] [CrossRef]
  51. Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. LightGBM: A highly efficient gradient boosting decision tree. Adv. Neur. Infor. Process. Sys. 2017, 30, 3146–3154. [Google Scholar]
  52. Zeng, H.; Yang, C.; Zhang, H.; Wu, Z.; Zhang, J.; Dai, G.; Babiloni, F.; Kong, W. A lightGBM-based EEG analysis method for driver mental states classification. Comput. Intell. Neurosci. 2019, 2019, 3761203. [Google Scholar] [CrossRef]
  53. Tong, L.; Zhang, Y.; Zhang, L.; Liu, H.; Zhang, Z.; Li, R. Ductility and energy dissipation behavior of G20Mn5QT cast steel shear link beams under cyclic loading. J. Constr. Steel Res. 2018, 149, 64–77. [Google Scholar] [CrossRef]
  54. Mahmoudi, F.; Dolatshahi, K.M.; Mahsuli, M.; Nikoukalam, M.T.; Shahmohammadi, A. Experimental study of steel moment resisting frames with shear link. J. Constr. Steel Res. 2018, 154, 197–208. [Google Scholar] [CrossRef]
Figure 1. Processing structure for prediction of the short links’ shear strength.
Figure 1. Processing structure for prediction of the short links’ shear strength.
Buildings 12 00775 g001
Figure 2. The architecture of the A N N to estimate the short links’ shear strength.
Figure 2. The architecture of the A N N to estimate the short links’ shear strength.
Buildings 12 00775 g002
Figure 3. X G B O O S T   mechanism.
Figure 3. X G B O O S T   mechanism.
Buildings 12 00775 g003
Figure 5. The 10-fold cross-validation technique exemplification.
Figure 5. The 10-fold cross-validation technique exemplification.
Buildings 12 00775 g005
Figure 6. Database statistical description.
Figure 6. Database statistical description.
Buildings 12 00775 g006
Figure 7. Correlation matrix analysis.
Figure 7. Correlation matrix analysis.
Buildings 12 00775 g007
Figure 8. Model’s prediction comparison with experimental strength.
Figure 8. Model’s prediction comparison with experimental strength.
Buildings 12 00775 g008
Figure 9. Representation of features importance.
Figure 9. Representation of features importance.
Buildings 12 00775 g009
Figure 10. Distribution of overstrength ratio for the built-up sections.
Figure 10. Distribution of overstrength ratio for the built-up sections.
Buildings 12 00775 g010
Figure 11. Distribution of overstrength ratio for the built-up sections.
Figure 11. Distribution of overstrength ratio for the built-up sections.
Buildings 12 00775 g011
Figure 12. Performance of overstrength ratio vs. link length ratio for various models.
Figure 12. Performance of overstrength ratio vs. link length ratio for various models.
Buildings 12 00775 g012
Table 1. Description of model variables.
Table 1. Description of model variables.
FeatureDefinitionData Type
( b f / t f )Flange slenderness ratioNumeric
( d / t w )Web slenderness ratioNumeric
( A f / A w )Flange to web area ratioNumeric
( A f f y f l a n g e )Flange forceNumeric
( A w f y w e b )Web forceNumeric
e / ( M / V ) Link length ratioNumeric
Table 2. Database description of short links.
Table 2. Database description of short links.
ReferenceNo. of Testsbf/tfd/twe/(M/V)fyflange, MPafyweb, MPaVtest (kN)
Ji et al., 2015 [3]1212.9400.58–0.97319228; 273869–1130
Ji et al., 2016 [11]210.6; 14.2350.7–0.76378; 396228838–926
McDaniel et al., 2003 [5]210.6–13.333.90.59; 0.823663549363–9919
Volynkin et al., 2018 [46]512–12.821.7–44.20.76–1.02364; 455364; 374783–1034
Dusicka et al., 2010 [8]511.8; 13.622–33.90.8; 0.82223–503242–5031845–4348
Liu et al., 2017 [4]1110–1321–351.12–1.6366354–362373–668
Okazaki et al., 2005 [6]1111.5–18.322.1–56.81.04–1.49319–362382–404585–1280
Okazaki, T. 2004 [7]612.257.51.11351.63931007–1140
Bokurt and Topaya 2017 [12]818–20.722.4–22.81.04–1.59268–281275–299275–591
Bokurt and et al., 2019 [13]618–2022.2–291.26–1.59272–357276–343288–573
Tong et al., 2018 [53]41217.91.25461.2463.4720–1013
Mahmoudi et al., 2018 [54]110340.78301301478
Hjelmstad et al., 1983 [45]811.5; 15.643.4; 571.27–1.57241.3; 285.4711–914600–1067
Dubina et al., 2008 [44]2412.2538.70.65–1.3221–315221–315270–420
Price, B. 2015 [43]511.5; 16.523.8; 56.81.11; 1.23353.7; 398.5360; 403433–1298
Total110
Table 3. Statistical description analysis of features.
Table 3. Statistical description analysis of features.
Stander StatisticsFeatures
( b f / t f ) ( d / t w ) ( A f / A w ) ( A f f yflange ) ( A w f yweb ) e / ( M / V )
Mean13.5136.661.01879.08891.671.09
Standard Error0.241.160.04115.7107.910.03
Median12.2438.710.86608.74664.321.1
Mode12.2438.710.86803.88550.240.87
Standard Deviation2.5312.180.431213.481131.790.28
Sample Variance6.42148.370.181,472,5371,280,9550.08
Kurtosis0.56−0.760.3336.8437.1−0.65
Skewness1.330.311.085.655.74−0.15
Range10.7146.151.869622.048304.591.36
Minimum1011.330.41259.96219.730.33
Maximum20.7157.482.2798828524.321.69
Sum1486.024032.6110.6196698.7198083.4119.9
Count110110110110110110
Table 4. Performance comparison for prediction of the ML models and AISC code.
Table 4. Performance comparison for prediction of the ML models and AISC code.
Performance ComparisonPrediction Models
L i g h t G B M X G B O O S T A N N A I S C   C o d e
M A E 92.0196.5378.0397.9
R M S E 132.5284.0507.9804.2
M A P E 11.724.135.839.2
R 2 0.990.960.900.75
Training   Tim 7 s9 s14 s
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Almasabha, G.; Alshboul, O.; Shehadeh, A.; Almuflih, A.S. Machine Learning Algorithm for Shear Strength Prediction of Short Links for Steel Buildings. Buildings 2022, 12, 775. https://doi.org/10.3390/buildings12060775

AMA Style

Almasabha G, Alshboul O, Shehadeh A, Almuflih AS. Machine Learning Algorithm for Shear Strength Prediction of Short Links for Steel Buildings. Buildings. 2022; 12(6):775. https://doi.org/10.3390/buildings12060775

Chicago/Turabian Style

Almasabha, Ghassan, Odey Alshboul, Ali Shehadeh, and Ali Saeed Almuflih. 2022. "Machine Learning Algorithm for Shear Strength Prediction of Short Links for Steel Buildings" Buildings 12, no. 6: 775. https://doi.org/10.3390/buildings12060775

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop