A Comparative Study between Regression and Neural Networks for Modeling Al6082-T6 Alloy Drilling

Karkalos, Nikolaos E.; Efkolidis, Nikolaos; Kyratsis, Panagiotis; Markopoulos, Angelos P.

doi:10.3390/machines7010013

Open AccessArticle

A Comparative Study between Regression and Neural Networks for Modeling Al6082-T6 Alloy Drilling

by

Nikolaos E. Karkalos

¹,

Nikolaos Efkolidis

²,

Panagiotis Kyratsis

^2,*

and

Angelos P. Markopoulos

¹

School of Mechanical Engineering, Section of Manufacturing Technology, National Technical University of Athens, Heroon Polytechniou 9, 15780 Athens, Greece

²

Department of Mechanical Engineering & Industrial Design, Western Macedonia University of Applied Sciences, GR 50100 Kila Kozani, Greece

^*

Author to whom correspondence should be addressed.

Machines 2019, 7(1), 13; https://doi.org/10.3390/machines7010013

Submission received: 31 December 2018 / Revised: 28 January 2019 / Accepted: 29 January 2019 / Published: 2 February 2019

(This article belongs to the Special Issue Advances in CAD/CAM/CAE Technologies)

Download

Browse Figures

Versions Notes

Abstract

:

Apart from experimental research, the development of accurate and efficient models is considerably important in the field of manufacturing processes. Initially, regression models were significantly popular for this purpose, but later, the soft computing models were proven as a viable alternative to the established models. However, the effectiveness of soft computing models can be often dependent on the size of the experimental dataset, and it can be lower compared to that of the regression models for a small-sized dataset. In the present study, it is intended to conduct a comparison of the performance of various neural network models, such as the Multi-layer Perceptron (MLP), the Radial Basis Function Neural Network (RBF-NN), and the Adaptive Neuro-Fuzzy Inference System (ANFIS) models with the performance of a multiple regression model. For the development of the models, data from drilling experiments on an Al6082-T6 workpiece for various process conditions are employed, and the performance of models related to thrust force (F_z) and cutting torque (M_z) is assessed based on several criteria. From the analysis, it was found that the MLP models were superior to the other neural networks model and the regression model, as they were able to achieve a relatively lower prediction error for both models of F_z and M_z.

Keywords:

drilling; Al6082-T6; multiple regression; multi-layer perceptron; radial basis function neural network; adaptive neuro-fuzzy inference system; thrust force; cutting torque

1. Introduction

Aluminum is among the most widely-used metallic elements. Due to its beneficial properties such as being lightweight and having high corrosion resistance for a considerable range of factors, it is preferred for the fabrication of various parts in the automotive and aerospace industry. Aluminum is often found in the form of an alloy with various elements such as copper, magnesium, manganese, and silicon, among others, and can be subjected to special treatment such as quenching or precipitation hardening in order to alter its properties, such as strength. Various aluminum alloys exhibit considerably high strength-to-weight ratios, and most of the alloys have excellent machinability.

As aluminum is used for a wide variety of applications, a large amount of experimental work has already been conducted regarding various machining processes. Especially in the case of the drilling process, several parameters regarding the influence of process parameters on the forces, surface quality, or tool wear have been conducted, providing useful details for the appropriate processing of this material. Nouari et al. [1] investigated the effect of drilling process parameters and tool coating on tool wear during dry drilling of AA2024 aluminum alloy. They noted that at a certain cutting speed, a transition in tool wear mechanisms occurs, leading in higher tool wear, and thus, they investigated the shift in wear mechanism for various tool coating materials. Their results indicated that, at high cutting speeds, larger deviations from the nominal hole diameters are obtained mainly due to the higher temperature generated on the rake face of the tool, leading to increased tool damage. While abrasion or adhesion wear is observed for lower speeds in the form of built-up edge or built-up layer, the increase of temperature for high cutting speeds resulted in diffusion wear and the increase of the material adhered to the tool. Furthermore, although the uncoated tool produced more accurate hole dimensions at low cutting speeds, this trend was reversed in the case of higher cutting speeds, and the use of the uncoated HSS tool was found to produce the least accurate holes. Girot et al. [2] also conducted experiments regarding tool wear during dry drilling of aluminum alloys. In their work, they aimed at the reduction of the built-up layer in the cutting tool by altering the process parameters and tool coating and geometry. They employed an uncoated carbide drill, a nano-crystalline diamond-coated drill, a nano-crystalline Balinit-coated drill, and a Mo diamond-coated drill and conducted experiments with variable cutting speed. After they established the relationship between the cutting parameters, thrust force, and burr height, they found that diamond coatings were superior due to their low friction coefficient and adequate adhesion of the coating on the tool, and the nano-structured diamond coated tools were the best performing in particular.

Farid et al. [3] conducted a study on chip morphology during high speed drilling of Al-Si alloy. More specifically, they investigated the effect of cutting speed and feed rate on chip morphology by conducting observations at various surfaces of the chip using microscopy techniques. For the drilling experiments, uncoated WC drills were employed to produce 15 mm-diameter holes under various cutting speeds and feed values, in the range of 240–320 m/min and 0.16–0.24 mm/rev, respectively. It was found that, on the free surface of the chip, lamella structures were created and that an increase of speed and feed rate led to a decrease of the frequency of the appearance of these lamella structures. Regarding chip morphology, the increase of cutting speed and feed led to the increase of chip pitch and height and a decrease of the chip compression ratio. Finally, thrust force was found to increase with a decrease of cutting speed and an increase of the feed rate, so it was suggested that high cutting speeds are essential in order to minimize thrust force in the case of Al-Si alloys’ drilling.

Qiu et al. [4] investigated the use of high-performance drills during drilling of aluminum and titanium alloys with a view toward minimizing cutting force and torque. For their experiments, they employed two conventional and two modified types of drills, which had larger point, primary relief, and helix angles and a lower second clearance angle. For the aluminum alloy, the cutting speed was varied from 100–200 m/min and the feed rate from 0.1–0.2 mm/rev. The experiments indicated that the tools with the modified geometry were able to reduce thrust force and torque in the majority of cases, with the average reduction of force being 9.78% and torque 21.08%. Dasch et al. [5] conducted a thorough comparison regarding various categories of coated cutting tools for the drilling process of aluminum. For their study, they used five different types of coating materials, namely carbon, graphitic, hydrogenated and hydrogen-free diamond-like carbon, and diamond on two types of drills, namely HSS and carbide drills. In the tests, they measured tool wear, temperature, and spindle power. After conducting tests on various cases of cutting speed and feed combinations, they concluded that the most important is to coat the flutes of the tool rather than the cutting edge, that coated tools were able to drill up to 100-times more holes than the uncoated, and that hydrogenated carbon coatings are superior to the other types of coating.

Regarding modeling of the drilling process, various studies have been conducted due to its popularity. Kurt, Bagci, and Kaynak [6] determined the optimum cutting parameters for high surface quality and hole accuracy using the Taguchi method. In their work, they conducted experiments at various cutting speeds, feed rates, and depths of drilling and tested the performance of both uncoated and coated HSS drills. After the experiments were conducted by the Taguchi method, regression models for Ra, and hole diameter accuracy were derived, and the optimum parameters were identified by analyzing the S/N ratio results. Kilickap [7] used the Taguchi method and Response Surface Methodology (RSM) to predict burr height during drilling of aluminum alloys and to determine the optimum drilling parameters. He conducted experiments using various cutting speeds, feed rates, and point angles and observed the burr height and surface roughness. Analysis using the Taguchi method and Analysis Of Variance (ANOVA) revealed the optimum parameters for reducing the burr height and surface roughness, and regression models created using RSM were proven to be adequate to describe the correlation between process parameters and process outcome. Sreenivasulu and Rao [8], Efkolidis et al [9], and Kyratsis et al [10] also employed the Taguchi method to determine the optimum levels of the process parameters for the minimization of thrust force and torque during drilling of aluminum alloys. After they performed experiments at various cutting speeds, feed rates, and various drill diameters and geometries, they determined the most significant parameters of the process and their optimum values.

Apart from regression models, several studies based on neural network models have been reported in the relevant literature. Singh et al. [11] employed an Artificial Neural Network (ANN) model to predict tool wear during drilling of copper workpieces. As inputs for the neural network, feed rate, spindle speed, and drill diameter were selected, and thrust force, torque, and maximum wear were the outputs. After the optimum number of hidden layers and neurons, as well as the optimum learning rate and momentum coefficient were determined, it was shown that the best neural network predicted tool wear with a maximum error of 7.5%. Umesh Gowda et al. [12] presented an ANN model for the prediction of circularity, cylindricity, and surface roughness when drilling aluminum-based composites. After the experiments, tool wear, surface roughness, circularity, and cylindricity were obtained as a function of machining time, and NN models were used to correlate process parameters with these quantities.

Besides the usual MLP models, other similar approaches such as RBF-NN or hybrid approaches combining MLP with fuzzy logic or genetic algorithms have also been presented. Neto et al. [13] also presented MLP and ANFIS models for the prediction of hole diameter during drilling of various alloys. For the neural network models, inputs from various sensors such as acoustic emission, electric power, force, and vibration were employed. Thorough analysis of the predicted results’ errors revealed that MLP models had superior performance to ANFIS models. Ferreiro et al. [14] conducted a comprehensive study in developing an AI-based burr detection system for the drilling process of Al7075-T6. For that reason, they employed data-mining techniques and tested various approaches with a view toward reducing the classification errors. All developed models performed better than the existing mathematical model for burr prediction, and in particular, it was shown that a model based on the naive Bayes method exhibited the maximum accuracy.

Lo [15] employed an ANFIS model for the prediction of surface roughness in end milling. In their models, they employed both a triangular and trapezoidal membership function with 48 sets of data for training and 24 for testing. The error was found to be minimum using the triangular membership function. Zuperl et al. [16] used an ANFIS model for the estimation of flank wear during milling. The model was trained with 140 sets of data (75 for training and 65 for testing) and included 32 fuzzy rules. It was shown that a model with a triangular membership function was again capable of exhibiting the lowest error. Song and Baseri [17] applied the ANFIS model for the selection of drilling parameters in order to reduce burr size and improve surface quality. In this work, a different model was built for each of the four outputs, namely burr height, thickness, type, and hole overcut. One hundred and fifty datasets were used, 92 for the training, 25 for checking, and 25 for testing of the model. For the first output, the best model had four membership functions of a triangular type; for the second one, the best model had a three-membership function of a Gaussian type; and the last two three- and four-membership functions of a generalized bell type, respectively. Fang et al. [18] used an RBF model for surface roughness during machining of aluminum alloys. Their model included eight inputs, related to process parameters, cutting forces, and vibration components, and five outputs, related to surface roughness. The RBF model was proven to be faster than an MLP model, but it was inferior in terms of accuracy. Elmounayri et al. [19] employed an RBF model for the prediction of cutting forces during ball-end milling. The model had four inputs, related to the process parameters and four outputs, namely the maximum, minimum, mean, and standard deviation of instantaneous cutting force. In this case, it was noted that the RBF network was more efficient and accurate and faster than the classical MLP.

Although comparisons between regression models and neural network model for cases of machining processes have already been conducted, comparison between several soft computing methods has rarely been conducted, and only a few relevant studies exist. Tsai and Wang [20] conducted a thorough comparison between various neural network models such as different variants of MLP, RBF-NN, and ANFIS for the cases of electrical discharge machining. They concluded that the best performing method was ANFIS, as it exhibited the best accuracy among the tested methods. Nalbant et al. [21] conducted a comparison between regression and artificial neural network models for CNC turning cases and concluded that although the neural network model was performing slightly better, both models were appropriate for modeling the experimental results. Jurkovic et al. [22] compared support vector regression, polynomial regression, and artificial neural networks in the case of high-speed turning. Their models included feed, cutting speed, and depth of cut as inputs, and separate models were created for three output quantities, namely cutting forces, surface roughness, and cutting tool life. In the cases of cutting force and surface roughness, they found that the polynomial regression model was the best, whereas in the case of cutting tool life, ANN performed better than the other methods.

In the present study, a comparison of various neural network models, as well as multiple regression models is conducted with a view toward determining the best performing model for the case of drilling Al6082-T6. As in this case, a relatively small dataset is used, it is expected that the results of regression models and neural network models will be comparable, and so, it is interesting to find out whether there can be a clear difference in the performance of these methods in the present case. The various neural network models are trained with different parameters, and the comparison between them is conducted by criteria relevant to the prediction error, both for training and testing datasets.

2. Artificial Neural Networks

Artificial neural networks are among the most important soft computing methods, widely used for a great range of applications spanning across various scientific fields. This method is able to successfully predict the outcome of a process by using pairs of input and output data in a learning procedure. In fact, this method imitates the function of biological neurons, which receive inputs, process them, and generate a suitable output. The most common ANN model, namely MLP, has a structure containing various interconnected layers of neurons, starting from an input layer and ending at an output layer. The middle layers are called hidden layers, and their number is variable, depending on the size and the characteristics of each problem. Each neuron is connected by artificial synapses to the neurons of the following layer, and a weighting coefficient, or weight, is assigned to each synapse. The inputs for each neuron are multiplied by the weighting coefficient of the respective synapse, and then, they are summed; after that, the output from each neuron is generated by using an activation function, which usually is a sigmoid function, such as the hyperbolic tangent.

The ANN model is created by a learning process, containing three different stages, i.e., training, validation, and testing. For each stage, a part of the total input/output data is reserved, most commonly 70%, 15%, and 15%, respectively. The learning process consists of the determination of suitable weight values according to real input/output data pairs and is an iterative process. This process is called error backpropagation, as the difference of the predicted and actual output is first computed in the output layer, and then, the error is propagated through the network in the opposite way, from the output to the input layer; after various iterations, or epochs, the network weights are properly adjusted so that the error is minimized. The training and validation stage both involve adjustment of the weights, and it is possible to stop the learning process if the error is not decreasing after some epochs, a technique that is named the early stopping technique. Finally, during the testing stage, the trained model is checked for its generalization capabilities by providing it with unknown data from the testing data sample. Usually, the Mean Squared Error (MSE) is used for the determination of model performance:

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(Y_{i} - A_{i})}^{2}

(1)

where n is the number of data samples, Y_i are the predicted values and A_i the actual values.

Furthermore, the Mean Absolute Percentage Error (MAPE) can be also employed to evaluate the results of neural network models:

M A P E = \frac{100 %}{n} \sum_{i = 1}^{n} | \frac{Y_{i} - A_{i}}{A_{i}} |

(2)

Finally, RBF-NN is constructed by using an appropriate radial function as the activation function for a neural network model with a single layer. Most commonly, a Gaussian-type function is used as the activation function:

φ (r) = e^{- ε r^{2}}

(3)

where ε is a small number and r =

‖ x - x_{i} ‖

, with x being an input and x_i the center of the radial basis function. Apart from the widely-used MLP model and RBF-NN models, various other types of neural networks exist, as well as hybrid models, combining ANN with other soft computing methods such as fuzzy logic. In addition to the learning and optimization ability of the neural networks, fuzzy logic systems are able to provide a means for reasoning to the system with the use of IF-THEN rules. ANFIS is a soft computing method with characteristics related both to neural networks and fuzzy logic. Generally, the creation of an ANFIS model has several similarities to the creation of a MLP model, but there exist also some basic differences. This model has a structure similar to the MLP structure, but there are some additional layers, related to membership functions of the Fuzzy Inference System (FIS). One of the most common fuzzy models used in ANFIS models is the Sugeno fuzzy model. By the training process, fuzzy rules are formed, and an FIS is created. At first, the input data are converted into fuzzy sets through a fuzzification layer, and then, membership functions are generated. Membership functions can have various shapes such as triangular, trapezoid, or Gaussian, among others. In the second layer, the number of nodes is equal to the number of fuzzy rules of the system; the determination of the number of fuzzy rules is conducted using a suitable method such as the subtractive clustering method. The output of the second layer nodes is the product of the input signals, which they receive from the previous nodes. Then, in the third layer, the ratio of each rule’s firing strength to the sum of the firing strengths of all rules is calculated, and it is used in the defuzzification layer to determine the output values from each node. Finally, the total output is calculated by summation of the outputs of the fourth layer. The FIS parameters are updated by means of a learning algorithm in order to reduce the prediction error, such as in the case of MLP model.

3. Methods and Materials

3.1. Technics of Experiments

In the present work, it is intended to conduct a comparison of various methods, which can be employed for modeling of the drilling process of Al6082-T6. The datasets for the models were obtained by conducting experiments on an Al6082-T6 workpiece. This work material used for the present investigation was a 140 mm × 100 mm × 30 mm plate with the chemical composition and main mechanical properties as summarized in Table 1.

Drilling experiments were carried out on a HAAS VF1 CNC machining center using solid carbide drill tools. Two components of drilling loads (thrust force F_z and torque M_z values) were recorded with a Kistler dynamometer Type 9123 connected to a three-channel charge amplifier with a data acquisition system. Dynoware software (Type 2825D-02) was used in order to monitor and record the experimental values of the cutting forces, as shown in Figure 1.

3.2. Methodology of Results Analysis

The experiments were conducted according to a full factorial strategy. The three cutting parameters selected for the present investigation were the cutting speed (V), the feed rate (f), and the cutting tool diameter (D). Cutting velocity values of 50 m/min, 100 m/min, and 150 m/min together with feed rates of 0.15 mm/rev, 0.2 mm/rev, and 0.25 mm/rev were used in combination with three cutting tool diameters (D) of 8 mm, 10 mm, and 12 mm. The constant depth of the holes drilled was 30 mm. The machining parameters used and their levels are shown in Table 2.

During the experimental process, 27 different solid carbide drilling tools (one tool for each hole) were clamped to a Weldon clamping device with a high rigidity, while a mechanical vise was used for clamping the workpiece. As the type of cutting tools (KENNAMETAL B041A/KC7325) is non-through coolant solid carbide, the cutting fluid used (KOOLRite 2270) was provided by the delivery system near them. Figure 2 demonstrates all the dimensional details of the main shape of the cutting tools. All the possible combinations of the manufacturing parameters were used (cutting speed, feed rate, tool diameter).

As shown in Figure 3, apart from multiple regression method, three soft computing methods, related to artificial neural networks, namely MLP, RBF-NN, and ANFIS methods, were used.

For the development of the model, MATLAB^TM software was employed. For each method, only one characteristic, mainly related to the architecture of the network, was varied, and the other settings were assumed to be equal to the default settings of the software. For the MLP and RBF-NN, the number of hidden neurons was varied, and for the ANFIS, where the number of inner nodes cannot be altered in the same way as in the other two types of networks, the number of clusters used for the data, using the fuzzy c-means clustering method, which directly affects the number of parameters of the system, was varied. In all cases, the lower and upper boundaries for the variable parameters were chosen in a way that it did not lead to extreme overfitting; the analysis conducted was able to determine exactly the limit values of the variable parameters, which can lead to a properly-functioning model.

The choice of the number of hidden neurons is crucial in the case of MLP and RBF-NN, as it can increase the accuracy of the network up to a point, and then, it causes problems of overfitting. In the relevant literature, no certain methodology for the determination of the exact number of hidden neurons exists; there are several empirical rules, which usually relate the appropriate number of hidden neurons with the number of input and output variables. However, the number of available pairs of input/output data also affects the choice of the number of hidden neurons, as it is not appropriate to use more network parameters, e.g., weights and biases, than the number of available input/output data pairs. This is especially important for the present study, as the size of dataset was relatively small, namely 27 input/output data pairs. For the determination of the lower and upper value of the number of hidden neurons, in order not to conduct an exhaustive search, the recommendations from some empirical relationships were taken into consideration. In [23], it was proposed that the number of hidden neurons could be approximated as follows:

n = \sqrt{N_{i n p} + N_{o u t}} + a

(4)

where N_inp is the number of input neurons, N_out the number of output neurons, and α is a number between zero and 10. In the present case, N_inp equals to three, i.e., drill diameter, cutting speed, and feed, and N_out is one, i.e., thrust force or cutting torque. Thus, according to this formula, the lower and upper boundaries for the number of hidden neurons in the present study was between two and 12. However, in the present study, an investigation of the optimum value of hidden neurons was conducted by taking into consideration the performance of the models regarding not only MSE, but also MAPE, both for the training and testing dataset, as it is required to achieve a low value of error for the training data, but also to prevent the network from overfitting, something that is indicated by the error in the testing dataset. For the MLP and RBF-NN, the number of hidden neurons is varied between one and eight, a choice that is comparable to the suggestions offered by Equation (4). Each network was trained 50 times with the same settings in order to overcome the effect of the random initialization of weights. For the ANFIS models, the parameter that was varied was the number of clusters, in the range of 2–4. As with the other two types of networks, the training was repeated 50 times, and the results regarding MSE and MAPE for training and testing data are presented.

For all models, the data pairs reserved for the testing stage of the network assessment were selected randomly, but were the same in order to conduct a more appropriate comparison between these models. Models were developed separately for thrust force F_z and torque M_z. In Figure 4, a flowchart presenting graphically the procedure followed in the present study is depicted.

4. Results and Discussion

4.1. Result of Experiment

Figure 5 shows the experimental values derived from the whole process. It is clear that there was a relationship between cutting parameters and cutting forces. When the tool diameter increased, both cutting forces increased. As the feed rate values increased, the cutting forces increased respectively. The cutting speed did not noticeably affect either cutting force. A full factorial strategy was applied, and twenty seven (27) drilling experiments were performed, while both F_z and M_z were modelled separately using polynomial mathematical models.

4.2. Multiple Regression Models

The RSM is a tool based on the collection of statistical and mathematical techniques used for process optimization. Optimization can be managed by creating an empirical relationship between independent variables and the response of the system using the quantitative data collected from experiment [5,6]. The experiments were designed and executed by employing second-order nonlinear mathematical models in order to predict the thrust force and the cutting torque, which are of the following form:

Y = b_{0} + b_{1} X_{1} + b_{2} X_{2} + b_{3} X_{3} + b_{4} X_{1}^{2} + b_{5} X_{2}^{2} + b_{6} X_{3}^{2} + b_{7} X_{1} X_{2} + b_{8} X_{1} X_{3} + b_{9} X_{2} X_{3}

(5)

where Y is the response, X_i stands for the coded values, and bi stand for the models’ regression coefficients.

When the level of significance is very small, less than 0.05, this indicates that the results were high and probably not randomly scattered. The significant terms were found by Analysis Of Variance (ANOVA) for each response (Table 3 and Table 4).

The following regression equations for the thrust force and the cutting torque as a function of three input process variables were developed:

\begin{array}{l} F_{z} = - 222 + 57.6 D + 1.96 V + 1292 f - 0.76 D * D - 0.0117 V * V - 667 f * f \\ + 0.0560 D * V + 103 D * f - 1.20 V * f \end{array}

(6)

and:

\begin{array}{l} M_{z} = 12.9 - 2.67 D + 0.0137 V - 15.6 f + 0.140 D * D - 0.000055 V * V + 4.7 f * f \\ - 0.000122 D * V + 2.63 D * f - 0.0233 V * f \end{array}

(7)

where D is the tool diameter (mm), f is the feed rate (mm/rev), and V is the cutting speed (m/min).

The p-values are used as a basic tool to check the importance of each of the coefficients. The smaller the amount of P, the more significant is the corresponding coefficient. In our case, for F_z, these factors are: D (p-value = 0.026), V (p-value = 0.002), V*V (p-value = 0.000), and D*f (p-value = 0.005), while for M_z, the significant terms are: D (p-value = 0.000), D*D (p-value = 0.000), V*V (p-value =0.046), and D*f (p-value = 0.000). Moreover, from Table 3 and Table 4, it is evaluated that the coefficient of multiple determination was very close to unity for both cases (R² = 99.6% for F_z and R² = 98.9% for M_z), and the adjusted coefficient (R² adj) was 99.4% for F_z and 98.3% for M_z. All these statistical estimators indicate an appropriate RSM model with the degree of freedom and optimal architecture that can be used for predictive simulations of the reactive extraction. Of course, for both cases, only the significant terms can be included in order to present a set of simplified equations. The agreement between experimental and RSM predicted data is shown in Figure 6. Residual analysis was performed to test the models’ accuracy; in both cases, all points were positioned near to a straight line, indicating that RSM predicts the experimental data for the considered valid region well.

4.3. Multi-Layer Perceptron Models

As was described in the Methods and Materials section, in the case of MLP models, an investigation of the optimum number of hidden neurons was conducted based on various criteria. After the MLP models were developed, the results obtained were as presented in Table 5.

As can be seen from Table 5 and Figure 7, in the case of the thrust force model, the MSE for the training dataset was generally decreasing with an increase of the number of hidden neurons. Especially, for models with over five neurons, the MSE became extremely small, indicating that the training dataset can be reproduced with a high level of accuracy by this model. However, the MSE for the testing dataset, which is an indication of overfitting, became the minimum for the model with four hidden neurons and then increased again. A similar situation was observed in the case of MAPE, where MAPE for the training dataset became extremely small for networks with more than four hidden neurons, but the minimum MAPE for the testing dataset was obtained for the model with five hidden neurons. Thus, the optimum number of hidden neurons was four or five, and the final choice was conducted in terms of total error, as the error values for these cases were very close. Finally, the case with four neurons was chosen as the total error was very close to the total error of the case with five neurons, and this network was less complex than the other.

As can be seen from Table 6 and Figure 8, in the case of the torque model, the MSE for training was decreasing with an increasing number of hidden neurons, mainly for models with more than six hidden neurons. However, the minimum value of MSE for the testing dataset was obtained for the model with four hidden neurons. Regarding MAPE, the minimum value for the training dataset was obtained for the model with seven hidden neurons, whereas the minimum value for the testing dataset was obtained for the model with six hidden neurons. Thus, the best model was determined again by checking the total error, and it was determined that the network with six hidden neurons was the best performing one. Generally, it was observed that the error values for the M_z model were considerably larger than those of the F_z model. Nevertheless, the results confirmed that the optimum number of neurons was dependent on the size of the dataset; as for the relatively small dataset of 27 samples, a model with six hidden neurons was sufficient.

4.4. Radial Basis Function Neural Network Models

As was previously mentioned, in the case of RBF-NN models, it is intended to determine the optimum number of hidden neurons. In Table 3, the results from the RBF-NN models are presented.

For the RBF-NN models in the case of thrust force, it can be observed from Table 7 and Figure 9 that MSE for the training set was decreasing with an increasing number of hidden neurons, but there was not such a large decrease as in the case of MLP models. For the testing dataset, MSE was the minimum for the model with seven hidden neurons, and the same was noted for MAPE for the training dataset. Thus, the model with seven hidden neurons was the optimum in this case. It is to be noted that the value of MAPE for both the training and testing dataset was considerably reduced for models with over three neurons, something that was not observed to a similar extent in the case of MLP models, at least for the testing dataset.

As for the torque (M_z) cases, it was observed from Table 8 and Figure 10 that the optimum model was the one with eight hidden neurons, as it exhibited both a lower MSE and MAPE for the testing dataset. Thus, it can be stated that for RBF-NN models, a slightly larger number of hidden neurons was required compared to MLP models.

4.5. Adaptive Neuro-Fuzzy Inference System Models

In the case of ANFIS models, results from cases with a different number of clusters are presented in Table 9.

In the case of the ANFIS models for thrust force, it can be seen from Table 9 and Figure 11 that the MSE for both training and testing datasets was reduced with an increasing number of clusters. Furthermore, the same can be observed for the MAPE for both datasets; thus, the ANFIS model with four clusters was selected as the best performing network. In the case of torque models, whose results are presented in Table 10 and Figure 12, the optimum value of MSE for the testing dataset was obtained for the model with two clusters, whereas the optimum value of MAPE for the testing dataset was obtained for the model with four clusters. Therefore, the best model was selected according to the total error values, which were lower for the model with four clusters.

4.6. Comparison of Regression and Neural Network Models

The thrust force and cutting torque predictive models were developed by RSM and ANN by implementing a full factorial experimentation design depending on the effects of the selected cutting parameters such as cutting speed, feed rate, and tool diameter. Figure 13 and Figure 14 illustrate the comparison between the experimental values of F_z and M_z and their values of the ANN and RSM models, respectively.

After the results regarding all developed models were presented, a comparison between these models was conducted. For the comparison, the best model from each method was selected, and its performance was evaluated using the same criteria, as can be seen in Table 11. As for the regression model, the MSE was not calculated, and also, no separate dataset for training and testing exists, so the total MAPE for each case was chosen as the criterion for the comparison between all models.

Among the three neural network models, it can be seen that the MLP model had the lowest MAPE values in all cases, followed by the ANFIS model and the RBF model. The MLP and ANFIS models were clearly superior to the RBF model and also better than the regression model. Thus, it can be concluded that the MLP model was more appropriate for the modeling of thrust force and torque in the presented case. Finally, given the relatively small size of the neural networks model, there was no significant difference in terms of computational time between these approaches and the regression model, and so, it was shown that these models can perform efficiently for the presented case.

5. Conclusions

In the present paper, various methods for the modeling of the drilling process, such as regression and neural network models, were compared in order to determine which is the most accurate and efficient one. These models were applied to cases of drilling Al 6082-T6 alloy under various process conditions. From this study, several conclusions can be drawn.

From the statistical analysis and derivation of the multiple regression models, it can be deduced that the developed models are sufficiently accurate, as they exhibited high correlation coefficient values, namely 99.6% and 98.9% for the F_z and M_z models, respectively, and also low MAPE values, namely 0.86305% and 3.0495% for the F_z and M_z models, respectively.

As for the neural network models, it was determined that an MLP model with 4 hidden neurons was more suitable for the prediction of thrust force, and an MLP model with six hidden neurons was more appropriate for the prediction of cutting torque. Furthermore, for the RBF-NN model, a higher number of hidden neurons was shown to lead to better accuracy, namely seven and eight in the cases of thrust force and torque, respectively. Finally, for the ANFIS models with fuzzy c-means clustering, the use of four clusters was the best choice in both the F_z and M_z cases.

From the comparison of all models, it was concluded that the MLP model performed better in all cases, followed by ANFIS, multiple regression, and RBF-NN. Thus, it was shown that MLP can be competitive with multiple regression models even for relatively small datasets and that ANFIS and RBF-NN are less suitable for cases with small datasets.

Author Contributions

Conceptualization, N.E.K., N.E., P.K., and A.P.M.; methodology, N.E.K., N.E., P.K., and A.P.M.; software, N.E.K. and N.E.; validation, N.E. and P.K.; formal analysis, N.E.K. and N.E.; investigation, N.E.K., N.E., P.K., and A.P.M.; resources, P.K. and A.P.M.; writing, original draft preparation, N.E.K. and N.E.; writing, review and editing, P.K. and A.P.M.; supervision, P.K. and A.P.M.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Nouari, M.; List, G.; Girot, F.; Gehin, D. Effect of machining parameters and coating on wear mechanisms in dry drilling of aluminum alloys. Int. J. Mach. Tools Manuf. 2005, 45, 1436–1442. [Google Scholar] [CrossRef]
Girot, F.; Gutierre-Orrantia, M.E.; Calamaz, M.; Coupard, D. Modeling and adhesion tool wear in dry drilling of aluminum alloys. AIP Conf. Proc. 2011, 1315, 1639–1644. [Google Scholar]
Farid, A.A.; Sharif, S.; Idris, M.H. Chip morphology study in high speed drilling of Al-Si alloy. Int. J. Adv. Manuf. Technol. 2011, 57, 555–564. [Google Scholar] [CrossRef]
Qiu, K.; Qin, S.; Ge, C.; Chen, M. A study of high-performance drills in the drilling of aluminum alloy and titanium alloy. Key Eng. Mater. 2014, 589–590, 163–167. [Google Scholar] [CrossRef]
Dasch, J.M.; Ang, C.C.; Wong, C.A.; Cheng, Y.T.; Weiner, A.M.; Lev, L.C.; Konca, E. A comparison of five categories of carbon-based tool coatings for dry drilling of aluminum. Surf. Coat. Technol. 2006, 200, 2970–2977. [Google Scholar] [CrossRef]
Kurt, M.; Kaynak, Y.; Bagci, E. Evaluation of drilled hole quality in Al2024 alloy. Int. J. Adv. Manuf. Technol. 2008, 37, 1051–1060. [Google Scholar] [CrossRef]
Kilickap, E. Modeling and optimization of burr height in drilling of Al-7075 using Taguchi method and response surface methodology. Int. J. Adv. Manuf. Technol. 2010, 49, 911–923. [Google Scholar] [CrossRef]
Sreenivasulu, R.; Rao, C.S. Effect of drilling parameters on thrust force and torque during drilling of aluminum 6061 alloy-based on Taguchi design of experiments. J. Mech. Eng. 2016, 46, 41–48. [Google Scholar] [CrossRef]
Efkolidis, N.; Garcia-Hernandez, C.; Huertas-Talon, J.L.; Kyratsis, P. Modelling and prediction of thrust force and torque in drilling operations of Al7075 using ANN and RSM Methodologies. Strojinski Vestn. J. Mech. Eng. 2018, 64, 351–361. [Google Scholar]
Kyratsis, P.; Markopoulos, A.; Efkolidis, N.; Maliagkas, V.; Kakoulis, K. Prediction of thrust force and cutting torque in drilling based on the response surface methodology. Machines 2018, 6, 24. [Google Scholar] [CrossRef]
Singh, A.K.; Panda, S.S.; Pal, S.K.; Chakraborty, D. Predicting drill wear using an artificial neural network. Int. J. Adv. Manuf. Technol. 2006, 28, 456–462. [Google Scholar] [CrossRef]
Umesh Gowda, B.M.; Ravindra, H.V.; Ullas, M.; Naveen Prakash, G.V.; Ugrasen, G. Estimation of circularity, cylindricity and surface roughness in drilling Al-Si3N4 metal matrix composites using artificial neural network. Procedia Mater. Sci. 2014, 6, 1780–1787. [Google Scholar] [CrossRef]
Neto, F.C.; Geronimo, T.M.; Cruz, C.E.D.; Aguiar, P.R.; Bianchi, E.E.C. Neural models for predicting hole diameters in drilling processes. Procedia CIRP 2013, 12, 49–54. [Google Scholar] [CrossRef]
Ferreiro, S.; Sierra, B.; Irigoien, I.; Gorritxategi, E. Data mining for quality control: Burr detection in the drilling process. Comput. Ind. Eng. 2011, 60, 801–810. [Google Scholar] [CrossRef]
Lo, S.P. An adaptive-network based fuzzy inference system for prediction of workpiece surface roughness in end milling. J. Mater. Process. Technol. 2003, 142, 665–675. [Google Scholar] [CrossRef]
Zuperl, U.; Cus, F.; Kiker, E. Adaptive network based inference system for estimation of flank wear in end-milling. J. Mater. Process. Technol. 2009, 209, 1504–1511. [Google Scholar]
Azarrang, S.; Baseri, H. Selection of dry drilling parameters for minimal burr size and desired drilling quality. Proc. Inst. Mech. Eng. E 2017, 231, 480–489. [Google Scholar] [CrossRef]
Fang, N.; Srinivasa Pai, P.; Edwards, N. Neural network modeling and prediction of surface roughness in machining aluminum alloys. J. Comput. Commun. 2016, 4, 66460. [Google Scholar] [CrossRef]
El-Mounayri, H.; Briceno, J.F.; Gadallah, M. A new artificial neural network approach to modeling ball-end milling. Int. J. Adv. Manuf. Technol. 2010, 47, 527–534. [Google Scholar] [CrossRef]
Tsai, K.M.; Wang, P.J. Comparisons of neural network models on material removal rate in electrical discharge machining. J. Mater. Process. Technol. 2001, 117, 111–124. [Google Scholar] [CrossRef]
Nalbant, M.; Gokkaya, H.; Toktas, I. Comparison of regression and artificial neural network models for surface roughness prediction with the cutting parameters in CNC turning. Model. Simul. Eng. 2007, 2007, 92717. [Google Scholar] [CrossRef]
Jurkovic, Z.; Cukor, G.; Brezocnik, M.; Brajkovic, T. A comparison of machine learning methods for cutting parameters prediction in high speed turning process. J. Intell. Manuf. 2018, 29, 1683–1693. [Google Scholar] [CrossRef]
Zhang, Z.H.; Yan, D.N.; Ju, J.T.; Han, Y. Prediction of the flow stress of a high alloyed austenitic stainless steel using artificial neural network. Mater. Sci. Forum 2012, 724, 351–354. [Google Scholar] [CrossRef]

Figure 1. The workflow used for the research.

Figure 2. Cutting tool geometry details.

Figure 3. Development of the predicted models.

Figure 4. Flowchart of the procedure followed in the present research.

Figure 5. Experimental values derived from Kistler 9123.

Figure 6. Residuals analyses for F_z and M_z.

Figure 7. Results regarding the MLP model for F_z: (a) MSE for training and testing datasets, (b) MAPE for training and testing datasets.

Figure 8. Results regarding the MLP model for M_z: (a) MSE for the training and testing datasets and (b) MAPE for the training and testing datasets.

Figure 9. Results regarding the RBF-NN model for F_z: (a) MSE for training and testing datasets and (b) MAPE for training and testing datasets.

Figure 10. Results regarding the RBF model for M_z: (a) MSE for training and testing datasets and (b) MAPE for training and testing datasets.

Figure 11. Results regarding the ANFIS model for F_z: (a) MSE for training and testing datasets and (b) MAPE for training and testing datasets.

Figure 12. Results regarding the ANFIS model for M_z: (a) MSE for training and testing datasets and (b) MAPE for training and testing datasets.

Figure 13. Comparison of the experimental values with the predicted values for F_z.

Figure 14. Comparison of the experimental values with the predicted values for M_z.

Table 1. Chemical composition and mechanical properties of Al6082.

Chemical Composition
Elements	Mg	Cu	Cr	Fe	Si	Mn	Others	Al
Percentage	0.8	0.1	0.25	0.5	0.9	0.5	0.3	Balance
Mechanical Properties
Alloy	Temper	Thermal Conductivity		Coefficient of Thermal Expansion	Melting Point		Elastic Modulus	Electrical Resistivity
		W/m·K		K⁻¹	Min Max		GPa	Ω·m
Al6082	T6	180		24 × 10⁻⁶	555 650		70	0.038 × 10⁻⁶

Table 2. Machining factors and their levels.

Factors	Notation	Levels
		I	II	III
Cutting speed (m/min)	V	50	100	150
Feed rate (mm/rev)	f	0.15	0.2	0.25
Tool diameters (mm)	D	8	10	12

Table 3. ANOVA table for F_z (thrust force).

Source	Degree of Freedom	Sum of Squares	Mean Square	f-Value	p-Value
Regression	9	514.673	57.186	471.95	0.000
Residual Error	17	2060	121
Total	26	516.733
R-Sq(adj) = 99.4%
Predictor	Parameter Estimate Coefficient	Standard Error Coefficient		t-Value	p-Value
Constant	−221.9	151.4		−1.47	0.161
D	57.59	23.60		2.44	0.026
V	1.9550	0.5455		3.58	0.002
f	1291.8	798.0		1.62	0.124
D*D	−0.762	1.123		−0.68	0.506
V*V	−0.011660	0.001798		−6.49	0.000
f*f	−667	1798		−0.37	0.715
D*V	0.05600	0.03178		1.76	0.096
D*f	103.25	31.78		3.25	0.005
V*f	−1.203	1.271		−0.95	0.357

Table 4. ANOVA table for M_z (torque).

Source	Degree of Freedom	Sum of Squares	Mean Square	f-Value	p-Value
Regression	9	37.8929	4.2103	171.77	0.000
Residual Error	17	0.0924	0.0245
Total	26	38.3096
R-Sq(adj) = 98.3%
Predictor	Parameter Estimate Coefficient	Standard Error Coefficient		t-Value	p-Value
Constant	12.941	2.154		6.01	0.000
D	−2.6736	0.3357		−7.96	0.000
V	0.013678	0.007758		1.76	0.096
f	−15.58	11.35		−1.37	0.188
D*D	0.14025	0.01598		8.78	0.000
V*V	−0.00005493	0.00002557		−2.15	0.046
f*f	4.67	25.57		0.18	0.857
D*V	−0.0001225	0.0004520		−0.27	0.790
D*f	2.6250	0.4520		5.81	0.000
V*f	−0.02330	0.01808		−1.29	0.215

Table 5. Results regarding the MLP models for thrust force (F_z).

No. of Hidden Neurons	MSEtrain	MSEtest	MAPEtrain (%)	MAPEtest (%)
1	2.35 × 10⁻⁴	6.36 × 10⁻⁴	1.46 × 10⁰	2.93 × 10⁰
2	8.42 × 10⁻⁵	1.05 × 10⁻⁴	9.84 × 10⁻¹	1.14 × 10⁰
3	2.72 × 10⁻⁷	1.53 × 10⁻⁴	5.32 × 10⁻²	1.36 × 10⁰
4	2.20 × 10⁻⁶	6.10 × 10⁻⁵	1.36 × 10⁻¹	8.23 × 10⁻¹
5	1.78 × 10⁻¹⁵	1.31 × 10⁻⁴	2.50 × 10⁻⁶	9.45 × 10⁻¹
6	3.40 × 10⁻²²	4.42 × 10⁻⁴	2.04 × 10⁻⁹	2.22 × 10⁰
7	4.98 × 10⁻²⁴	2.59 × 10⁻⁴	2.12 × 10⁻¹⁰	2.02 × 10⁰
8	5.03 × 10⁻²³	2.60 × 10⁻⁴	6.41 × 10⁻¹⁰	2.06 × 10⁰

Table 6. Results regarding the MLP models for torque (M_z).

No. of Hidden Neurons	MSEtrain	MSEtest	MAPEtrain (%)	MAPEtest (%)
1	7.39 × 10⁻⁴	1.57 × 10⁻³	4.75 × 10⁰	6.86 × 10⁰
2	4.11 × 10⁻⁴	1.08 × 10⁻³	3.53 × 10⁰	3.48 × 10⁰
3	1.14 × 10⁻⁴	9.34 × 10⁻⁴	1.65 × 10⁰	4.15 × 10⁰
4	6.65 × 10⁻⁶	6.66 × 10⁻⁴	3.32 × 10⁻¹	3.20 × 10⁰
5	5.36 × 10⁻⁶	1.14 × 10⁻³	3.59 × 10⁻¹	4.10 × 10⁰
6	5.53 × 10⁻⁷	1.86 × 10⁻⁴	1.30 × 10⁻¹	2.19 × 10⁰
7	6.60 × 10⁻²³	7.81 × 10⁻⁴	6.79 × 10⁻¹⁰	4.36 × 10⁰
8	8.68 × 10⁻²³	1.45 × 10⁻³	1.36 × 10⁻⁹	6.01 × 10⁰

Table 7. Results regarding the RBF-NN models for thrust force (F_z).

No. of Hidden Neurons	MSEtrain	MSEtest	MAPEtrain (%)	MAPEtest (%)
1	9.92 × 10⁻³	2.01 × 10⁻²	1.08 × 10⁻¹	1.94 × 10¹
2	1.20 × 10⁻³	6.62 × 10⁻⁴	4.09 × 10⁰	2.70 × 10⁰
3	1.59 × 10⁻⁴	2.78 × 10⁻⁴	1.34 × 10⁰	1.98 × 10⁰
4	1.30 × 10⁻⁴	1.39 × 10⁻⁴	1.21 × 10⁰	1.51 × 10⁰
5	9.99 × 10⁻⁵	1.60 × 10⁻⁴	1.12 × 10⁰	1.52 × 10⁰
6	9.07 × 10⁻⁵	1.65 × 10⁻⁴	1.02 × 10⁰	1.45 × 10⁰
7	8.19 × 10⁻⁵	1.14 × 10⁻⁴	9.42 × 10⁻¹	1.13 × 10⁰
8	7.67 × 10⁻⁵	1.32 × 10⁻⁴	9.29 × 10⁻¹	1.35 × 10⁰

Table 8. Results regarding the RBF-NN models for torque (M_z).

No of Hidden Neurons	MSEtrain	MSEtest	MAPEtrain (%)	MAPEtest (%)
1	3.31 × 10⁻²	5.94 × 10⁻²	3.03 × 10¹	3.60 × 10¹
2	5.60 × 10⁻³	1.26 × 10⁻²	1.37 × 10¹	2.13 × 10¹
3	4.20 × 10⁻³	1.05 × 10⁻²	1.14 × 10¹	1.98 × 10¹
4	2.74 × 10⁻³	1.05 × 10⁻²	9.51 × 10⁰	2.04 × 10¹
5	2.65 × 10⁻³	1.23 × 10⁻²	9.52 × 10⁰	2.38 × 10¹
6	2.37 × 10⁻³	7.83 × 10⁻³	9.20 × 10⁰	1.83 × 10¹
7	6.49 × 10⁻⁴	1.24 × 10⁻³	4.63 × 10⁰	6.27 × 10⁰
8	4.82 × 10⁻⁴	8.73 × 10⁻⁴	3.75 × 10⁰	5.81 × 10⁰

Table 9. Results regarding the ANFIS models for thrust force (F_z).

No of Clusters	MSEtrain	MSEtest	MAPEtrain (%)	MAPEtest (%)
2	4.94 × 10⁻⁵	3.68 × 10⁻⁴	8.03 × 10⁻¹	2.49 × 10⁰
3	1.37 × 10⁻⁵	4.47 × 10⁻⁴	3.60 × 10⁻¹	2.74 × 10⁰
4	2.80 × 10⁻⁶	1.20 × 10⁻⁴	1.40 × 10⁻¹	1.07 × 10⁰

Table 10. Results regarding the ANFIS models for torque (M_z).

No of Clusters	MSEtrain	MSEtest	MAPEtrain (%)	MAPEtest (%)
2	5.39 × 10⁻⁴	1.49 × 10⁻³	3.66 × 10⁰	8.38 × 10⁰
3	2.75 × 10⁻⁵	2.52 × 10⁻³	8.22 × 10⁻¹	1.03 × 10¹
4	5.90 × 10⁻⁶	2.63 × 10⁻³	3.16 × 10⁻¹	7.78 × 10⁰

Table 11. Comparison of error values for all models.

Type of model	MAPEtotal (F_z)	MAPEtotal (M_z)
Multiple Regression	0.86305%	3.0495%
MLP	0.2979%	0.9667%
RBF-NN	0.978%	4.129%
ANFIS	0.312%	1.698%

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Karkalos, N.E.; Efkolidis, N.; Kyratsis, P.; Markopoulos, A.P. A Comparative Study between Regression and Neural Networks for Modeling Al6082-T6 Alloy Drilling. Machines 2019, 7, 13. https://doi.org/10.3390/machines7010013

AMA Style

Karkalos NE, Efkolidis N, Kyratsis P, Markopoulos AP. A Comparative Study between Regression and Neural Networks for Modeling Al6082-T6 Alloy Drilling. Machines. 2019; 7(1):13. https://doi.org/10.3390/machines7010013

Chicago/Turabian Style

Karkalos, Nikolaos E., Nikolaos Efkolidis, Panagiotis Kyratsis, and Angelos P. Markopoulos. 2019. "A Comparative Study between Regression and Neural Networks for Modeling Al6082-T6 Alloy Drilling" Machines 7, no. 1: 13. https://doi.org/10.3390/machines7010013

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Comparative Study between Regression and Neural Networks for Modeling Al6082-T6 Alloy Drilling

Abstract

1. Introduction

2. Artificial Neural Networks

3. Methods and Materials

3.1. Technics of Experiments

3.2. Methodology of Results Analysis

4. Results and Discussion

4.1. Result of Experiment

4.2. Multiple Regression Models

4.3. Multi-Layer Perceptron Models

4.4. Radial Basis Function Neural Network Models

4.5. Adaptive Neuro-Fuzzy Inference System Models

4.6. Comparison of Regression and Neural Network Models

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI