Next Article in Journal
Special Issue: “Polymer-Based Biomaterials and Tissue Engineering”
Previous Article in Journal
Optimising Two-Stage Vacuum Heat Treatment for a High-Strength Micro-Alloyed Steel in Railway Spring Clip Application: Impact on Microstructure and Mechanical Performance
Previous Article in Special Issue
Highly Robust Ti Adhesion Layer during Terminal Reaction in Micro-Bumps
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Coefficient Extraction of SAC305 Solder Constitutive Equations Using Equation-Informed Neural Networks

1
Department of Mechanical and Computer-Aided Engineering, Feng Chia University, Taichung 40724, Taiwan
2
Department of Power Mechanical Engineering, National Tsing Hua University, Hsinchu City 30013, Taiwan
3
College of Semiconductor Research, National Tsing Hua University, Hsinchu City 30013, Taiwan
*
Author to whom correspondence should be addressed.
Materials 2023, 16(14), 4922; https://doi.org/10.3390/ma16144922
Submission received: 28 May 2023 / Revised: 30 June 2023 / Accepted: 5 July 2023 / Published: 10 July 2023
(This article belongs to the Special Issue Simulation and Reliability Assessment of Advanced Packaging)

Abstract

:
Equation-Informed Neural Networks (EINNs) are developed as an efficient method for extracting the coefficients of constitutive equations. Subsequently, numerical Bayesian Inference (BI) iterations were applied to estimate the distribution of these coefficients, thereby further refining them. We could generate coefficients optimally aligned with the targeted application scenario by carefully adjusting pre-processing mapping parameters and identifying dataset preferences. Leveraging graphical representation techniques, the EINNs formulation is implemented in temperature- and strain-rate-dependent hyperbolic Garofalo, Anand, and Chaboche constitutive models to extract the corresponding coefficients for lead-free SAC305 solder material. The performance of the EINNs-based extracted coefficients, obtained from experimental results of SAC305 solder material, is comparable to existing studies. The methodology offers the dual advantage of providing the coefficients’ value and distribution against the training dataset.

1. Introduction

Proper material constitutive models and related coefficients are fundamental for reliable finite element predictions, encompassing the performance prediction model [1], the manufacturing process [2,3], and the reliability prediction models. Non-linear material properties, based on the temperature- and strain-rate-dependent material models, are often necessary for modeling critical sections of electronic packaging [4,5] and further influence the accuracy and predictability of the surrogate AI models [6,7,8].
Solder, a key component in electronic packaging, is often associated with potential fatigue failures. Wilde et al. conducted a study on the rate-dependent constitutive relationship of Pb-rich material [9], resulting in extracting Anand-based coefficients and identifying kinematic hardening, also known as the Bauschinger effect. To gain a better understanding of the creep characteristics of Pb-free solders, Xiao and Armstrong [10] performed tensile tests on both eutectic PbSn and Sn3.9Ag0.6Cu solder. Their findings revealed substantial microstructural alterations in the Sn3.9Ag0.6Cu with significantly lower absolute creep rates than the PbSn eutectic. The creep measurement data were successfully fitted into the Garofalo model [11], and the corresponding Garofalo coefficient was extracted.
Furthermore, Motalab et al. [12,13] conducted creep tests under meticulous control of the microstructure of the SAC305 solder without an oxidized surface, yielding a set of nine parameters for the Anand model. Basit et al. [14] utilized the Anand constitutive model with the extracted coefficients for solder joint lifetime prediction. The Chaboche material model [15], which considers the Bauschinger effect, was applied by Xie and Chen [16], Deshpande et al. [17], Wang et al. [5], and Yan et al. [18] for life prediction using Manson–Coffin type equations.
Ma and Suhling reviewed the constitutive equation and the corresponding coefficients of lead-free solder joint [19], and significant coefficient discrepancies have been reported. On the other hand, finite element engineers frequently face difficulties in selecting an appropriate material model and its parameters, as the measurement conditions may differ from those in practical applications. Kuczynska et al. [20] performed mechanical/dynamic tests against the solder joint to verify the ability of these material models and their coefficients to map the lifetime differences depending on the temperature rate under field and testing conditions, as well as on the mean operating temperature.
Considering the many application scenarios, which may range from low to high temperatures and strain rates, an emerging trend encourages users to obtain their own material coefficients [5]. This approach emphasizes the importance of tailoring the coefficients to the specific conditions encountered in each unique application. The least squares method and its derivatives are frequently employed in extracting coefficients. Although this approach is well established, matrix multiplication and inversion may diminish computational efficiency when handling extensive datasets. Moreover, the obtained coefficients based on the least square method are sensitive to the outliers, and this method does not apply to the censored data.
Historically, machine learning has harbored a certain resistance to rule-based inference. However, the efficacy of neural networks in symbolic computation is gaining recognition by integrating symbolic reasoning with continuous representations. Pioneers such as Zaremba et al. [21] and Allamanis [22] have explored the application of neural networks in handling mathematical objects. A significant advancement came from Lample and Charton [23] when they proposed a representation for mathematical expressions. Sharma et al., Chhabra et al., and Yadav et al. have applied neural network method for material optimization studies [24,25,26]. As a result, the theoretical basis for representing mathematical and symbolic equations using neural networks is well-established.
In this research, we have developed the Equation-Informed Neural Networks (EINNs) method, synergistically incorporating the Bayesian Inference (BI) iteration technique to extract the coefficients of constitutive equations from measurement data. As visualized in Figure 1, the foundational concept of EINNs begins with constructing an artificial neural network to embody the constitutive equation f , where coefficient α k is designated as their respective weights. Subsequently, EINNs deploy a pre-processing mapping technique on the input/output data pairs which are obtained from the experiments, a strategy rooted in neural network learning theory, and enables exploring coefficient fitting across various domains in-depth.
This neural network can be incrementally trained using input and output data pairs, facilitating the simultaneous approximation of coefficient α k . Theoretically, the steepest descent algorithm of the neural network backpropagation bolsters the computation efficiency and fosters the selective learning of data pairs. The final coefficients are obtained by the post-processing conversion. Utilizing the coefficients obtained by EINNs as initial values, Bayesian Inference (BI) is applied to obtain the distribution of the coefficients against the training datasets and further enhance the accuracy of coefficient extraction.
This paper is organized as follows: the “Theory” section provides an introduction to the framework of Equation-Informed Neural Networks (EINNs) and the numerical Bayesian Inference (BI) method. The subsequent section, “EINN Formulation”, presents the conversion process of constitutive equations from their conventional mathematical forms to their EINN equivalents, complete with pre-processing mapping and post-processing functions. In the “Applications” section, we apply the EINN formulation to the coefficient extraction of the material constitutive equation pertinent to Pb-free SAC305 solder joints. Detailed discussions and numerical results pertaining to the EINN formulations of the Chaboche, hyperbolic Garofalo, and Anand material models are also included. The paper concludes with a concise summary of our findings.

2. Theory

2.1. The Framework of Equation-Informed Neural Networks (EINNs)

Assume a constitutive equation is given by the function:
y i = f ( x j ; α k ) ,
where y i , x j , and α k are vectors in real space with dimensions i , j , and k , respectively. The x j and y i represent the input and output of the functions, while α k refers to the coefficients. Design pre-processing mapping functions:
M x x j = X j   and   M y y i = Y i ,
which serve to effectively modify the domains of x j and y i to optimize the precision of coefficient extraction. Consequently, a new function can be formulated as Y i = F ( X j , A k ) . Meanwhile, the corresponding neural network representations of Y i are formulated, and the coefficient A k is assigned as the weighting.
The learning process of the neural network involves continuous adjustment of these weights or coefficients. These adjustments can be computed for each known data pair using steepest-descent-based backpropagation as A k n e w = A k n e w η Δ y y i A k . Since these updates are independent of each data pair, the computationally expensive matrix multiplication and inversion inherent in the least squares-based approaches can be avoided. Furthermore, incorporating ratios into the adjustments allows for user emphasis on specific data pairs. This can be implemented as A k n e w = A k n e w η   l r l · Δ y y i A k ( l )   , where l is the coefficient adjustment from each data pair and l r l = 1 .
Following several learning iterations with satisfactory accuracy, the coefficient A k of the constitutive equation can be obtained. However, due to the pre-processing mapping function (2) being applied, counteractions are required to reverse its effect. Therefore, we define the post-processing conversion functions as follows.
a k = g k x j , y i , A k .
Through the combined application of pre-processing mapping functions and post-processing conversion of coefficients, the EINN framework gains an additional degree of freedom, bolstering the accuracy of coefficient extraction. Additionally, the steepest descent method offers a unique opportunity to prioritize specific data pairs while maintaining high computational efficiency.

2.2. The Numerical Bayesian Inference (BI) Iteration

We define the mean square error (MSE) function of Equation (1) with respect to the coefficients α k , as
ϵ α k = l y t , i ( l ) y i x j ( l ) ; α k 2 ,
where x j l and y t , i l denote the input and ground truth of the l -th datapair, respectively.
Assume that the distribution of the data pairs y t , i l and x j l are normal, and so is the error function ϵ α k , denoted as ϵ α k ~ N ( μ , τ ) . Because the parameter τ cannot be negative, we assume it follows the gamma distribution, so that τ ~ G ( a 0 , b 0 ) , where a 0 and b 0 are the gamma distribution parameters of τ . Moreover, assume that all the coefficients follow the normal distribution, say α k ~ N μ k , τ k , and μ k and τ k are the average and precision, respectively. The posterior distribution after the BI remains normal distribution. In practice, we set μ k equal to α k .
Consequently, the probabilities of the coefficient τ and α k can be derived as P τ = b 0 a 0 τ a 0 1 e b 0 τ Γ a 0 and P α k = 2 π 1 2 τ k 1 2 e 1 2 τ k α k μ k 2 , respectively. The likelihood with respect to coefficient τ and α k is L α k , τ P d a t a α k , τ = l = 1 L 1 2 π τ n 2 e y t , i l y i l 2 2 σ 2 = 2 π n 2 τ n 2 e τ 2 ϵ α k [27].
The posterior of the τ distribution can be updated by the gamma–gamma conjugate:
a 0 n e w = a 0 o l d + n 2   and   b 0 n e w = b 0 o l d + 1 2 ϵ ,
As Equation (1) is not always a linear function, the posterior of coefficient α k cannot always be computed by conjugate. Therefore, under the assumption that the value of Δ α k is relatively small, a numerical integration approach is applied:
0 L ( α k ) · P α k d α k   ~ n = 1 n = N L ( α k 0 + n · Δ α k ) · P α k 0 + n · Δ α k · Δ α k ,
where α k 0 is the minimal value of α k and n is the number of the equal split between the assigned maximum and minimum α k with a total of N splits. The posterior can then be obtained using normal distribution approximation.
We employ the Markov Chain Monte Carlo (MCMC) method to compute large hierarchical models requiring integration over many parameters. By applying the Gibbs sampling, the τ distribution parameters a 0 and b 0 are first updated through the conjugate (Equation (5)), and a new τ value will be sampled from the gamma distribution. Each α k will be updated sequenently, and the new value will be accepted. Following thousands of iterations, every α k exhibits a normal distribution. The mean value of this distribution is computed and assigned as the updated value for α k .

3. EINN Formulation

This section outlines the development of Equation-Informed Neural Network (EINN) formulations for the hyperbolic Garofalo, nine-parameter Anand, and Chaboche models, including pre-processing mapping and post-processing coefficient functions.

3.1. Hyperbolic Garofalo Model

The conventional hyperbolic Garofalo constitutive equation can be written as:
ε p ˙ = C 1 · sinh C 2 σ C 3 · e Q R T ,
where ε p ˙ , σ , Q , R , and T represent the plastic strain rate, stress, activation energy, gas constant, and temperature, respectively. C 1 , C 2 , and C 3 are the coefficients that need to be extracted from the experimental data.
We introduce e = ε p ˙ · e Q R T and accumulate the data pairs of { e } and { σ } from the experimental results. In order to proportional convert the original data to the [ a , b + a ] domain, the pre-processing matching functions are defined as follows:
M x σ = σ σ m Δ σ b + a = x   and M y e = e e m Δ e b + a = y ,
where σ m and Δ σ represent the minimal and maximum different values of set { σ } , and e and Δ e correspond to set { e } . Parameters a and b are parts of pre-processing mapping, and a = 0.001 and b = 1 are assigned for this case. The values after the pre-processing are defined as x and y , respectively. Subsequently, a new function can be derived as:
y = C sinh A x n ,
The corresponding neural network can be defined in Figure 2. The definition of the neurons is given in Table 1.
Accordingly, the post-processing conversion of the coefficients can be approximated as C 1 = C Δ e · r 2 n , C 2 = A · b / Δ σ , and C 3 = n , where r 2 = A b Δ σ A a .

3.2. Anand Model

Anand et al. [28] proposed a set of viscoplastic constitutive equations for the rate-dependent deformation of metals. Recently, the Anand model has been extensively applied to microelectronic solders exhibiting large viscoplastic deformations. In addition to the activation energy, there are eight coefficients in the Anand model. A two-step approach is commonly employed to extract these eight coefficients [9,12,13].
The governing equation for the first step of the Anand model, including the ultimate tensile stress ( σ * ), plastic strain rate ( ε ˙ p ), activation energy ( Q ), and temperature ( T ), is expressed in Equation (10). s ^ , ξ , A , n , and m are the coefficients that need to be extracted.
σ * = s ^ ξ ε ˙ p A · e Q R T n sinh 1 ε ˙ p A e Q R T m ,
Utilizing the same method as in the previous section, we assume e 0 = ε ˙ p · e Q R T . Since the value of the strain rate is relatively small compared to other input parameters, a scaling factor R is applied, such that e = e 0 R . For consistency within this paper, the same activation function as in the previous section is assumed. The data pair of { e } and { σ * } is collected from the Motalab et al. [12,13]. An additional y and x are introduced to represent the output and input parameters, and the pre-processing mapping functions are defined as
x = e e m Δ e b e + a e   and   y = σ * σ m * Δ σ * b σ + a σ ,
where e m and Δ e are the minimal and maximum difference among set {e}, and so are σ m * and Δ σ * in { σ * } . a e , b e , a σ , and b σ are the mapping coefficients. By defining β = s ^ ξ , the new function can be written as
y = β * x A * n * sinh 1 x A * m * ,
Based on Equation (12), the EINN representation can be formulated as Figure 3. This network’s definitions are listed in Table 2.
By defining r = σ m * a σ Δ σ * b σ , the post-processing of the coefficients can be written as follows:
1 A = 1 A * b b e Δ e + a e a v g y · 1 R ,   n = n * ,   m = m *   and   β = Δ σ b σ · β * + r β * · a v g x = s ^ ξ ,
where a v g ( x ) and a v g ( y ) are the averges of { e } and { σ * } .
The governing equation of the second step of the Anand model is listed in (14), and s 0 , a , and h 0 are the three remaining coefficients. The parameter c is defined in (15), and ξ is defined as the smallest positive real number to keep c < 1 .
σ = σ * σ * c s 0 1 a + a 1 c h 0 σ * a ε p 1 / ( 1 a ) ,
c = 1 ξ sinh 1 ε p ˙ A e Q R T m   ,
We assume that x = sinh 1 ε ˙ A e Q R T m , y = ( σ * σ ) , l = σ * , and z = ε p , and the pre-processing mapping functions are defined as
y = y y m Δ y b y + a y ,   l = l l m Δ l b l + a l ,   x = x x m Δ x b x + a x and   z = z z m Δ z b z + a z ,
By assuming 1 a = a , the new function can be written as
y = l ¯ + s 0 ξ * · x a * a * h 0 ξ * · y ¯ l a * 1 · z ¯ 1 / a * ,
Based on Equation (17), the EINN representation can be formulated as Figure 4. This network’s definitions are listed in Table 3. Moreover, the post-processing of coefficients can be derived as
a * = a ,   s 0 = r y r l · ( s 0 ) ,   h 0 = r y r l · r z ( h 0 ) ,
where r l = b l Δ l , r x = b x Δ x , r y = b y Δ y , and r z = b z Δ z .

3.3. Chaboche Model

The Chaboche model [15,29] is often applied for presenting the metallic material with the Bauschinger effect under cyclic loading. The original function can be written as
α = C γ 1 e γ · ε p + σ 0
where α and ε p are the back tensile stress and the plastic strain. σ 0 is the initial yielding stress, and C and γ are the fitting coefficients. To simplify the equation, we substitute and C / γ as β . Let x = ε p , y = α , as the parameters, with the pre-processing mapping functions:
x = ε p ε p , m Δ ε p   and   y = α α m Δ α ,
where ε p , m and Δ ε p are the minimal and maximum differences among set { ε p }, and so are α m and Δ α in { α } , and s = σ 0 . Hence, the new function can be re-written as
y = β * 1 e γ * · x + s ,
with the EINN formulation shown in Figure 5 and the neuron definition listed in Table 4.
Furthermore, the post-processing of coefficients can be derived as
σ 0 = x m + s * · Δ α ,   γ = γ * Δ ε p ,   and   C = Δ α Δ ε p · ( β * · γ * )

4. Applications

Building on the EINN formulation and Bayesian Inference (BI) iteration described in the preceding section, this chapter discusses the extraction of coefficients from the hyperbolic Garofalo, nine-parameter Anand, and Chaboche models for the SAC305 solder material.

4.1. Hyperbolic Garofalo Model

The experimental dataset is drawn from Xiao and Armstrong [10]. To determine the coefficient C in Equation (9), we employ a grid search combined with a bisection optimization technique, whereas the EINN structure for coefficients A and n is addressed using standard backpropagation. To emphasize coefficient extraction for low temperatures (both 318 and 353 K) and low strain rates, ratios are assigned to the datapairs, as shown in Table 5. Table 5 also lists the input (plastic strain) and output (stress) of the EINN learning. Utilizing the post-processing conversion formula, the EINN coefficients, C 1 , C 2 , and C 3 , are obtained and presented in the middle column of Table 6. The hyperbolic model, when compared to the experimental data, is depicted in Figure 6. The data at 388 K exhibits a more significant difference than the others, primarily due to the ratio setting outlined in Table 5.
A total of 1000 Bayesian Inference interactions were performed to obtain the distribution of the extracted coefficients. The distributions are displayed in Figure 7, represented as the ratio of each coefficient value to the average, and are expanded by the precision τ of the error function. As denoted by the dashed lines in Figure 7, which signify a 5 % difference, stable distributions of coefficients C 2 and C 3 are observed, while the large variation in C 1 is attributed to the ratio setting, which induces a higher discrepancy among the 388 K data.
The coefficient extraction of the hyperbolic Garofalo constitutive equation highlights the flexibility of the EINN framework, as it allows for assigning ratios to data pairs to prioritize specific data. The fitting accuracy of the EINN results demonstrates a significant improvement compared to the original reports [10] as indicated by the mean square error (MSE) of Table 6, followed by Equation (4). Although the distribution of the C 1 coefficients demonstrate a small fraction of the outliers from BI integration as Figure 7, both C 2 and C 3 show statistical difference within ± 5 % difference. Over 1000 iterations, only 58 instances of C 1 shows more than ± 5 % difference of the average value. Consequently, a robust set of coefficients for the hyperbolic Garofaolo constitutive model is achieved.

4.2. Anand Model

In this section, the Anand constitutive model coefficients extraction is implemented for the lead-free SAC305 solder. The same activation energy as in the previous section is applied for the sake of research consistency. To extract the remaining eight coefficients of the Anand constitutive model, the first step involves utilizing temperature and strain rate-dependent ultimate tensile stresses to determine the initial four coefficients. Subsequently, the second step defines the remaining parameters based on temperature and strain rate-dependent stress and plastic strain.
The experimental data are sourced from Motalab et al. [12]. The EINN formulation, following Equation (12), is applied with the pre-processing mapping coefficients a e , b e , a σ , and b σ (Equation (13)) which are 0.8, 0.15, 0.9, and 0.1. It is vital to note that the selection of these mapping coefficients depends on the numerical characteristics of the dataset, and it is essential for preventing numerical errors during the backpropagation-based machine learning of the EINN formulation.
During the learning phase of the EINN formulation, the coefficients n * , m * , A * , and β of Equation (12) and Figure 3 are constrained to be positive. A grid search technique is employed to identify optimal initial values concerning the experimental data.
Furthermore, learning ratios are implemented to emphasize the learning preference for low strain rates and temperatures close to the working temperature of electronic components. After hundreds of iterations, the EINN coefficients are reported in Table 7. The MSE values indicate that the coefficients obtained from the EINN formulation exhibit similar accuracy to those obtained using conventional methods. The obtained step 1 Anand model is plotted in Figure 8.
The EINN formulation coefficients serve as initial inputs for Bayesian Inference (BI) to analyze the statistical distribution of the coefficients. Figure 9 illustrates the distribution of the coefficients, with dashed lines indicating differences within ± 5 % . Due to its low value, the coefficient n was not examined. Both coefficients A and β exhibit distribution within ± 5 % difference. Out of 1500 values, only 61 cases of coefficient m exceed ± 5 % difference, which can be attributed to the preference settings during the EINN learning process. The average coefficients obtained from BI are presented in the last column of Table 7 and are utilized for the subsequent coefficient extraction step in the Anand model.
The temperature and strain rate dependent stress–strain curves are obtained from Motalab [12]. The EINN formulation of the step 2 Anand model, as indicated in Equation (17) and Figure 4, is applied with the pre-processing mapping parameters shown in Table 8, based on Equation (16), while in the EINN learning procedure, the values of s 0 and h 0 are forced to be positive. A grid search technique is applied to define the optimal initial coefficients. The learning ratios are implemented to emphasize the learning preference for low strain rates and temperatures close to the working temperature of electronic components, following the coefficient extraction strategy of Motalab et al. [12]. With Equation (18), the optimized coefficients can be obtained, as listed in Table 9, and the stress–strain curves at different strain rates from the Anand model are plotted against the experiment [12], as shown in Figure 10.
The dataset with high preference is applied to the BI iteration to mitigate the large coefficient shifting. Figure 11 plots the MSEs of EINNs and EINNs with BI against the Anand coefficient obtained by Motalab et al. [12], under different temperatures and strain rates. By adjusting the ratio of EINN network learning, the coefficient extraction can be fine-tuned to perform better in the room to the working temperature at a low strain rate, as indicated in Figure 11.

4.3. Chaboche Model

To study the lifetime of the ball-grid-array-type of advanced electronic packaging, the Chaboche material model is often applied [5,8]. The Chaboche model and its coefficients can be extracted from the temperature-dependent stress–strain curves by a given strain rate. Unlike the previous sectors, this section investigates the extraction of Chaboche coefficients from the Anand model.
The Anand coefficients from Table 6 and Table 7, adjusted via Bayesian Inference (BI), are utilized to generate inputs for the Chaboche model. A strain rate of 10 5 (1/s) is maintained, given that the Anand coefficients have been optimized for lower strain rates, as demonstrated in the previous section. Stress–strain curves can be generated by the Anand model (as Equations (14) and (15)) for each temperature point, including −40 °C, −20 °C, 40 °C, 80 °C, and 122 °C.
The temperature-dependent stress–strain data serve as the training datasets. With the pre-processing mapping established by Equation (20), we apply the EINN formulation for the Chaboche model as Equation (21). Following this, the steepest-descent coefficient optimization is applied to the EINN formulation (as illustrated in Figure 5) with the neural definitions outlined in Table 4. The post-processing of the coefficients Equation (22) allows for the acquisition of Chaboche coefficients at various temperatures. The resultant data are documented in Table 10, with the mean square errors (MSE) compared to the input dataset.
The coefficients derived from the EINN formulation are subsequently incorporated into Bayesian Inference (BI) iterations for the temperature-dependent Chaboche model. Figure 12 delineates the distribution of coefficients σ 0 , C , and γ across different temperatures, magnified by the precision τ of the error function. The vertical axes in this figure represent the ratio of the coefficient value obtained at each BI iteration to the averaged value. Table 11 contains the averaged coefficient post-BI.
While variations in all coefficients lie within a ± 5 % difference, a larger variety, coupled with a lower MSE, as listed in Table 10 and Table 11, is evident at higher temperatures. This suggests a reduced coefficient sensitivity at these elevated temperatures. By introducing Young’s modulus obtained by linear extrapolation from the experiment [12], the temperature-dependent stress–strain curves are plotted in Figure 13.

5. Conclusions

In this study, we developed the concept of Equation-Informed Neural Networks (EINNs) as an efficient method for extracting the coefficients of constitutive equations. Subsequently, the MCMC with numerical Bayesian Inference (BI) iterations was applied to estimate the distribution of these coefficients, thereby further refining them.
The EINN formulation was derived by leveraging graphical representation techniques to convert the mathematical form of constitutive equations into an equivalent EINN format. By carefully adjusting pre-processing mapping parameters and identifying dataset preferences, we could generate coefficients optimally aligned with the targeted application scenario.
The EINN formulation has been successfully applied to the hyperbolic Garofalo, Anand, and Chaboche constitutive models. This paper details the EINN formulation with its neural network format, the definition of each neuron, the appropriate pre-processing techniques, and the post-processing of the coefficients.
The extraction of coefficients for the hyperbolic Garofalo and Anand models was conducted using experimental results from lead-free SAC305 solder material studies by Xiao and Armstrong [10] and Motalab et al. [12,13]. Our report includes the employed pre-processing mapping techniques and parameters. With the dataset preference, the constitutive equations with extracted coefficients performed better in the interested zone.
Comparisons with coefficients of the constitutive equations from the aforementioned studies demonstrated that those extracted from the EINN formulation were alike. Importantly, the mean square error (MSE) of the EINN formulation learning was comparable to those from the literature [10,12,13]. The performance of the MSE depends on many factors, such as the prescription capability of the material model and experimental measurement accuracy. In this research, the MES is applied as a comparison of how the coefficients extracted by the EINNs perform to the ones obtained by the original methods.
Moreover, the MCMC with numerical Bayesian Inference (BI) iteration technique was employed to analyze the robustness of the extracted coefficients against the experiment data, as shown in Figure 7, Figure 9 and Figure 12. A slightly higher variation was observed when the dataset preference was applied to the EINN learning. Nevertheless, the coefficients derived from EINNs remained within a ±5% confidence interval.
In conclusion, the combined use of EINNs with BI provides a powerful tool for extracting coefficients from temperature- and strain-rate-dependent constitutive equations with dataset preference. This is under the assumption that the SAC305 solder material characteristics can be described by the material model and that the experimental measurement is accurate enough. This approach provides the coefficients’ value and the distribution of coefficients against the training dataset.
This study’s potential limitations may include the dataset preference assumption, which may not universally apply across all scenarios. Additionally, the applicability of the EINN formulation to all forms of constitutive equations remains to be fully determined, necessitating further exploration of potential limitations. Moreover, advanced neural network backpropagation methods, such as Levenberg–Marquardt (LM) algorithm, will be applied to EINN frameworks.

Author Contributions

Conceptualization, methodology, software, formal analysis, investigation, data curation, and writing—original draft preparation, C.Y.; Formula checking, writing—review and editing, and discussion during research, Q.S.; Problem identification, concept initialization, writing—review and editing, discussion during research, and supervision, K.-N.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research is partially supported by the project of National Tsing Hua University, sponsored by the Ministry of Science and Technology, under the grand no. MOST109-2221-E-007-001-MY3, the “Research on the system reliability modeling of the high power LED packaging based by the online machine learning methods” project of Feng Chia University, sponsored by the National Science and Technology Council under the grand no. MOST 111-2221-E-035-043, and the “AI-Assisted Design-on-Simulation Technology for Advanced Packaging” project of National Tsing Hua University, sponsored by the Semiconductor Research Cooperation (SRC) under contract no. 2023-PK-3180.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this research study are available in this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Liu, D.S.; Chao, Y.C.; Wang, C.H. Study of wire bonding looping formation in the electronic packaging process using the three-dimensional finite element method. Finite Elem. Anal. Des. 2004, 40, 263–286. [Google Scholar] [CrossRef]
  2. Yuan, C.C.A.; Chang, H.M.; Chiang, K.N. Investigation of the mechanical characteristics of the Cu/low-k BEOL under wire bonding process loading. J. Mech. 2022, 38, 539–551. [Google Scholar] [CrossRef]
  3. Cheng, H.-C.; Wu, Z.-D.; Liu, Y.-C. Viscoelastic Warpage Modeling of Fan-out Wafer-Level Packaging during Wafer-Level Mold Cure Process. IEEE Trans. Compon. Packag. Manuf. Technol. 2020, 10, 1240–1250. [Google Scholar] [CrossRef]
  4. Qiu, B.; Xiong, J.; Wang, H.; Zhou, S.; Yang, X.; Lin, Z.; Liu, M.; Cai, N. Survey on Fatigue Life Prediction of BGA Solder Joints. Electronics 2022, 11, 542. [Google Scholar] [CrossRef]
  5. Wang, P.H.; Lee, Y.C.; Lee, C.K.; Chang, H.H.; Chiang, K.N. Solder Joint Reliability Assessment and Pad Size Studies of FO-WLP with Glass Substrate. IEEE Trans. Device Mater. Reliab. 2021, 21, 96–101. [Google Scholar] [CrossRef]
  6. Tauscher, M.; Merk, T.; Adsule, A.; Linnemann, A.; Wilde, J. Surrogate Modeling for Creep Strain-Based Fatigue Prediction of a Ball Grid Array Component. J. Electron. Packag. 2024, 146, 011003. [Google Scholar] [CrossRef]
  7. Yuan, C.C.A.; Fan, J.; Fan, X. Deep machine learning of the spectral power distribution of the LED system with multiple degradation mechanisms. J. Mech. 2021, 37, 172–183. [Google Scholar] [CrossRef]
  8. Hsiao, H.Y.; Chiang, K.N. AI-assisted reliability life prediction model for wafer-level packaging using the random forest method. J. Mech. 2020, 37, 28–36. [Google Scholar] [CrossRef]
  9. Wilde, J.; Becker, K.; Thoben, M.; Blum, W.; Jupitz, T.; Wang, G.; Cheng, Z.N. Rate dependent constitutive relations based on Anand model for 92.5 Pb5Sn2. 5Ag solder. IEEE Trans. Adv. Packag. 2000, 23, 404–414. [Google Scholar] [CrossRef]
  10. Xiao, Q.; Armstrong, W.D. Tensile creep and microstructural characterization of bulk Sn3.9Ag0.6Cu lead-free solder. J. Electron. Mater. 2005, 34, 196–211. [Google Scholar] [CrossRef]
  11. Garofalo, F.A. Fundamentals of Creep and Creep-Rupture in Metals; MacMillian: New York, NY, USA, 1965. [Google Scholar]
  12. Motalab, M.; Cai, Z.; Suhling, J.C.; Lall, P. Determination of Anand constants for SAC solders using stress-strain or creep data. In Proceedings of the 13th InterSociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (ITHERM), San Diego, CA, USA, 30 May–1 June 2012. [Google Scholar]
  13. Motalab, M.; Mustafa, M.; Suhling, J.C.; Zhang, J.; Evans, J.; Bozack, M.J.; Lall, P. Thermal Cycling Reliability Predictions for PBGA Assemblies that Include Aging Effects. In Proceedings of the ASME 2013 International Technical Conference and Exhibition on Packaging and Integration of Electronic and Photonic Microsystems (InterPACK2013), Burlingame, CA, USA, 16–18 July 2013. [Google Scholar]
  14. Basit, M.M.; Motalab, M.; Suhling, J.C.; Hai, Z.; Evans, J.; Bozack, M.J.; Lall, P. Thermal cycling reliability of aged PBGA assemblies—Comparison of Weibull failure data and finite element model predictions. In Proceedings of the 2015 IEEE 65th Electronic Components and Technology Conference (ECTC), San Diego, CA, USA, 26–29 May 2015; pp. 106–117. [Google Scholar]
  15. Chaboche, J.L. Constitutive equations for cyclic plasticity and cyclic viscoplasticity. Int. J. Plast. 1989, 5, 247–302. [Google Scholar] [CrossRef]
  16. Xie, M.; Chen, G. A developed crystal plasticity model for viscoplastic mechanical behavior of SAC305 solder under thermomechanical coupled cyclic loading. Int. J. Plast. 2022, 159, 103465. [Google Scholar] [CrossRef]
  17. Deshpande, A.; Jiang, Q.; Dasgupta, A.; Becker, U. Role of Nominal Stress State on Cyclic Fatigue Durability of SAC305 Grain-Scale Solder Joints. J. Electron. Packag. 2022, 144, 031006. [Google Scholar] [CrossRef]
  18. Yan, L.; Yao, J.; Dai, Y.; Zhang, S.; Bai, W.; Gao, K.; Yang, H.; Wang, Y. Study of Thermal Stress Fluctuations at the Die-Attach Solder Interface Using the Finite Element Method. Electronics 2021, 11, 62. [Google Scholar] [CrossRef]
  19. Ma, H.; Suhling, J.C. A review of mechanical properties of lead-free solders for electronic packaging. J. Mater. Sci. 2009, 44, 1141–1158. [Google Scholar] [CrossRef]
  20. Kuczynska, M.; Schafet, N.; Becker, U.; Metasch, R.; Roellig, M.; Kabakchiev, A.; Weihe, S. Validation of different SAC305 material models calibrated on isothermal tests using in-situ TMF measurement of thermally induced shear load. Microelectron. Reliab. 2018, 91, 67–85. [Google Scholar] [CrossRef]
  21. Zaremba, W.; Kurach, K.; Fergus, R. Learning to Discover Efficient Mathematical Identities. In Proceedings of the the 27th International Conference on Neural Information Processing Systems, Montréal, QC, Canada, 8–13 December 2014; MIT Press: Cambridge, MA, USA, 2014; pp. 1278–1286. [Google Scholar]
  22. Allamanis, M.; Chanthirasegaran, P.; Kohli, P.; Sutton, C. Learning Continuous Semantic Representations of Symbolic Expressions. In Proceedings of the 34th International Conference on Machine Learning, Proceedings of Machine Learning Research, Sydney, Australia, 6–11 August 2017; pp. 80–88. [Google Scholar]
  23. Lample, G.; Charton, F. Deep Learning for Symbolic Mathematics. arXiv 2019, arXiv:1912.01412. [Google Scholar]
  24. Sharma, A.; Chhabra, D.; Sahdev, R.; Kaushik, A.; Punia, U. Investigation of wear rate of FDM printed TPU, ASA and multi-material parts using heuristic GANN tool. Mater. Today Proc. 2022, 63, 559–565. [Google Scholar] [CrossRef]
  25. Chhabra, D.; Deswal, S.; Kaushik, A.; Garg, R.K.; Kovács, A.; Khargotra, R.; Singh, T. Analysis of fused filament fabrication parameters for sliding wear performance of carbon reinforced polyamide composite material fabricated parts using a hybrid heuristic tool. Polym. Test. 2023, 118, 107910. [Google Scholar] [CrossRef]
  26. Yadav, M.; Kaushik, A.; Garg, R.K.; Yadav, M.; Chhabra, D.; Rohilla, S.; Sharma, H. Enhancing dimensional accuracy of small parts through modelling and parametric optimization of the FDM 3D printing process using GA-ANN. In Proceedings of the 2022 International Conference on Computational Modelling, Simulation and Optimization (ICCMSO), Bangkok, Thailand, 23–25 December 2022; pp. 89–94. [Google Scholar]
  27. Watanabe, S. Mathematical Theory of Bayesian Statistics; Chapman & Hall: London, UK, 2020. [Google Scholar]
  28. Anand, L. Constitutive equations for hot-working of metals. Int. J. Plast. 1985, 1, 213–231. [Google Scholar] [CrossRef]
  29. Chaboche, J.L. On some modifications of kinematic hardening to improve the description of ratchetting effects. Int. J. Plast. 1991, 7, 661–678. [Google Scholar] [CrossRef]
Figure 1. The concept of Equation-Informed Neural Networks (EINNs).
Figure 1. The concept of Equation-Informed Neural Networks (EINNs).
Materials 16 04922 g001
Figure 2. The EINNs for hyperbolic Garofalo model.
Figure 2. The EINNs for hyperbolic Garofalo model.
Materials 16 04922 g002
Figure 3. The EINNs for step 1 Anand equation.
Figure 3. The EINNs for step 1 Anand equation.
Materials 16 04922 g003
Figure 4. The EINNs for the step 2 Anand equation.
Figure 4. The EINNs for the step 2 Anand equation.
Materials 16 04922 g004
Figure 5. The EINN formulation for the Chaboche model.
Figure 5. The EINN formulation for the Chaboche model.
Materials 16 04922 g005
Figure 6. The obtained hyperbolic Garofalo curves for different temperatures. The experimental data are from Xiao and Armstrong [10].
Figure 6. The obtained hyperbolic Garofalo curves for different temperatures. The experimental data are from Xiao and Armstrong [10].
Materials 16 04922 g006
Figure 7. The distribution of the coefficients from the BI iteration. (ac) are coefficients C 1 , C 2 , and C 3 , respectively.
Figure 7. The distribution of the coefficients from the BI iteration. (ac) are coefficients C 1 , C 2 , and C 3 , respectively.
Materials 16 04922 g007
Figure 8. Step 1 Anand model at different temperatures. The experimental data are based on Motalab et al. [12].
Figure 8. Step 1 Anand model at different temperatures. The experimental data are based on Motalab et al. [12].
Materials 16 04922 g008
Figure 9. The distribution of the coefficients from the BI iteration. (ac) are coefficients A , m , and β , respectively.
Figure 9. The distribution of the coefficients from the BI iteration. (ac) are coefficients A , m , and β , respectively.
Materials 16 04922 g009
Figure 10. The obtained step 2 Anand model curves for different temperatures. (ac) are the obtained Anand model with strain rates of 10 3 , 10 4 , and 10 5 (1/s). The experimental data are based on Motalab et al. [12].
Figure 10. The obtained step 2 Anand model curves for different temperatures. (ac) are the obtained Anand model with strain rates of 10 3 , 10 4 , and 10 5 (1/s). The experimental data are based on Motalab et al. [12].
Materials 16 04922 g010
Figure 11. The MSE plot of step 2 Anand model coefficient extraction results Motalab et al. [12].
Figure 11. The MSE plot of step 2 Anand model coefficient extraction results Motalab et al. [12].
Materials 16 04922 g011
Figure 12. The distributions of the temperature-dependent Chaboche coefficients obtained by BI. (ae) represent the distributions of −40, −20, 40, 80, and 122 °C, respectively.
Figure 12. The distributions of the temperature-dependent Chaboche coefficients obtained by BI. (ae) represent the distributions of −40, −20, 40, 80, and 122 °C, respectively.
Materials 16 04922 g012
Figure 13. The temperature-dependent stress–stress curves from Chaboche model using the coefficients in Table 11.
Figure 13. The temperature-dependent stress–stress curves from Chaboche model using the coefficients in Table 11.
Materials 16 04922 g013
Table 1. The neuron definition of the EINN representation of the hyperbolic Garofalo equation.
Table 1. The neuron definition of the EINN representation of the hyperbolic Garofalo equation.
NeuronNet ValueActivation
M 1 M 1 , n e t = A · x M 1 = l n ( sinh M 1 , n e t )
M 2 M 2 , n e t = n · M 1 M 2 = e M 2 , n e t
y y n e t = C · M 2 y = y n e t
Table 2. The neuron definition of the EINN representation of the step 1 Anand equation.
Table 2. The neuron definition of the EINN representation of the step 1 Anand equation.
NeuronNet ValueActivation
M 1 M 1 , n e t = 1 / A * · x M 1 = l n ( M 1 , n e t )
M 2 M 2 , n e t = m * · M 1 M 2 = ln sinh 1 e M 2 , n e t
y y n e t = n * · M 1 + M 2 + β y = e y n e t
Table 3. The neuron definition of the EINN representation of the step 2 Anand equation.
Table 3. The neuron definition of the EINN representation of the step 2 Anand equation.
NeuronNet ValueActivation
M 1 M 1 , n e t = l + s 0 ξ * x M 1 = l n ( M 1 , n e t )
M 2 M 2 , n e t = h 0 ξ * · M 1 M 2 = l n ( M 2 , n e t )
M 3 M 3 , n e t = z M 3 = l n ( M 3 , n e t )
M 4 M 4 , n e t = l M 4 = l n ( M 4 , n e t )
M 5 M 5 , n e t = a * · M 1 M 5 = e M 5 , n e t
M 6 M 6 , n e t = M 2 + M 3 + ( a * 1 ) · M 4 M 6 = e M 6 , n e t
M 7 M 7 , n e t = M 5 a * · M 6 M 7 = ln ( M 7 , n e t )
y y n e t = 1 a * · M 2 y = e y n e t
Table 4. The neuron definition of the EINN representation of the Chaboche equation.
Table 4. The neuron definition of the EINN representation of the Chaboche equation.
NeuronNet ValueActivation
M 1 M 1 , n e t = ( γ * ) · x M 1 = 1 e M 1 , n e t
y y n e t = β * · M 1 + s * y = y n e t
Table 5. The ratios applied to the data pair to emphasize the preference.
Table 5. The ratios applied to the data pair to emphasize the preference.
Data IDTemperature (°C) Plastic   Strain   Rate   ( ε p ˙ , 1 0 9 1 / s ) Stress (MPa)Ratio
1451.410.5418.0
2454.012.306.0
34513.614.255.0
44543.915.920.5
5806.610.4318.0
68013.111.596.0
78021.212.405.0
88057.813.803.0
98095.014.461.0
1011513.48.7315.0
1111519.89.3512.0
1211534.210.076.0
1311550.510.616.0
14115166.012.411.0
1515090.17.511.0
16150165.08.631.0
17150315.09.821.0
1815045410.540.5
1915081011.520.5
Table 6. The comparison of the extracted coefficients of Hyperbolic Garofalo Model.
Table 6. The comparison of the extracted coefficients of Hyperbolic Garofalo Model.
Xiao and Armstrong [10]EINNsEINNs + BI
Q (kJ/mol)62,00065,00065,000
C 1 0.1840.5390.443
C 2 0.2210.4730.482
C 3 2.891.0551.073
MSE *37,188.511,794.910,967.4
*: defined by Equation (4).
Table 7. The comparison of the extracted coefficients of step 1 Anand model.
Table 7. The comparison of the extracted coefficients of step 1 Anand model.
Motalab et al. [12]EINNsEINNs + BI
A 350116501649
n 1.00 × 1 0 2 1.54 × 1 0 4 1.64 × 1 0 4
m 0.250.540.53
β = s ^ / ξ 7.554.114.16
MSE *17.0315.9615.78
*: defined by Equation (4).
Table 8. The pre-processing mapping parameters.
Table 8. The pre-processing mapping parameters.
y l x z
a 10.80.10.8
b 00.40.050.1
Table 9. The comparison of the extracted coefficients of step 2 Anand model.
Table 9. The comparison of the extracted coefficients of step 2 Anand model.
Motalab et al. [12]EINNsEINNs + BI
ξ 417.6617.66
s 0 2155.9657.45
a 1.782.302.26
h 0 18,000828,822828,822
Table 10. Temperature-dependent Chaboche coefficients of EINNs.
Table 10. Temperature-dependent Chaboche coefficients of EINNs.
Temperature σ 0 C γ MSE *
−40 °C39.329174.11004.71.11
−20 °C33.807535.7964.00.84
40 °C21.354216.0840.00.50
80 °C 15.432988.0824.30.28
122 °C10.501886.5759.10.18
*: defined by Equation (4).
Table 11. Temperature-dependent Chaboche coefficients of EINNs and BI.
Table 11. Temperature-dependent Chaboche coefficients of EINNs and BI.
Temperature σ 0 C γ MSE *
−40 °C39.309174.11004.71.11
−20 °C33.787535.7964.00.84
40 °C21.394216.0840.00.49
80 °C 15.452988.0824.30.27
122 °C10.531886.5759.10.17
*: defined by Equation (4).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yuan, C.; Su, Q.; Chiang, K.-N. Coefficient Extraction of SAC305 Solder Constitutive Equations Using Equation-Informed Neural Networks. Materials 2023, 16, 4922. https://doi.org/10.3390/ma16144922

AMA Style

Yuan C, Su Q, Chiang K-N. Coefficient Extraction of SAC305 Solder Constitutive Equations Using Equation-Informed Neural Networks. Materials. 2023; 16(14):4922. https://doi.org/10.3390/ma16144922

Chicago/Turabian Style

Yuan, Cadmus, Qinghua Su, and Kuo-Ning Chiang. 2023. "Coefficient Extraction of SAC305 Solder Constitutive Equations Using Equation-Informed Neural Networks" Materials 16, no. 14: 4922. https://doi.org/10.3390/ma16144922

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop