Next Article in Journal
Electroviscous Effect of Water-Base Nanofluid Flow between Two Parallel Disks with Suction/Injection Effect
Next Article in Special Issue
Efficiency Evaluation of Software Faults Correction Based on Queuing Simulation
Previous Article in Journal
Reductions and Conservation Laws of a Generalized Third-Order PDE via Multi-Reduction Method
Previous Article in Special Issue
Construct Six Sigma DMAIC Improvement Model for Manufacturing Process Quality of Multi-Characteristic Products
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bias Correction Method for Log-Power-Normal Distribution

1
Department of Statistics, Tamkang University, Tamsui District, New Taipei City 251301, Taiwan
2
Department of Mathematical Sciences, University of South Dakota, Vermillion, SD 57069, USA
3
Department of Information Management, Tamkang University, Tamsui District, New Taipei City 251301, Taiwan
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(6), 955; https://doi.org/10.3390/math10060955
Submission received: 18 February 2022 / Revised: 12 March 2022 / Accepted: 16 March 2022 / Published: 17 March 2022

Abstract

:
The log-power-normal distribution is a generalized version of the log-normal distribution. The maximum likelihood estimation method is the most popular method to obtain the estimates of the log-power-normal distribution parameters. In this article, we investigate the performance of the maximum likelihood estimation method for point and interval inferences. Moreover, a simple method that has less impact from the subjective selection of the initial solutions to the model parameters is proposed. The bootstrap bias correction method is used to enhance the estimation performance of the maximum likelihood estimation method. The proposed bias correction method is simple for use. Monte Carlo simulations are conducted to check the quality of the proposed bias correction method. The simulation results indicate that the proposed bias correction method can improve the performance of the maximum likelihood estimation method with a smaller bias and provide a coverage probability close to the nominal confidence coefficient. Two real examples about the air pollution and cement’s concrete strength are used for illustration.

1. Introduction

Log-normal (shortened by LN) distribution has been extensively used to characterize lifetime data in the applications of reliability analysis and quality control. Typical applications of the LN distribution can be found in modeling the fatigue failure and failure rate. In order to study composite materials subject to fatigue loading, Ratnaparkhi et al. [1] considered Yang’s differential equation for residual strength degradation to obtain the LN model for characterizing the fatigue life of a composite material. Considering the fatigue strength distribution as a function of the number of failure cycles, Murty et al. [2] derived this fatigue strength distribution via the S–N curve representing fatigue as follows. The number of cycles to failure at two levels of stress is used to obtain the S–N curve, and the numbers of cycles to failure are in turn assigned to follow a LN distribution. The LN distribution has a close relationship with the normal distribution. Let random variable, Y, follow a LN distribution with location parameter ξ and scale parameter σ , then log ( Y ) follows a normal distribution with mean ξ and standard deviation σ .
Similar to the relationship between the LN distribution and the normal distribution, a close relationship can be found between the log-power-normal (LPN) distribution and power-normal (PN) distribution. The PN distribution is a generalized version of the normal distribution. Goto and Inoue [3] introduced the PN family of distributions, investigated its statistical properties and presented the moment function in terms of infinite series in 1980. Later, Goto et al. [4] discussed some applications through using PN distribution and [5] studied the maximum likelihood estimation method for the normalized PN distribution. Hereafter, the “MLE method” is used to denote the maximum likelihood estimation method for simplicity. Freeman and Modarres [6] used the inverse Box–Cox power transformation to obtain another expression for the PN distribution. They also obtained the mathematical forms of the mean and variance for the PN distribution. Castillo et al. [7] studied the statistical properties of the truncated PN by truncating the PN distribution at zero. They also studied the probabilistic properties based on the MLE and moments estimation methods. Maruo and Goto [8] used the delta method to propose several estimation methods to obtain the confidence interval of the percentile of the PN distribution. Gupta and Gupta [9] investigated the closeness of the PN and skew-normal distributions and then proposed estimation procedures to model skewed data via using the PN distribution. Maruo et al. [10] used simulation methods to evaluate the asymptotic impact of the truncation on the parameter estimation for the PN distribution. Assume that random variable Y follows a LPN distribution, then the logarithm transformation X = log ( Y ) follows a PN distribution. Martiner-Florez et al. [11] studied the MLE method for the LPN distribution and used the LPN distribution to characterize air contamination data. Zhu et al. [12] used the PN distribution for quality control applications.
In the real world, the lifetime distribution often does not have a symmetric shape. A generalized version of the LN distribution can be helpful for reliability analysis and quality control applications. The log-skew-normal distribution could be an option because the logarithm transformation of random variable inside the skew-normal distribution can follow the shape of the skew-normal distribution. More information can be seen from [13]. The skew-normal distribution is not the best model to capture high degrees of kurtosis in data. Moreover, the quality of the estimation results based on the skew-normal distribution could not be reliable if the skewness parameter is close to zero. The PN distribution could be a better model for the higher kurtosis data, which have a narrow range of asymmetry, than the normal and skew-normal distributions. More information about the aforementioned distributions can be seen in [14,15]. In this study, we emphasize on suggesting a bias correction method for the LPN distribution.

2. Motivation and Organization

Let X follow the PN distribution with parameter Θ = ( θ 1 , θ 2 , θ 3 ) = ( ξ , σ , γ ) . The probability density function (PDF) and cumulative density function (CDF) of the PN distribution are defined, respectively, by
f X ( x | Θ ) = γ σ Φ x ξ σ γ 1 ϕ x ξ σ , x , ξ R , γ , σ > 0
and
F X ( x | Θ ) = Φ x ξ σ γ , x , ξ R , γ , σ > 0 ,
where ξ , σ and γ are the location, scale and shape parameters, respectively. Let X = log ( Y ) , Martiner-Florez et al. [11] presented the PDF and CDF of Y by
f ( y | Θ ) = γ y σ Φ log ( y ) ξ σ γ 1 ϕ log ( y ) ξ σ , y > 0
and
F ( y | Θ ) = Φ log ( y ) ξ σ γ , y > 0 ,
respectively. The distribution of Y, named LPN, is defined by Equations (3) and (4). Symbolically, it is denoted by Y L P N ( Θ ) . The LPN distribution reduces to the LN distribution when γ = 1 . The survival function and hazard function at y 0 > 0 can be obtained, respectively, by
S ( y 0 | Θ ) = 1 F ( y 0 | Θ ) = 1 Φ log ( y 0 ) ξ σ γ
and
h ( y 0 | Θ ) f ( y 0 | Θ ) S ( y 0 | Θ ) .
The pth quantile function of L P N ( Θ ) can be presented by
q p q p ( p | Θ ) = exp { ξ + σ Φ 1 ( p 1 γ ) } , 0 < p < 1 .
Figure 1 displays the PDFs of the standard LPN distribution with various shape parameters. Using the MLE method and Fisher information matrix to obtain reliable point and interval estimates has been the most popular procedure for the parametric modeling in the real world. Martiner-Florez et al. [11] investigated the MLE method for the LPN distribution. Zhu et al. [12] studied the evaluation of the process performance index via using PN distribution. They mentioned that using the Fisher information matrix to obtain the confidence interval of the process performance index could be problematic and recommended using bootstrap methods instead. Even if the parametric bootstrap method could be used to obtain the confidence interval of the model parameter, using reliable MLEs to generate high-quality bootstrap samples is the key factor for guaranteeing the estimation performance quality. However, the MLE method is sensitive to the initial solutions of the model parameters. Moreover, the log-likelihood function with several model parameters could not be convex. This fact makes the MLE method more sensitive to the initial solution when the dimension of the model parameter over two. We are motivated to propose methods to improve the quality of MLEs with the two aspects:
  • For reducing the influence of subjective assigning the initial solutions of ξ , σ and γ in the LPN distribution, we propose a simple least square (LS) estimation method to obtain reliable initial solutions of ξ , σ and γ for MLE estimation.
  • Propose a bias-correction MLE method to enhance the estimation performance of interval inference.
In this study, we focus on studying the performance for estimating the survival function at different quantiles. The survival function is an important metric to measure the lifetime quality of products in reliability analysis and quality control applications. For simplicity, we use BcLS-MLE method here and after to denote the proposed bias-correction MLE method with using the LS estimates as initial solutions of ξ , σ and γ .
The rest of this article is organized as follows. In Section 3, we investigate the MLE method, exact and observed Fisher information matrices. The confidence interval of a general differentiable function of Θ in the LPN distribution is constructed via using the observed Fisher information matrix and delta method. A regression method based on using the quantile function of LPN is used to obtain reasonable initial solutions for the LPN distribution parameters. The bootstrap bias-correction method is proposed to improve the inference performance of the MLE-based methods. Monte Carlo simulations are conducted in Section 4 to evaluate the performance of the proposed methods in terms of the measures of relative bias (rbias) and relative square root mean squared error (rsqMSE). We discuss the superiority of the proposed methods over the typical MLE method as well. In Section 5, two examples about the air pollution and cement’s concrete strength are used to illustrate the applications of the proposed estimation procedures. Finally, some concluding remarks are given in Section 6.

3. Parameter Estimation Methods

In this section, we investigate the typical MLE method and propose a bias-correction MLE method. Moreover, we use the quantile function of L P N ( Θ ) to propose a simple method to obtain a reasonable initial solution of Θ .

3.1. Maximum Likelihood Estimation Method

Let Y = ( Y 1 , Y 2 , , Y n ) be a random sample taken from L P N ( Θ ) . The likelihood and log-likelihood functions can be presented, respectively, by
L ( Θ | Y ) = i = 1 n f ( y i | Θ ) = γ σ n i = 1 n 1 y i Φ log ( y i ) ξ σ γ 1 ϕ log ( y ) ξ σ
and
( Θ | Y ) = n ( log ( γ ) log ( σ ) ) i = 1 n log ( y i ) + ( γ 1 ) i = 1 n log ( Φ ( z i ) ) + i = 1 n log ( ϕ ( z i ) ) ,
where z i = log ( y i ) ξ σ follows the standard PN distribution, i = 1 , 2 , , n . Martiner-Florez et al. [11] suggested the process to obtain the MLE of Θ and the observed Fisher information matrix. The first derivative of the log-likelihood function can be expressed by
θ 1 = ξ = n σ z ¯ ( γ 1 ) w ¯ , θ 2 = σ = n σ 1 i = 1 n z i 2 n + ( γ 1 ) i = 1 n z i w i n , θ 3 = γ = n 1 γ + v ¯
where w i = ϕ ( z i ) Φ ( z i ) , v i = log ( Φ ( z i ) ) , i = 1 , 2 , , n . Denote the MLE of Θ by Θ ^ = ( θ ^ 1 , θ ^ 2 , θ ^ 3 ) = ( ξ ^ , σ ^ , γ ^ ) . Θ ^ is the solution of the likelihood functions of θ i = 0 , i = 1 , 2 , 3 . The second order derivatives of the log-likelihood function can be expressed by
2 θ 1 2 = 2 ξ 2 = 1 σ 2 n + ( γ 1 ) i = 1 n w i 2 + i = 1 n z i w i ,
2 θ 1 θ 2 = 2 θ 2 θ 1 = 2 ξ σ = 1 σ 2 i = 1 n z i + ( γ 1 ) i = 1 n z i w i 2 + i = 1 n z i 2 w i i = 1 n w i ,
2 θ 1 θ 3 = 2 θ 3 θ 1 = 2 ξ γ = 1 σ i = 1 n w i ,
2 θ 2 2 = 2 σ 2 = 1 σ 2 3 i = 1 n z i 2 n ( γ 1 ) 2 i = 1 n z i w i i = 1 n z i 3 w i i = 1 n z i 2 w i 2 ,
2 θ 2 θ 3 = 2 θ 3 θ 2 = 2 σ γ = 1 σ i = 1 n z i w i ,
and
2 θ 3 2 = 2 γ 2 = n γ 2 .
Let η i j = E 2 θ i θ j , for i = 1 , 2 , 3 and j = 1 , 2 , 3 and a s t = E ( Z s W t ) , for s = 0 , 1 , 2 , 3 and t = 1 , 2 . We can show that
a s t = z s ϕ ( z ) Φ ( z ) t γ ϕ ( z ) [ Φ ( z ) ] γ 1 d z = γ z s ( ϕ ( z ) ) t + 1 ( Φ ( z ) ) γ t 1 d z
for s = 0 , 1 , 2 , 3 and t = 1 , 2 . The value of a s t can be evaluated via using numerical computation methods. Martiner-Florez et al. [11] obtained η i j . We express η i j as a function of a s t by
η 11 = n σ 2 1 + ( γ 1 ) ( a 02 + a 11 ) , η 12 = η 21 = n σ 2 a 10 + ( γ 1 ) ( a 12 + a 21 a 01 ) , η 13 = η 31 = n a 01 σ , η 22 = n σ 2 3 a 20 1 ( γ 1 ) ( 2 a 11 a 31 a 22 ) , η 23 = η 32 = n a 11 σ , η 33 = n γ 2 .
The Fisher information can be defined by
I ( Θ ) = [ η i j ] .
Because not all entries in I ( Θ ) have an analytic form, numerical integral methods are needed to obtain the I ( Θ ) based on data. A simple method is to approximate I ( Θ ) by the observed Fisher information matrix,
i ( Θ ) = 2 θ i θ j | Θ = Θ ^ .
Let g : R 3 R be a differentiable function and I 1 ( Θ ) = [ η i j ( Θ ) ] , where η i j ( Θ ) is the ( i , j ) th element of the inverse Fisher information matrix. The ( 1 α ) × 100 % confidence interval of g ( Θ ) can be obtained by
g ( Θ ^ ) ± z α / 2 g ( Θ ^ ) I 1 ( Θ ^ ) g ( Θ ^ ) T ,
where g ( Θ ^ ) = g θ 1 , g θ 2 , g θ 3 Θ = Θ ^ T is the gradient of g ( Θ ) at Θ = Θ ^ .
Please note that the terms of a i j could not be a finite number. If γ t 1 < 0 , a s t is infinite as z . Therefore, it could be difficult to use Fisher information matrix to obtain an approximate confidence interval of g ( Θ ) . The observed Fisher information matrix, defined in Equation (18), is easy to be evaluated. The quality of the confidence interval of g ( Θ ) based on using the observed Fisher information matrix needs to be verified. In this study, we focus on studying the performance for estimating the survival function at different quantiles. We will conduct intensive simulations to evaluate the performance of the proposed interval inference method and typical MLE method in Section 4.

3.2. The Least Square Method for Initial Solutions

There are three parameters in L P N ( Θ ) , it is not easy to simultaneously set up the initial solutions of ξ , σ and γ to implement the MLE method. A high-quality initial solution Θ 0 is a key factor to obtain a reliable MLE of Θ . We suggest a simple process to obtain a proper initial solution of Θ . From the quantile Equation of (7), we obtain the linear equation
log ( q p ) = ξ + σ Φ 1 p 1 γ .
Let γ = γ 0 , the order statistics of the random sample Y be Y ( 1 ) Y ( 2 ) , , Y ( n ) and l q i = log ( q p i ) , where q p i Y ( i ) , p i = i 0.5 n for i = 1 , 2 , , n . For each γ 0 , the initial solutions of ξ and σ can be lease square (LS) estimates of the intercept and slope of the regression line,
l q ^ i = ξ ˙ + σ ˙ Φ 1 p 1 γ , i = 1 , 2 , , n .
The LS initial solutions of ξ , σ and γ can be denoted by ξ ˙ , σ ˙ and γ 0 , respectively; that is, we only need to set up the initial value of γ to be γ 0 , then use Equation (21) and γ 0 to obtain the LS initial solutions of ξ and σ . We denote the obtained MLEs via using the LS initial solutions ξ ˙ , σ ˙ and γ o by Θ ˜ = ( ξ ˜ , σ ˜ , γ ˜ ) .
Table 1 reports the MLEs for L P N ( Θ ) with ( ξ , σ ) = ( 5 , 0.6 ) and γ = 1.5 , 3 , 5 , 7 , 9 . We use the measures of rbias and rsqMSE to evaluate the performance of the MLE method. Let the estimate of parameter θ is θ ^ , the rbias is defined by
rbias = Bias θ ,
where Bias = 1 B i = 1 n θ ^ i θ and B is the number of repetitions in simulations. The rsqMSE is defined by
rsqMSE = MSE θ ,
where MSE = 1 B i = 1 n ( θ ^ i θ ) 2 . The measures of rbias and rsqMSE are free of scale.
In practical applications, we do not have enough knowledge to set up all three initial solutions in Θ . We consider using ξ 0 = 1 , σ 0 = 1 and γ 0 = 1 as initial solutions to obtain the MLEs of ξ , σ and γ , respectively. The rbias and rsqMSE of each MLE are evaluated using 1000 repetitions. In view of Table 1, we find the rbias of γ is significantly larger than the rbias of ξ and σ . We note that the initial solution of Θ 0 = ( ξ 0 , σ 0 , γ 0 ) = ( 1 , 1 , 1 ) is a proper initial solution for the simulation cases with γ = 1.5 , 3 and 5. When the value of γ increases, the initial solution of Θ 0 = ( 1 , 1 , 1 ) becomes an improper initial solution. We find unreasonable results that the rbias and rsqMSE increase as the sample size increases from 50 to 100 for γ = 5 , 7 and 9. These results indicate that the MLE method for L P N ( Θ ) is very sensitive to the initial solution of Θ when the sample size is large. Moreover, we also find that the reqMSE of γ is significantly larger than the reqMSEs of ξ and σ . We will use Monte Carlo simulation method to study the impact of sample size on the performance of MLE method in Section 4. Based on our findings from the MLE method, a bias correction method is provided in this study to improve the estimation performance of γ .

3.3. The Bias Correction MLE Method

MLE often is a biased estimate. The Firth method, which was studied in [16], and the bootstrap method are two widely used methods to reduce the bias of MLE. Because we cannot guarantee the existence of the expected values of the second and third derivatives of the log-likelihood function for L P N ( Θ ) , the Firth method is unable to be used in this study. The bootstrap bias correction method can be a potential method to reduce the bias of MLEs. Let the bias-correction estimate of γ be γ ˜ B . The bootstrap bias correction method can be implemented as the following steps to obtain γ ˜ B .

Bootstrap Bias Correction Method

Initial Step A: 
Obtain a working sample y = ( y 1 , y 2 , , y n ) .
Step A1: 
Using LS estimate Θ ˙ = ( ξ ˙ , σ ˙ , γ 0 ) as the initial solution of Θ to obtain the MLE, Θ ˜ = ( ξ ˜ , σ ˜ , γ ˜ ) , based on the working random sample y .
Step A2: 
Generate a bootstrap random sample y * from L P N ( Θ ˜ ) and transform y * into a standardized PN sample z * = ( z 1 * , z 2 * , , z n * ) , where z i * = log ( y i * ) ξ ˜ σ ˜ , i = 1 , 2 , , n . We can treat z * as a random sample that is taken from P N ( γ ˜ ) , where P N ( γ ˜ ) is the standardized PN distribution with parameter γ = γ ˜ . Obtain the MLE of γ based on z * and P N ( γ ˜ ) and denote the obtained MLE by γ ˜ * .
Step A3: 
Repeat Step A2 B c times, where B c is a huge positive integer. Let γ ˜ j * denote the MLE of γ based on jth bootstrap sample for j = 1 , 2 , , B c . Efron and Tibshirani [17] showed that the estimated bias of γ ˜ can be obtained by
γ ˜ B i a s * = 1 B c j = 1 B c γ ˜ j * γ ˜ .
The bootstrap bias correction MLE of γ can be presented by
γ ˜ B = γ ˜ γ ˜ B i a s * = 2 γ ˜ 1 B c j = 1 B c γ ˜ j * .

4. Monte Carlo Simulations

4.1. Performance Comparison

In this section, we use Monte Carlo simulation method to verify the performance of the MLE method with the initial solution Θ 0 = ( ξ 0 , σ 0 , γ 0 ) = ( 1 , 1 , 1 ) and LS initial solutions with using γ 0 = 1 . Because the moment functions of L P N ( Θ ) are complicated, it is difficult to use the first three moment functions to obtain the moment estimates as the initial solutions of ξ , σ and γ . Setting each parameter by 1 can be an optional initial solution to obtain the MLEs of parameters if the user does not have good knowledge to set up a quality initial solution for each model parameter.
Parameter combinations of ξ = 5 , σ = 0.6 , γ = 1.5 , 3 , 5 , 7 , 9 ; ξ = 8 , 10 , σ = 0.6 , γ = 3 ; and ξ = 5 , σ = 1, 3, γ = 3 are considered for the Monte Carlo simulations. The measures of rbias and rsqMSE are used to evaluate the performance of the MLE and BcLS-MLE methods based on B = 5000 repetitions. When γ = 1.5 , 3 , 5 , the initial solution Θ 0 = ( ξ 0 , σ 0 , γ 0 ) = ( 1 , 1 , 1 ) is a good initial solution for Θ . Θ 0 becomes an improper initial solution for γ as the true value of γ increases. Please note that the initial solution ξ 0 = 1 and σ 0 = 1 are still good initial solutions. We will show that the MLE method is very sensitive to the initial values even only one parameter without using good initial solution to obtain the MLE of Θ . We use B c = 1000 bootstrap samples for bias correction in the simulation study. The survival function is an important metric in the applications of the reliability analysis and quality control. Let
g ( Θ | y 0 ) = S ( y 0 | Θ ) = 1 Φ log ( y 0 ) ξ σ γ ,
where y 0 can be determined by q p in Equation (7). The coverage probability (CP) for estimating g ( Θ | y 0 ) ( = S ( y 0 | Θ ) ) is obtained based on 5000 repetitions for using the MLE and BcLS-MLE methods. In this study, we consider the quantile with p = 0.1 , 0.3 and 0.5 to determine the value of y 0 .
Replacing the Fisher information matrix by the observed Fisher information matrix, the approximate ( 1 α ) × 100 % confidence interval of g ( Θ ) can be presented by
g ( Θ ^ | y 0 ) ± z α / 2 g ( Θ ^ | y 0 ) i 1 ( Θ ^ ) g ( Θ ^ | y 0 ) T
Based on Equation (26), we can show that g ( Θ | y 0 ) θ i = δ θ i , where δ = Φ log ( y 0 ) θ 1 θ 2 θ 3 . Taking logarithm transformation to the both sides of δ = Φ log ( y 0 ) θ 1 θ 2 θ 3 , we obtain log ( δ ) = θ 3 log Φ log ( y 0 ) θ 1 θ 2 . It can be shown that δ θ i = δ × θ i θ 3 log Φ log ( y 0 ) θ 1 θ 2 . We can obtain the following results:
( i ) δ ξ = δ θ 1 = θ 3 θ 2 × [ Φ ( z 0 ) ] θ 3 1 ϕ ( z 0 ) , ( i i ) δ σ = δ θ 2 = z 0 θ 3 θ 2 × [ Φ ( z 0 ) ] θ 3 1 ϕ ( z 0 ) , ( i i i ) δ γ = δ θ 3 = [ Φ ( z 0 ) ] θ 3 log [ Φ ( z 0 ) ] ,
where z 0 = log ( y 0 ) θ 1 θ 2 . Based on the results of (i), (ii), (iii) and the equation of g ( Θ | y 0 ) θ i = δ θ i , i = 1 , 2 , 3 , we obtain
g ( Θ | y 0 ) ξ = g ( Θ | y 0 ) θ 1 = θ 3 θ 2 × [ Φ ( z 0 ) ] θ 3 1 ϕ ( z 0 ) = θ 3 θ 2 × [ Φ ( z 0 ) ] θ 3 × w 0 , g ( Θ | y 0 ) σ = g ( Θ | y 0 ) θ 2 = z 0 θ 3 θ 2 × [ Φ ( z 0 ) ] θ 3 1 ϕ ( z 0 ) = z 0 θ 3 θ 2 × [ Φ ( z 0 ) ] θ 3 × w 0
and
g ( Θ | y 0 ) γ = g ( Θ | y 0 ) θ 3 = [ Φ ( z 0 ) ] θ 3 log [ Φ ( z 0 ) ] = [ Φ ( z 0 ) ] θ 3 v 0 ,
where w 0 = ϕ ( z 0 ) Φ ( z 0 ) and v 0 = log [ Φ ( z 0 ) ] . The gradient of g at Θ = Θ ^ can be resented by
g ( Θ ^ ) = [ Φ ( z 0 ) ] θ 3 w 0 θ 3 θ 2 , z 0 w 0 θ 3 θ 2 , v 0 Θ = Θ ^ T .
The ( 1 α ) × 100 % approximate confidence interval of g ( Θ ) can be obtained using Equations (27) and (28).
Each parameter combination was run with a L P N ( Θ ) random sample that has a size n = 30 , 50 , 100 , 150 and 200 in the simulation study. All simulation results based on the MLE method with the initial solution of Θ 0 = ( ξ 0 , σ 0 , γ 0 ) = ( 1 , 1 , 1 ) and the BcLS-MLE method with γ 0 = 1 , and the CP of S ( y 0 | Θ ) with y 0 = q 0.1 , q 0.3 and q 0.5 are reported in Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9 and Table 10. From these five tables, we find the following results.
  • In most cells of Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9 and Table 10, the BcLS-MLE method can reduce the rbias of MLEs.
  • Note that reducing the rbias of an estimate could inflate the rsqMSE. Some cells in Table 2, Table 3, Table 4, Table 5 and Table 6 show that the rsqMSE of BcLS-MLE is larger than the rsqMSE of MLE when the rbias of MLE is reduced. For example, the MLE method in Table 4 slightly outperforms the BcLS-MLE method in many cells in terms of the measures of rbias and rsqMSE for γ = 5 . Carefully checking the values of rbias and rsqMSE in Table 4, two estimation methods are competitive for γ = 5 . For Table 2, Table 3, Table 4, Table 5 and Table 6, the proposed BcLS-MLE significantly outperforms the MLE method in terms of the measures of rbias and rsqMSE in most cells.
  • Θ 0 = ( 1 , 1 , 1 ) is a proper initial solution to obtain the MLEs of Θ for γ = 1.5 , 3 and 5. When the value of γ increases, Θ 0 = ( 1 , 1 , 1 ) or the LS estimates with γ 0 = 1 becomes improper initial solutions. From Table 5 and Table 6, we find that the proposed BcLS-MLE method is more reliable than the MLE method if the initial solution is improper. Even if the rbias and rsqMSEs of the BcLS-MLE and MLE in Table 5 and Table 6 are closed, The CP based on the proposed BcLS-MLE method is more closed to the nominal value 95% than the value of CP based on the MLE method. The CP of the MLE method could drop down under 70%, which seriously underestimates the nominal confidence coefficient.
  • Based on the CPs from Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9 and Table 10, the proposed BcLS-MLE method can improve the interval estimation performance of the MLE method and provide a CP more closed to the nominal confidence coefficient.
  • In practice, we can try different values of γ to obtain the LS estimates ξ ˜ and σ ˜ . Then, select the vector of ( ξ ˜ , σ ˜ , γ 0 ) which has the maximal log-likelihood as the initial values of ξ , σ and γ to obtain better estimation performance. In the simulation design, we only consider γ 0 = 1 for all parameter combinations. This consideration is based on the performance comparison between the proposed BcLS-MLE and MLE methods for the cases with proper and improper initial solutions.
  • From Table 7, Table 8, Table 9 and Table 10, we find that the MLE method performs bad when the values of ξ or σ change from that in Table 2, Table 3, Table 4, Table 5 and Table 6. Many cells of bias and rsqMSEs in these four tables are unacceptably large. Moreover, some CPs in Table 8 and Table 10 are close to or equal to 0 and seriously underestimate the nominal confidence coefficient of 0.95. These findings indicate that the MLE method is unreliable. The proposed BcLS-MLE method outperform the MLE method in terms of the metrics of bias, rsqMSE and CP.
  • Selecting a proper initial solution of Θ to implement the MLE method is important if the working sample is large.

4.2. Discussions

In our simulation experience for LPN or PN, the MLE method often fails to be implemented due to improper initial solutions were used. Even if the MLE method can be implemented to obtain a set of MLEs, the obtained MLEs could be improper MLEs. This is because the log-likelihood function easily falls into a local optimum with an improper initial solution. The practitioner struggles to simultaneously assign good initial solutions for ξ , σ and γ to implement the MLE method. The proposed LS initial solution method is feasible as an optional method because we only need to assign the initial solution of γ .
Carefully checking the structure of the log-likelihood function, we find that it is difficult to estimate the shape parameter γ when the sample size is small. The terms contain γ in the log-likelihood function can be expressed as a summation by
n log ( γ ) + ( γ 1 ) i = 1 n log ( Φ ( z i ) ) = n log ( γ ) + ( γ 1 ) i = 1 n v i .
There are two situations that the MLE of γ could overestimate or underestimate its true value if the sample size is small.
Situation 1: 
When the log-likelihood function chooses γ > 1 as a candidate, the term “ n log ( γ ) ” is positive and the term “ ( γ 1 ) i = 1 n v i ” is negative due to ( γ 1 ) > 0 and i = 1 n v i < 0 . For small size sample, the summation term i = 1 n v i often is small. Therefore, the log-likelihood function could favor a large γ in (29) as the MLE to maximize the log-likelihood function. Therefore, γ is easy to be overestimated in this situation.
Situation 2: 
When the log-likelihood function chooses γ < 1 as a candidate, the term “ n log ( γ ) ” is negative and the term “ ( γ 1 ) i = 1 n v i ” is positive due to ( γ 1 ) < 0 and i = 1 n v i < 0 . Because of the summation term i = 1 n v i often is small for small size sample, the log-likelihood function could favor a small γ toward 1 in (29) as the MLE to maximize the log-likelihood function. In most applications of reliability analysis and quality control, right-skewed data are more often than left-skewed data. Therefore, it is easy to underestimate γ in this situation if the working data follow a right-skewed distribution.
On the basis of Situation 1 and Situation 2 for small size sample, we do not suggest using a small or big value as the initial value for γ with a small size sample to protect the log-likelihood function falls into the trap of local maximum. Luckily, the shape parameter is at the power of the standardized normal distribution Φ log ( y ) ξ σ , the true value of γ should not be too large for most real data. A good idea is to search the initial values of γ in a range of positive domain, and pick up the best one from all LS initial solutions to result in the maximum log-likelihood among all considered competitors. For example, we can figure out the domain of γ of { 2 < γ < 10 } into a discrete set of { γ | γ = γ i , i = 1 , 2 , , m } , where γ 1 = 2 , γ i = γ i 1 + 0.01 for 2 i ( m 1 ) and γ m = 10 as candidates. Then, use the proposed LS initial solution method to find the LS initial solutions of ξ and σ corresponding to γ i . Denote the initial solution set by { ( ξ ˙ i , σ ˙ i , γ i ) , i = 1 , 2 , , m } . Then, select the one with the maximal log-likelihood based on the working sample as initial solutions to find the MLEs of ξ , σ and γ .
We use Monte Carlo simulation to show the strength of the proposed BcLS-MLE method over the MLE method. Let ξ = σ = 5 and γ = 1.5 , 5, 7, 9, 12. Consider the sample size n = 50 , 100 and 200 for the Monte Carlo simulation. The MLE method has a high failure rate if we use ξ 0 = 1 , σ = 1 and γ 0 as initial solutions, where γ 0 is randomly generated from the uniform distribution, U ( 2 , 10 ) . Instead of the MLE method, we can implement the proposed BcLS-MLE method to find the BcLS-MLEs of ξ ˜ , σ ˜ and γ ˜ B . The initial solutions of ξ ˙ and σ ˙ can be obtained via doing global searching for γ over the domain of { γ | γ = γ i , i = 1 , 2 , , m } , where γ 1 = 2 , γ i = γ i 1 + 0.01 for 2 i ( m 1 ) and γ m = 10 . Then, select the set ( ξ ˙ i * , σ ˙ i * , γ i * ) which has the maximal log-likelihood among m competitors as the initial solution of ( ξ , σ , γ ) . If more than one set reach same maximal log-likelihood, select the set with the smallest index of i as initial solution set to implement the proposed BcLS-MLE method. All simulation results are reported in Table 11. The rbias and rsqMSE are evaluated based on B = 5000 repetitions and B c = 1000 bootstrap samples are used for bias correction.
Table 11 shows that we can obtain reliable MLEs via using the proposed BcLS-MLE method even if we do not have enough knowledge to assign the initial values of ξ , σ and γ . The LS initial solution method is feasible to assign proper initial solutions of the model parameters through searching an given interval of γ . Because the confidence interval using observed Fisher information matrix is conservative, the CP of the survival function slightly underestimates the nominal confidence coefficient if p is small. When p increases, the CP could be underestimated over the expectation. From the last column of Table 2 to Table 11, we find that the CP could drop below 90% from the nominal value of 95% when the survival function at the median is evaluated. The evaluation of low pth quantile is more important than the evaluation of upper pth quantile in lifetime data analysis. Therefore, the proposed BcLS-MLE method can be used to obtain a reliable MLE of g ( Θ ) and provide an approximate confidence interval for g ( Θ ) through using the observed Fisher information matrix.

5. Examples

5.1. Example 1

The first example is related to air pollution in the city of New York, USA. In this dataset, the daily measurements of ozone concentration (in ppb D ppm×1000) in the atmosphere of New York city during May–September 1973. All measurements were from the New York State Department of Conservation. This dataset was previously analyzed by Martiner-Florez et al. [11], Nadarajah [18] and Leiva et al. [19]. It is usually assumed that the air pollutant data are uncorrelated and independent. For easy reference, we report this data set with 116 measurements in Appendix A.
Nadarajah [18] has shown that the LPN distribution is a best model, which outperforms over the LN, log-skew-normal, Birnbaum–Saunders distribution(BS) distributions. Martiner-Florez et al. [11] reported the MLEs of the LPN distribution parameters as ξ ^ = 4.986 , σ ^ = 0.146 and γ ^ = 0.012 with the log-likelihood −540.266. We got BcLS-MLEs to be ξ ^ = 4.9354 , σ ^ = 0.1789 and γ ^ = 0.0196 with the log-likelihood −540.297. The improvement of the BcLS-MLEs over the MLEs is insignificant for this data set and the values of log-likelihood based on the MLE and the proposed BcLS-MLE method are closed. In this example, the MLE method and the proposed BcLS-MLE method can perform equally well.

5.2. Example 2

A dataset of cement’s concrete strength (CS) is used to illustrate the proposed BcLS-MLE and interval inference methods. The quality of concrete is mostly evaluated on the basis of its strength. CS is used to measure the ability of concrete to withstand loading. A cement dataset which has 1030 observations, each observation has 9 features, can be obtained from the link of https://www.kaggle.com/prathamtripathi/regression-with-neural-networking (accessed on 17 February 2022). In this dataset, the target feature is the CS per kiloNewton. The CS with the feature age at 28 are selected and used for illustrating the proposed methods. The feature of age in the dataset stands for the time before it needs repairing. Subsequently, total 425 CS observations at age 28 are taken from the original data set and reported in Appendix A. The histogram of the logarithm transformation of CS and its density curve are displayed in Figure 2.
In view of Figure 2, we find that the data set of log(CS) is left-skewed. Moreover, we find that the sample skewness and kurtosis coefficients are −0.493 and 0.275, respectively. We use L P N ( ξ , σ , γ ) with the initial solution ( ξ 0 , σ 0 , γ 0 ) = ( 1 , 1 , 1 ) and γ 0 = 1 to implement the MLE method and the proposed BcLS-MLE method, respectively. We obtain the MLEs ξ ^ = 6.442 , σ ^ = 0.223 and γ ^ = 0.422 , and the BcLS-MLEs ξ ˜ = 4.191 , σ ˜ = 0.143 and γ ˜ = 0.056 . The log-likelihood based on the obtained MLEs and BcLS-MLEs are −2109.504 and −1726.389, respectively. As per our expectation, we favor the BcLS-MLE method on the basis of the maximal log-likelihood.
The quantile-to-quantile plot of using L P N ( ξ ˜ , σ ˜ , γ ˜ ) as the probability model of the CS data set is displayed in Figure 3. The p-value of Kolmogorov–Smirnov test for the CS dataset based on the L P N ( ξ ˜ , σ ˜ , γ ˜ ) is 0.8404. We can find a good straight fit in Figure 3. Moreover, the p-value of the Kolmogorov–Smirnov test also supports that the LPN distribution can be a good candidate for fitting the CS dataset. The BS distribution, Weibull distribution and LN distribution are used to compare with the LPN distribution for fitting the cement’s CS dataset. Table 12 shows that the LPN distribution is the best model for fitting the cement’s CS dataset based on the values of maximal log-likelihood, minimal Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) among all competitors under considered.
We use the observed Fisher information and delta method to find the confidence interval of the survival function at y 0 = q p for p = 0.1 , 0.3 and 0.5. After algebra computation, the 95% confidence interval of the survival function is ( 0.878 , 0.922 ) , ( 0.666 , 0.734 ) and ( 0.467 , 0.533 ) for the survival function at p = 0.1 , 0.3 and 0.5, respectively. The resulting confidence interval covers the true survival function of 0.9, 0.7 and 0.5, respectively. Moreover, the lengths of these three confidence intervals are 0.044, 0.068 and 0.066, respectively. All the lengths are short.

6. Conclusions

In this article, we propose a parametric bootstrap bias correction MLE method, called the BcLS-MLE method, to improve the performance of the typical MLE method. Moreover, a LS method based on the quantile function is used to obtain proper initial solutions for the LPN distribution parameters. Monte Carlo simulation method is used to evaluate the performance of the proposed bootstrap bias correction MLE method. From simulation results, we find that the proposed bootstrap bias correction MLE method is easy to use to obtain reliable MLEs for the LPN distribution parameters.
Compared with the typical MLE method, the merit of the proposed BcLS-MLE method is that practitioners only need to set up the initial value of the shape parameter and then use the proposed LS method to obtain reliable initial solutions for the LPN distribution parameters to implement the proposed BcLS-MLE method. Based on the proposed BcLS-MLE method, the delta method based on the observed Fisher information matrix can be used to obtain an approximate ( 1 α ) × 100 % confidence interval of the survival function at pth quantile. The delta method can be used for any differentiable function other than the survival function. A real data set about cement’s CS is used to illustrate the proposed methods.
We find that the CP of the survival function at the quantile of median could be less satisfied because of underestimation. Other parameter estimation methods could be helpful to fill this gap. This topic can be studied in the future.

Author Contributions

Conceptualization, T.-R.T. and Y.L.; methodology, T.-R.T. and Y.L.; software, Y.-Y.F. and C.-P.C.; validation, T.-R.T., Y.L. and Y.-Y.F.; formal analysis, T.-R.T., Y.L. and C.-P.C.; investigation, T.-R.T. and Y.L.; resources, T.-R.T.; data curation, T.-R.T. and Y.-Y.F.; writing—original draft preparation, T.-R.T.; writing—review and editing, T.-R.T. and Y.L.; visualization, T.-R.T.; supervision, T.-R.T. and Y.L.; project administration, T.-R.T.; funding acquisition, T.-R.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Science and Technology, Taiwan, grant number MOST 110-2221-E-032-034-MY2.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

One-hundred-and-sixteen measurements of air pollution in the city of New York, US, during May–September 1973, were from the New York State Department of Conservation as well as reported in Appendix A and Appendix B. The complete cement dataset with 1030 observations and 9 features can be obtained at https://www.kaggle.com/prathamtripathi/regression-with-neural-networking (accessed on 17 February 2022), in the Public Domain. The dataset with 425 CS observations at Age 28 is reported in Appendix A.

Acknowledgments

This study is supported by Ministry of Science and Technology, Taiwan MOST 110-2221-E-032-034-MY2. The authors would also like to thank the editors and reviewers for their valuable remarks and suggestions that improve this paper significantly.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. The Air Pollution Data with 116 Measures

4136121828231987161114
181434630111114322345
11537297139232137201213135
49326440779797851027748
3561796316801082052825064
5939916783566122891104428
652259233144219451687376
1188485967873914732202321
24442128913461813241613

Appendix B. The Data Set with 425 CS Observations at Age 28

79.9961.8936.4545.8539.2928.0247.8128.2437.4330.0833.0240.86
71.9961.0959.860.2961.856.768.366.960.2950.756.460.29
55.568.571.374.752.271.367.771.36674.571.349.9
24.8922.8425.7226.424.924.4828.4721.5424.2445.7140.2324.54
30.2329.2231.6437.438.533.7329.6532.6627.7731.2731.1234.74
48.2839.9433.9430.8550.652.551.3347.440.1545.9463.1455.51
52.230.2231.9767.5757.2381.7564.0278.841.3760.2856.8351.02
55.5544.1339.3855.6547.2844.3352.349.2541.3729.1639.439.3
67.8758.5253.585976.2469.8426.246.3939.1641.233.6938.2
41.4137.8124.8527.2244.6437.2733.2757.0344.4251.0253.3950.24
46.6846.6844.0355.4555.5557.9262.0572.121.9743.7362.9459.49
23.8539.643.9418.7531.9725.5727.9432.6339.4425.7533.0821.07
32.0522.4925.2239.738.717.5818.222.6325.7337.8133.0426.91
31.919.7737.4417.631.3832.0418.4225.9727.5312.2517.3430.57
24.526.8538.2143.720.8734.2929.8924.2933.9520.5933.6627.87
39.3243.3838.83317.2824.2824.0517.9632.8822.3541.6830.44
36.9627.8331.7433.6117.5425.1843.8936.9422.4444.0967.3127.34
31.3537.4222.4431.8425.1817.5421.6531.4532.7219.6931.6539.58
37.4219.6927.9255.9425.1821.7539.0974.9937.1733.7616.519.99
36.3533.6915.4233.4239.0527.6826.8645.330.1215.5744.6153.52
57.2165.9152.8233.418.0337.3632.8442.6440.0641.9461.2340.87
33.352.4215.0938.4637.2635.2342.1331.8741.5439.4537.9144.28
31.1823.6932.7632.428.6336.818.2833.0631.4231.0344.3912.18
25.5636.4432.9623.8426.2317.9540.6819.0133.728.5413.4632.24
23.5229.7249.7752.4440.9344.8613.237.4329.8756.6112.4623.79
13.2939.4246.2344.5223.7426.1415.5243.5735.8641.0528.9946.24
26.9210.5425.129.079.7433.839.8426.9727.2330.6533.0524.58
21.9130.8815.3424.3423.8922.9329.4128.6336.818.2932.7231.42
28.9440.9312.1825.5636.4432.9623.8426.2317.9638.6319.0133.72
8.5413.4632.2523.5229.7349.7752.4540.9344.8713.237.4329.87
56.6212.4623.7913.2939.4246.2344.5223.7426.1515.5343.5835.87
41.0528.9946.2526.9210.5425.129.079.7433.837.1733.7616.5
19.9936.3538.2215.4233.4239.0627.6826.8645.330.1215.5744.61
53.5257.2265.9152.8333.418.0337.3635.3142.6440.0643.861.24
40.8733.3152.4315.0938.4637.2735.2342.1431.8841.5439.4637.92
44.2831.1823.7032.7732.40

References

  1. Ratnaparkhi, M.V.; Park, W.J. Lognormal distribution-model for fatigue life and residual strength of composite materials. IEEE Trans. Reliab. 1986, 35, 312–315. [Google Scholar] [CrossRef]
  2. Murty, A.S.R.; Gupta, U.C.; Radha, A. A new approach to fatigue strength distribution for fatigue reliability evaluation. Int. J. Fatigue 1995, 17, 85–89. [Google Scholar] [CrossRef]
  3. Goto, M.; Inoue, T. Some properties of power-normal distribution. Jpn. J. Biom. 1980, 1, 28–54. [Google Scholar] [CrossRef]
  4. Goto, M.; Matsubara, Y.; Tsuchiya, Y. Power-normal distribution and its applications. Rep. Stat. Appl. Res. 1983, 30, 8–28. [Google Scholar]
  5. Goto, M.; Inoue, T.; Tsuchiya, Y. On estimation of parameters in power-normal distribution. Bull. Inform. Cybern. 1984, 21, 41–53. [Google Scholar] [CrossRef]
  6. Freeman, J.; Modarres, R. Inverse Box-Cox: The power-normal distribution. Stat. Probab. Lett. 2006, 76, 764–772. [Google Scholar] [CrossRef]
  7. Castillo, N.O.; Gallardo, D.I.; Bolfarine, H.; Gómez, H. Truncated power-normal distribution with application to non-negative measurements. Entropy 2018, 20, 433. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Maruo, K.; Goto, M. Percentile estimation based on the power-normal distribution. Comput. Stat. 2013, 28, 241–356. [Google Scholar] [CrossRef]
  9. Gupta, R.D.; Gupta, R.C. Analyzing skewed data by power normal model. Test 2008, 17, 197–210. [Google Scholar] [CrossRef]
  10. Maruo, K.; Shirahata, S.; Goto, M. Underlying assumptions of the power-normal distribution. Behaviormetrika 2011, 38, 85–95. [Google Scholar] [CrossRef]
  11. Martiner-Flórez, G.; Bolfarine, H.; Gómez, H.W. The log-power-normal distribution application to air pollution. Environmetric 2014, 25, 44–56. [Google Scholar] [CrossRef]
  12. Zhu, J.; Xin, H.; Zheng, C.; Tsai, T.-R. Inference for the process performance index of products on the basis of power-normal distribution. Mathematics 2022, 10, 35. [Google Scholar] [CrossRef]
  13. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  14. Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; Wiley: New York, NY, USA, 1995. [Google Scholar]
  15. Papoulis, A.; Pillai, S.U. Probability, Random Variables and Stochastic Processes; McGraw-Hill: New York, NY, USA, 2002. [Google Scholar]
  16. Firth, D. Bias reduction of maximum likelihood estimates. Biometrika 1993, 80, 27–38. [Google Scholar] [CrossRef]
  17. Efron, B.; Tibshirani, R.J. An Introduction to the Bootstrap; Chapman & Hall: New York, NY, USA, 1993. [Google Scholar]
  18. Nadarajah, S. A truncated inverted beta distribution with application to air pollution data. Stoch. Environ. Res. Risk Assess. 2008, 22, 285–289. [Google Scholar] [CrossRef]
  19. Leiva, V.; Vilca-Labra, F.; Balakrishnan, N.; Sanhueza, A. A skewed sinh-normal distribution and its properties and application to air pollution. Commun. Stat. Methods 2010, 39, 426–443. [Google Scholar] [CrossRef]
Figure 1. The plot of PDF of the standard LPN distribution for γ = 1 , 2, 5, 10 (mode increasing).
Figure 1. The plot of PDF of the standard LPN distribution for γ = 1 , 2, 5, 10 (mode increasing).
Mathematics 10 00955 g001
Figure 2. The histogram of the logarithm transformation of CS and its density curve.
Figure 2. The histogram of the logarithm transformation of CS and its density curve.
Mathematics 10 00955 g002
Figure 3. The quantile-to-quantile plot of using L P N ( ξ ˜ , σ ˜ , γ ˜ ) as the probability model of the CS data set.
Figure 3. The quantile-to-quantile plot of using L P N ( ξ ˜ , σ ˜ , γ ˜ ) as the probability model of the CS data set.
Mathematics 10 00955 g003
Table 1. The rbias and rsqMSE of the MLEs of ξ , σ and σ based on 1000 repetitions.
Table 1. The rbias and rsqMSE of the MLEs of ξ , σ and σ based on 1000 repetitions.
rbiasrsqMSE
γ n ξ σ γ ξ σ γ
1.530−0.08350.09215.49270.23910.566610.9186
50−0.03420.00473.09180.19470.47297.7778
100−0.03820.0442.18810.16520.39365.7547
200−0.02260.03791.43810.14460.32594.5906
330−0.05030.01683.38060.1960.46356.2849
50−0.0182−0.03152.1540.17470.41984.7815
100−0.03590.04051.98950.1580.35284.5396
200−0.03980.0661.51860.130.28513.6875
530−0.03850.00952.26040.17190.4123.9232
50−0.0037−0.05041.48510.1570.37233.4087
100−0.11690.2064.31070.21120.4856.6299
200−0.03670.05571.26710.1170.24162.6537
730−0.0218−0.01731.56530.15010.3562.8169
500.0001−0.05071.16760.14450.34012.6136
100−0.2050.50694.28350.76882.5956.1334
200−0.03680.05681.15450.10620.2132.2886
930−0.0052−0.04521.06620.13880.3442.0683
50−0.005−0.03611.06840.13520.32232.1908
100−0.09670.1942.47190.3841.19173.8185
200−0.02510.03370.84840.09610.19031.754
Table 2. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 1.5 , and 1 α = 0.95 .
Table 2. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 1.5 , and 1 α = 0.95 .
rbiasrsqMSECP
n Method ξ σ γ ξ σ γ q 0 . 1 q 0 . 3 q 0 . 5
30MLE−0.0850.0885.6480.2390.55811.0020.8930.9170.917
30BcLS-MLE−0.0520.0015.1430.2320.5511.5950.9060.9170.903
50MLE−0.0460.0263.4690.2010.488.1960.9080.9270.920
50BcLS-MLE−0.0420.0153.4900.2010.4798.6840.9170.9300.916
100MLE−0.0400.0432.3160.1680.3986.2190.9260.9270.923
100BcLS-MLE−0.0280.0171.9270.1570.3775.9750.9330.9290.921
150MLE−0.0320.0431.6290.1450.3424.9120.9310.9250.919
150BcLS-MLE−0.0200.0141.1550.3120.3124.1870.9410.9280.924
200MLE−0.0220.0361.4400.1460.3274.6040.9190.9200.900
200BcLS-MLE−0.0140.0100.7300.1070.2652.8060.9460.9320.919
Table 3. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 3 , and 1 α = 0.95 .
Table 3. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 3 , and 1 α = 0.95 .
rbiasrsqMSECP
n Method ξ σ γ ξ σ γ q 0 . 1 q 0 . 3 q 0 . 5
30MLE−0.0550.0303.4930.1980.4646.3370.8740.9120.886
30BcLS-MLE−0.022−0.0533.1240.2020.4866.5760.9110.9240.924
50MLE−0.024−0.0182.2800.1740.4145.0120.9170.9310.926
50BcLS-MLE−0.021−0.0262.4540.1790.4225.6920.9270.9340.936
100MLE−0.0390.0432.1760.1620.3594.8860.9350.9330.939
100BcLS-MLE−0.0220.0031.8640.1510.3425.0670.9360.9380.935
150MLE−0.0320.0471.6130.1580.3323.9630.9390.9340.940
150BcLS-MLE−0.0180.0091.2720.1270.2853.9100.9440.9380.940
200MLE−0.0430.0711.6190.1340.2893.6770.9410.9400.954
200BcLS-MLE−0.0160.0110.9290.1120.2512.9750.9460.9360.942
Table 4. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 5 , and 1 α = 0.95 .
Table 4. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 5 , and 1 α = 0.95 .
rbiasrsqMSECP
n Method ξ σ γ ξ σ γ q 0 . 1 q 0 . 3 q 0 . 5
30MLE−0.032−0.0062.1640.1690.4073.9450.8220.8980.769
30BcLS-MLE0.007−0.1031.8070.1800.4474.1050.9160.9200.885
50MLE−0.006−0.0461.5130.1550.3683.4000.8740.9110.833
50BcLS-MLE−0.006−0.0521.8080.3830.384.1830.9180.9230.900
100MLE−0.1190.2094.5330.2210.5266.9520.8750.9160.856
100BcLS-MLE−0.014−0.0101.5800.1410.3074.0270.9380.9270.926
150MLE−0.0370.0581.3250.1260.2512.6870.9200.9350.925
150BcLS-MLE−0.0140.0011.1900.1230.2623.2710.9480.9420.941
200MLE−0.0370.0561.3080.1170.2422.7530.9380.9310.932
200BcLS-MLE−0.0140.0070.9530.1100.2322.7900.9540.9410.950
Table 5. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 7 , and 1 α = 0.95 .
Table 5. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 7 , and 1 α = 0.95 .
rbiasrsqMSECP
n Method ξ σ γ ξ σ γ q 0 . 1 q 0 . 3 q 0 . 5
30MLE−0.024−0.0141.5880.1500.3612.8250.7740.8780.681
30BcLS-MLE0.014−0.1091.3610.1700.4213.0930.8940.9120.853
50MLE0.003−0.0581.1100.1450.3422.5820.8270.9020.753
50BcLS-MLE0.001−0.0581.4760.1560.3603.3930.8900.9180.841
100MLE−0.2190.5604.1110.8902.9626.0350.7490.8740.695
100BcLS-MLE−0.010−0.0151.3250.1350.293.2630.8960.9160.842
150MLE−0.0330.5480.8990.23710.0231.4640.8150.9060.786
150BcLS-MLE−0.010−0.0051.0240.1160.2442.7550.8840.9130.847
200MLE−0.0280.0411.0050.1050.2122.1270.8170.8920.774
200BcLS-MLE−0.0120.0040.8970.1060.2192.5360.8890.9150.861
Table 6. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 9 , and 1 α = 0.95 .
Table 6. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 0.6 , γ = 9 , and 1 α = 0.95 .
rbiasrsqMSECP
n Method ξ σ γ ξ σ γ q 0 . 1 q 0 . 3 q 0 . 5
30MLE−0.024−0.0141.5880.1500.3292.8250.7740.8780.681
30BcLS-MLE0.014−0.1091.3610.1700.4013.0930.8940.9120.853
50MLE0.003−0.0581.1100.1450.3192.5820.8270.9020.753
50BcLS-MLE0.001−0.0581.4760.1560.3483.3930.8900.9180.841
100MLE−0.2190.5604.1110.8901.8376.0350.7490.8740.695
100BcLS-MLE−0.010−0.0151.3250.1270.2713.2630.8960.9160.842
150MLE−0.0330.5480.8990.2375.8431.4640.8150.9060.786
150BcLS-MLE−0.010−0.0051.0240.1110.2292.7550.8840.9130.847
200MLE−0.0280.0411.0050.1050.212.1270.8170.8920.774
200BcLS-MLE−0.0120.0040.8970.2040.222.5360.8890.9150.861
Table 7. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 8 , σ = 0.6 , γ = 3 , and 1 α = 0.95 .
Table 7. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 8 , σ = 0.6 , γ = 3 , and 1 α = 0.95 .
rbiasrsqMSECP
30MLE−0.2400.75919.2730.3171.45320.0390.8300.8970.861
30BcLS-MLE−0.010−0.0682.9000.1240.4846.3630.9150.9260.912
50MLE−0.5692.50728.4061.2657.08228.5610.7580.8290.863
50BcLS-MLE−0.011−0.0342.3150.1100.4175.4790.9230.9290.928
100MLE−22.560244.39.79622.722275.915.1940.0010.2460.347
100BcLS-MLE−0.012−0.0031.7850.0930.3384.9120.9360.9360.933
150MLE−0.1440.4489.1090.1680.53011.6040.9070.9110.936
150BcLS-MLE−0.0100.0061.1810.0780.2813.6920.9380.9320.940
200MLE−0.1720.53912.3890.2020.67715.2300.9030.8930.913
200BcLS-MLE−0.0090.0080.8610.0680.2472.8440.9440.9330.942
Table 8. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 10 , σ = 0.6 , γ = 3 , and 1 α = 0.95 .
Table 8. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 10 , σ = 0.6 , γ = 3 , and 1 α = 0.95 .
rbiasrsqMSECP
30MLE−0.6934.37929.921.62113.5830.060.7240.8220.841
30BcLS-MLE−0.009−0.0572.9090.0990.4806.3030.9160.9240.915
50MLE0.3201108.6−0.5110.3671108.90.511<0.0011.0000.000
50BcLS-MLE−0.009−0.0302.2530.0880.4155.3630.9350.9320.925
100MLE−11.601559.510.0424.471887.118.03<0.0010.4120.293
100BcLS-MLE−0.009−0.0021.6160.0720.3304.5920.9400.9330.940
150MLE−0.3912.07725.120.7755.09426.290.7890.8000.813
150BcLS-MLE−0.0080.0061.1210.0610.2783.5910.9420.9360.939
200MLE−3.22521.4931.914.19828.3831.970.1450.2310.400
200BcLS-MLE−0.0060.0070.8270.0540.2452.7620.9470.9410.938
Table 9. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 1 , γ = 3 , and 1 α = 0.95 .
Table 9. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 1 , γ = 3 , and 1 α = 0.95 .
rbiasrsqMSECP
30MLE−0.1360.0963.9260.3360.4646.3840.8660.9110.904
30BcLS-MLE−0.024−0.0672.6190.320.4765.7480.8950.9170.894
50MLE−0.1390.1183.7680.3260.4486.3450.8870.9350.934
50BcLS-MLE−0.043−0.0162.720.3040.4256.2440.9160.9230.904
100MLE−0.2360.2915.7180.6781.2210.0460.8940.9030.88
100BcLS-MLE−0.0390.011.690.2440.3324.4070.9220.9310.91
150MLE−0.20.3382.5510.263.1323.3310.9050.9260.918
150BcLS-MLE−0.0330.0161.1840.2070.2823.4180.9310.930.911
200MLE−0.0810.0851.5320.2130.2773.2390.9220.9240.914
200BcLS-MLE−0.030.0180.9240.1830.2472.8820.9350.9290.916
Table 10. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 3 , γ = 3 , and 1 α = 0.95 .
Table 10. The rbias and rsqMSEs of the BcLS-MLE and MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 3 , γ = 3 , and 1 α = 0.95 .
rbiasrsqMSECP
30MLE−1.4730.54913.511.5850.63115.130.8320.9110.867
30BcLS-MLE−0.076−0.0672.8540.9780.4766.2560.9010.9120.869
50MLE−2.0590.79623.4342.1980.96324.010.8410.8980.842
50BcLS-MLE−0.113−0.0232.560.8950.426.010.9180.9250.881
100MLE−8.2233.88.47214.9639.6316.660.0350.7050.253
100BcLS-MLE−0.1010.0031.7040.7320.3324.7420.9230.9280.89
150MLE−0.5350.1924.5851.0040.4137.8470.9010.9260.89
150BcLS-MLE−0.0840.0071.1910.6260.2823.6090.9260.9310.892
200MLE−0.9320.3568.2861.2780.50511.920.8970.910.854
200BcLS-MLE−0.0630.0060.8050.5370.2442.5960.9340.9290.889
Table 11. The rbias and rsqMSE of BcLS-MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 5 , and 1 α = 0.95 based on the LS initial solutions by searching γ in [2,10].
Table 11. The rbias and rsqMSE of BcLS-MLE, and the CP of S ( y 0 | Θ ) for L P N ( Θ ) with ξ = 5 , σ = 5 , and 1 α = 0.95 based on the LS initial solutions by searching γ in [2,10].
rbiasrsqMSECP
γ n ξ σ γ ξ σ γ q 0 . 1 q 0 . 3 q 0 . 5
1.550−0.281−0.0022.7221.5800.4646.5820.8650.9180.819
100−0.2040.0091.7691.2780.3705.6830.8820.9230.828
200−0.1220.0090.7330.8970.2662.7150.9040.9280.838
550−0.056−0.0491.8011.3570.3814.1530.9130.9250.886
100−0.107−0.0111.3851.1410.3033.5050.9260.9290.893
200−0.08−0.0010.7920.8770.2282.4010.9360.9260.893
7500.042−0.0691.3561.2830.3603.1990.920.930.881
100−0.040−0.0271.1361.0810.2822.8830.920.9330.884
200−0.056−0.0070.7480.8570.2162.1750.9310.9320.892
9500.114−0.08081.0341.22180.3422.5720.9220.9280.885
100−0.009−0.0320.9271.020.2652.4040.9310.9280.892
200−0.03−0.0120.6750.8320.2061.9650.9320.9320.886
12500.238−0.1040.6371.1770.3311.9040.9240.9290.884
1000.010−0.0510.6150.970.2521.8430.9270.9250.886
2000.022−0.0210.5030.7770.191.5980.9260.9270.889
Table 12. Maximum likelihood estimates using cement concrete strength data set.
Table 12. Maximum likelihood estimates using cement concrete strength data set.
Distribution ξ σ γ α β Log-LikelihoodAICBIC
BS0.43733.544−1736.553477.093485.20
Weibull 2.67041.390−1734.263472.513480.62
LN3.520.425−1735.363474.723482.82
LPN4.190.1430.056−1726.393458.783470.93
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tsai, T.-R.; Lio, Y.; Fan, Y.-Y.; Cheng, C.-P. Bias Correction Method for Log-Power-Normal Distribution. Mathematics 2022, 10, 955. https://doi.org/10.3390/math10060955

AMA Style

Tsai T-R, Lio Y, Fan Y-Y, Cheng C-P. Bias Correction Method for Log-Power-Normal Distribution. Mathematics. 2022; 10(6):955. https://doi.org/10.3390/math10060955

Chicago/Turabian Style

Tsai, Tzong-Ru, Yuhlong Lio, Ya-Yen Fan, and Che-Pin Cheng. 2022. "Bias Correction Method for Log-Power-Normal Distribution" Mathematics 10, no. 6: 955. https://doi.org/10.3390/math10060955

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop