Next Article in Journal
Fixed-Point Theorems in Fuzzy Normed Linear Spaces for Contractive Mappings with Applications to Dynamic-Programming
Previous Article in Journal
New Hermite–Hadamard Type Inequalities in Connection with Interval-Valued Generalized Harmonically (h1,h2)-Godunova–Levin Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Generalization of Two-Sided Length Biased Inverse Gaussian Distributions and Applications

by
Teerawat Simmachan
1,2 and
Wikanda Phaphan
3,*
1
Department of Mathematics and Statistics, Faculty of Science and Technology, Thammasat University, Pathum Thani 10120, Thailand
2
Thammasat University Research Unit in Data Learning, Thammasat University, Pathum Thani 12120, Thailand
3
Department of Applied Statistics, Faculty of Applied Science, King Mongkut’s University of Technology North Bangkok, Bangkok 10800, Thailand
*
Author to whom correspondence should be addressed.
Symmetry 2022, 14(10), 1965; https://doi.org/10.3390/sym14101965
Submission received: 26 July 2022 / Revised: 11 September 2022 / Accepted: 16 September 2022 / Published: 20 September 2022
(This article belongs to the Special Issue Symmetry in Computational Statistics)

Abstract

:
The notion of length-biased distribution can be used to develop adequate models. Length-biased distribution was known as a special case of weighted distribution. In this work, a new class of length-biased distribution, namely the two-sided length-biased inverse Gaussian distribution (TS-LBIG), was introduced. The physical phenomenon of this scenario was described in a case of cracks developing from two sides. Since the probability density function of the original TS-LBIG distribution cannot be written in a closed-form expression, its generalization form was further introduced. Important properties such as the moment-generating function and survival function cannot be provided. We offered a different approach to solving this problem. Some distributional properties were investigated. The parameters were estimated by the method of the moment. Monte Carlo simulation studies were carried out to appraise the performance of the suggested estimators using bias, variance, and mean square error. An application of a real dataset was presented for illustration. The results showed that the suggested estimators performed better than the original study. The proposed distribution provided a more appropriate model than other candidate distributions for fitting based on Akaike information criterion.

1. Introduction

Recorded observations may not have original distributions when practitioners collect natural observations according to certain stochastic models. Each observation is taken with unequal probabilities of recording. Weighted distributions can be adopted in this situation for selecting appropriate models [1]. One of the most widely known for special cases of weighted distributions is length-biased distributions. Precisely, let X denote a non-negative random variable with a probability density function shortly called PDF or f X ( x ) . The weighted version of X denoted by X w has a PDF defined as
f X w ( x ) = w ( x ) f X ( x ) E [ w ( x ) ] , x > 0 ,
where w ( x ) is the weighted function and 0 < E [ w ( x ) ] < . Different weighted models are formulated depending on choices of the weight function w ( x ) . In cases of w ( x ) = x , the resulting distribution is called length-biased whose PDF is defined by
f ( x ) = x f X ( x ) E [ X ] , x > 0 ,
Several versions of length-biased distributions are employed in various applications. For example, length-biased Birnbaum–Saunders distribution with an application in water quality was proposed by Leiva et al. [2]. Length-biased weighted Weibull distribution introduced by Das and Roy [3] was utilized in rainfall data. A generalization of length-biased Nakagami distribution offered by Abdullahi and Phaphan [4] was applied in heart attack data. Further, length-biased distributions can be used in percolation theory. Since percolation models are formulated from different weights and the distribution of a weight is a non-negative random variable, length-biased distributions can be employed as an alternative distribution. Some examples in this area were given in [5,6].
The length-biased inverse Gaussian (LBIG), one of special cases of the length-biased weighted distributions, is frequently used as a lifetime distribution. The LBIG distribution has been studied by many authors. In the early state, Khattree [7] presented a description of the inverse Gaussian (IG) and gamma distributions via their length-biased versions. Akman and Gupta [8] proposed a comparison of several estimators of the mean for IG and LBIG distributions. Akman and Gupta [9] offered statistical properties of the mixture of the IG and LBIG distributions. Recently, Naik [10] introduced a convoluted form of length-biased inverse Gaussian and gamma distributions. Budsaba and Phaphan [11] provided maximum likelihood estimation for re-parameterized LBIG distribution. The LBIG distribution has been utilized as a component of mixed distributions. For instance, it was used for constructing a mixture inverse Gaussian distribution [12], new parametrization of mixture inverse Gaussian distribution [13], weighted inverse Gaussian distribution [14], Birnbaum–Saunders distribution [15], re-parametrization of Birnbaum–Saunders distribution [16], three-parameter crack distribution [17], and two-parameter crack distribution [18].
In a reliability framework, a two-sided model can be described in a situation in which fatigue cracks evolve from two sides of the studied object. Lisawadi [19] early introduced two distributions using the parametrization suggested by Ahmed et al. [16], namely the two-sided Birnbaum–Saunders (TS-BS) and two-sided inverse Gaussian (TS-IG) distributions. Subsequently, Simmachan et al. [20] presented an alternative distribution applying the approach of Lisawadi [19] called two-sided length-biased inverse Gaussian (TS-LBIG) distribution. However, all of the two-sided versions have no closed-form PDFs. Important distributional properties such as a moment-generating function (MGF) and a survival function cannot be presented.
This study aims to re-introduce the TS-LBIG distribution originally proposed by Simmachan et al. [20] in closed-form expression. The reciprocal property is employed for derivation of the MGF. The resulting MGF is compared to a known MGF. By uniqueness property, the PDF of the TS-LBIG distribution can be obtained.
The rest of the article is organized as follows: a review of IG and LBIG distributions is presented in Section 2 and Section 3, respectively. The TS-LBIG random variable is described in Section 4. Reciprocal properties are provided in Section 5. In this section, four propositions are given. The MGF of TS-LBIG distribution is derived in Section 6. The PDF of TS-LBIG in closed-form expression is introduced in Section 7. Other distributional properties are established in Section 8, Section 9, Section 10 and Section 11. Parameter estimation by the method of moment is provided in Section 12. Numerical results consisting of a simulation study and real data application are shown in Section 13. Finally, conclusions and discussion are reported in Section 14.

2. Inverse Gaussian Distribution

Chikara and Folk [21] studied the variables of the two-parameter inverse Gaussian distribution which is the continuous probability distribution ( 0 , ) . Suppose X is a random variable with an inverse Gaussian distribution. Consequently, a PDF can be written in this formula:
f I G ( x ; μ , β ) = β 2 π x 3 2 e x p β x μ 2 2 μ 2 x ; x > 0 0 ; o t h e r w i s e ,
where μ > 0 is a location parameter or a mean, and β > 0 is a shape or scale parameter. The two parameters are called classical parameters. However, this research pays attention for studying the re-parameterized version of IG distribution. The parametrization was originally presented by Ahmed et al. [16] in the form of the Birnbaum–Saunders distribution (BS). The BS distribution was combined from IG and LBIG distributions. Precisely,
f B S ( x ; λ , θ ) = 1 2 f I G ( x ; λ , θ ) + 1 2 f L B I G ( x ; λ , θ ) ,
where f B S , f I G and f L B I G are the PDFs of the Birnbaum–Saunders, inverse Gaussian and length-biased inverse Gaussian distributions, respectively. The new form of the distribution parameters ( λ and θ ) is called non-classical parameters, where λ > 0 and θ > 0 represent the thickness of the machine element and nominal treatment pressure on the machine element, respectively. The interrelations between ( μ , β ) and ( λ , θ ) are as follows.
λ = β μ , θ = μ 2 β , μ = λ θ , and β = λ 2 θ .
From Equations (3) and (5), the PDF of non-classical IG distribution, denoted as f I G ( λ , θ ) , can be written in this form:
f I G ( x ; λ , θ ) = λ θ 2 π θ x 3 2 e x p 1 2 x θ λ θ x 2 ; x > 0 0 ; o t h e r w i s e .

3. Length-Biased Inverse Gaussian Distribution

By the definition of a length-biased distribution defined in Equation (2), the length-biased inverse Gaussian distribution can be explained as follows. Let X be an inverse Gaussian random variable with parameters λ and θ or X I G ( λ , θ ) . The PDF of X is denoted as f L B I G ( λ , θ ) . As the relations between the classical parameters and non-classical parameters, the expected value or the first moment of X is E [ X ] = μ = λ θ . Therefore, the length-biased version of X can be expressed as
f L B I G ( x ; λ , θ ) = 1 θ 2 π θ x 1 2 e x p 1 2 x θ λ θ x 2 ; x > 0 0 ; o t h e r w i s e .

4. TS-LBIG Random Variable

In this section, the TS-LBIG random variable ( τ ) introduced by Simmachan et al. [20] is described. Let X be a non-negative continuous random variable and let F ( x ) = F L B I G ( x , λ , θ ) denote the distribution function of the breakdown time moment τ for one-sided loading. The parameters λ and θ were previously defined. Let Y = k / τ be the random variable denoted as a crack speed. Under the object consideration, a crack expands from two sides with the same distribution function of the time to approach the length k. The random variables from both sides, τ 1 and τ 2 , are supposed to be independent and identically distributed. The crack speed for the two-sided situation is definded as
Y 1 + Y 2 = k τ 1 + k τ 2 = k 1 τ 1 + 1 τ 2 = X .
The breakdown moment of the interested object is defined as the following random variable
τ = k Y 1 + Y 2 = k k 1 τ 1 + 1 τ 2 = 1 τ 1 + 1 τ 2 1 = 1 X .

5. Reciprocal Properties

Proposition 1.
If the random variable τ > 0 has the probability density function f τ ( x ) , then the reciprocal random variable 1 / τ has the probability density function f 1 / τ ( x ) = x 2 f τ ( 1 / x ) .
Proof of Proposition 1.
For the reciprocal random variable 1 / τ , the distribution function is given by
F 1 / τ ( x ) = P 1 τ x = P τ 1 x = 1 P τ 1 x = 1 F τ 1 x ,
and applying the chain rule, the density function is
f 1 / τ ( x ) = F 1 / τ ( x ) = F τ 1 x = f τ 1 x · 1 x 2 = x 2 f τ 1 x .
Proposition 2.
If random variable τ > 0 has L B I G ( λ , θ ) distribution, then the reciprocal random variable 1 / τ is I G [ λ , 1 / ( λ 2 θ ) ] distributed.
Proof of Proposition  2.
By Proposition 1,
f 1 / τ ( x ) = x 2 f L B I G 1 x ; λ , θ = x 2 θ 1 / 2 x 1 / 2 2 π e x p 1 2 λ θ 1 / 2 x 1 / 2 θ 1 / 2 x 1 / 2 2 = x 3 / 2 2 π θ e x p 1 2 λ θ x 1 θ x 2 = x 3 / 2 2 π θ e x p 1 2 λ 1 / ( λ 2 θ ) x x 1 / ( λ 2 θ ) 2 = λ ( 1 / λ 2 θ ) 1 / 2 2 π x 3 / 2 e x p 1 2 λ 1 / ( λ 2 θ ) x x 1 / ( λ 2 θ ) 2 = f I G [ x ; λ , 1 / ( λ 2 θ ) ] .
Proposition 3.
If the random variable τ > 0 has I G ( λ , θ ) distribution, then the reciprocal random variable 1 / τ is L B I G [ λ , 1 / ( λ 2 θ ) ] distributed.
Proof of Proposition 3.
By Proposition 1,
f 1 / τ ( x ) = x 2 f I G 1 x ; λ , θ = λ θ 2 x 1 / 2 2 π e x p 1 2 λ θ 1 / 2 x 1 / 2 θ 1 / 2 x 1 / 2 2 = 1 2 π ( 1 / λ 2 θ ) 1 / 2 x 1 / 2 e x p 1 2 λ 1 / ( λ 2 θ ) x x 1 / ( λ 2 θ ) 2 = f L B I G [ x ; λ , 1 / ( λ 2 θ ) ] .
Proposition 4.
If the random variable τ > 0 has I G [ 2 λ , 1 / ( λ 2 θ ) ] distribution, then the reciprocal random variable 1 / τ is L B I G [ 2 λ , θ / 4 ) ] distributed.
Proof of Proposition 4.
Applying Proposition 1,
f 1 / τ ( x ) = x 2 f I G 1 x ; 2 λ , 1 / ( λ 2 θ ) = 2 λ 2 π ( λ 2 θ ) 1 / 2 x 1 / 2 e x p 1 2 2 λ ( 1 / λ 2 θ ) 1 / 2 x 1 / 2 ( 1 / λ 2 θ ) 1 / 2 x 1 / 2 2 = 2 λ 2 π ( λ 2 θ ) 1 / 2 x 1 / 2 e x p 1 2 2 λ x λ 2 θ λ 2 θ x 2 = ( θ / 4 ) 1 / 2 2 π x 1 / 2 e x p 1 2 x θ / 4 2 λ θ / 4 x 2 = f L B I G [ x ; 2 λ , θ / 4 ] .

6. Moment-Generating Function for TS-LBIG Distribution

Theorem 1.
If the random variables τ 1 , τ 2 L B I G ( λ , θ ) , then the moment-generating function of X = τ 1 1 + τ 2 1 is given as
M X ( t ) = exp 2 λ 1 1 2 t λ 2 θ ,
where 0 < t < λ 2 θ 2 .
Proof of Theorem 1.
We know that a moment-generating function (MGF) of I G ( λ , θ ) of a random variable τ is defined as
ψ I G ( t ) = M τ ( t ) = exp λ 1 1 2 θ t ·
Now, we have two independent LBIG random variables τ 1 and τ 2 . That is,
τ 1 L B I G ( λ , θ ) and τ 2 L B I G ( λ , θ ) .
Initially, we find the MGF of the random variable X = τ 1 1 + τ 2 1 . According to Proposition 2, if the random variable τ > 0 has L B I G ( λ , θ ) distribution, then the reciprocal random variable 1 / τ is I G [ λ , 1 / ( λ 2 θ ) ] distributed. Therefore,
M 1 τ 1 ( t ) = M 1 τ 2 ( t ) = M I G ( t ) = exp λ 1 1 2 1 λ 2 θ t ·
Hence,
M X ( t ) = E e t X = E e t 1 τ 1 + 1 τ 2 = E e t τ 1 E e t τ 2 = M 1 τ 1 ( t ) × M 1 τ 2 ( t ) = exp λ 1 1 2 t λ 2 θ × exp λ 1 1 2 t λ 2 θ = exp 2 λ 1 1 2 t λ 2 θ ·
By uniqueness property, it is implied that X = τ 1 1 + τ 2 1 I G [ 2 λ , 1 / ( λ 2 θ ) ] .
Theorem 2.
If a random variable Y T S L B I G ( λ , θ ) , the moment generating function of Y is given as
M Y ( t ) = 1 2 θ 4 t 1 / 2 exp 2 λ 1 1 2 θ 4 t ·
Proof of Theorem  2.
By Theorem 1, it is known that X I G [ 2 λ , 1 / ( λ 2 θ ) ] . To find the MGF of the TS-LBIG random variable Y, the reciprocal of X is considered. Let Y = 1 / X = [ τ 1 1 + τ 2 1 ] 1 . We know that a MGF of L B I G [ λ , θ ] distribution is defined as
M L B I G ( t ; λ , θ ) = ( 1 2 θ t ) 1 / 2 exp λ [ 1 ( 1 2 θ t ) 1 / 2 ] .
According to Proposition 4, Y L B I G [ 2 λ , θ / 4 ] . Therefore, the moment-generating function of Y is given by
M Y ( t ) = 1 2 θ 4 t 1 / 2 exp 2 λ 1 1 2 θ 4 t ·
Most importantly, by uniqueness property, it is indicated that Y T S - L B I G ( λ , θ ) L B I G [ 2 λ , θ / 4 ] .

7. The Probability Density Function of TS-LBIG Distribution

Theorem 3.
Define X as a random variable of the TS-LBIG distribution. Then, the corresponding probability density function (PDF) of X is given by
f T S - L B I G ( x ; λ , θ ) = 2 θ 2 π θ x 1 2 exp 1 2 λ θ x 2 x θ 2 .
Proof of Theorem 3.
By Theorem 2, we know that T S - L B I G ( λ , θ ) L B I G [ 2 λ , θ / 4 ] . We start with Equation (7), which is the original form of LBIG distribution, f L B I G ( x ; λ , θ ) . Substituting Equation (7) by the parameter λ to be 2 λ and θ to be θ / 4 , the probability density function of the TS-LBIG distribution is
f T S - L B I G ( x ; λ , θ ) = 1 θ 4 2 π θ 4 x 1 2 exp 1 2 2 λ θ 4 x x θ 4 2 = 4 θ 2 π θ 4 x 1 2 exp 1 2 2 λ θ 4 x 4 x θ 2 = 2 θ 2 π θ x 1 2 exp 1 2 λ θ x 2 x θ 2 . ·
Several shapes of the PDF for the TS-LBIG distribution are illustrated in Figure 1 and Figure 2 for various parameter values. The different shapes indicate that the TS-LBIG distribution is right-skewed and unimodal. Moreover, this distribution is a family of asymmetric distributions which are useful for skewed data analysis.

8. The Cumulative Density Function of TS-LBIG Distribution

Theorem 4.
Let X be a random variable of the TS-LBIG distribution. The cumulative density function (CDF) of X is given by
F T S - L B I G ( x ) = Φ 2 x θ λ θ x exp ( 4 λ ) Φ 2 x θ + λ θ x ,
where Φ ( x ) is the standard normal distribution function.
Proof of Theorem 4.
The CDF of LBIG distribution is
F L B I G ( x ) = Φ x θ λ θ x exp ( 2 λ ) Φ x θ + λ θ x .
Hence, the CDF of TS-LBIG is
F T S - L B I G ( x ) = Φ x θ 4 2 λ θ 4 x exp ( 4 λ ) Φ x θ 4 + 2 λ θ 4 x , = Φ 2 x θ λ θ x exp ( 4 λ ) Φ 2 x θ + λ θ x ,
where Φ ( x ) is the standard normal distribution function. □

9. The Survival Function of TS-LBIG Distribution

Theorem 5.
Let X be a random variable of the TS-LBIG distribution with parameters λ and θ. The survival function of X is obtained as:
S T S - L B I G ( x ) = 1 Φ 2 x θ λ θ x + exp ( 4 λ ) Φ 2 x θ + λ θ x ,
where Φ ( x ) is the standard normal distribution function.
Proof of Theorem 5.
Let X be a continuous random variable with a cumulative density function F ( x ) on the interval [ 0 , ) . The survival function of X can be written in this form:
S ( x ) = x f ( t ) d t = 1 F ( x )
Inserting Equation (13) into Equation (17) leads to the survival function of TS-LBIG distribution in equation:
S T S - L B I G ( x ) = 1 Φ 2 x θ λ θ x + exp ( 4 λ ) Φ 2 x θ + λ θ x ,
where Φ ( x ) is the standard normal distribution function. □

10. The Hazard Rate Function of TS-LBIG Distribution

Theorem 6.
Let X be a random variable of the TS-LBIG distribution with parameters λ and θ. The hazard rate function of X is given by
h T S - L B I G ( x ) = 2 θ 2 π θ x 1 2 exp 1 2 λ θ x 2 x θ 2 1 Φ 2 x θ λ θ x + exp ( 4 λ ) Φ 2 x θ + λ θ x ,
where Φ ( x ) is the standard normal distribution function.
Proof of Theorem 6.
Let X be an absolutely continuous non-negative random variable with the probability density function f ( x ) and the survival function S ( x ) ; then, the hazard rate function of X can be defined as:
h ( x ) = f ( x ) S ( x ) , = 2 θ 2 π θ x 1 2 exp 1 2 λ θ x 2 x θ 2 1 Φ 2 x θ λ θ x + exp ( 4 λ ) Φ 2 x θ + λ θ x ,
where Φ ( x ) is the standard normal distribution function. □

11. The Mean and the Variance of TS-LBIG Distribution

From [22], let Y I G ( x ; λ , θ ) and Z L B ( x ; λ , θ ) ; then, the rth moment of Z for r = 1 , 2 , 3 , is given by
E [ Z r ] = 1 λ θ E [ Z r + 1 ] .
Hence, the first four raw moments of the LBIG distribution are
E [ Z ] = θ ( λ + 1 ) E [ Z 2 ] = θ 2 ( λ 2 + 3 λ + 3 ) E [ Z 3 ] = θ 3 ( λ 3 + 6 λ 2 + 15 λ + 15 ) E [ Z 4 ] = θ 4 ( λ 4 + 10 λ 3 + 45 λ 2 + 105 λ + 105 ) .
Let X T S - L B I G . Therefore, the first four raw moments of TS-LBIG distribution are
E [ X ] = θ 4 ( 2 λ + 1 ) E [ X 2 ] = θ 4 2 ( 4 λ 2 + 6 λ + 3 ) E [ X 3 ] = θ 4 3 ( 8 λ 3 + 24 λ 2 + 30 λ + 5 ) E [ X 4 ] = θ 4 4 ( 16 λ 4 + 80 λ 3 + 180 λ 2 + 210 λ + 105 ) .
Therefore, the mean of TS-LBIG distribution is
E [ X ] = θ 4 ( 2 λ + 1 ) ,
and the variance of TS-LBIG distribution is
V a r ( X ) = E ( X 2 ) [ E ( X ) ] 2 = θ 2 16 ( 4 λ 2 + 6 λ + 3 ) θ 2 16 ( 2 λ + 1 ) 2 = 4 λ 2 θ 2 + 6 λ θ 2 + 3 θ 2 16 4 λ 2 θ 2 + 4 λ θ 2 + θ 2 16 = 2 λ θ 2 + 2 θ 2 16 = λ θ 2 + θ 2 8 .

12. Parameter Estimation by the Method of Moments for the TS-LBIG Distribution

Recall that the rth raw population moment is equal to the rth raw sample moment.
θ 4 ( 2 λ + 1 ) = 1 n i = 1 n x i
θ 4 2 ( 4 λ 2 + 6 λ + 3 ) = 1 n i = 1 n x i 2
From Equation (26) and letting x ¯ = 1 n i = 1 n x i , we obtain
θ 4 ( 2 λ + 1 ) = x ¯ 2 λ θ + θ = 4 x ¯ 2 λ θ = 4 x ¯ θ λ ^ = 4 x ¯ θ 2 θ .
Substituting Equation (28) into Equation (27) and letting T = 1 n i = 1 n x i 2 , we obtain
4 4 x ¯ θ 2 θ 2 θ 2 + 6 4 x ¯ θ 2 θ 2 θ 2 + 3 θ 2 16 = T ( 4 x ¯ θ ) 2 + 3 θ ( 4 x ¯ θ ) + 3 θ 2 16 = T θ 2 + 16 x ¯ 2 + 4 θ x ¯ = 16 T θ 2 + 4 x ¯ θ + 16 x ¯ 2 16 T = 0 .
From a x 2 + b x + c = 0
x = b ± b 2 4 a c 2 a ,
where a = 1 , b = 4 x ¯ , c = 16 x ¯ 2 16 T = 16 ( x ¯ 2 T ) .
θ ^ = 4 x ¯ ± 16 x ¯ 2 64 ( x ¯ 2 T ) 2 .
Since θ ^ is positive parameter, hence
θ ^ = 4 x ¯ + 16 x ¯ 2 64 ( x ¯ 2 T ) 2 = 4 x ¯ + 4 x ¯ 2 4 x ¯ 2 + 4 T 2 = 2 x ¯ + 2 3 x ¯ 2 + 4 T = 2 4 T 3 x ¯ 2 x ¯ .
Next, substituting Equation (32) into Equation (28)
λ ^ = 4 x ¯ 2 4 T 3 x ¯ 2 x ¯ 4 4 T 3 x ¯ 2 x ¯ .

13. Numerical Results

13.1. Simulation Study

In this section, the Monte Carlo simulation to test the performance of the suggested estimators of the TS-LBIG distribution parameters is presented. Different values of the true parameters are considered. All 60 scenarios are the combination of sample size (n) = 10 , 50 and 100, λ = 1 , 3 , 5 and 10, and θ = 0.5 , 1 , 3 , 5 and 10. The proposed estimators, λ ^ and θ ^ , are compared to the estimators presented by Simmachan et al. [20], λ ˜ and θ ˜ , via bias, MSE and variance. The random numbers of the TS-LBIG distribution are generated via the composition method using the “twoCrack” package [23] in R [24], and the replications are repeated 1000 times in each scenario. The parameter estimates with their bias, MSE and variance are reported in Table 1, Table 2, Table 3, Table 4, Table 5 and Table 6. For easier consideration, bar charts are created and presented in Figure 3 and Figure 4. The blue and yellow bars represent the proposed method and the method of Simmachan et al. [20], respectively. It reveals that the bias, MSE and variance become smaller as the sample size increases and the estimates become closer to the true value of parameters. For bias consideration, the estimators of λ give over-estimates for both methods. The bias of the proposed estimator is slightly smaller than that of the original estimator. On the other hand, the estimators of θ provide mostly under-estimates for both methods. The bias of the proposed estimator is much smaller than that of the original estimator. MSE and variance indicators have similar behavior. For parameter λ , the MSE and variance of the proposed estimator are slightly smaller than those of the original estimator. For parameter θ , however, the MSE and variance of the proposed estimator are much smaller than those of the original estimator. Interestingly, the bias, MSE and variance of the proposed estimators are superior to those of the estimators from the previous study.

13.2. Illustrative Examples

In this section, the suggested distribution is implemented via a real dataset. The following data are collected from the BackBlaze data center [25], and they present the lifetime of the hard drives (in days) containing only the model ST8000DM002 in December 2017: 490, 497, 521, 394, 489, 323, 376, 319, 431, 484, 547, 383, 534 and 316. This dataset was analyzed by Chananet and Phaphan [26], and the result indicated that the lifetime of the hard drives follow the right skewed distribution. Consequently, five right skewed distributions—two-parameter crack [18], Birnbaum–Saunders [16], inverse Gaussian [17], length-biased inverse Gaussian [17], and the proposed two-sided length-biased inverse Gaussian—are selected for goodness of fit comparison. The parameters of the TS-LBIG distribution are estimated by the suggested estimators. The parameters of other candidate distributions are estimated via the maximum likelihood estimation. The “nlminb” function in R [24] is employed for maximizing their likelihood functions. The Akaike information criterion (AIC) is used as an assessment criterion; hence, the best model is the one that provides the minimum AIC. As the result in Table 7 indicates, the TS-LBIG distribution gives the minimum AIC. This indicates that the proposed distribution is the best of the candidate distributions by considering at the value of AIC. Hence, by Equations (24) and (25), the average and standard deviation of the lifetime of the hard drives are 436 and 80.15877 days, respectively.

14. Conclusions and Discussion

In this article, a new form of the TS-LBIG distribution is introduced, since the original version offered by Simmachan et al. [20] does not present a closed-form PDF. This distribution is a right-skewed distribution. Some distributional properties of this distribution were studied, and its two parameters were estimated using the method of moment. Sixty combination scenarios are used to construct the simulation study in assessing the performance of the proposed method. An application of the TS-LBIG distribution was implemented in the lifetime of the hard drives. Results show that the proposed estimators are more efficient than the Simmachan et al. [20] estimators. The original study dealing with the indirect method of parameter estimation affects the parameter estimates far from the true values, especially the parameter θ . This is different from the proposed estimators that dealt with the direct method. The TS-LBIG distribution gives a better fit than the other candidate distributions in terms of AIC. Our contribution provides an alternative right-skewed distribution that can be applied in other aspects such as survival analysis and forestry.
For the future directions of this work, other methods of parameter estimation could be considered. Confidence intervals of the parameters could be also examined. Additionally, the concept of the two-sided model could be extended to generate a new distribution. Moreover, other applications of the TS-LBIG distribution should be applied.

Author Contributions

Conceptualization, T.S.; methodology, T.S. and W.P.; validation, T.S. and W.P.; formal analysis, T.S. and W.P.; investigation, T.S. and W.P.; writing—original draft preparation, T.S. and W.P.; writing—review and editing, T.S. and W.P.; visualization, T.S. and W.P.; funding acquisition, T.S. and W.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by King Mongkut’s University of Technology North Bangkok, Thailand. Contract no. KMUTNB-66-BASIC-04.

Data Availability Statement

The real-world dataset used in this work is available in title of site: Backblaze Hard Drive Data and Stats. Available online: https://www.backblaze.com/b2/hard-drive-test-data.html (accessed on 1 March 2021).

Acknowledgments

The authors wish to thank the reviewers for their helpful comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Modi, K.; Gill, V. Length-biased Weighted Maxwell distribution. Pak. J. Stat. Oper. Res. 2015, 11, 465–472. [Google Scholar] [CrossRef]
  2. Leiva, V.; Sanhueza, A.; Angulo, J.M. A length-biased version of the Birnbaum-Saunders distribution with application in water quality. Stoch. Env. Res. Risk Assess. 2009, 23, 299–307. [Google Scholar] [CrossRef]
  3. Das, K.K.; Roy, T.D. On Some Length-Biased Weighted Weibull Distribution. Adv. Appl. Sci. Res. 2011, 2, 465–475. [Google Scholar]
  4. Abdullahi, I.; Phaphan, W. A Generalization of Length-biased Nakagami Distribution. Int. J. Math. Comput. Sci. 2022, 17, 21–31. [Google Scholar]
  5. Kvam, P. Length bias in the measurements of carbon nanotubes. Technometrics 2008, 50, 462–467. [Google Scholar] [CrossRef]
  6. Shang, Y.L. Multi-type directed scale-free percolation. Commun. Theor. Phys. 2012, 57, 701–716. [Google Scholar] [CrossRef]
  7. Khattree, R. Characterization of inverse-Gaussian and gamma distributions through their length-biased distributions. IEEE Trans. Reliab. 1989, 38, 610–611. [Google Scholar] [CrossRef]
  8. Akman, O.; Gupta, R.C. A comparison of various estimators of the mean of an inverse Gaussian distribution. J. Stat. Comput. Simul. 1992, 40, 71–81. [Google Scholar] [CrossRef]
  9. Gupta, R.C.; Akman, O. On the reliability studies of a weighted inverse Gaussian model. J. Stat. Plann. Inference 1995, 48, 69–83. [Google Scholar] [CrossRef]
  10. Naik, S. On the convoluted gamma to length-biased inverse Gaussian distribution and application in financial modeling. J. Stat. Manag. Syst. 2021, 24, 1581–1600. [Google Scholar] [CrossRef]
  11. Budsaba, K.; Phaphan, W. Parameter estimation for re-parameterized length-biased inverse Gaussian distribution. IJMCS 2022, 17, 107–121. [Google Scholar]
  12. Jorgensen, B.; Seshadri, V.; Whitmore, G.A. On the mixture of the inverse Gaussian distribution with its complementary reciprocal. Scand. J. Stat. 1991, 18, 77–89. [Google Scholar]
  13. Kotz, S.; Leiva, V.; Sanhueza, A. Two new mixture models related to the inverse Gaussian distribution. Methodol. Comput. Appl. Probab. 2010, 12, 199–212. [Google Scholar] [CrossRef]
  14. Gupta, R.C.; Kundu, D. Weighted inverse Gaussian—A versatile lifetime model. J. Appl. Stat. 2011, 38, 2695–2708. [Google Scholar] [CrossRef]
  15. Birnbaum, Z.W.; Saunders, S.C. A new family of life distribution. J. Appl. Probab. 1969, 6, 319–327. [Google Scholar] [CrossRef]
  16. Ahmed, S.E.; Budsaba, K.; Lisawadi, S.; Volodin, A. Parametric Estimation for the Birnbaum-Saunders Lifetime Distribution Based on New Parametrization. Thail. Stat. Thail. 2008, 6, 213–240. [Google Scholar]
  17. Bowonrattanaset, P.; Budsaba, K. Some properties of the three-parameter Crack distribution. Thail. Stat. 2011, 9, 195–203. [Google Scholar]
  18. Saengthong, P.; Bodhisuwan, W. A new two-parameter crack distribution. J. Appl. Sci. 2014, 14, 758–766. [Google Scholar] [CrossRef]
  19. Lisawadi, S. Parameter Estimation for the Two-Sided BS and IG Distributions: And Laws of Large Numbers for Arrays under a Condition of Uniform Integrability; VDM Verlag: Zweibrücken, Germany, 2009. [Google Scholar]
  20. Simmachan, T.; Budsaba, K.; Volodin, A. On two-sided length biased inverse Gaussian distribution. Chiang Mai J. Sci. 2018, 45, 2826–2837. [Google Scholar]
  21. Chhikara, R.S.; Folks, J.L. The Inverse Gaussian distribution as a lifetime model. Technometrics 1977, 19, 461–468. [Google Scholar] [CrossRef]
  22. Pichetverachai, P. Bootstrap Confidence Intervals for a Population Mean of Crack Distribution. Ph.D. Thesis, Thammasat University, Pathum Thani, Thailand, 2015. [Google Scholar]
  23. Phaphan, W. R package for the two-parameters crack distribution. IJMCS 2021, 16, 1455–1467. [Google Scholar]
  24. R Core Team. Available online: http//www.R-project.org (accessed on 1 August 2022).
  25. Backblaze Hard Drive Data and Stats. Available online: https://www.backblaze.com/b2/hard-drive-test-data.html (accessed on 1 March 2021).
  26. Chananet, C.; Phaphan, W. On the new weight parameter of the mixture Pareto distribution and its application to real data. Appl. Sci. Eng. Prog. 2021, 14, 460–467. [Google Scholar] [CrossRef]
Figure 1. PDFs for the TS-LBIG distribution for several values of λ (lambda).
Figure 1. PDFs for the TS-LBIG distribution for several values of λ (lambda).
Symmetry 14 01965 g001
Figure 2. PDFs for the TS-LBIG distribution for several values of θ (theta).
Figure 2. PDFs for the TS-LBIG distribution for several values of θ (theta).
Symmetry 14 01965 g002
Figure 3. Bar chart for the bias of estimator λ (lambda) and θ (theta).
Figure 3. Bar chart for the bias of estimator λ (lambda) and θ (theta).
Symmetry 14 01965 g003
Figure 4. Bar chart for the MSE of estimator λ (lambda) and θ (theta).
Figure 4. Bar chart for the MSE of estimator λ (lambda) and θ (theta).
Symmetry 14 01965 g004
Table 1. The average estimates, the bias, the mean squared errors, and the simulated variance of the proposed estimators λ ^ and θ ^ for n = 10 .
Table 1. The average estimates, the bias, the mean squared errors, and the simulated variance of the proposed estimators λ ^ and θ ^ for n = 10 .
λ θ λ ^ θ ^ Bias  ( λ ^ ) Bias  ( θ ^ ) MSE  ( λ ^ ) MSE  ( θ ^ ) Var  ( λ ^ ) Var  ( θ ^ )
10.51.7220.4300.722−0.0702.3550.0571.8340.052
11.7840.8370.784−0.1632.1200.2511.5050.224
31.7162.5340.716−0.4661.6962.0941.1831.877
51.6754.2800.675−0.7201.6495.2891.1934.770
101.6828.6150.682−1.3851.89323.4921.42821.575
30.54.5260.4421.526−0.05812.5040.05210.1740.049
14.8810.8381.881−0.16214.4270.21810.8890.192
34.5362.6041.536−0.3969.1351.9406.7751.783
54.5874.3611.587−0.63911.2435.2058.7234.797
104.5868.6401.586−1.3609.84122.3477.32520.498
50.57.5040.4512.504−0.04928.8310.06322.5620.060
17.7460.8752.746−0.12540.7520.22033.2110.205
37.5192.6572.519−0.34332.7101.82326.3671.705
57.2394.4822.239−0.51822.5894.97317.5754.705
107.4988.6662.498−1.33425.35019.06019.10917.281
100.514.8420.4364.842−0.064100.5110.05177.0670.047
114.3250.8874.325−0.11385.9400.19167.2350.178
314.9802.6564.980−0.344154.6571.975129.8581.857
515.4664.3765.466−0.624163.5005.670133.6205.280
1014.1749.1054.174−0.895106.91018.67589.48817.873
Table 2. The average estimates, the bias, the mean squared errors, and the simulated variance of the original estimators of λ ˜ and θ ˜ for n = 10 .
Table 2. The average estimates, the bias, the mean squared errors, and the simulated variance of the original estimators of λ ˜ and θ ˜ for n = 10 .
λ θ λ ^ θ ^ Bias  ( λ ^ ) Bias  ( θ ^ ) MSE  ( λ ^ ) MSE  ( θ ^ ) Var  ( λ ^ ) Var  ( θ ^ )
10.51.8274.0000.8273.5002.44417.4951.7605.244
11.8871.9280.8870.9282.2201.9621.4331.102
31.8210.6570.821−2.3431.7905.6161.1170.125
51.7810.3970.781−4.6031.73921.2311.1290.045
101.7880.2030.788−9.7971.98095.9861.3590.012
30.54.5810.6541.5810.15412.5730.13010.0740.107
14.9330.3101.933−0.69014.5150.50010.7790.024
34.5900.1081.590−2.8929.2108.3686.6820.003
54.6410.0651.641−4.93511.31824.3598.6240.001
104.6400.0321.640−9.9689.91799.3547.2290.000
50.57.5410.2622.541−0.23828.8880.07722.4320.021
17.7820.1272.782−0.87340.8210.76733.0810.004
37.5550.0432.555−2.95732.7738.74626.2450.000
57.2760.0262.276−4.97422.65024.74517.4700.000
107.5340.0122.534−9.98825.42299.75119.0010.000
100.514.8620.0664.862−0.434100.5790.18976.9440.001
114.3450.0344.345−0.96686.0000.93467.1210.000
315.0000.0115.000−2.989154.7188.933129.7200.000
515.4860.0075.486−4.993163.56624.934133.4710.000
1014.1950.0034.195−9.997106.96099.93189.3660.000
Table 3. The average estimates, the bias, the mean squared errors, and the simulated variance of the proposed estimators λ ^ and θ ^ for n = 50 .
Table 3. The average estimates, the bias, the mean squared errors, and the simulated variance of the proposed estimators λ ^ and θ ^ for n = 50 .
λ θ λ ^ θ ^ Bias  ( λ ^ ) Bias  ( θ ^ ) MSE  ( λ ^ ) MSE  ( θ ^ ) Var  ( λ ^ ) Var  ( θ ^ )
10.51.1300.4850.130−0.0150.1420.0170.1250.017
11.1200.9640.120−0.0360.1270.0580.1130.057
31.1412.8860.141−0.1140.1380.5250.1190.512
51.1384.8320.138−0.1680.1501.6270.1311.599
101.1439.5800.143−0.4200.1365.8990.1165.722
30.53.2850.4840.285−0.0160.7050.0120.6230.012
13.2510.9790.251−0.0210.7250.0520.6610.052
33.2982.9050.298−0.0950.7430.4690.6540.460
53.2954.8220.295−0.1780.7921.3200.7051.289
103.2329.8200.232−0.1800.6775.1970.6235.165
50.55.3930.4880.393−0.0121.6940.0121.5390.011
15.4430.9700.443−0.0301.8730.0501.6770.049
35.4042.9260.404−0.0741.8420.4141.6780.408
55.4324.8530.432−0.1471.7581.1771.5711.156
105.4039.7240.403−0.2761.6544.8501.4924.774
100.510.7290.4880.729−0.0126.0580.0115.5270.010
110.6670.9830.667−0.0176.3560.0445.9100.044
310.6922.9350.692−0.0655.9690.3725.4900.368
510.8094.8410.809−0.1596.4791.0855.8241.059
1010.6289.8430.628−0.1575.9474.1765.5534.151
Table 4. The average estimates, the bias, the mean squared errors, and the simulated variance of the original estimators of λ ˜ and θ ˜ for n = 50 .
Table 4. The average estimates, the bias, the mean squared errors, and the simulated variance of the original estimators of λ ˜ and θ ˜ for n = 50 .
λ θ λ ^ θ ^ Bias  ( λ ^ ) Bias  ( θ ^ ) MSE  ( λ ^ ) MSE  ( θ ^ ) Var  ( λ ^ ) Var  ( θ ^ )
10.51.2514.6000.2514.1000.17518.4840.1121.670
11.2412.3190.2411.3190.1592.1210.1010.383
31.2600.7550.260−2.2450.1745.0820.1060.040
51.2580.4580.258−4.5420.18420.6450.1180.017
101.2630.2260.263−9.7740.17395.5410.1040.003
30.53.3460.7210.3460.2210.7260.0760.6070.028
13.3120.3650.312−0.6350.7410.4100.6440.007
33.3580.1200.358−2.8800.7658.2970.6370.001
53.3550.0730.355−4.9270.81324.2800.6870.000
103.2930.0370.293−9.9630.69299.2670.6060.000
50.55.4330.2810.433−0.2191.7090.0521.5210.004
15.4830.1400.483−0.8601.8910.7411.6580.001
35.4450.0470.445−2.9531.8578.7211.6590.000
55.4730.0280.473−4.9721.77624.7211.5530.000
105.4440.0140.444−9.9861.67099.7191.4730.000
100.510.7510.0740.751−0.4266.0720.1825.5080.000
110.6900.0370.690−0.9636.3650.9275.8890.000
310.7140.0120.714−2.9885.9818.9265.4710.000
510.8310.0070.831−4.9936.49524.9265.8040.000
1010.6500.0040.650−9.9965.95699.9255.5340.000
Table 5. The average estimates, the bias, the mean squared errors, and the simulated variance of the proposed estimators λ ^ and θ ^ for n = 100 .
Table 5. The average estimates, the bias, the mean squared errors, and the simulated variance of the proposed estimators λ ^ and θ ^ for n = 100 .
λ θ λ ^ θ ^ Bias  ( λ ^ ) Bias  ( θ ^ ) MSE  ( λ ^ ) MSE  ( θ ^ ) Var  ( λ ^ ) Var  ( θ ^ )
10.51.0640.4950.064−0.0050.0650.0090.0610.009
11.0680.9820.068−0.0180.0640.0320.0590.032
31.0642.9510.064−0.0490.0630.2970.0590.295
51.0724.9230.072−0.0770.0630.8750.0580.869
101.0729.8430.072−0.1570.0683.5250.0623.501
30.53.1340.4920.134−0.0080.3080.0060.2900.006
13.1450.9830.145−0.0170.3210.0270.3000.027
33.1482.9410.148−0.0590.3010.2180.2790.214
53.1134.9690.113−0.0310.3200.7260.3070.725
103.1159.9330.115−0.0670.3292.8630.3162.858
50.55.2000.4930.200−0.0070.7460.0060.7060.006
15.1520.9960.152−0.0040.7470.0240.7240.024
35.2332.9500.233−0.0500.8610.2260.8060.223
55.2264.9120.226−0.0880.7600.6130.7090.606
105.2389.8070.238−0.1930.8172.5060.7602.469
100.510.3100.4960.310−0.0042.5130.0052.4160.005
110.4020.9840.402−0.0162.6620.0222.5000.021
310.4072.9540.407−0.0462.8320.2052.6660.203
510.3364.9480.336−0.0522.6880.5582.5750.556
1010.3939.8320.393−0.1682.5322.1652.3772.136
Table 6. The average estimates, the bias, the mean squared errors, and the simulated variance of the original estimators of λ ˜ and θ ˜ for n = 100 .
Table 6. The average estimates, the bias, the mean squared errors, and the simulated variance of the original estimators of λ ˜ and θ ˜ for n = 100 .
λ θ λ ^ θ ^ Bias  ( λ ^ ) Bias  ( θ ^ ) MSE  ( λ ^ ) MSE  ( θ ^ ) Var  ( λ ^ ) Var  ( θ ^ )
10.51.1874.6560.1874.1560.08918.0940.0540.820
11.1912.3260.1911.3260.0891.9540.0530.196
31.1870.7800.187−2.2200.0874.9530.0520.024
51.1940.4620.194−4.5380.08920.6030.0510.008
101.1950.2310.195−9.7690.09395.4270.0550.002
30.53.1950.7360.1950.2360.3190.0710.2810.015
13.2060.3670.206−0.6330.3330.4040.2910.004
33.2090.1220.209−2.8780.3158.2840.2710.000
53.1750.0740.175−4.9260.32924.2640.2980.000
103.1760.0370.176−9.9630.33899.2600.3070.000
50.55.2410.2840.241−0.2160.7550.0480.6970.002
15.1930.1440.193−0.8560.7520.7340.7140.001
35.2740.0470.274−2.9530.8718.7190.7960.000
55.2670.0280.267−4.9720.77124.7180.7000.000
105.2780.0140.278−9.9860.82899.7170.7510.000
100.510.3330.0750.333−0.4252.5180.1802.4070.000
110.4240.0370.424−0.9632.6710.9272.4910.000
310.4300.0120.430−2.9882.8418.9252.6560.000
510.3590.0080.359−4.9922.69424.9252.5650.000
1010.4160.0040.416−9.9962.54199.9252.3680.000
Table 7. The MLE of the model parameters for the hard drive failure dataset, and AIC measure.
Table 7. The MLE of the model parameters for the hard drive failure dataset, and AIC measure.
Fitting Dist.Estimate ParametersAIC
λ θ
TCR110.98910.100013.39908
BS0.0010.10067.6742
IG0.0010.00193.44156
LBIG26.7980115.684589.106067
TS-LBIG14.7766357.080657.748027
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Simmachan, T.; Phaphan, W. Generalization of Two-Sided Length Biased Inverse Gaussian Distributions and Applications. Symmetry 2022, 14, 1965. https://doi.org/10.3390/sym14101965

AMA Style

Simmachan T, Phaphan W. Generalization of Two-Sided Length Biased Inverse Gaussian Distributions and Applications. Symmetry. 2022; 14(10):1965. https://doi.org/10.3390/sym14101965

Chicago/Turabian Style

Simmachan, Teerawat, and Wikanda Phaphan. 2022. "Generalization of Two-Sided Length Biased Inverse Gaussian Distributions and Applications" Symmetry 14, no. 10: 1965. https://doi.org/10.3390/sym14101965

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop