Next Article in Journal
Q-Learnheuristics: Towards Data-Driven Balanced Metaheuristics
Next Article in Special Issue
Conditions for the Existence of Absolutely Optimal Portfolios
Previous Article in Journal
Exploring Personal and Contextual Variables of the Global Entrepreneurship Monitor through the Rasch Mathematical Model
Previous Article in Special Issue
Reliability Properties of the NDL Family of Discrete Distributions with Its Inference
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Flexible Family of Continuous Distributions: The Additive Odd-G Family

by
Emrah Altun
1,
Mustafa Ç. Korkmaz
2,
Mahmoud El-Morshedy
3,4 and
Mohamed S. Eliwa
4,*
1
Department of Mathematics, Bartin University, Bartin 74100, Turkey
2
Department of Measurement and Evaluation, Artvin Çoruh University, Artvin 08000, Turkey
3
Department of Mathematics, College of Science and Humanities in Al-Kharj, Prince Sattam bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia
4
Department of Mathematics, Faculty of Science, Mansoura University, Mansoura 35516, Egypt
*
Author to whom correspondence should be addressed.
Mathematics 2021, 9(16), 1837; https://doi.org/10.3390/math9161837
Submission received: 21 June 2021 / Revised: 25 July 2021 / Accepted: 28 July 2021 / Published: 4 August 2021
(This article belongs to the Special Issue Probability, Statistics and Their Applications 2021)

Abstract

:
This paper introduces a new family of distributions based on the additive model structure. Three submodels of the proposed family are studied in detail. Two simulation studies were performed to discuss the maximum likelihood estimators of the model parameters. The log location-scale regression model based on a new generalization of the Weibull distribution is introduced. Three datasets were used to show the importance of the proposed family. Based on the empirical results, we concluded that the proposed family is quite competitive compared to other models.

1. Introduction

The main aim of generating new probability distributions is to increase the modeling ability of the baseline distribution. With this aim, many distribution generators have been introduced in the literature. Some well-known distribution generators are the Marshall–Olkin-G family [1], the beta-G family [2], the gamma-G family [3], the Kumaraswamy-G (Kw-G) family [4], the generalized beta-generated family [5], the T-X family [6], and the two-sided generalized family [7], among others. In addition to these families, many odd families of the distribution can be cited such as the odd log-logistic-G (OLL-G) family [8], the odd Weibull-G (OW-G) family [9], the odd Burr G family [10], the generalized odd log-logistic-G (GOLL-G) family [11], another generalized odd log-logistic-G (GOLL2-G) family [12], the odd log-logistic Lindley-G (OLLLi-G) family [13], the odd Chen-G (OCh-G) family [14], the odd flexible Weibull-H (OFW-H) family [15], and the new Kumaraswamy generalized (NKw-G) family [16]. These families bring more flexibility to the baseline model for data modeling.
To motivate the family in this paper, we considered a serial system with two independent components and supposed that the lifetime of the components follows the member of the OLL-G family and the member of the OW-G family with survival functions (sfs):
S O L L G x ; α , θ = G ¯ x ; θ α G x ; θ α + G ¯ x ; θ α
and:
S O W G x ; β , θ = exp G x ; θ G ¯ x ; θ β ,
respectively, where x , α , β > 0 are shape parameters, which control the tail, skewness, and kurtosis of the model, G x ; θ is the baseline cumulative distribution function (cdf), θ is the parameter vector of the baseline distribution, and G ¯ x ; θ = 1 G x ; θ is the sf of the baseline distribution. Hence, the sf of the system is given by:
S x ; α , β , θ = G ¯ x ; θ α G x ; θ α + G ¯ x ; θ α exp G x ; θ G ¯ x ; θ β .
With this definition in (3), we propose a new wider flexible family based on the OLL-G and the OW-G distributions by combining them in a serial system. The corresponding cdf and probability density function (pdf) of (3) are given by:
F x ; α , β , θ = 1 G ¯ x ; θ α G x ; θ α + G ¯ x ; θ α exp G x ; θ G ¯ x ; θ β
and:
f x ; α , β , θ = g x ; θ G ¯ x ; θ α 1 G x ; θ α + G ¯ x ; θ α α G x ; θ α 1 G x ; θ α + G ¯ x ; θ α + β G x ; θ β 1 G ¯ x ; θ β exp G x ; θ G ¯ x ; θ β
respectively, where g x , θ is the baseline pdf.
The hazard rate function (hrf) of this family is given by:
τ x ; α , β , θ = h x ; θ α G x ; θ α 1 G x ; θ α + G ¯ x ; θ α + β G x ; θ β 1 G ¯ x ; θ β = h O L L G x ; α , θ + h O W G x ; β , θ ,
where h x ; θ = g x ; θ G ¯ x ; θ is the hrf of the baseline distribution, h O L L G x ; α , θ and h O W G x ; β , θ are the hrfs of the OLL-G and OW-G families, respectively. It can be seen that the cdf, pdf, and hrf of this family are the structure of the additive model with the OLL-G and OW-G families. The additive distribution models include the various pdf and hrf shapes for data modeling and ensure better fitting to the dataset than the ordinary distribution model. They are especially useful for lifetime data modeling since the empirical approaches to real data are often nonmonotone hrf. In general, there are many additive models based on the Weibull distribution in the literature such as the additive Weibull, by Xie and Lai [17], the additive Burr XII, by Wang [18], the modified Weibull, by Sarhan and Zaidin [19], the extended additive Weibull, by Almalki and Yuan [20], the log-logistic Weibull, by Oluyede et al. [21], and the additive modified Weibull distributions, by He et al. [22]. Therefore, we call this new family the additive odd log-logistic odd Weibull-G (AOLLOW-G) family and denote it by A O L L O W - G ( α , β , θ ) . We want to point out that the structure of the AOLLOW-G family can be proposed as a new method to obtain a flexible distribution family. As a real interpretation of the suggested model, consider a device with a two independent series subdevice with Z 1 OLL-G and Z 2 OW-G, then the lifetime of this device is X = min { Z 1 , Z 2 } AOLLOW-G, which is the suggested family. We were also motivated to introduce this family because it exhibits increasing, decreasing, constant, unimodal, unimodal, then bathtub, as well as bathtub hazard rates and its other shapes in modeling lifetime data.
The remaining parts of the study can be summarized as follows. Section 2 contains the special members of the proposed family. In Section 3, we present the linear combinations of the proposed family in terms of the exponentiated-G family. Section 4 contains the statistical properties. The parameter estimation issue of the proposed family is discussed in Section 5. The log location-scale regression model based on a new generalization of the Weibull distribution is introduced in Section 6. Two simulation studies related to the finite sample behavior of the estimators of the proposed model are given in Section 7. The empirical findings of the presented study are given in Section 8. The study is concluded in Section 9.

2. Special Members of the Family

The AOLLOW-G family generates alternative extended distributions, and it contains some new subfamilies. For example, the additive odd-log logistic odd exponential-G family is obtained for β = 1 , and the additive odd-log logistic odd Rayleigh-G family is obtained for β = 2 . It is reduced to the additive G odd Weibull-G family for α = 1 , and it is the additive G odd exponential-G family for α = β = 1 . It also is the additive G odd Rayleigh-G family for α = 1 and β = 2 . Now, we present the three important submodels of this new family.

2.1. The AOLLOW-Normal Distribution

Let G x ; μ , σ = Φ x μ σ be the cdf of the normal distribution. The cdf of the AOLLOW-normal (AOLLOW-N) distribution is given by:
F x ; α , β , μ , σ = 1 1 Φ x μ σ α Φ x μ σ α + 1 Φ x μ σ α exp 1 Φ x μ σ 1 β .
Some possible plots of the AOLLOW-N density for selected parameter values are displayed in Figure 1. This figure shows that the pdf shapes of the AOLLOW-N can be trimodal, bimodal, unimodal, and skew-shaped. Therefore, we can say that new extended normal distribution has great flexible density shapes for data modeling.

2.2. The AOLLOW-Weibull Distribution

Let G x ; λ , γ = 1 exp λ x γ be the cdf of the Weibull distribution with shape parameter γ > 0 and scale parameter λ > 0 . The cdf of the AOLLOW-Weibull (AOLLOW-W) distribution is given by:
F x ; α , β , λ , γ = 1 e α λ x γ e α λ x γ + 1 e λ x γ α exp e λ x γ 1 β .
Figure 2 displays the pdf and hrf shapes of the AOLLOW-W. As seen from the figure, it is obvious that the AOLLOW-W distribution has left-right skewed and bimodal shapes. Its hrf structure is also very flexible and has the following shapes: increasing, decreasing, bathtub, decreasing-increasing-decreasing.
The saturation characteristic of the AOLLOW-W distribution can be studied in the sense of the Hausdorff distance, which is important for reliability theory, financial mathematics, and debugging theory. This important property of the AOLLOW-W distribution is planned as a future work of the present study. Detailed information can be found in Kyurkchiev [23] and Kyurkchiev and Markov [24].

2.3. The AOLLOW-Gamma Distribution

Let G x ; δ , λ = γ * λ , δ x Γ 1 λ be the cdf of the gamma distribution. The Γ · is a complete gamma function, and γ * λ , δ x is defined as:
γ * λ , δ x = 0 δ x t λ 1 e t d t
which is called as incomplete gamma function. The cdf of the AOLLOW-gamma (AOLLOW-Ga) distribution is:
F x ; α , β , λ , δ = 1 1 γ * λ , δ x Γ 1 λ α γ * λ , δ x Γ 1 λ α + 1 γ * λ , δ x Γ 1 λ α exp γ * λ , δ x Γ 1 λ 1 γ * λ , δ x Γ 1 λ β .
Figure 3 displays the pdf and hrf shapes of the AOLLOW-Ga distribution. We have the following results from these figures: (i) the pdf shapes of the AOLLOW-Ga distribution are left-right skewed and bimodal; (ii) the hrf shapes of the AOLLOW-Ga distribution are increasing, decreasing, bathtub, decreasing-increasing-decreasing-increasing, and increasing-decreasing-increasing. Since the new extended Weibull and gamma distributions have a very flexible density and hrf shapes, we can say that they can be useful especially in lifetime data modeling.

3. Useful Expansions

Let us define the pdf and cdf of the exponentiated-G (exp-G) family for the parent distribution G ( x ) . The cdf and pdf of the exp-G family are given by:
H c ( x ) = G ( x ) c ,
h c ( x ) = c g ( x ) G ( x ) c 1 ,
The cdf of the AOLLOW-G family can be expressed as follows by using the generalized binomial expansion:
F ( x ) = 1 G ¯ ( x ) α G ( x ) α + G ¯ ( x ) α exp { G ( x ) G ¯ ( x ) β } = 1 G ¯ ( x ) α G ( x ) α + G ¯ ( x ) α l = 0 ( 1 ) l l ! G ( x ) G ¯ ( x ) β l = 1 l , j = 0 ( 1 ) l + j ( α β l ) j l ! G ( x ) β l + j G ( x ) α + G ¯ ( x ) α
The second part of (9) can be written as follows:
G ( x ) β l + j G ( x ) α + G ¯ ( x ) α = k = 0 b k G ( x ) k k = 0 a k G ( x ) k = k = 0 c k G ( x ) k
where:
a k = i = k ( 1 ) i + k β l + j i i k b k = ( 1 ) k α k + i = k ( 1 ) i + k α i i k
c 0 = a 0 b 0
For k 1 ,
c k = 1 b 0 a k 1 b 0 r = 1 k b r c k r
Then, we can write:
F ( x ) = 1 k = 0 w k G ( x ) k = k = 0 d k G ( x ) k = k = 0 d k H k ( x )
where w k = l , j = 0 ( 1 ) l + j ( α β l ) j l ! c k ( α , β , l , j ) , d 0 = 1 w 0 , and for k 1 , d k = w k .
H k ( x ) = G ( x ) k denotes the exp-G with power parameter k.
We can write:
f ( x ) = k = 0 d k + 1 h k + 1 ( x )
and h k + 1 ( x ) = ( k + 1 ) g ( x ) G ( x ) k denote the pdf of the exp-G with power parameter k + 1 . The main result of this section is (14), which shows that the proposed family can be expressed as a linear combination of the exp-G densities. This property of the proposed family helps us obtain its statistical properties based on the exp-G densities.

4. Statistical Properties

This section deals with the statistical properties of the proposed family.

4.1. Quantile Function

Generating random variables from continuous probability distributions is generally performed by the quantile function (qf). The uth quantile, denoted by x u = Q u , of the AOLLOW-G distribution is obtained by the solution of the equation:
log 1 u G x u ; θ G ¯ x u ; θ α + 1 G x u ; θ G ¯ x u ; θ β = 0 ,
where u 0 , 1 . We can also write for the qf of any AOLLOW-G distribution:
x u = min { Q G ( u 1 / α u 1 / α + ( 1 u ) 1 / α ) , Q G ( ( log ( 1 u ) ) 1 / β 1 + ( log ( 1 u ) ) 1 / β ) } ,
where Q G ( . ) = G 1 ( . ) denotes the qf of the G baseline distribution and u 0 , 1 . Using (15) or (16), the random variables can be generated from any member of the proposed family.

4.2. Moments

Let Y k follow an exp-G distribution with the parameter k + 1 . Using (14), the nth moment of the AOLLOW-G family is:
E ( X n ) = k = 0 d k + 1 E ( Y k n ) .
Regarding the moments of exp-G distributions, one can visit the work of Nadarajah and Kotz [25].
Equation (17) can be expressed based on the G qf as follows:
E ( X n ) = k = 0 ( k + 1 ) d k + 1 τ ( n , k ) ,
where τ ( n , k ) = x n G ( x ) k g ( x ) d x = 0 1 Q G ( u ) n u k d u . Cordeiro and Nadarajah [26] provided an explicit expression for τ ( n , k ) for some beta generalized distributions.
For integer values of n, let μ n = E ( X n ) and μ = μ 1 = E ( X ) , then one can also find the nth central moment of the AOLLOW-G distribution as:
μ n = E ( X μ ) n = i = 0 n n i μ i ( μ ) n i .
Using (19), we calculate the skewness and kurtosis of the AOLLOW-G as follows:
S k e w n e s s ( X ) = μ 3 3 μ 2 μ 1 + 2 μ 1 3 μ 2 μ 1 2 3 2 ,
and:
K u r t o s i s ( X ) = μ 4 4 μ 1 μ 3 + 6 μ 1 2 μ 3 3 μ 1 4 μ 2 μ 1 2 ,
respectively. Figure 4 displays the skewness and kurtosis plots of the AOLLOW-N distribution. These figures help us to understand how additional shape parameters affect the shape of the distribution.

4.3. Generating Function

Let M X ( t ) = E ( e t X ) where X AOLLOW-G, then the first one simply comes from (14) as:
M X ( t ) = k = 0 d k + 1 M k ( t ) ,
where M k ( t ) is the moment generating function (mgf) of Y k . So, exp-G generating function is used to obtain M X ( t ) . From (14), we have:
M ( t ) = i = 0 ( k + 1 ) d k + 1 ρ ( t , k ) ,
where ρ ( t , k ) = e t x G ( x ) k g ( x ) dx = 0 1 exp [ t Q G ( u ) ] u k du .
Equation (23) can be used to obtain the mgfs of the different distributions obtained by means of the AOLLOW-G family.

5. Inference

The unknown parameters of the AOLLOW-G family are estimated by means of the maximum likelihood estimation (MLE) method. Let Ψ = ( α , β , θ ) be an unknown parameter vector where θ is a q × 1 parameter vector for the baseline distribution. Under these settings, the log-likelihood function of the AOLLOW-G family is:
= i = 1 n log g x i ; θ + α 1 i = 1 n log G ¯ x i ; θ i = 1 n log G x i ; θ α + G ¯ x i ; θ α i = 1 n log α G x i ; θ α 1 G x i ; θ α + G ¯ x i ; θ α + β G x i ; θ β 1 G ¯ x i ; θ β i = 1 n G x i ; θ G ¯ x i ; θ β .
One could prefer to obtain the score vectors corresponding to the likelihood function, given in (24). To do this, it is necessary to take partial derivatives of (24) with respect to the parameters such as U Ψ = Ψ = ( α , β , θ q ) . The simultaneous solution of these score vectors gives the MLE of the unknown parameter vector, Ψ . Note that the score vectors can be requested from the authors. Unfortunately, it is impossible to obtain the explicit solution of these nonlinear equation systems. Therefore, the log-likelihood has to be maximized by using the optimization algorithms. Here, we prefer the optim function of the R software for this purpose. The asymptotic confidence intervals of the parameters are constructed based on the observed information matrix.

6. Regression Modeling

In this section, a new location-scale regression model is introduced based on the AOLLOW-W distribution. For this aim, we use the log-transformation and some convenient parametrizations on the AOLLOW-W distribution. Let γ = 1 / σ and λ = exp μ and Y = log ( X ) . We have the following density for the random variable Y:
f ( y ) = 1 σ exp y μ σ exp y μ σ exp exp y μ σ α 1 1 exp exp y μ σ α + exp exp y μ σ α × α 1 exp exp y μ σ α 1 1 exp exp y μ σ α + exp exp y μ σ α + β 1 exp exp y μ σ β 1 exp exp y μ σ β × exp 1 exp exp y μ σ exp exp y μ σ β ,
where y . The parameters μ and σ > 0 are the location and scale parameters, respectively. The parameters α > 0 and β > 0 are the shape parameters. The density in (25) is denoted as Y ∼ LAOLLOW-W ( α , β , σ , μ ) . The pdf shapes of the LAOLLOW-W distribution are displayed in Figure 5, which shows that the LAOLLOW-W can be used to model different types of the datasets such as left-skewed, symmetric, and bimodal shapes.
The sf is:
S y = exp exp y μ σ α 1 exp exp y μ σ α + exp exp y μ σ α exp 1 exp exp y μ σ exp exp y μ σ β .
Besides, using the standardized random variable, Z = ( Y μ ) / σ , we have:
f ( z ) = exp z exp z exp exp z α 1 1 exp exp z α + exp exp z α × α 1 exp exp z α 1 1 exp exp z α + exp exp z α + β 1 exp exp z β 1 exp exp z β × exp 1 exp exp z exp exp z β .
Now, using the LAOLLOW-W density, we introduce a location-scale regression model by linking the covariates to the location of the random variable Y by means of the identity link function. Consider the regression structure:
y i = x i β + σ z i , i = 1 , , n ,
where x i = x i 1 , , x i k is the independent variable vector and y i is the dependent variable, following the density in (25). The regression parameters are represented by the vector, β = ( β 1 , , β k ) . The dependent variable is defined as y i = min { log ( t i ) , log ( c i ) } where t i and c i are the observed lifetime and censoring times, respectively. Let F and C be the sets representing the lifetime and censoring times. Based on these specifications, the log-likelihood function of the LAOLLOW-W regression model is:
τ = r log σ + i F z i u i α 1 i F u i i F log 1 exp u i α + exp u i α + i F log α 1 exp u i α 1 1 exp u i α + exp u i α + β 1 exp u i β 1 exp u i β i F 1 exp u i exp u i β α i C u i i C log 1 exp u i α + exp u i α i C 1 exp u i exp u i β ,
where τ = α , β , σ , β , u i = exp ( z i ) and z i = ( y i x i β ) / σ . Note that r is the number of uncensored observations. The MLE of the unknown parameter vector, τ ^ , is obtained based on the maximization of the given log-likelihood function in (29). The optim function of the R software is used to minimize the negative log-likelihood function. The standard errors of the estimated parameters are obtained by means of observed information, which is easily obtained by the Hess function of the R software.

Residual Analysis

Residual analysis is required to control the accuracy of the proposed regression model for the fitted dataset. For this goal, we consider two residuals. These are the martingale and modified deviance residuals. The martingale residual (see Fleming and Harrington [27]) of the LAOLLOW-W model is given by:
r M i = 1 + log exp u i α 1 exp u i α + exp u i α exp 1 exp u i exp u i β if i F , log exp u i α 1 exp u i α + exp u i α exp 1 exp u i exp u i β if i C ,
Since the martingale residuals are not distributed symmetrically, its interpretation has some difficulties. In this case, the modified deviance residuals, proposed by Therneau et al. [28], are generally used, which are defined as:
r D i = sign r M i 2 r M i + log 1 r M i 1 / 2 , if i F sign r M i 2 r M i 1 / 2 , if i C ,
where r ^ M i is the martingale residual.

7. Simulation Studies

7.1. Simulation Study 1

The AOLLOW-N distribution was used for the first simulation. The simulation was repeated N = 1000 times. The sample size was increased by 10 units such as n = 20 , 30 , , 1000 . The parameter values were α = 5 , β = 2 , μ = 1 , and σ = 2 . The results were evaluated based on the following metrics: estimated mean, bias, standard deviation (sd), and mean squared error (MSE). Our expectation was to see that when n is sufficiently large, the estimated biases and MSEs should be near the zero. The simulation results, displayed in Figure 6, verified our expectation. Therefore, the MLE method is the preferable method to obtain the unknown parameters of the AOLLOW-N distribution.

7.2. Simulation Study 2

The AOLLOW-W distribution was used for the second simulation. As in the first simulation, the simulation replication number was N = 1000 . The sample sizes used were: n = 20 , 60 , 100 . The results are discussed based on the estimated mean and sd of the MLEs. Table 1 contains the simulation results. The results in this table show that the estimated mean and sd decrease withthe sample size n.

8. Data Analysis

Three datasets were analyzed to compare the special cases of the AOLLOW-G family with existing models such as the Kw-G, GOLL-G, GOLL2-G, OLL-G, OW-G, and additive Weibull (AW) by Lemonte et al. [29]. The information criteria and goodness-of-fit test below were used to select the best model for the modeled data:
  • Akaike information criterion (AIC);
  • Kolmogorov–Smirnov (KS);
  • Cramer–von Mises ( W * );
  • Anderson–Darling ( A * ).
Furthermore, the estimated log-likelihood values ^ were used to select the best model. See Chen and Balakrishnan [30] for detailed information on W * and A * . The smallest values of these metrics show the best model for the data. Additionally, we used the total time on test (TTT) plot (Aarset [31]) to determine the shape of the empirical hrf. The TTT plots of the two datasets are given in Figure 7.

8.1. Stress Data

The data regarded the stress-rupture life of Kevlar 49/epoxy strands. Andrews and Herzberg [32], Cooray and Ananda [33], and Paraniaba et al. [34] analyzed the same dataset. Figure 7a shows that the empirical hrf of the data has a convex-concave-convex shape. Table 2 contains the estimated parameters, as well as goodness-of-fit statistics. From these results, we concluded that the AOLLOW-W distribution is the best choice among the others since it had the lowest values of the model selection criteria.
Figure 8 gives information about the suitability of the fitted distributions for the dataset graphically. From these figures, one can conclude that the AOLLOW-W distribution is a good choice for the data used. Additionally, only the estimated hrf of the AOLLOW-W overlaps with the result of the TTT plot.

8.2. Guinea Pig Data

The second dataset regarded the survival times (in days) of guinea pigs (Bjerkedal [35]). Gupta et al. [36], Korkmaz and Genç [37], and Korkmaz [38] analyzed the same dataset with various probability distributions. Figure 7b shows that the empirical hrf of the data is concave-convex-concave.
The estimated parameters and goodness-of-fit test results are given in Table 3. As in the previous application, the proposed model, AOLLOW-Ga, had the lowest values of the model selection criteria. Moreover, to strengthen the suitability of the AOLLOW-Ga distribution for the data, the fitted functions of the AOLLOW-Ga and other competitive distributions are plotted in Figure 9. It is obvious that the AOLLOW-Ga distribution is the best among the others and provided a nearly perfect fit to the data.

8.3. Stanford Heart Transplant Dataset

In the third application, the practicability of the LAOLLOW-W regression model was discussed based on the real data. Brito et al. [39] used the Stanford heart transplant dataset for the log-Topp–Leone odd log-logistic-Weibull (Log-TLOLL-W) regression model. We used the same dataset to compare the proposed model with the log-TLOLL-W model. The total individuals were n = 103 , and the percentage of the censored observations was 27%. The dependent variable survival time y i was modeled with the following covariates:
x 1 : year of acceptance to the program;
x 2 : age of the patient (years);
x 3 : previous surgery 1 = y e s , 0 = n o ;
x 4 : transplant 1 = y e s , 0 = n o .
We consider the regression structure:
y i = β 0 + β 1 x i 1 + β 2 x i 2 + β 3 x i 3 + β 4 x i 4 + σ z i ,
The above regression structure is fit by the log-Weibull, log-TLOLL-W, and log-AOLLOW-W models. The results of the fitted regression models are summarized in Table 4, giving the estimated parameters, standard errors, p-values, and the the model selection AIC and BIC. From these results, we concluded that the LAOLLOW-W regression model is the best choice among the others for the dataset used. The regression parameters, β 1 and β 2 were found to be statistically significant at the 1% level. Therefore, the survival time increased when the year of acceptance increased. Besides, when the age of the patient increased, the survival time decreased.

Model Accuracy

The accuracy of the fitted regression model was checked using the modified deviance residuals. Figure 10 displays the index and quantile–quantile (qq) plots. The expectation was that when the fitted model is statistically valid, the modified deviance residuals should be distributed as N(0,1). From the qq plot, it is easy to conclude that the LAOLLOW-W regression model is statistically valid. The index plot of the modified deviance residuals shows that there were no observations evaluated as possible outliers.

9. Concluding Remark

A new family of distributions was introduced with two extra shape parameters. The mathematical properties were studied in detail. Simulation studies were implemented. The location-scale regression model was also introduced. Three datasets were analyzed. The empirical results showed that the AOLLOW-G family gives better results than other famous G-families of distributions. As a future work of the present paper, we plan to develop a bivariate version of the AOLLOW-G family. Furthermore, this family can be used to generate new heavy-tailed distributions by using the Pareto as a baseline distribution. A new generalization of the Pareto distribution using the proposed family can be applied to financial datasets to forecast financial risk. We hope that this new family captures many application areas.

Author Contributions

Conceptualization, M.Ç.K., E.A., M.S.E. and M.E.-M.; methodology, M.Ç.K. and E.A.; software, M.Ç.K. and E.A.; validation, M.Ç.K., E.A., M.S.E. and M.E.-M.; formal analysis, M.Ç.K. and E.A.; writing—original draft preparation, M.Ç.K. and E.A.; writing—review and editing, M.S.E. and M.E.-M. All authors read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data sources are given in Section 8.

Acknowledgments

The authors would like to thank anonymous referees for their constructive comments. Special thanks to Morad Alizadeh for his helpful suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
Kw-GKumaraswamy-G
OW-Godd Weibull-G
OLL-Godd log-logistic-G
GOLL-Ggeneralized odd log-logistic-G
GOLL2-Ganother generalized odd log-logistic-G
sfsurvival function
sdstandard deviation
MSEmean squared error
cdfcumulative distribution function
pdfprobability density function
hrfhazard rate function
mgfmoment generating function
AOLLOW-Gadditive odd-log logistic odd Weibull-G
qfquantile function
qqquantile-quantile
TTTtotal time on test
AICAkaike information criterion
KSKolmogorov–Smirnov

References

  1. Marshall, A.W.; Olkin, I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 1997, 84, 641–652. [Google Scholar] [CrossRef]
  2. Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Commun. Stat. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
  3. Zografos, K.; Balakrishnan, N. On families of beta-and generalized gamma-generated distributions and associated inference. Stat. Methodol. 2009, 6, 344–362. [Google Scholar] [CrossRef]
  4. Cordeiro, G.M.; Castro, M. A new family of generalized distributions. J. Stat. Comput. Simul. 2011, 81, 883–898. [Google Scholar] [CrossRef]
  5. Alexander, C.; Cordeiro, G.M.; Ortega, E.M.; Sarabia, J.M. Generalized beta-generated distributions. Comput. Stat. Data Anal. 2012, 56, 1880–1897. [Google Scholar] [CrossRef]
  6. Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef] [Green Version]
  7. Korkmaz, M.Ç.; Genç, A.I. A new generalized two-sided class of distributions with an emphasis on two-sided generalized normal distribution. Commun. Stat. Simul. Comput. 2017, 46, 1441–1460. [Google Scholar] [CrossRef]
  8. Gleaton, J.U.; Lynch, J.D. Properties of generalized log-logistic families of lifetime distributions. J. Probab. Stat. Sci. 2006, 4, 51–64. [Google Scholar]
  9. Bourguignon, M.; Silva, R.B.; Cordeiro, G.M. The Weibull-G family of probability distributions. J. Data Sci. 2014, 12, 53–68. [Google Scholar] [CrossRef]
  10. Alizadeh, M.; Cordeiro, G.M.; Nascimento, A.D.; Lima, M.D.C.S.; Ortega, E.M. Odd-Burr generalized family of distributions with some applications. J. Stat. Comput. Simul. 2017, 87, 367–389. [Google Scholar] [CrossRef]
  11. Cordeiro, G.M.; Alizadeh, M.; Ozel, G.; Hosseini, B.; Ortega, E.M.M.; Altun, E. The generalized odd log-logistic family of distributions: Properties, regression models and applications. J. Stat. Comput. Simul. 2017, 87, 908–932. [Google Scholar] [CrossRef]
  12. Haghbin, H.; Ozel, G.; Alizadeh, M.; Hamedani, G.G. A new generalized odd log-logistic family of distributions. Commun. Stat. Theory Methods 2017, 46, 9897–9920. [Google Scholar] [CrossRef]
  13. Alizadeh, M.; Afify, A.Z.; Eliwa, M.S.; Ali, S. The odd log-logistic Lindley-G family of distributions: Properties, Bayesian and non-Bayesian estimation with applications. Comput. Stat. 2020, 35, 281–308. [Google Scholar] [CrossRef]
  14. El-Morshedy, M.; Eliwa, M.S.; Afify, A.Z. The odd Chen generator of distributions: Properties and estimation methods with applications in medicine and engineering. J. Natl. Sci. Found. Sri Lanka 2020, 48, 113–130. [Google Scholar]
  15. El-Morshedy, M.; Eliwa, M.S. The odd flexible Weibull-H family of distributions: Properties and estimation with applications to complete and upper record data. Filomat 2019, 33, 2635–2652. [Google Scholar] [CrossRef]
  16. Tahir, M.H.; Hussain, M.A.; Cordeiro, G.M.; El-Morshedy, M.; Eliwa, M.S. A new Kumaraswamy generalized family of distributions with properties, applications, and bivariate extension. Mathematics 2020, 8, 1989. [Google Scholar] [CrossRef]
  17. Xie, M.; Lai, C.D. Reliability analysis using an additive Weibull model with bathtub-shaped failure rate function. Reliabil. Eng. Syst. Saf. 1996, 52, 87–93. [Google Scholar] [CrossRef]
  18. Wang, F.K. A new model with bathtub-shaped failure rate using an additive Burr XII distribution. Reliabil. Eng. Syst. Saf. 2000, 70, 305–312. [Google Scholar] [CrossRef]
  19. Sarhan, A.M.; Zaindin, M. Modified Weibull distribution. Appl. Sci. 2009, 11, 123–136. [Google Scholar]
  20. Almalki, S.J.; Yuan, J. A new modified Weibull distribution. Reliabil. Eng. Syst. Saf. 2013, 111, 164–170. [Google Scholar] [CrossRef]
  21. Oluyede, B.; Foya, S.; Warahena-Liyanage, G.; Huang, S. The log-logistic weibull distribution with applications to lifetime data. Aust. J. Stat. 2016, 45, 43–69. [Google Scholar] [CrossRef]
  22. He, B.; Cui, W.; Du, X. An additive modified Weibull distribution. Reliabil. Eng. Syst. Saf. 2016, 145, 28–37. [Google Scholar] [CrossRef]
  23. Kyurkchiev, N. Comments on the epsilon and omega cumulative distributions: “Saturation in the hausdorff sense”. AIP Conf. Proc. 2021, 2321, 030020-1–030020-8. [Google Scholar]
  24. Kyurkchiev, N.; Markov, S. On the Hausdorff distance between the Heaviside step function and Verhulst logistic function. J. Math. Chem. 2016, 54, 109–119. [Google Scholar] [CrossRef]
  25. Nadarajah, S.; Kotz, S. The exponentiated-type distributions. Acta Appl. Math. 2006, 92, 97–111. [Google Scholar] [CrossRef]
  26. Cordeiro, G.M.; Nadarajah, S. Closed-form expressions for moments of a class of beta generalized distributions. Braz. J. Probab. Stat. 2011, 25, 14–33. [Google Scholar] [CrossRef]
  27. Fleming, T.R.; Harrington, D.P. Counting Process and Survival Analysis; John Wiley: New York, NY, USA, 1994. [Google Scholar]
  28. Therneau, T.M.; Grambsch, P.M.; Fleming, T.R. Martingale-based residuals for survival models. Biometrika 1990, 77, 147–160. [Google Scholar] [CrossRef]
  29. Lemonte, A.J.; Cordeiro, G.M.; Ortega, E.M. On the additive Weibull distribution. Commun. Stat. Theory Methods 2014, 43, 2066–2080. [Google Scholar] [CrossRef]
  30. Chen, G.; Balakrishnan, N. A general purpose approximate goodness-of-fit test. J. Qual. Technol. 1995, 27, 154–161. [Google Scholar] [CrossRef]
  31. Aarset, M.V. How to identify a bathtub hazard rate. IEEE Trans. Reliabil. 1987, 36, 106–108. [Google Scholar] [CrossRef]
  32. Andrews, D.F.; Herzberg, A.M. Data: A Collection of Problems from Many Fields for the Student and Research Worker; Springer: New York, NY, USA, 1985. [Google Scholar]
  33. Cooray, K.; Ananda, M.M. A generalization of the half-normal distribution with applications to lifetime data. Commun. Stat. Theory Methods 2008, 37, 1323–1337. [Google Scholar] [CrossRef]
  34. Paranaiba, P.F.; Ortega, E.M.; Cordeiro, G.M.; Pascoa, M.A.D. The Kumaraswamy Burr XII distribution: Theory and practice. J. Stat. Comput. Simul. 2013, 83, 2117–2143. [Google Scholar] [CrossRef]
  35. Bjerkedal, T. Acquisition of resistance in guinea pigs infected with different doses of virulent tubercle bacilli. Am. J. Hyg. 1960, 72, 130–148. [Google Scholar] [PubMed]
  36. Gupta, R.C.; Kannan, N.; Raychaudhuri, A. Analysis of lognormal survival data. Math. Biosci. 1997, 139, 103–115. [Google Scholar] [CrossRef]
  37. Korkmaz, M.Ç.; Genç, A.I. Two-Sided Generalized Exponential Distribution. Commun. Stat. Theory Methods 2015, 44, 5049–5070. [Google Scholar] [CrossRef]
  38. Korkmaz, M.Ç. A generalized skew slash distribution via gamma-normal distribution. Commun. Stat. Simul. Comput. 2017, 46, 1647–1660. [Google Scholar] [CrossRef]
  39. Brito, E.; Cordeiro, G.M.; Yousof, H.M.; Alizadeh, M.; Silva, G.O. The Topp Leone odd log-logistic family of distributions. J. Stat. Comput. Simul. 2017, 87, 3040–3058. [Google Scholar] [CrossRef]
Figure 1. The pdf plots of the AOLLOW-N distribution.
Figure 1. The pdf plots of the AOLLOW-N distribution.
Mathematics 09 01837 g001
Figure 2. The pdf and hrf plots of the AOLLOW-W distribution.
Figure 2. The pdf and hrf plots of the AOLLOW-W distribution.
Mathematics 09 01837 g002
Figure 3. The pdf and hrf plots of the AOLLOW-Ga distribution.
Figure 3. The pdf and hrf plots of the AOLLOW-Ga distribution.
Mathematics 09 01837 g003
Figure 4. The skewness and kurtosis plots for the selected AOLLOW-N distributions.
Figure 4. The skewness and kurtosis plots for the selected AOLLOW-N distributions.
Mathematics 09 01837 g004
Figure 5. The pdf plots of the LAOLLOW-W density.
Figure 5. The pdf plots of the LAOLLOW-W density.
Mathematics 09 01837 g005
Figure 6. Simulation results of the AOLLOW-N distribution.
Figure 6. Simulation results of the AOLLOW-N distribution.
Mathematics 09 01837 g006
Figure 7. TTT plots for stress data (a) and guinea pig data (b).
Figure 7. TTT plots for stress data (a) and guinea pig data (b).
Mathematics 09 01837 g007
Figure 8. The fitted plots for the first dataset.
Figure 8. The fitted plots for the first dataset.
Mathematics 09 01837 g008
Figure 9. The fitted plots for the second dataset.
Figure 9. The fitted plots for the second dataset.
Mathematics 09 01837 g009
Figure 10. The index (a) and qq (b) plots of the modified deviance residuals.
Figure 10. The index (a) and qq (b) plots of the modified deviance residuals.
Mathematics 09 01837 g010
Table 1. The simulation results of the AOLLOW-W distribution based on empirical mean and standard deviation (in parentheses).
Table 1. The simulation results of the AOLLOW-W distribution based on empirical mean and standard deviation (in parentheses).
Parameters n = 20 n = 60 n = 100
α , β , λ , γ α ^ β ^ λ ^ γ ^ α ^ β ^ λ ^ γ ^ α ^ β ^ λ ^ γ ^
0.5, 2, 0.5, 20.53272.07540.51562.18580.50762.01870.50602.06860.50672.02050.50222.0351
(0.2003)(0.4328)(0.0359)(0.3527)(0.0982)(0.2422)(0.0174)(0.2160)(0.0717)(0.1615)(0.0125)(0.1509)
1, 0.5, 0.5, 21.04070.54910.52322.53671.01670.54650.50402.15411.00180.53990.50162.1123
(0.6327)(0.2974)(0.0848)(0.7923)(0.4233)(0.1849)(0.0428)(0.5598)(0.3554)(0.1624)(0.0330)(0.4873)
5, 5, 0.5, 0.54.90995.07240.52660.54074.85265.09900.51770.52454.97845.01630.50740.5098
(0.3564)(0.2439)(0.0797)(0.1098)(0.5727)(0.4224)(0.0490)(0.0624)(0.1569)(0.1126)(0.0343)(0.0419)
2, 2, 2, 21.88512.35152.03642.25211.97402.15482.01252.07901.99962.08802.00512.0457
(0.7248)(0.5187)(0.1201)(0.5273)(0.5768)(0.3070)(0.0661)(0.2762)(0.5067)(0.0661)(0.0547)(0.2370)
1, 2, 3, 41.10582.17573.04274.35211.03552.04833.01274.13081.02162.02113.00754.0867
(0.4546)(0.7245)(0.0893)(0.5841)(0.2447)(0.3964)(0.0516)(0.3320)(0.2018)(0.3024)(0.0407)(0.2894)
4, 3, 2, 13.88583.02932.11361.25963.91763.00122.05391.11673.95752.97172.03681.0697
(0.7970)(0.9388)(0.2442)(0.5369)(0.6317)(0.7288)(0.1538)(0.2735)(0.4844)(0.5696)(0.1172)(0.1893)
Table 2. The results of the first data set (standard errors of the estimates are in parentheses).
Table 2. The results of the first data set (standard errors of the estimates are in parentheses).
Model α ^ β ^ λ ^ γ ^ ^ AIC KS A * W *
AOLLOW-W43.519910.95910.00020.0461999.9641207.92820.05050.37900.0460
(10.6311)(1.6396)(0.00001)(0.0017)
AW0.67030.78930.31391.2451102.8146213.62920.08260.93720.1601
(0.7347)(0.2712)(0.7209)(0.5576)
GOLL2-W0.88910.63991.61661.0111102.8434213.68690.09021.01400.1814
(0.1949)(0.1222)(0.2645)(0.1777)
GOLL-W1.17120.60620.61311.1123102.7667213.53350.07980.93480.1561
(0.9348)(0.8847)(0.9732)(0.4377)
Kw-W0.71970.24293.50481.0362102.6217213.24330.07520.84320.1376
(0.0053)(0.0245)(0.0041)(0.0106)
OLL-W0.8892 1.03961.0109102.8435211.68690.09031.01450.1816
(0.1944) (0.1286)(0.1771)
OW-W 6.24920.03300.1077102.8714211.74280.08470.97780.1686
(13.8005)(0.2478)(0.2391)
W 1.01010.9260102.9768209.95360.09061.12200.1963
(0.1141)(0.0726)
Table 3. The results of the second data set (standard errors of the estimates are in parentheses).
Table 3. The results of the second data set (standard errors of the estimates are in parentheses).
Model α ^ β ^ δ ^ λ ^ ^ AIC KS A * W *
AOLLOW-Ga0.05980.03710.441274.2805386.5875781.17510.08830.54830.1003
(0.0163)(0.0057)( 2.3 × 10 6 )(1.1 × 10−9)
GOLL2-Ga4.51233.4472 7.7 × 10 5 3.5404391.0022790.00430.08970.91380.1411
(1.3647)(2.5399)( 1 × 10 6 )(0.1279)
GOLL-Ga12.20280.0621 1.7 × 10 5 1.6275390.5474789.09480.09060.74930.1254
(1.1728)(0.0403)(0.0001)(0.9773)
Kw-Ga287.20960.37960.03060.0206390.3771788.75430.09860.94090.1761
(0.4384)(0.1826)(0.0140)(0.0099)
OLL-Ga10.3182 2.5 × 10 5 0.1202390.6752787.35030.08880.82850.1285
(0.7899) (0.0001)(0.1771)
OW-Ga 0.04650.392561.1120394.4674794.93480.22765.29881.1630
(0.0042)(0.0001)(0.1688)
Ga 0.02082.0810394.2476792.49520.13851.89600.3555
(0.0037)(0.3232)
Table 4. The results of the fitted regression models.
Table 4. The results of the fitted regression models.
Models
Log-WeibullLog-TLOLL-WLAOLLOW-W
ParametersEstimateS.E.p-ValueEstimateS.E.p-ValueEstimateS.E.p-Value
α ---2.3403.546-5.2444.840-
β ---24.0293.015-4.9865.745-
σ 1.4780.133-9.68012.526-8.2708.640-
β 0 1.6396.8350.811−0.6458.4590.9396.6893.1990.036
β 1 0.1040.0960.2790.0740.0970.4480.2360.0860.006
β 2 −0.0920.02<0.001−0.0530.020.009−0.0790.018<0.001
β 3 1.1260.6580.0871.6760.5970.005−0.0820.4700.861
β 4 2.5440.378<0.0012.3940.384<0.0010.2630.3550.458
171.2405164.684161.911
AIC354.481345.368339.822
BIC370.2894366.4458360.900
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Altun, E.; Korkmaz, M.Ç.; El-Morshedy, M.; Eliwa, M.S. A New Flexible Family of Continuous Distributions: The Additive Odd-G Family. Mathematics 2021, 9, 1837. https://doi.org/10.3390/math9161837

AMA Style

Altun E, Korkmaz MÇ, El-Morshedy M, Eliwa MS. A New Flexible Family of Continuous Distributions: The Additive Odd-G Family. Mathematics. 2021; 9(16):1837. https://doi.org/10.3390/math9161837

Chicago/Turabian Style

Altun, Emrah, Mustafa Ç. Korkmaz, Mahmoud El-Morshedy, and Mohamed S. Eliwa. 2021. "A New Flexible Family of Continuous Distributions: The Additive Odd-G Family" Mathematics 9, no. 16: 1837. https://doi.org/10.3390/math9161837

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop