Next Article in Journal
A Model-Based Design Method for the Correlation between Customer Feedback and Technical Design Parameters in the Context of Systems Engineering
Previous Article in Journal
Characterizing Prediction Uncertainty in Agricultural Modeling via a Coupled Statistical–Physical Framework
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Logit Truncated-Exponential Skew-Logistic Distribution with Properties and Applications

1
Department of Mathematics, Xi’an University of Technology, Xi’an 710054, China
2
College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China
3
Department of Mathematical Sciences, Eastern New Mexico University, Portales, NM 88130, USA
4
Department of Mathematical Sciences, New Mexico State University, Las Cruces, NM 88003, USA
*
Author to whom correspondence should be addressed.
Modelling 2021, 2(4), 776-794; https://doi.org/10.3390/modelling2040041
Submission received: 12 November 2021 / Revised: 5 December 2021 / Accepted: 13 December 2021 / Published: 15 December 2021

Abstract

:
In recent years, bounded distributions have attracted extensive attention. At the same time, various areas involve bounded interval data, such as proportion and ratio. In this paper, we propose a new bounded model, named logistic Truncated exponential skew logistic distribution. Some basic statistical properties of the proposed distribution are studied, including moments, mean residual life function, Renyi entropy, mean deviation, order statistics, exponential family, and quantile function. The maximum likelihood method is used to estimate the unknown parameters of the proposed distribution. More importantly, the applications to three real data sets mainly from the field of engineering science prove that the logistic Truncated exponential skew logistic distribution fits better than other bounded distributions.

1. Introduction

In daily life, data related to percentage and proportion are bounded. For example, the proportion of charitable donations in income needs a bounded continuous distribution to describe it; see Papke and Wooldridge [1]. In the statistical literature, Beta distribution is a famous bounded distribution. However, its cumulative distribution function (cdf) is not a closed expression and contains special functions. Scholars have therefore proposed bounded distributions that were more flexible than the Beta distribution. Topp and Leone [2] proposed the Topp–Leone distribution, which was useful to model life cycle phenomena and was used as an alternative to Beta distribution in many respects. Pourdarvish et al. [3] studied a generalization of Topp–Leone distribution referred to as the exponentiated Topp–Leone distribution. They discussed many aspects of this new model, such hazard rate function, the moments, and the order statistics, and applied it to milk yield data. Kumaraswamy [4] introduced another widely used distribution referred to as the Kumaraswamy distribution with bounded support, which was applicable to many natural phenomena in which the process values were bounded on both sides. Lemonte et al. [5] used the exponentiated Kumaraswamy distribution for reliability and life analysis and emphasized that its advantage over Beta distribution was a simple explicit formula of distribution and quantile function without involving any special function.
In recent years, there has been increasing interest in introducing new distributions defined on bounded support. Gómez-Déniz et al. [6] described that log-Lindley distribution could be effectively applied to different scenarios, such as actuarial settings or econometric analysis. Especially in insurance, the log-Lindley regression model could be regarded as an alternative to the classical Beta regression model. Altun and Hamedani [7] claimed that the advantage of log-xgamma distribution lied in its simple form and certain flexibility through risk rate function, and it could be used to model life cycle data sets. Korkmaz [8] showed that logit slash had a good fitting effect for high-kurtosis skewness data sets with outlier observations and that the proposed model could be an alternative to the classical bounded distributions available in the statistical literature to model percentage and proportion. Mazucheli et al. [9] demonstrated that the unit-Weibull regression model was more suitable for modeling health, accounting, and social science data sets than Beta and Kumaraswamy regression models. Altun [10] discussed the flexibility of log-weighted distribution through the educational-attainment and self-reported-health data sets of OECD countries and a new regression for bounded response variable was introduced based on the log-weighted distribution.
The Truncated-exponential skew-logistic (TESL) distribution is a member of the skew logistic distribution family, which was studied by Mirzadeh and Iranmanesh [11]. Compared with other skew logistic distributions, the TESL distribution has closed expressions of the the probability density function (pdf), cdf, and quantile function. In addition, the TESL distribution belongs to the exponential family. Therefore, the skewness parameter can be derived more easily. Meanwhile, the TESL distribution can be used to model unimodal data with certain skewness. Compared with other skew logistic distributions, the TESL distribution was better than other models. A random variable Y has a TESL distribution with parameters μ R , σ > 0 and λ R , denoted by TESL ( μ , σ , λ ) , if its pdf is given by
f T E S L y ; μ , σ , λ = λ exp y μ σ σ 1 exp ( λ ) 1 + exp y μ σ 2 exp λ 1 + exp y μ σ , y R .
The cdf of Y is given by
F T E S L y ; μ , σ , λ = 1 1 exp ( λ ) 1 exp λ 1 + exp y μ σ , y R .
In this paper, we introduce a new bounded distribution, named by the logit TESL (LTESL) distribution, which is generated from e Y 1 + e Y , where Y follows the TESL distribution introduced in the literature. The new distribution has closed and tractable properties, such as its pdf, and cdf and quantile functions have explicit forms. Since the pdf of the proposed distribution is explicit and straightforward, the parameters of the new distribution can be estimated by maximum-likelihood estimation. The application results show that compared with other bounded distributions on the unit interval, the new distribution provides better modeling ability.
The rest of the paper is organized as follows: Some important mathematical properties of the new distribution are derived in Section 2, such as cdf, moment, mean residual life function, Renyi entropy, mean deviation, order statistics, exponential family, and quantile function. The maximum-likelihood estimation of the new distribution is discussed in Section 3. Simulation studies to evaluate the proposed methods are conducted in Section 4. Three real data applications are studied in Section 5. Some conclusions are given in Section 6.

2. Logit Truncated-Exponential Skew-Logistic Distribution

This section describes some basic properties of the LTESL distribution, which is defined as follows.
Definition 1.
A random variable X has the LTESL distribution with parameters μ R , σ > 0 and λ R , denoted by L T E S L ( μ , σ , λ ) , if its pdf is given by
f x ; μ , σ , λ = λ σ x 1 x exp log x 1 x μ σ 1 exp ( λ ) 1 + exp log x 1 x μ σ 2 × exp λ 1 + exp log x 1 x μ σ , 0 < x < 1 .
Proposition 1.
Let X L T E S L ( μ , σ , λ ) ; the cdf of X is given by
F x ; μ , σ , λ = 1 1 exp ( λ ) 1 exp λ 1 + x 1 x 1 σ exp μ σ , 0 < x < 1 .
Proof of Proposition 1.
According to the variable substitution of the integral, we have
F x ; μ , σ , λ = λ σ 0 x 1 t ( 1 t ) exp log t 1 t μ σ 1 exp ( λ ) 1 + exp log t 1 t μ σ 2 × exp λ 1 + exp log t 1 t μ σ d t = λ log x 1 x μ σ exp ( ω ) 1 exp ( λ ) 1 + exp ( ω ) 2 exp λ 1 + exp ( ω ) d ω = λ 1 exp ( λ ) 1 + x 1 x 1 σ exp μ σ 1 u 2 exp λ u d u = 1 1 exp ( λ ) 1 exp λ 1 + x 1 x 1 σ exp μ σ ,
where ω = log t 1 t μ σ and u = 1 + e ω . Thus, the result is obtained. □
We point out that F x ; μ , σ , λ = F T E S L log x 1 x ; μ , σ , λ . Next, we give some important properties of the LTESL distribution.
Figure 1 shows the possible pdf curves of the LTESL distribution with different values of parameters. From these Figures, we can see that the LTESL distribution has the following pdf shapes: decreasing and unimodal. The LTESL distribution is a good choice to model unimodal skewness data set.
Proposition 2.
Let X L T E S L ( μ , σ , λ ) , the survival function (sf) of X is given by
S ( x ) = 1 F ( x ) = 1 1 1 exp ( λ ) 1 exp λ 1 + x 1 x 1 σ exp μ σ ,
the hazard rate function (hrf) of X is given by
h x ; μ , σ , λ = λ σ x ( 1 x ) g ( x ) 1 exp λ g ( x ) g 2 ( x ) × exp λ 1 + x 1 x 1 σ exp μ σ exp ( λ ) 1 ,
where g ( y ) = 1 + exp log y 1 y μ σ , and the reversed hazard rate of X is given by
r ( x ) = f ( x ) F ( x ) = λ σ x ( 1 x ) g ( x ) 1 exp λ g ( x ) g 2 ( x ) 1 exp λ 1 + x 1 x 1 σ exp μ σ 1 .
Figure 2 shows some hrf curves of the LTESL distribution with different values of parameters.
In mathematics and statistics, moment is a set of measures of variable distribution and morphological characteristics. For the next property, we deduce the r-order original moment of the LTESL distribution and further obtain its mean and variance.
Proposition 3.
Let X L T E S L ( μ , σ , λ ) ; the rth moment of X is given by
μ r = 1 r 1 exp ( λ ) 0 1 x r 1 1 exp λ 1 + x 1 x 1 σ exp μ σ d x .
Proof of Proposition 3.
According to the definition of rth moment, we have
μ r = E ( X r ) = Ω X r d p = Ω 0 X r x r 1 d x d p = Ω 0 I X > x r x r 1 d x d p = 0 r x r 1 Ω I X > x d p d x = 0 r x r 1 P ( X > x ) d x = 0 1 r x r 1 ( 1 F ( x ) ) d x = 1 0 1 r x r 1 F ( x ) d x = 1 r 1 exp ( λ ) 0 1 x r 1 1 exp λ 1 + x 1 x 1 σ exp μ σ d x ,
where I A is the indicator function and Ω is probability space. Hence, the result is obtained. □
Corollary 1.
Let X L T E S L ( μ , σ , λ ) ; the mean and the variance of X are given by
E ( X ) = 1 Δ 1 1 exp ( λ ) ,
V a r ( X ) = 1 1 exp ( λ ) 2 ( Δ 1 Δ 2 ) Δ 1 2 1 exp ( λ ) ,
where
Δ 1 = 0 1 1 exp λ 1 + x 1 x 1 σ exp μ σ d x ,
and
Δ 2 = 0 1 x 1 exp λ 1 + x 1 x 1 σ exp μ σ d x .
Entropy is used to describe the degree of uncertainty of random variables. In the environment where information is represented by probability distribution, entropy represents the degree of uneven distribution of information. Renyi entropy is of great significance in statistics and social ecology.
Proposition 4.
Let X L T E S L ( μ , σ , λ ) , the Renyi entropy is of X, R γ ( x ) , is given by
R γ ( x ) = 1 1 γ γ log λ σ ( 1 exp ( λ ) ) + log 0 1 x ( 1 x ) g ( x ) 1 exp λ g ( x ) g 2 ( x ) γ d x ,
where γ > 0 and γ 1 .
Proof of Proposition 4.
Using the definition of Renyi entropy, we have
R γ ( x ) = 1 1 γ log 0 f γ x ; μ , σ , λ d x = 1 1 γ log λ σ ( 1 exp ( λ ) ) γ 0 1 x ( 1 x ) g ( x ) 1 exp λ g ( x ) g 2 ( x ) γ d x = 1 1 γ γ log λ σ ( 1 exp ( λ ) ) + log 0 1 x ( 1 x ) g ( x ) 1 exp λ g ( x ) g 2 ( x ) γ d x .
Proposition 5.
The LTESL distribution belongs to the exponential family.
Proof of Proposition 5.
Assume X L T E S L ( μ , σ , λ ) . Let g ( x ) be a function defined the same way as in Proposition 2. Then the pdf of X becomes
f x ; μ , σ , λ = λ σ x ( 1 x ) g ( x ) 1 [ 1 exp ( λ ) ] g 2 ( x ) exp λ g ( x ) = 1 x ( 1 x ) 1 g ( x ) 1 g 2 ( x ) λ σ [ 1 exp ( λ ) ] exp λ 1 g ( x ) , 0 < x < 1 ,
which means the pdf of the LTESL distribution can be rewritten as
f x ; μ , σ , λ = h ( x ) c ( λ ) exp w ( λ ) t ( x ) ,
where t ( x ) = 1 g ( x ) , c ( λ ) = λ σ [ 1 exp ( λ ) ] , w ( λ ) = λ and h ( x ) = 1 x ( 1 x ) 1 g ( x ) 1 g 2 ( x ) . Therefore, the result is obtained. □
The mean residual life (MRL) function can be applied to reliability, survival analysis, economics, actuarial science, and other fields. It describes the expected survival time of an individual or object under the condition of survival at time t.
Proposition 6.
Let X L T E S L ( μ , σ , λ ) , the MRL function of X is given by
μ X ( t ) = 1 δ ( t ) exp ( λ ) exp ( λ ) t δ ( t ) + λ exp μ σ σ t 1 x 1 1 σ ( 1 x ) 2 1 σ δ ( x ) 1 + x 1 x 1 σ exp μ σ 2 d x ( 1 t ) exp ( λ ) ,
where δ ( y ) = exp λ 1 + y 1 y 1 σ exp μ σ .
Proof of Proposition 6.
Using the definition of MLR function, we have
E ( X t | X > t ) = 1 S ( t ) t 1 S ( x ) d x = 1 exp ( λ ) δ ( t ) exp ( λ ) [ ( 1 t ) exp ( λ ) 1 exp ( λ ) + 1 1 exp ( λ ) ( exp ( λ ) t δ ( t ) ( 1 t ) exp ( λ ) + λ exp μ σ σ t 1 x 1 1 σ ( 1 x ) 2 1 σ δ ( x ) 1 + x 1 x 1 σ exp μ σ 2 d x ) ] = 1 δ ( t ) exp ( λ ) exp ( λ ) t δ ( t ) + λ exp μ σ σ t 1 x 1 1 σ ( 1 x ) 2 1 σ δ ( x ) 1 + x 1 x 1 σ exp μ σ 2 d x ( 1 t ) exp ( λ ) .
Thus, the results are derived. □
Order statistics is a common and important statistic in mathematical statistics. It has a wide range of applications in the field of parametric and nonparametric statistics.
Proposition 7.
Let X 1 , X 2 , , X n be a random sample coming from L T E S L ( μ , σ , λ ) , and X ( 1 ) X ( 2 ) X ( n ) denote the corresponding order statistics. Then the pdf and cdf of the k t h order statistic X ( k ) are, respectively, given by
f x ( k ) , μ , σ , λ = λ n ! σ ( k 1 ) ! ( n k ) ! 1 x ( 1 x ) ( g ( x ) 1 ) exp λ g ( x ) g 2 ( x ) i = 0 n k ( 1 ) 2 i + k 1 n k i δ ( x ) 1 1 exp ( λ ) k + i 1 ,
F x ( k ) , μ , σ , λ = j = k n i = 0 n j ( 1 ) 2 i + j 1 n j n j i δ ( x ) 1 1 exp ( λ ) j + i 1 .
Proof of Proposition 7.
As we all know, the pdf and cdf of X ( k ) are, respectively, given by
f x ( k ) , μ , σ , λ = n ! ( k 1 ) ! ( n k ) ! ( F ( x ) ) k 1 ( 1 F ( x ) ) n k f ( x ) ,
F x ( k ) , μ , σ , λ = j = k n n j ( F ( x ) ) j ( 1 F ( x ) ) n j .
Hence, the results are obtained using Equations (1) and (2). □
The mean deviation can be used to measure the spread of a population. In the following proposition, we derive the mean deviation of the LTESL distribution.
Proposition 8.
Let X L T E S L ( μ , σ , λ ) , the mean deviation about the mean of X is given by
D ( m ) = 2 m ( δ ( m ) 1 ) λ g ( m ) ( ( u 1 ) σ exp ( μ ) + 1 ) 1 1 u 2 exp λ u d u 1 exp ( λ ) ,
where m = E ( X ) .
Proof of Proposition 8.
Using Equation (1) and D ( m ) = E [ | X m | ] , we have
D ( m ) = 0 m ( m x ) f ( x ) d x + m 1 ( x m ) f ( x ) d x = m 0 m f ( x ) d x m 1 f ( x ) d x + 0 m x f ( x ) d x + m 1 x f ( x ) d x = 2 m F ( m ) 0 m x f ( x ) d x = 2 m ( 1 δ ( m ) ) 1 exp ( λ ) λ 1 exp ( λ ) g ( m ) ( ( u 1 ) σ exp ( μ ) + 1 ) 1 1 u 2 exp λ u d u = 2 m ( δ ( m ) 1 ) λ g ( m ) ( ( u 1 ) σ exp ( μ ) + 1 ) 1 1 u 2 exp λ u d u 1 exp ( λ ) .
Thus, the result is obtained. □
In regression analysis, sometimes the mean value is not a simple analytic expression. Therefore, quantile can also be used as a choice of regression model. In addition, the quantile function can also generate random numbers.
Proposition 9.
Let X L T E S L ( μ , σ , λ ) , the quantile function of X, Q ( τ ) = F 1 ( τ ; μ , σ , λ ) , is given by
Q ( τ ) = λ log 1 τ [ 1 exp ( λ ) ] 1 σ exp ( μ ) + λ log 1 τ [ 1 exp ( λ ) ] 1 σ ,
where 0 < τ < 1 .
Proof of Proposition 9.
For 0 < τ < 1 , suppose there exists a x ( 0 , 1 ) such that x = F 1 ( τ ; μ , σ , λ ) = Q ( τ ) ; that is, F ( x ; μ , σ , λ ) = τ . Then
F x ; μ , σ , λ = 1 1 exp ( λ ) 1 exp λ 1 + x 1 x 1 σ exp μ σ = τ exp λ 1 + x 1 x 1 σ exp μ σ = 1 τ 1 exp ( λ ) λ 1 + x 1 x 1 σ exp μ σ = log 1 τ 1 exp ( λ ) x 1 x 1 σ = λ log 1 τ [ 1 exp ( λ ) ] 1 exp μ σ x = λ log 1 τ [ 1 exp ( λ ) ] 1 σ exp ( μ ) + λ log 1 τ [ 1 exp ( λ ) ] 1 σ .
Thus, the result is derived. □

3. Maximum-Likelihood Estimation

In this section, based on LTESL distribution, we consider maximum-likelihood estimation (MLE) to estimate its unknown parameters.
Let x 1 , x 2 , , x n be a random sample from L T E S L ( μ , σ , λ ) . The likelihood function is given by
L ( Θ | x 1 , x 2 , , x n ) = λ σ ( 1 exp ( λ ) ) n 1 i = 1 n x i ( 1 x i ) exp i = 1 n log x i 1 x i n μ σ i = 1 n 1 + exp log x i 1 x i n μ σ 2 × exp λ i = 1 n 1 1 + exp log x i 1 x i n μ σ ,
where Θ = ( μ , σ , λ ) is the parameter vector. The log-likelihood function is given by
( Θ | x 1 , x 2 , , x n ) = n log λ σ ( 1 exp ( λ ) ) + i = 1 n log 1 x i ( 1 x i ) i = 1 n log x i 1 x i n μ σ 2 i = 1 n log 1 + exp log x i 1 x i n μ σ λ i = 1 n 1 1 + exp log x i 1 x i n μ σ .
Therefore, the MLE μ ^ M L E , σ ^ M L E and λ ^ M L E of μ , σ , and λ can be obtained by solving the following equations. The partial derivatives, μ , σ , and λ and set them to be 0, we have
μ = 1 σ n + 2 i = 1 n exp log x i 1 x i μ σ 1 + exp log x i 1 x i μ σ λ i = 1 n exp log x i 1 x i μ σ 1 + exp log x i 1 x i μ σ 2 = 0 , σ = 1 σ 2 n ( σ μ ) + i = 1 n log x i 1 x i 2 i = 1 n log x i 1 x i μ exp log x i 1 x i μ σ 1 + exp log x i 1 x i μ σ λ i = 1 n log x i 1 x i μ exp log x i 1 x i μ σ 1 + exp log x i 1 x i μ σ 2 = 0 , λ = n λ i = 1 n 1 1 + exp log x i 1 x i μ σ = 0 .
Since the likelihood equation contains nonlinear functions, it is impossible to obtain the explicit form of the MLE. Therefore, numerical methods such as the Newton–Raphson method can be used to solve the above equations.
Fisher information can reflect the accuracy of parameter estimation. Therefore, we have that the Fisher information of the LTESL distribution in the following
I ( Θ ) = E μ μ μ σ μ λ σ μ σ σ σ λ λ μ λ σ λ λ ,
where
μ μ = 1 σ 2 i = 1 n g ( x i ) 1 g 2 ( x i ) 2 λ ( 2 g ( x i ) ) g ( x i ) , μ λ = 1 σ i = 1 n g ( x i ) 1 g 2 ( x i ) = λ μ , μ σ = 1 σ 2 n + i = 1 n g ( x i ) 1 g ( x i ) 2 λ g ( x i ) 1 σ 2 i = 1 n g ( x i ) 1 log x i 1 x i μ g 2 ( x i ) × 2 λ ( 2 g ( x i ) ) g ( x i ) = σ μ ,
σ σ = 1 σ 2 2 σ n ( σ μ ) + i = 1 n log x i 1 x i i = 1 n 2 + λ g ( x i ) g ( x i ) 1 log x i 1 x i μ g ( x i ) + 1 σ 2 i = 1 n g ( x i ) 1 log x i 1 x i μ 2 g 2 ( x i ) 1 + 2 g ( x i ) n , σ λ = 1 σ 2 i = 1 n g ( x i ) 1 log x i 1 x i μ g 2 ( x i ) = λ σ , λ λ = n λ 2 .

4. Simulation

In this section, the MLE method is considered to estimate the unknown parameters of LTESL distribution. We carry out simulation with R software, and the simulation experiments are repeated N = 1000 times. In addition, we can generate random numbers from LTESL distribution according to the following steps:
(i)
Set μ , σ and λ .
(ii)
Simulate u U n i f o r m ( 0 , 1 ) .
(iii)
Compute Y = F T E S L 1 ( u ) = σ log log ( 1 u ( 1 exp ( λ ) ) ) λ + log ( 1 u ( 1 exp ( λ ) ) ) + μ , then Y follows T E S L ( μ , σ , λ ) .
(iv)
Compute X = e Y 1 + e Y , then X follows L T E S L ( μ , σ , λ ) .
The evaluation of point estimation is based on the empirical mean and the standard deviation (SD) of each sample size. We set the sample size at n = 20 , 30 , 50 , 100 , 200 , 500 , and 1000, and Θ = ( 1.5 , 1 , 1 ) , ( 1.5 , 2.5 , 1 ) , ( 2 , 1 , 0.5 ) , ( 3 , 2 , 4 ) , and ( 5 , 4 , 6 ) . The mean values of the parameters and the corresponding SD are shown in Table 1.
From Table 1, it can be seen that as the sample size n increases, the mean value of estimators become closer to the true values and the SD decreases in all cases. In addition, the estimation is quite stable and the estimated values of all estimators tend to be more accurate.

5. Applications

In this section, we use three real data sets to verify the practicability of the LTESL distribution and compare it with some other bounded distributions, including the Beta distribution, Kumaraswamy (Kw) distribution, unit-Weibull (UW) distribution, and exponentiated Topp-Leone (ETL) distribution. The parameter estimates, minus log-likelihood, Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Kolmogorov–Smirnov (K-S) statistics ( D n ) are listed in Table 2, Table 3 and Table 4. The D n of K-S is given by
D n = sup i = 1 , , n | F n ( x ( i ) ; Θ ^ ) F ( x ( i ) ; Θ ^ ) | ,
where Θ = ( μ , σ , λ ) is MLE of Θ , F n is the empirical distribution for the sample data, and x ( i ) are the ordered data defined before. Meanwhile, the AIC and BIC are defined as
A I C = 2 ( Θ ^ ) + 2 k ,
B I C = 2 ( Θ ^ ) + k log ( n ) ,
where ( Θ ^ ) is the log-likelihood function for Θ , k is the number of parameters of the model, and n is the size of the data set. The minimum values for AIC or BIC and maximum value for p-Value represent the best-fitted model on the data sets. Compare the LTESL distribution with the distributions defined on ( 0 , 1 ) interval as follows.
(1)
Beta distribution:
f 1 ( x ) = 1 B ( μ , σ ) x μ 1 ( 1 x ) σ 1 , μ , σ > 0 ,
where B ( μ , σ ) is the beta function.
(2)
Kw distribution:
f 2 ( x ) = μ σ x μ 1 ( 1 x μ ) σ 1 , μ , σ > 0 .
(3)
UW distribution:
f 3 ( x ) = 1 x μ σ ( l o g ( x ) ) σ 1 e μ ( l o g ( x ) ) σ , μ , σ > 0 .
(4)
ETL distribution:
f 4 ( x ) = 2 μ σ ( 1 x ) [ x ( 2 x ) ] μ 1 [ 1 x μ ( 2 x ) μ ] σ 1 , μ , σ > 0 .
Data set 1: The first data set is related to the perforation operation on the metal plate, which has been considered by Dey et al. [12]. It represents 50 burr observations on the metal plate, for which the hole diameter is 9 mm and the sheet thickness is 2 mm. These data are related to the two different machines being compared. The data are: 0.06, 0.12, 0.14, 0.04, 0.14, 0.16, 0.08, 0.26, 0.32, 0.22, 0.16, 0.12, 0.24, 0.06, 0.02, 0.18, 0.22, 0.14, 0.22, 0.16, 0.12, 0.24, 0.06, 0.02, 0.18, 0.22, 0.14, 0.02, 0.18, 0.22, 0.14, 0.06, 0.04, 0.14, 0.22, 0.14, 0.06, 0.04, 0.16, 0.24, 0.16, 0.32, 0.18, 0.24, 0.22, 0.04, 0.14, 0.26, 0.18, 0.16.
Table 2 shows the parameter estimator results of the LTESL, Beta, Kw, UW, and ETL distributions for data set 1. The corresponding minus log-likelihood, AIC, BIC, and p-Value are also presented. As seen from Table 2, we find that the LTESL distribution has the minimum values for AIC or BIC, and maximum value for p-Value compared other bounded distributions.
Figure 3 shows the empirical distribution function for data set 1 and the estimated bounded distributions. From Figure 3, the LTESL distribution provides a qualified fit for data set 1. The distribution of sample points is roughly the same as the LTESL distribution by the Q-Q plot presented in Figure 4.
Data set 2: The second data set covers the effective failure time of 20 components, which are taken from Nigm et al. [13]. These data are given as follows: 0.0009, 0.0040, 0.0142, 0.0221, 0.0261, 0.0418, 0.0473, 0.0834, 0.1091, 0.1252, 0.1404, 0.1498, 0.1750, 0.2031, 0.2099, 0.2168, 0.2981, 0.3465, 0.4035, 0.6143.
Table 3 shows the estimated value of the parameters, AIC, BIC, and p-value of these bounded distributions. It can be seen that LTESL distribution provides the best fit among these bounded distributions.
Figure 5 shows the fitted cdf’s of the LTESL, Beta, Kw, UW, and ETL distributions for data set 2 and suggests that the fit of the LTESL distribution is reasonable. The corresponding Q-Q plots are also displayed in Figure 6.
Data set 3: The third data set considers 30 failures of an aircraft air conditioning system (in hours), which has been studied by Linhart and Zucchini [14]. This data set has recently been normalized to obtain data between 0 and 1 by Bantan et al. [15]. The data are: 0.086792453, 0.984905660, 0.328301887, 0.026415094, 0.452830189, 0.052830189, 0.233962264, 0.177358491, 0.849056604, 0.267924528, 0.928301887, 0.079245283, 0.158490566, 0.075471698, 0.018867925, 0.045283019, 0.452830189, 0.041509434, 0.011320755, 0.052830189, 0.267924528, 0.041509434, 0.052830189, 0.041509434, 0.060377358, 0.339622642, 0.003773585, 0.060377358, 0.196226415, 0.358490566.
Table 4 shows the parameter estimates, AIC, BIC, and p-Value for all fitted bounded distributions. According to Table 4, the LTESL distribution is more suitable than other bounded distributions for optimal data fitting.
Figure 7 shows the fitted models for data set 3. In Figure 7, we note that the LTESL distribution has better fitting than other bounded distributions. In addition, the corresponding Q-Q plots are also displayed in Figure 8.

6. Discussion

A new three-parameter LTESL distribution on bounded interval is introduced. The basic statistical properties of LTESL distribution are studied in detail, including moment, MRL function, Renyi entropy, mean deviation, order statistics, exponential family and quantile function. The maximum likelihood method for unknown parameter estimation of the proposed distribution is discussed through a simulation study. According to the analysis of real data sets in three engineering science fields, the results show that the new distribution has a better fitting than Beta, Kw, UW, and ETL distributions. Moreover, the LTESL distribution can be a good choice for processing some data in the field of engineering science.

Author Contributions

W.T.: Conceptualization, Methodology, Validation, Investigation, Resources, Supervision, Project Administration, Visualization, Writing review and editing; L.P.: Software, Formal analysis, Data curation, Writing—original draft preparation, Visualization; T.T. and X.C.: Software, Methodology, Visualization. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Datasets are provided in the paper.

Acknowledgments

The authors would like to thank the editor, and three anonymous referees for their careful reading of this article and for their constructive suggestions which considerably improved this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Papke, L.E.; Wooldridge, J.M. Econometric methods for fractional response variables with an application to 401 (k) plan participation rates. J. Appl. Econom. 1996, 11, 619–632. [Google Scholar] [CrossRef] [Green Version]
  2. Topp, C.W.; Leone, F.C. A family of J-shaped frequency functions. J. Am. Stat. Assoc. 1955, 50, 209–219. [Google Scholar] [CrossRef]
  3. Pourdarvish, A.; Mirmostafaee, S.; Naderi, K. The exponentiated Topp-Leone distribution: Properties and application. J. Appl. Environ. Biol. Sci. 2015, 5, 251–256. [Google Scholar]
  4. Kumaraswamy, P. A generalized probability density function for double-bounded random processes. J. Hydrol. 1980, 46, 79–88. [Google Scholar] [CrossRef]
  5. Lemonte, A.J.; Barreto-Souza, W.; Cordeiro, G.M. The exponentiated Kumaraswamy distribution and its log-transform. Braz. J. Probab. Stat. 2013, 27, 31–53. [Google Scholar] [CrossRef]
  6. Gómez-Déniz, E.; Sordo, M.A.; Calder-Ojeda, E. The Log-Lindley distribution as an alternative to the beta regression model with applications in insurance. Insur. Math. Econ. 2014, 54, 49–57. [Google Scholar] [CrossRef]
  7. Altun, E.; Hamedani, G.G. The log-xgamma distribution with inference and application. J. Soc. Fr. Stat. 2018, 159, 40–55. [Google Scholar]
  8. Korkmaz, M.Ç. A new heavy-tailed distribution defined on the bounded interval: The logit slash distribution and its application. J. Appl. Stat. 2020, 47, 2097–2119. [Google Scholar] [CrossRef]
  9. Mazucheli, J.; Menezes, A.F.B.; Fernandes, L.B.; de Oliveira, R.P.; Ghitany, M.E. The unit-Weibull distribution as an alternative to the Kumaraswamy distribution for the modeling of quantiles conditional on covariates. J. Appl. Stat. 2020, 47, 954–974. [Google Scholar] [CrossRef]
  10. Altun, E. The log-weighted exponential regression model: Alternative to the beta regression model. Commun. Stat.-Theory Methods 2021, 50, 2306–2321. [Google Scholar] [CrossRef]
  11. Mirzadeh, S.; Iranmanesh, A. A new class of skew-logistic distribution. Math. Sci. 2019, 13, 375–385. [Google Scholar] [CrossRef] [Green Version]
  12. Dey, S.; Mazucheli, J.; Nadarajah, S. Kumaraswamy distribution: Different methods of estimation. Comput. Appl. Math. 2018, 37, 2094–2111. [Google Scholar] [CrossRef]
  13. Nigm, A.M.; Al-Hussaini, E.K.; Jaheen, Z.F. Bayesian one-sample prediction of future observations under Pareto distribution. Statistics 2003, 37, 527–536. [Google Scholar] [CrossRef]
  14. Linhart, H.; Zucchini, W. Model Selection; John Wiley Sons: Hoboken, NJ, USA, 1986. [Google Scholar]
  15. Bantan, R.A.; Chesneau, C.; Jamal, F.; Elgarhy, M.; Tahir, M.H.; Ali, A.; Zubair, M.; Anam, S. Some new facts about the unit-Rayleigh distribution with applications. Mathematics 2020, 8, 1954. [Google Scholar] [CrossRef]
Figure 1. The pdf curves of LTESL( μ , σ , λ ) for different parameters.
Figure 1. The pdf curves of LTESL( μ , σ , λ ) for different parameters.
Modelling 02 00041 g001
Figure 2. The hrf curves of LTESL( μ , σ , λ ) for different parameters.
Figure 2. The hrf curves of LTESL( μ , σ , λ ) for different parameters.
Modelling 02 00041 g002
Figure 3. Estimated cdfs for data set 1.
Figure 3. Estimated cdfs for data set 1.
Modelling 02 00041 g003
Figure 4. Q-Q plot for data set 1.
Figure 4. Q-Q plot for data set 1.
Modelling 02 00041 g004aModelling 02 00041 g004b
Figure 5. Estimated cdfs for data set 2.
Figure 5. Estimated cdfs for data set 2.
Modelling 02 00041 g005
Figure 6. Q-Q plots for data set 2.
Figure 6. Q-Q plots for data set 2.
Modelling 02 00041 g006
Figure 7. Estimated cdfs for data set 3.
Figure 7. Estimated cdfs for data set 3.
Modelling 02 00041 g007
Figure 8. Q-Q plot for data set 3.
Figure 8. Q-Q plot for data set 3.
Modelling 02 00041 g008aModelling 02 00041 g008b
Table 1. Parameter estimation of μ , σ , and λ under different samples.
Table 1. Parameter estimation of μ , σ , and λ under different samples.
n μ σ λ μ ^ ( SD ) σ ^ ( SD ) λ ^ ( SD )
201.5111.6977 (0.3829)1.0625 (0.1074)1.3455 (0.2259)
1.52.511.7625 (0.4773)2.4766 (0.4686)1.3550 (0.4907)
−210.5−1.8101 (0.4166)1.0638 (0.1090)0.8605 (0.4454)
3243.0849 (0.6539)2.0994 (0.1882)4.1419 (1.0866)
5464.3876 (3.0847)3.8286 (0.7388)6.4131 (3.2937)
301.5111.6859 (0.3303)1.0602 (0.0911)1.3248 (0.2335)
1.52.511.7364 (0.4327)2.4824 (0.3788)1.2900 (0.3991)
−210.5−1.8222 (0.3509)1.0497 (0.0887)0.8521 (0.4058)
3243.0685 (0.5513)2.0988 (0.1758)4.1102 (0.9704)
5464.4552 (3.0493)3.8496 (0.6641)6.2602 (3.1702)
501.5111.6675 (0.2621)1.0439 (0.0685)1.3241 (0.2327)
1.52.511.6896 (0.3775)2.4642 (0.3013)1.2377 (0.3371)
−210.5−1.8375 (0.3204)1.0449 (0.0694)0.8008 (0.3981)
3243.0671 (0.5375)2.0633 (0.1155)4.0944 (0.8906)
5464.6089 (2.9256)3.8809 (0.5423)6.3159 (2.9512)
1001.5111.6498 (0.1863)1.0336 (0.0508)1.2673 (0.1904)
1.52.511.6414 (0.3251)2.4752 (0.2070)1.1693 (0.2482)
−210.5−1.8869 (0.2727)1.0329 (0.0501)0.7207 (0.4003)
3243.0612 (0.5065)2.0494 (0.0882)4.0814 (0.7698)
5464.7179 (2.4738)3.9321 (0.4217)6.1701 (2.4982)
2001.5111.6055 (0.1407)1.0255 (0.0367)1.2033 (0.1506)
1.52.511.6173 (0.2528)2.4899 (0.1435)1.1330 (0.1496)
−210.5−1.9161 (0.2115)1.0246 (0.0367)0.6526 (0.3547)
3243.0527 (0.4729)2.0395 (0.0652)4.0809 (0.6951)
5464.8571 (1.7233)3.9687 (0.2688)6.0921 (1.6962)
5001.5111.5424 (0.0811)1.0135 (0.0213)1.0932 (0.0657)
1.52.511.5315 (0.1407)2.4957 (0.0914)1.0526 (0.0784)
−210.5−1.9879 (0.1547)1.0135 (0.0209)0.5145 (0.2679)
3243.0489 (0.3303)2.0250(0.0427)4.0473 (0.4864)
5464.8724 (1.3613)3.9602 (0.2689)6.0313 (1.3095)
10001.5111.5134 (0.0551)1.0101 (0.0153)1.0196 (0.0142)
1.52.511.5016 (0.1031)2.5001 (0.0642)1.0150 (0.0222)
−210.5−1.9998 (0.1288)1.0104 (0.0156)0.5007 (0.2368)
3243.0241 (0.2209)2.0162 (0.0285)4.0245 (0.3483)
5464.9665 (0.9635)3.9914 (0.4599)6.0643 (1.0431)
Table 2. The MLEs and the goodness of fit statistics for data set 1.
Table 2. The MLEs and the goodness of fit statistics for data set 1.
Models μ σ λ -LoglikeAICBIC D n p-Value
LTESL−0.53140.52298.033939.875085.749991.48600.13600.2868
Beta1.59248.1127-53.3012110.6023114.42640.16590.1137
Kw1.489512.9468-55.1892114.3783118.20240.17060.0795
UW0.08392.9690-49.9912103.9823107.80630.20140.0297
ETL1.98889.6583-57.1006118.2132122.03720.16330.1242
Table 3. The MLEs and the goodness of fit statistics for data set 2.
Table 3. The MLEs and the goodness of fit statistics for data set 2.
Models μ σ λ -LoglikeAICBIC D n p-Value
LTESL0.05211.12825.50458.792823.585626.57280.09670.9828
Beta0.72353.8239-17.202238.440440.43190.09900.9782
Kw0.77483.4567-17.170738.341440.33290.09840.9797
UW0.15461.7223-16.414536.829138.82050.14280.7580
ETL0.73341.8797-17.199038.397940.38940.10750.9558
Table 4. The MLEs and the goodness-of-fit statistics for data set 3.
Table 4. The MLEs and the goodness-of-fit statistics for data set 3.
Models μ σ λ -LoglikeAICBIC D n p-Value
LTESL−1.60041.07640.68509.257524.515028.71860.12530.6874
Beta0.50821.3509-13. 42930.485833.28820.19130.1953
Kw0.53671.3541-13.534831.069633.87200.18790.2116
UW0.27751.4576-15.192234.384537.18690.17330.2934
ETL0.52390.8074-12.573229.146431.94870.20460.1406
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Pang, L.; Tian, W.; Tong, T.; Chen, X. Logit Truncated-Exponential Skew-Logistic Distribution with Properties and Applications. Modelling 2021, 2, 776-794. https://doi.org/10.3390/modelling2040041

AMA Style

Pang L, Tian W, Tong T, Chen X. Logit Truncated-Exponential Skew-Logistic Distribution with Properties and Applications. Modelling. 2021; 2(4):776-794. https://doi.org/10.3390/modelling2040041

Chicago/Turabian Style

Pang, Liyuan, Weizhong Tian, Tingting Tong, and Xiangfei Chen. 2021. "Logit Truncated-Exponential Skew-Logistic Distribution with Properties and Applications" Modelling 2, no. 4: 776-794. https://doi.org/10.3390/modelling2040041

Article Metrics

Back to TopTop