Next Article in Journal
Computer Network Redundancy Reduction Using Video Compression
Next Article in Special Issue
Robust Semi-Parametric Inference for Two-Stage Production Models: A Beta Regression Approach
Previous Article in Journal
X-ray Tests of General Relativity with Black Holes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Discrete Exponentiated-Chen Model and Its Applications

1
Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, Riyadh 11671, Saudi Arabia
2
Department of Statistics, Al-Azhar University, Cairo 11884, Egypt
3
Applied Statistics Laboratory, Department of Industrial Engineering, Pusan National University, Busan 46241, Republic of Korea
4
Faculty of Technology and Development, Zagazig University, Zagazig 44519, Egypt
*
Author to whom correspondence should be addressed.
Symmetry 2023, 15(6), 1278; https://doi.org/10.3390/sym15061278
Submission received: 28 May 2023 / Revised: 9 June 2023 / Accepted: 15 June 2023 / Published: 18 June 2023

Abstract

:
A novel discrete exponentiated Chen (DEC) distribution, which is a subset of the continuous exponentiated Chen distribution, is proposed. The offered model is more adaptable to analyzing a wide range of data than traditional and recently published models. Several important statistical and reliability characteristics of the DEC model are introduced. In the presence of Type-II censored data, the maximum likelihood and asymptotic confidence interval estimators of the model parameters are acquired. Two various bootstrapping estimators of the DEC parameters are also obtained. To examine the efficacy of the adopted methods, several simulations are implemented. To further clarify the offered model in the life scenario, the two applications, based on the number of vehicle fatalities in South Carolina in 2012 and the final exam marks in 2004 at the Indian Institute of Technology at Kanpur, are analyzed. The analysis findings showed that the DEC model is the most effective model for fitting the supplied data sets compared to eleven well-known models in literature, including: Poisson, geometric, negative binomial, discrete-Weibull, discrete Burr Type XII, discrete generalized exponential, discrete-gamma, discrete Burr Hatke, discrete Nadarajah-Haghighi, discrete modified-Weibull, and exponentiated discrete-Weibull models. Ultimately, the new model is recommended to be applied in many fields of real practice.

1. Introduction

When it becomes impossible or impracticable to evaluate a product’s or piece of equipment’s lifespan on a continuous scale, the discretization phenomenon occurs. This applies to many domains, including clinical trials, engineering, economics, and other associated sciences; see [1]. The lifespan of a copy machine is determined by the total number of copies it produces. In survival analysis, the lifetime periods for people with brain tumors, or the duration between remission and recurrence, may be documented as the number of days. In a reliability framework, the functional state of a system is checked every unit of time. Observed statistics represent the number of time units achieved prior to a failure. In addition to lifespan statistics, the count phenomenon may be seen in a variety of real-world contexts, including the number of earthquakes in a year, accidents, ecological species types, insurance claims, and more. Discrete distributions are quite helpful for modeling lifespan data in such circumstances. In this situation, the geometric and negative binomial distributions are recognized as discrete substitutes for the exponential and gamma distributions. In some real-world circumstances, these distributions do not, however, fit discrete data. Many continuous lifespan distributions have been discretized in the literature in recent decades as a result of the demand for more believable discrete distributions to simulate discrete data in a range of real-world settings.
Consider the discrete Weibull distribution, which is discussed in refs. [2,3], the discrete Rayleigh distribution, which is discussed in refs. [4,5], the discrete inverse, which is discussed in ref. [6], the exponentiated discrete Weibull distribution in ref. [7], and the discrete Weibull geometric distribution, see [8]. To simulate data with a bathtub shape and a growing failure rate in recent years, ref. [9] developed a discrete additive Perks-Weibull distribution. The discrete Nadarajah and Haghighi distributions for one variable were initially proposed by ref. [10]. To evaluate discrete rising failure and count data, ref. [11] presented a discrete Perks distribution. A brand-new discrete Lindley distribution with two parameters and an unusual bathtub-shaped hazard rate characteristic was described by ref. [12]. For over- and under-dispersed data, ref. [13] presents the discrete Gompertz-G family. Ref. [14] estimated the quantity of new coronavirus cases using a discrete generalized Lindley distribution. A discrete equivalent of the peculiar Weibull-G family of distributions was shown by ref. [15]. They spoke about traditional and Bayesian estimates and showed how the suggested model may be used with count data sets. A discrete equivalent of the Teissier distribution with application to data counting was recently studied by ref. [16].
Even though there are a lot of discretized distributions in the literature, there could be situations when they conflict with discrete data modeling. There is therefore still plenty of potential for creating novel discretized distributions that are appropriate in various situations. In light of this, we suggest a discretized variant of the exponentiated Chen distribution proposed by ref. [17]. To discretize continuous distribution, a variety of techniques can be applied; see [18]. The following is the most typical procedure for finding the discrete analog of a continuous distribution. Many discrete distributions have been introduced using this approach to discretization, including the natural discrete Lindley [19], the discrete Marshal-Olkin Weibull [20], the uniform Poisson-Ailamujia [21], the discrete inverted Topp-Leone [22], and the discrete power-Ailamujia [23].
If the underlying random variable (RV) X has the survival function (SF), S X ( x ) P ( X x ) , the underlying RV Y has the probability mass function (PMF) ( Y = [ X ] , biggest integer less than or equal to X )
P Y = y = P y X < y + 1 = P X y P X y + 1 = S X y S X y + 1 , y = 0,1 , 2 , , .
The technique described above generates a discrete distribution by using the same functional form as the continuous version of the SF. However, this discrete variant retains certain aspects of dependability. There is a strong motivation to apply this technique in order to create discrete versions of existing continuous distributions. On numerous occasions, constraints such as limited time or resources hinder the collection of complete data sets. Incomplete data can involve censored data, which can be further explored in ref. [24]. Various censoring algorithms have been documented in the literature to analyze these types of data sets. The commonly employed censoring techniques include Type I and Type II censoring. Type I censoring involves recording the event only if it occurs before a predetermined time, while Type II censoring continues the research until a specific number of subjects experience failure. Random censoring, on the other hand, is a distinct type of censorship where individuals can be censored at any point during the experiment and at different time intervals. In the context of clinical trials or medical research, random censorship could occur when patients withdraw early without completing the prescribed treatment. Further information on censoring techniques, their extensions, and their analysis can be found in ref. [25]. Accurate analysis of randomly censored lifespan data is crucial to derive reliable insights and meaningful research findings. Such data sets are frequently encountered in fields such as biology, reliability studies, and medical science. Typically, these data involve right-censoring, as it is impractical to observe individuals until their deaths or because participants may drop out of the study. In regard to right-censoring with discrete observations, it has been examined by [26]. Notably, ref. [27] investigated various inferential approaches for the exponentiated discrete Weibull model with censored data.
Therefore, the main objectives of the DEC model are:
  • The first advantage of this distribution over many other one- or two-parameter discrete distributions is that it gives the various hazard rate forms, such as decreasing, increasing, or increasing-constant. So, by virtue of these hazard rates, the proposed model is suitable for modeling various data sets;
  • It offers various PMF shapes that may not be adequately modeled by other competitive models that are appropriate to model positively skewed, negatively skewed, or symmetric data;
  • Several statistical and reliability characteristics, including moments, probability functions, reliability indices, hazard functions, order statistics, etc., are introduced;
  • Analysis results from two real applications showed that the DEC distribution fits the given data sets satisfactorily compared to the other eleven discrete distribution models in the literature;
  • Maximum likelihood and bootstrapping estimation methods are considered to estimate the proposed parameters in the presence of data collected under the Type-II censored strategy;
  • Through extensive Monte Carlo simulations using various accuracy criteria, namely mean squared errors, mean absolute biases, and average interval lengths, the performance of the acquired estimators is evaluated. It may seem reasonable to recommend the use of Type-II censoring for estimating unknown parameters.
The remainder of the paper is organized as follows: In Section 2, materials and methods are provided. Monte Carlo results and real data analysis are illustrated in Section 3 and Section 4, respectively. Finally, we provide some concluding remarks in Section 5.

2. Materials and Methods

The SF of the continuous exponentiated Chen distribution is provided by:
S x = 1 1 e β 1 e x λ θ , β , λ , θ > 0 , x 0 ,
subsequently, the PMF of the DEC distribution becomes
P x = 1 e β 1 e x + 1 λ θ 1 e β 1 e x λ θ , x = 0,1 , 2 , , ,
where β , λ , θ > 0 , are the shape parameters. It is easy to show that the PMF in Equation (3) is accurate, i.e., x = 0 P X = x = 1 .
For specified values of β ,   λ , and θ based on their domains, Figure 1 displays the PMF and hazard rate function (HRF) shapes of the DEC distribution. It shows that the PMF can be unimodal (for β = 0.1 ,   λ = 0.4 ,   θ = 4.2 ), right skewed (for β = 0.2 ,   λ = 0.4 ,   θ = 1.8 ) or left skewed (for β = 0.1 ,   λ = 0.3 ,   θ = 7.5 ). It also shows that the HRF of the new model exhibits different shapes, such as the decreasing (for β = 6 ,   λ = 0.2 ,   θ = 3 ), increasing (for β = 0.1 ,   λ = 0.6 ,   θ = 2.2 ) or increasing-constant (for β = 0.1 ,   λ = 0.4 ,   θ = 2.5 ). As a result, the proposed model is suitable for modeling different real-world data sets.
For PMF in Equation (3), the cumulative distribution function (CDF) of DEC is given by:
F x = 1 e β 1 e ( x + 1 ) λ θ f o r   x 0 0 f o r   x < 0
where x is the floor function that gives the greatest integer less than or equal to x . Then the quantile function can be defined as Q ( y ) = i n f { x : F ( x ) y } . For more details on this quantile function, see Section 14 of ref. [28] and Theorem 2.1.10 of ref. [29]. It should be noted here that F 0 = 1 e β 1 e θ , and the proportion of non-zero values is given by 1 F 0 .
Lemma 1. 
Let  x  denote the floor function and letxbe a ceiling operator that provides the least number larger (or equal to) than x. Then, for any integer  n  (or m), and real number  x , we have the following:
(i) 
n x  if and only if  n x .
(ii) 
x m  if and only if  x m .
Proof. 
According to ref. [30],
(i)
Since x x , n x implies n x x . Thus, we have n x . Next, we need to prove that n x implies n x . Its contrapositive statement is that n < x implies n < x . Since x 1 < x x , we can rewrite x = x + δ , where 0 δ < 1 . Since n and x are both integers, it is immediate from n < x that we can set n = x k where k = 1,2 , . If we assume that n < x is not satisfied (that is, n x ), then we have x k x which results in δ = x x k . Thus, we have δ 1 since k = 1,2 , , which violates the condition 0 δ < 1 .
(ii)
Since x x , we have x m if x m . Next, we need to prove that x m implies x m . Its contrapositive statement is that x < m implies x < m . Since x x < x + 1 , we can rewrite x = x δ , where 0 δ < 1 . Since m and x are both integers, it is immediate from x < m that we can set m = x + k where k = 1,2 , . If we assume that x < m is not satisfied (that is x m ), then we have x x + k , which results in δ = x x k . Thus, we have δ 1 , which violates the condition 0 δ < 1 . □
However, the quantile function, Q ( y ) = F 1 ( y ) , of the DEC distribution is given by
Q ( y ) = l n 1 1 β l n ( 1 y 1 / θ ) 1 / λ 1 .
Proof. 
It is immediate from F ( x ) y that we have
x l n 1 1 β l n ( 1 y 1 / θ ) 1 / λ 1
Since x is an integer, we denote n = x for notational convenience. Then, applying Lemma 1(i) to (1), we know that F ( x ) y is equivalent to
x l n 1 1 β l n ( 1 y 1 / θ ) 1 / λ 1
Since the last term in Equation (6) is an integer, we denote
m = l n 1 1 β l n ( 1 y 1 / θ ) 1 / λ 1 ,
for notational convenience. That is, we have x m . By applying Lemma 1(ii), we have x m , which implies
x l n 1 1 β l n ( 1 y 1 / θ ) 1 / λ 1
Then F ( x ) y is equivalent to
x l n 1 1 β l n ( 1 y 1 / θ ) 1 / λ 1 .
Thus, we have
Q ( y ) = i n f { x : F ( x ) y } = i n f x : x l n 1 1 β l n ( 1 y 1 / θ ) 1 / λ 1 = l n 1 1 β l n ( 1 y 1 / θ ) 1 / λ 1 ,
which completes the proof. It is noteworthy that the quantile function is left-continuous while the CDF is right-continuous ref. [31]. □

2.1. Statistical Properties

2.1.1. Random Number Generation, Skewness, and Kurtosis

The popular quantile-based formulae for skewness or kurtosis were established by refs. [32,33]. These indicators stand out in part because they can be computed even for distributions lacking moments and are less impacted by outliers. Following ref. [32], the skewness ( S k ) can be expressed as:
S k = Q 3 4 + Q 1 4 2 Q 1 2 Q 3 4 Q 1 4 .
The Moors kurtosis [33], denoted by K u , suggests looking like this:
K u = Q 7 8 Q 5 8 + Q 3 8 Q 1 8 Q 3 4 Q 1 4 .
Using Equation (5) in the aforementioned formulas, it is simple to derive the S k and K u for the DEC distribution.

2.1.2. Moments

Calculating a probability distribution’s mean, variance, S k , K u , and other properties involves using the distribution’s moments. The r th raw moments of the DEC distribution may be obtained as follows:
ω ` r = E X r = x x + 1 r x r 1 F x = x x + 1 r x r 1 1 e β 1 e x λ θ .
Hence, following [9], the DEC distribution’s initial four raw moments are obtained by substituting r = 1,2 , 3,4 using Equation (13).
Using the raw moments from the following relations, we can quickly determine the S k and K u as follows:
S k = ω ` 3 3 ω ` 2 ω ` 1 + 2 ω ` 1 3 V X 3 / 2   a n d   K u = ω ` 4 4 ω ` 2 ω ` 1 + 6 ω ` 2 ω ` 1 2 3 ω ` 1 4 V X 2 .
The ratio of the standard deviation to the mean is known as the coefficient of variation ( C V ). The variance-to-mean ratio serves as the index of dispersion ( I D ). Keep in mind that the C V is always scale-invariant and dimensionless. The I D , on the other hand, is only dimensionless when it pertains to a dimensionless quantity, such as a count, as is the case in practice. It is not scale-invariant.
Although both C V and I D are for non-negative variables, they are applied in various situations. Most frequently, discrete variables with non-negative integer values, like counts, are represented by the sample and theoretical I D s. A method for detecting whether data is uniformly, unequally, or overly spread is the in I D , such as I D > 1 , which refers to over-dispersion, I D < 1 ; refers to the opposite (under-dispersion), and I D = 1 refers to equi-dispersion. The following is the I D for the DEC model:
I D = V X E ( X ) ,
where V X and E X are the variance and mean of X , respectively.
If one needs to compare the variance of two independent samples, the coefficient of variation ( C V ), which is a relative measure of variance, is usually applied. Greater variability is indicated by a high C V value. However, the C V for the DEC distribution is given by
C V = V X 0.5 E ( X ) .
The aforementioned equations do not have a closed form, so we utilize R software to statistically illustrate these traits. Figure 2 displays some shapes of the S k and K u measures of the proposed model. It indicates that both measures depend excessively on the values of θ . Using a complete random sample of size n = 100 , based on different combinations of DEC parameters, Table 1 displays some numerical findings for the mean ( M ), variance ( V ), I D , C V , S k , and K u of the DEC distribution and indicates that:
  • For fixed β and θ , as λ increases, the values of M , V , I D , C V , S k , and K u decrease;
  • For fixed β and λ , as θ increases, the values of M and V increase, whereas I D , C V , S k , and K u decrease;
  • For fixed θ and λ , as β increases, the values of M , V , and I D decrease, whereas C V , S k , and K u increase.

2.1.3. Survival and Hazard Rate Functions

The DEC distribution’s SF ( S ( · ) ) and HRF ( h ( · ) ) are provided, respectively, by
S x = P X x = 1 1 e β 1 e x λ θ , x = 0,1 , 2 , , ,
and
h x = P X = x | X x = 1 e β 1 e x + 1 λ θ 1 e β 1 e x λ θ 1 1 e β 1 e x λ θ , x = 0,1 , 2 , , .
The recommended distribution is more flexible to evaluate a broad range of data than conventional and recently published models as a result of the distinctive forms of HRF. The second rate of hazard (symbolized by s r h · ) and reversed hazard rate (symbolized by r h r · ) function of the suggested model are also explored.
s r h x = P X = x | X x = 1 e β 1 e x + 1 λ θ 1 e β 1 e x λ θ 1 e β 1 e x + 1 λ θ , x = 0,1 , 2 , , ,
and
r h r x = l n S x S x + 1 = l n 1 1 e β 1 e x λ θ 1 1 e β 1 e x + 1 λ θ , x = 0,1 , 2 , , .

2.1.4. Mean Past Life Function

The predicted inactivity time function, also known as the mean-past-life (MPL) function, is symbolised by the sign m * i and it measures the amount of time that has elapsed since X failed, presuming that the system failed before i . It has various uses, including actuarial studies, the dependability concept, survival evaluation, and forensic science. In a discrete setting, the MPL function is defined as:
m * i = E i X | X < i = 1 F i 1 j = 1 i F j 1 ; i = 1,2 , 3 , , .
By substituting Equation (4) into m * i , we can quickly calculate the MPL for the suggested model.

2.1.5. Stress-Strength Reliability

Stress-strength (SS) reliability describes the life of a component that has random strength (say X ) and is subjected to a random stress (say Z ). The component fails when the stress applied to it exceeds its strength, but it continues to operate satisfactorily whenever R = P [ X > Z ] . If the applied stress is greater than the component’s strength, it instantly fails; otherwise, it performs properly. The SS system paradigm has several uses in disciplines including engineering, medicine, psychology, and other related domains; see [34,35,36,37]. A thorough analysis of SS models is provided in ref. [38]. For discrete independent RVs X and Z , the SS reliability is defined as:
R = P X > Z = x = 0 P X x F Z x ,
where, respectively, P X x and F Z x stand for the PMF and CDF of the independent discrete RVs, X and Z . Let the numbers X ~ D E C ( β 1 , λ 1 , θ 1 ) and X ~ D E C ( β 2 , λ 2 , θ 2 ) be genuine. Next, using Equations (3) and (4), we get
R = x = 0 1 e β 1 1 e x + 1 λ 1 θ 1 1 e β 1 1 e x λ 1 θ 1 1 e β 2 1 e x + 1 λ 2 θ 2 .
Since the SS reliability parameter cannot be expressed in a clear formula, it can be easily evaluated via any software.

2.1.6. Order Statistics

Particularly in the study of survival, order statistics are critical for constructing tolerance ranges for distributions and identifying population features. Let X 1 , X 2 , . . . , X n represent a sample of size, n , taken at random sample from the DEC β , λ , θ . Let the associated order statistics be represented by X 1 , X 2 , . . . , X n . The CDF of the r th order statistic is thus provided by, let’s say, υ = X ( r ) as:
F r υ = i = r n n i F i υ 1 F υ n i = i = r n k = 0 n i 1 k n i n i k F i + k π υ = i = r n k = 0 n i 1 k n i n i k 1 e β 1 e x + 1 λ θ i + k .
The corresponding PMF for r th order statistic is
F r υ = F r υ F r υ 1 = i = r n k = 0 n i 1 k n i n i k 1 e β 1 e υ + 1 λ θ i + k i = r n k = 0 n i 1 k n i n i k 1 e β 1 e υ λ θ i + k
Setting r = 1 or r = n in Equation (9), the PMF of m i n X 1 , , X n or m a x X 1 , , X n can be easily obtained.

2.1.7. Entropy

The average amount of “information,” “surprise,” or “uncertainty” included in a RV’s probable outcomes is measured by its entropy, in accordance with the principles of information theory. The Renyi entropy (RE) (see [39]) is a fundamental entropy. It is a key measure of complexity and uncertainty in many fields, such as statistical inference, physics, econometrics, and pattern recognition in computer science. For the DEC distribution, the RE ( f o r   ρ > 0 , ρ 1 ) can be specified as:
R E ρ = 1 1 ρ l n x = 0 P x ρ x = ρ 1 ρ l n x = 0 1 e β 1 e x + 1 λ θ 1 e β 1 e x λ θ ,
where ρ R is a deformation parameter. As a specific instance of RE as ρ 1 , Shannon entropy (Sh.E), which is given by S h . E = E [ l n P ( X ) ] , may be produced.

2.2. Maximum Likelihood Method

One of the most commonly used methods for classical point estimation is the maximum likelihood technique. The maximum likelihood estimate is the point in the parametric space where the likelihood function is greatest. Due to its clear and adaptable logic, it has become the accepted procedure for statistical inference. The model parameters’ maximum likelihood estimators (MLEs) are obtained based on Type II censoring in this section. When Type II censoring happens, an experiment contains a predefined number of participants or items, n, and ends the experiment after a certain number, r , are seen to have failed; the remaining subjects are then right-censored. On the other hand, under Type II censoring, the experiment has been stopped at the x r observation. The probability function of the three parameters based on Type-II censoring is calculated from the CDF in Equation (4) and the associated PMF in Equation (3) as follows:
L β , λ , θ i = 1 r P x i S x r n r i = 1 r 1 e β 1 e x i + 1 λ θ 1 e β 1 e x i λ θ 1 e β 1 e x r λ θ n r .
The likelihood function’s logarithm, 𝓁 of β , λ and θ is given by
𝓁 i = 1 r l n 1 e β 1 e x i + 1 λ θ 1 e β 1 e x i λ θ + n r ln 1 e β 1 e x r λ θ .
Following are the derivatives of 𝓁 with respect to the unknown parameters:
𝓁 β = i = 1 r θ 1 e x i λ ϑ x i ; β , λ θ 1 ( 1 ϑ x i ; β , λ ) θ 1 e x i + 1 λ ξ x i ; β , λ θ 1 ( 1 ξ x i ; β , λ ) ξ x i ; β , λ θ ϑ x i ; β , λ θ θ ( 1 e x r λ ) ϑ x r ; β , λ 1 ( 1 ϑ x r ; β , λ )
𝓁 λ = i = 1 r θ 1 e x i + 1 λ x i + 1 λ l n x i + 1 e x i + 1 λ 1 ξ x i ; β , λ θ 1 ( 1 ξ x i ; β , λ ) θ 1 e x i λ e x i λ l n x i e x i λ 1 ϑ x i ; β , λ θ 1 e β 1 e x i λ ξ x i ; β , λ θ ϑ x i ; β , λ θ θ 1 e x r λ x r λ l n x r e x r λ 1 ϑ x r ; β , λ 1 1 ϑ x r ; β , λ ,
𝓁 θ = i = 1 r ξ x i ; β , λ θ l n ξ x i ; β , λ ϑ x i ; β , λ θ l n ϑ x i ; β , λ ξ x i ; β , λ θ ϑ x i ; β , λ θ ( n r ) l n ϑ x r ; β , λ ,
where ϑ x i ; β , λ = 1 e x p ( β ( 1 e x i λ ) ) and ξ x i ; β , λ = 1 e x p ( β ( 1 e x i + 1 λ ) ) .
Now, a set of nonlinear equations exists with unknown parameters β , λ , and θ . It is evident that it is difficult to obtain a closed-form solution. So, the above-described nonlinear system may be solved numerically using an iterative method like Newton-Raphson.

2.3. Bootstrap Confidence Interval

The bootstrap approach, initially introduced by ref. [40], is a highly broad resampling strategy for estimating the distributions of statistics based on independent observations. As an alternative to asymptotic approaches, the bootstrap method is gaining popularity since it has been shown to be effective in several situations. Here, we create a bootstrap-t interval and a percentile bootstrap for the variables  β , λ , and θ .
(1)
Percentile Bootstrap Confidence Interval (B-P):
  • Calculate the MLE of ψ = β , λ , θ ;
  • To acquire the bootstrap estimate using ψ to obtain the bootstrap estimate of β (say β ^ b ), λ (say λ ^ b ), and θ (say θ ^ b ) using the bootstrap sample;
  • Repeat step (ii) B times to obtain β ^ b ( 1 ) , β ^ b ( 2 ) , , β ^ b ( B ) , λ ^ b ( 1 ) , λ ^ b ( 2 ) , , λ ^ b ( B ) and θ ^ b ( 1 ) , θ ^ b ( 2 ) , , θ ^ b ( B ) ;
  • Sort β ^ b [ 1 ] , β ^ b [ 2 ] , , β ^ b [ B ] , λ ^ b [ 1 ] , λ ^ b [ 2 ] , , λ ^ b [ B ] , and θ ^ b [ 1 ] , θ ^ b [ 2 ] , , θ ^ b [ B ] in ascending-order as β ^ b [ 1 ] , β ^ b [ 2 ] , , β ^ b [ B ] , λ ^ b [ 1 ] , λ ^ b [ 2 ] , , λ ^ b [ B ] , and θ ^ b [ 1 ] , θ ^ b [ 2 ] , , θ ^ b [ B ] , respectively;
  • At a significant level φ , the two-sided 100 1 φ % percentile bootstrap confidence intervals of the unknown parameters β , λ ,   a n d   θ are provided by β ^ b [ B φ / 2 ] , β ^ b [ B 1 φ / 2 ] , λ ^ b [ B φ / 2 ] , λ ^ b [ B 1 φ / 2 ] and θ ^ b B φ / 2 , θ ^ b B 1 φ / 2 , respectively.
(2)
Bootstrap-t Confidence Interval (B-T):
For the unidentified parameters β , λ , and θ , a two-sided 100 ( 1 φ ) % percentile bootstrap-t confidence interval is provided by
  • The same as steps (i) and (ii) in B-P;
  • The Fisher information matrix may be used to calculate the t-statistic of ψ = β , λ , θ as;
  • T = ψ ^ b ψ ^ V ψ ^ b 1 / 2 , where β ^ b , λ ^ b , and θ ^ b are the bootstrap estimates of β , λ , and θ respectively, using the bootstrap sample;
  • where V ψ ^ b is the asymptotic variances of ψ ^ k b ;
  • Repeat step (ii) B times to get T 1 , T 2 , , T B ;
  • Sort T 1 , T 2 , , T B in ascending order, such as T 1 < T 2 < < T B ;
  • For the unidentified parameters β , λ , and   θ , a two-sided 100 ( 1 φ ) % percentile bootstrap-t confidence interval is provided by
    β ^ + T 1 B φ / 2 V β ^ 1 / 2 ,         β ^ + T 1 B 1 φ / 2 V β ^ 1 / 2 ,
    λ ^ + T 2 B φ / 2 V λ ^ 1 / 2 ,         λ ^ + T 2 B 1 φ / 2 V λ ^ 1 / 2
    and
    θ ^ + T 3 B φ / 2 V θ ^ 1 / 2 ,   θ ^ + T 3 B 1 φ / 2 V θ ^ 1 / 2

3. Numerical Comparisons

To verify the performance of the suggested estimators, established in the preceding Section 2.2, of the parameters β , λ , and θ , Monte Carlo simulations are conducted in this section. Briefly, we shall provide the simulation scenario. Following that, some discussions on the simulation results are provided.

3.1. Simulation Scenario

To evaluate the efficiency of the acquired maximum likelihood estimates and of the bootstrap interval estimates of β , λ , and θ , in this subsection, several Monte Carlo simulation studies are conducted. Now, to collect a Type-II censored sample from the DEC model, we propose the following steps:
Step 1:
Set the actual values of β , λ , and θ .
Step 2:
Determine the specific values of n (total test units) and r (effective sample size).
Step 3:
Generate pseudo-random values with size n from D E C ( β , λ , θ ) using
x i = l n 1 1 β l n ( 1 u i 1 / θ ) 1 / λ 1 , i = 1,2 , , n ,
where u i ( 0,1 ) denotes the uniform random variate.
Step 4:
Sort the outputs in Step-3, and for a specific r obtain Type-II censored sample.
Step 5:
For each set ( n , r ) , compute the MLEs and 100 1 φ % B-P/B-T interval estimates of β , λ , and θ .
Now, by adopting three sets of DEC parameters ( β , λ , θ ) namely Set-1:(0.1, 0.4, 0.8), Set-2:(0.2, 0.8, 1.5), and Set-3:(1.5, 0.8, 0.4), a large 2000 Type-II censored samples are generated based on various options of n and r . Here, several choices of n such as n ( = 40 , 60 , 80 , 100 , 120 , 160 , 200 ) are used, where r (effective sample size) is used as a failure percentage (FP) such as r / n × 100 % = 50 , 75 , 100 % for each n , all acquired estimates of β , λ , and θ are evaluated. Obviously, the Type-II censored sample generated at F P = 100 % implies the complete sample. Following the bootstrap (B-P/B-T) interval estimates of β , λ , and θ are also obtained using B = 10,000 repetitions. The same actual values of β , λ , and θ are also considered as initial values for calculating the acquired MLEs of the same unknown parameters.
Specifically, for each group ( n , r ) , the average estimates (AEs) with their mean squared errors (MSEs), mean absolute biases (MABs), and average interval lengths (AILs) of β (for example) are obtained using the following formulae:
A E = 1 N j = 1 N β j , M S E = 1 N j = 1 N β j β 2 , M A B = 1 N j = 1 N β ( j ) β   and   A I L 1 φ β = 1 N j = 1 N U β j L β j ,
where L ( · ) and U ( · ) denote the lower and upper bounds of the interval estimate, N is the number of replications, β ( j ) is the calculated estimate of β at i th iteration. In a similar pattern, the AEs, MSEs, MABs, and AILs of λ and θ can be easily developed. We further recommend that the coverage probability criterion be considered when comparing the interval estimates; in our evaluations, this criterion is averted for brevity. All numerical evaluations are performed via R 4.2.2 programming software through the ‘maxLik’ package introduced by ref. [41]. All numerical results of β , λ , and θ are obtained and reported in Table 2, Table 3 and Table 4, respectively.

3.2. Simulation Discussions

This subsection focuses on several evaluations of the performance of the suggested point and interval estimation methods. From Table 2, Table 3 and Table 4, we can draw the following conclusions:
  • Generally, in terms of lowest MSE, MAB, and AIL values, the acquired estimates of the unknown parameters β , λ , and θ behave well;
  • As n increases, the MSEs, MABs, and AILs of β , λ , and θ tend to decrease. This finding confirms the consistency feature of the associated estimates when the required sample size is increased. A similar pattern is observed when the FP increases;
  • For fixed ( n , r ) , the interval estimates obtained from the B-T procedure have shorter AILs compared to those obtained from the B-P procedure. This result holds for all given parametric sets;
  • As the true values of β , λ , and θ increase, for each setting, the MSEs, MABs, and AILs of all unknown parameters β , λ , or θ increase;
  • As the true values of β and θ increase, with fixing λ , the MSEs, MABs, and AILs of β increase while those of λ and θ decrease;
  • In summary, the simulated results showed that the suggested estimation approaches of the DEC parameters perform well under Type-II censoring and can be easily extended to other models or other censoring plans;
  • Heat-map visualization is a way of graphically representing numerical data where the value of each data point is indicated using specified colors. Therefore, utilizing the heat-map tool in R 4.2.2 programming software, the calculated point (or interval) estimates of β , λ , or θ using Set-1 and Set-2 (for instant) are shown in Figure 3, Figure 4 and Figure 5, respectively.
Table 2. The AEs, MSEs, MABs, and AILs of β .
Table 2. The AEs, MSEs, MABs, and AILs of β .
n FP%AEMSEMABAILAEMSEMABAILAEMSEMABAIL
B-PB-T B-PB-T B-PB-T
Set   123
4050%0.04170.02200.08850.45300.45270.08290.04910.17560.88660.88711.74230.19200.32972.14192.1215
75%0.05190.01140.08440.27650.27590.13050.03320.16310.61680.61501.82290.17690.32371.84401.8268
100%0.11700.01020.08040.24100.24030.23750.03280.14600.48510.48551.44410.15810.31231.81481.7971
6050%0.03340.01240.08660.37110.37090.03380.03590.17200.65800.65811.77920.14790.32001.78031.7630
75%0.03320.01080.08070.25210.25180.22790.03160.16010.55940.55941.81790.11860.31951.77141.7544
100%0.11110.00780.06900.19390.19320.25730.02810.12940.25570.25601.50370.10990.31741.17691.1656
8050%0.02330.00750.08260.30460.30390.02640.03360.16960.60590.60601.65070.10940.31230.93690.9258
75%0.03050.00650.07410.13580.13470.21620.03090.14230.49350.49351.78610.10660.29160.92120.9123
100%0.10590.00620.05830.12450.12360.27370.02310.11050.23430.23421.41400.10480.28890.84390.8358
10050%0.02060.00720.08240.25430.25360.04530.03020.16250.55220.55221.80570.10330.28040.75290.7457
75%0.03320.00570.07110.11870.11810.16280.02760.12580.49190.49141.81670.10360.27790.74120.7341
100%0.10350.00430.04940.10340.10270.21170.01610.09280.20590.20581.46830.10230.27620.72690.7199
12050%0.01870.00710.08200.23940.23860.03340.02850.14710.51050.51011.76680.09950.27020.68360.6771
75%0.03430.00550.07090.11590.11500.24710.01700.10790.44690.44691.60290.09260.24880.66910.6626
100%0.10360.00380.04470.08780.08700.21390.01350.08610.18190.18181.40030.09160.24190.60450.5987
16050%0.01870.00690.08140.21130.21040.10770.01250.10160.36920.36931.75000.08710.22470.60010.5943
75%0.03000.00530.06880.08700.08630.12840.01230.09780.28330.28301.51320.08580.22440.52990.5248
100%0.10300.00270.03930.06620.06570.21010.00990.07470.17820.17811.36200.06890.20760.32210.3190
20050%0.02100.00650.07910.17200.17120.10360.01160.10080.34040.34031.66540.05710.18010.24320.2409
75%0.03350.00490.06720.08770.08660.10720.01170.09640.21330.21331.59550.04570.16300.22340.2213
100%0.10230.00210.03580.06670.06550.20860.00780.06870.10320.10321.27930.03980.15860.21550.2118
Table 3. The AEs, MSEs, MABs and AILs of λ .
Table 3. The AEs, MSEs, MABs and AILs of λ .
n FP%AEMSEMABAILAEMSEMABAILAEMSEMABAIL
B-PB-T B-PB-T B-PB-T
Set   123
4050%0.69160.15830.36860.68390.68221.27330.92130.98251.41471.41390.92210.43490.64020.79280.7843
75%0.59870.09110.27220.56990.56871.18410.82640.87061.38711.38680.85050.43160.65200.77920.7714
100%0.43600.01070.07770.37450.37390.88240.07370.19251.05141.05101.13500.40450.61970.76830.7606
6050%0.86810.14980.36200.65590.65231.07060.90010.92801.19271.19190.99900.38560.61410.75070.7439
75%0.67100.05950.21860.52920.52720.90840.74320.82180.85970.85910.85940.37980.58880.73090.7236
100%0.42140.00580.05940.28070.27860.84230.03450.14020.72420.72351.15920.36410.53500.71560.7113
8050%0.76190.13660.35080.54290.54081.72800.88040.90941.07491.07441.09390.35670.55920.70580.7016
75%0.61830.05720.20320.39150.38961.62180.57850.72920.76370.76360.88310.33320.54220.69870.6949
100%0.41730.00420.05040.24780.24690.83730.02770.12220.65000.64921.14890.31810.55040.67860.6746
10050%0.75080.13250.34910.45230.45131.58400.79950.85090.98720.98690.89660.25870.45110.67730.6732
75%0.58180.03930.18220.32310.32171.50300.52120.70300.74590.74500.90980.20850.37480.63050.6267
100%0.41320.00300.04270.21000.20890.82590.01760.10160.52570.52481.44020.16620.29390.62730.6235
12050%0.74910.11350.29410.40660.40511.56240.71540.78490.88340.88250.94740.13850.26330.62490.6212
75%0.56220.03360.17300.27410.27211.52920.48000.68310.69310.69231.12510.10640.19900.59310.5908
100%0.41030.00240.03780.19430.19330.81710.01420.09220.49070.49021.14200.09820.17740.58670.5844
16050%0.68050.08440.28050.30250.30131.48310.11780.31030.54570.54501.06330.08180.15210.56300.5608
75%0.57300.03110.16250.23980.23871.06620.27250.40390.46010.45931.13420.06720.13980.47190.4680
100%0.40740.00180.03240.16890.16790.81270.01020.07920.38600.38491.04140.06310.12660.30590.3047
20050%0.63670.05990.23670.24910.24831.11020.08820.27670.51930.51841.07480.05990.11310.27830.2769
75%0.54820.02460.14820.21420.21321.06550.08390.26560.44740.44651.13500.03910.08940.26740.2670
100%0.40590.00140.02980.14520.14460.81000.00850.07230.36470.36391.05200.03370.08050.25870.2485
Table 4. The AEs, MSEs, MABs and AILs of θ .
Table 4. The AEs, MSEs, MABs and AILs of θ .
n FP%AEMSEMABAILAEMSEMABAILAEMSEMABAIL
B-PB-T B-PB-T B-PB-T
Set   123
4050%0.44460.78540.54062.15972.15940.98291.99991.35282.92492.92260.37980.15940.39861.10721.1050
75%0.53940.43910.52891.58231.58101.11071.70401.25281.61331.61300.37540.15700.39430.99120.9894
100%0.99340.37750.48201.38191.38251.33961.50400.99321.43231.43120.32550.15660.39500.98830.9861
6050%0.33050.50040.53881.61011.60990.99101.29921.16681.88581.88470.31930.14790.37740.94840.9465
75%0.40600.38530.44651.26981.26841.17381.00251.00511.49801.49530.36290.14700.37870.80630.8047
100%0.88130.24310.36431.05881.05741.27060.96740.93291.06751.06410.37080.14280.36920.73080.7294
8050%0.32570.26180.49831.27351.27511.23891.04951.02451.78361.78290.33250.14070.36360.71570.7143
75%0.41990.18950.40290.76210.76201.49240.96040.76031.39421.39370.35030.12000.32450.69470.6946
100%0.84240.18430.29250.68550.68461.74270.53980.60400.94180.94080.32360.11890.30780.66030.6594
10050%0.31960.25430.48881.14131.14151.26521.01760.97901.56161.56070.35610.11760.33040.63050.6293
75%0.44800.16110.36990.70460.70421.56130.87580.67011.15641.18160.34330.09490.25980.61610.6147
100%0.82450.10850.24140.55350.55311.66140.44500.58240.89650.89430.39840.09140.28030.55960.5586
12050%0.31280.25350.48561.00881.00911.35610.96510.93701.20061.20100.38290.08350.17060.54970.5486
75%0.46200.14940.36490.66460.66521.39510.68210.57891.18101.15490.32230.07890.22880.51550.5136
100%0.82240.09640.22030.50340.50321.65410.41440.55470.83010.82860.33110.05740.19390.38220.3810
16050%0.34160.22230.45870.93460.93471.43120.59390.72731.18891.18720.34680.05480.16820.36340.3627
75%0.43860.14410.35200.52550.52561.48740.46990.56501.10131.10020.38090.05330.17820.34810.3483
100%0.81600.06590.19370.43640.43621.60940.39430.46970.74150.74020.33320.05040.17710.31230.3117
20050%0.37080.19530.42940.89130.89201.53610.40750.57551.01861.01900.35910.04640.14770.30790.3073
75%0.46550.12800.33730.49370.49411.58880.36910.54640.79320.79320.31730.03750.13970.29650.2947
100%0.81250.04970.17070.39970.40011.65090.33060.42650.63760.63590.30220.03570.12820.29010.2884
Figure 3. Heat-maps for the Monte Carlo results of β . (a) Set-1. (b) Set-2.
Figure 3. Heat-maps for the Monte Carlo results of β . (a) Set-1. (b) Set-2.
Symmetry 15 01278 g003aSymmetry 15 01278 g003b
Figure 4. Heat-maps for the Monte Carlo results of λ . (a) Set-1. (b) Set-2.
Figure 4. Heat-maps for the Monte Carlo results of λ . (a) Set-1. (b) Set-2.
Symmetry 15 01278 g004aSymmetry 15 01278 g004b
Figure 5. Heat-maps for the Monte Carlo results of θ . (a) Set-1. (b) Set-2.
Figure 5. Heat-maps for the Monte Carlo results of θ . (a) Set-1. (b) Set-2.
Symmetry 15 01278 g005aSymmetry 15 01278 g005b

4. Real-Life Applications

This section presents the analysis of two applications utilizing different real data sets in order to (i) examine the usefulness and adaptability of the offered model to real phenomena; (ii) demonstrate the applicability of the inferential results to a real practical situation; and (iii) evaluate to see whether the proposed model is a better choice than the other eleven models. The first application examines the number of vehicle fatalities for thirty-nine counties in the state of South Carolina in 2012, acquired from the National Highway Traffic Safety Administration’s (www-fars.nhtsa.dot.gov, accessed on 13 January 2023). Firstly, this data set (denoted by Data-I) was reported by ref. [42]. Another application provides an analysis of the final exam marks in 2004 of 48 slow-paced students in mathematics at the Indian Institute of Technology at Kanpur. This data set (denoted by Data-II) is taken from ref. [43] and reanalyzed by ref. [44]. It is better to point out here that the number of vehicle deaths or check marks have the same philosophy of separate data points as in different cases in nature, such as weights, heights, ages, etc. The data sets I and II are reported in Table 5, while their statistics, namely: min, max, (first, second, and third) quartiles Q i , i = 1 , 2,3 , mode, mean, standard deviation (St.D), S k , and Ku, are listed in Table 6.
To demonstrate the validity and superiority of the proposed model, based on the first and second datasets, the DEC probability model is compared alongside the other eleven competing models in the literature, namely: Poisson (P ( θ ) ) by ref. [45], discrete Weibull (DW ( λ , θ ) ) by ref. [44], negative binomial (NB ( n , θ ) ) and geometric (G ( p ) ) discussed by ref. [46], discrete Burr Type XII (DB ( λ , θ ) by ref. [4], discrete generalized-exponential (DGE ( λ , θ ) ) by ref. [47], discrete gamma (DG ( λ , θ ) ) by ref. [6], discrete Burr Hatke (DBH ( θ ) ) by ref. [48], discrete Nadarajah-Haghighi (DNH ( λ , θ ) ) by ref. [10], discrete modified Weibull (DMW ( α , λ , θ ) ) by ref. [49], and exponentiated discrete Weibull (EDW ( α , λ , θ ) ) by ref. [7] distributions.
To specify the best model, several criteria are used, namely: negative log-likelihood (NLL = 𝓁 ^ ), Akaike ( A = 2 π 2 𝓁 ^ ), consistent Akaike ( C A = 2 𝓁 ^ + 2 n π n π 1 1 ), Bayesian ( B = 2 𝓁 ^ + π l o g ( n ) ) and Hannan-Quinn (HQ = 2 𝓁 ^ + π l o g ( l o g ( n ) ) ) information criteria, where π is the length of the model parameter vector. Besides them, the Kolmogorov-Smirnov (K-S) statistic with its p -value is also considered. Obviously, the best probability model distribution gives the best fit for a given set of data if it has the highest p -value and the lowest values of all other measures. Via R programming software 4.2.2, by installing the ‘AdequacyModel’ package proposed by ref. [50], the maximum likelihood estimates (with their standard errors (St.Es)) as well as the fitted model selection criteria are presented in Table 7.
It is evident from the Data-I fit in Table 7 that the DEC distribution, which has the lowest statistical values and the highest p -value among all the fitted competitive models, is the best model. Also, from the Data-II fit in Table 7, it is clear that the DEC distribution, with respect to the p -value, is the best distribution among all compared models, while the other three-parameter DMW and EDW distributions perform better with respect to the other given criteria. Further, in Figure 6, the corresponding probability-probability (PP) plot is a visual plot showing the relationship between observed cumulative probability (OCP) and expected cumulative probability (ECP) of the DEC distribution and its competing distributions. It also supports our findings in Table 7. Various visualization goodness tools, namely (i) histogram with fitted PDF, (ii) fitted CDF, (iii) quantile-quantile (QQ), and (iv) total time test (TTT) plots, are utilized to show the goodness-of-fit of a theoretical model to observed data; see Figure 7. It is evident, from both data sets I and II, that (i) the fitted DEC density captures the general pattern of the histograms well; (ii) all estimated DEC distribution (or quantile) points are quite close to the straight line; and (iii) the DEC failure rate has an increasing shape. As a result, from the empirical results, we can conclude that the new DEC distribution provides a significantly good fit compared to the other traditional (or newly developed) discrete statistical models.
Now, to evaluate the acquired estimates of β , λ , and θ developed by maximum likelihood and bootstrap (based on 10,000 replications) methods in a real scenario, three Type-II censored samples from each data set in Table 5 are generated based on different values of r ; see Table 8. For each generated sample in Table 8, the MLEs (with their St.Es) and 95% B-P/B-T interval estimates (with their interval widths (IWs)) of β , λ , and θ are calculated and presented in Table 9. It indicates that the maximum likelihood results behave well, and interval estimates derived from the B-T method perform better than those derived from the other. Further, when r increases, both St.Es and IWs associated with all DEC parameters are narrowed. The proposed applications used, among others, two sets of common real data to show how the proposed model works well in practice.
Potential application areas of the new DEC distribution could include those already considered, and other real (or simulated) data could be easily incorporated. Furthermore, the numerical findings developed from the real data sets support the same findings investigated in our Monte Carlo simulations.

5. Concluding Remarks

In this study, a new three-parameter discrete model, named the discrete exponentiated Chen distribution, is introduced. The probability mass function of the new model can be “unimodal”, “right-skewed” or “left-skewed” as can its hazard rate function, which can be “decreasing“, “increasing”, or “increasing-constant”. In particular, the statistical characteristics of the model we propose are found. These consist of order statistics, entropy, skewness, kurtosis, moments, quantiles, medians, and other statistics. In the presence of Type II censored data, the maximum likelihood approach was used to fit the model parameters. In order to assess the efficacy of the method provided in this article, several simulations have been undertaken. Finally, to further explain the suggested model, two real data sets are provided. The first application examines the number of vehicle fatalities in 39 counties in South Carolina in 2012, as reported by the National Highway Traffic Safety Administration. The second application examines the final exam results of 48 slow learners who attended the Indian Institute of Technology in Kanpur in 2004. As a summary, based on the given data sets, we have shown that the offered model furnishes a better fit than some existing discrete distributions, including: Poisson, geometric, negative binomial, discrete Weibull, discrete Burr Type XII, discrete generalized-exponential, discrete gamma, discrete Burr Hatke, discrete Nadarajah-Haghighi, discrete modified Weibull, and exponentiated discrete Weibull models. Thus, we can conclude that the proposed estimation procedures provide a good explanation of the proposed distribution in the presence of data collected under Type II censoring. In future work, one can easily examine the stress strength, reliability, or failure rate of the proposed model based on other sampling schemes, such as progressive Type-II censoring. Finally, we hope that the proposed distribution attracts a wider set of applications in different sectors.

Author Contributions

Conceptualization, R.A. and H.R.; methodology, R.A., H.R. and A.E.; software, A.E.; funding acquisition, R.A.; supervision, R.A. and A.E.; Writing—original draft, R.A., H.R. and C.P. Writing—review and editing; R.A., H.R., C.P. and A.E. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number PNURSP2023R50), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would desire to express their gratitude to the editor and the anonymous referees for useful advice and helpful comments. The authors would also like to express their full thanks to Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia, for supporting (PNURSP2023R50) this work.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Alizadeh, M.; Khan, M.N.; Rasekhi, M.; Hamedani, G.G. A New Generalized Modified Weibull Distribution. Stat. Optim. Inf. Comput. 2021, 9, 17–34. [Google Scholar] [CrossRef]
  2. Nakagawa, T.; Osaki, S. The discrete Weibull distribution. IEEE Trans. Reliab. 1975, 24, 300–301. [Google Scholar] [CrossRef]
  3. Roy, D. Discrete Rayleigh distribution. IEEE Trans. Reliab. 2004, 53, 255–260. [Google Scholar] [CrossRef]
  4. Krishna, H.; Pundir, P.S. Discrete Burr and discrete Pareto distributions. Stat. Methodol. 2009, 6, 177–188. [Google Scholar] [CrossRef]
  5. Jazi, M.A.; Lai, C.D.; Alamatsaz, M.H. A discrete inverse Weibull distribution and estimation of its parameters. Stat. Methodol. 2010, 7, 121–132. [Google Scholar] [CrossRef]
  6. Chakraborty, S.; Chakravarty, D. Discrete gamma distributions: Properties and parameter estimations. Commun. Stat. Theory Methods 2012, 41, 3301–3324. [Google Scholar] [CrossRef]
  7. Nekoukhou, V.; Bidram, H. The exponentiated discrete Weibull distribution. SORT 2015, 39, 127–146. [Google Scholar]
  8. Jayakumar, K.; Babu, M.G. Discrete Weibull geometric distribution and its properties. Commun. Stat. Theory Methods 2018, 47, 1767–1783. [Google Scholar] [CrossRef]
  9. Tyagi, A.; Choudhary, N.; Singh, B. Discrete additive Perks-Weibull distribution: Properties and applications. Life Cycle Reliab. Saf. Eng. 2019, 8, 183–199. [Google Scholar] [CrossRef]
  10. Shafqat, M.; Ali, S.; Shah, I.; Dey, S. Univariate Discrete Nadarajah and Haghighi Distribution: Properties and Different Methods of Estimation. Statistica 2020, 80, 301–330. [Google Scholar]
  11. Tyagi, A.; Choudhary, N.; Singh, B. A new discrete distribution: Theory and applications to discrete failure lifetime and count data. J. Appl. Probab. Stat. 2020, 15, 117–143. [Google Scholar]
  12. El-Morshedy, M.; Eliwa, M.S.; Nagy, H. A new two-parameter exponentiated discrete Lindley distribution: Properties, estimation and applications. J. Appl. Stat. 2020, 47, 354–375. [Google Scholar] [CrossRef] [PubMed]
  13. Eliwa, M.S.; Alhussain, Z.A.; El-Morshedy, M. Discrete Gompertz-G family of distributions for over-and under-dispersed data with properties, estimation, and applications. Mathematics 2020, 8, 358. [Google Scholar] [CrossRef] [Green Version]
  14. Calderín-Ojeda, E.; Gómez-Déniz, E. An extension of the discrete Lindley distribution with applications. J. Korean Stat. Soc. 2013, 42, 371–373. [Google Scholar] [CrossRef]
  15. El-Morshedy, M.; Eliwa, M.S.; Tyagi, A. A discrete analogue of odd Weibull-G family of distributions: Properties, classical and Bayesian estimation with applications to count data. J. Appl. Stat. 2022, 49, 2928–2952. [Google Scholar] [CrossRef]
  16. Singh, B.; Agiwal, V.; Nayal, A.S.; Tyagi, A. A Discrete Analogue of Teissier Distribution: Properties and Classical Estimation with Application to Count Data. Reliab. Theory Appl. 2022, 17, 340–355. [Google Scholar]
  17. Dey, S.; Kumar, D.; Ramos, L.P.; Louzada, F. Exponentiated Chen distribution: Properties and estimation. Commun. Stat. B Simul. Comput. 2017, 46, 8118–8139. [Google Scholar] [CrossRef]
  18. Chakraborty, S. Generating discrete analogues of continuous probability distributions-A survey of methods and constructions. J. Stat. Distrib. Appl. 2015, 2, 6. [Google Scholar] [CrossRef] [Green Version]
  19. Al-Babtain, A.A.; Ahmed, A.H.N.; Afify, A.Z. A new discrete analog of the continuous Lindley distribution, with reliability applications. Entropy 2020, 22, 603. [Google Scholar] [CrossRef]
  20. Opone, F.C.; Izekor, E.K.; Akata, I.U.; Osagiede, F.E. A Discrete analogue of the continuous Marshall-Olkin Weibull distribution with application to count data. Earthline J. Math. Sci. 2020, 5, 415–428. [Google Scholar] [CrossRef]
  21. Aljohani, H.M.; Akdoğan, Y.; Cordeiro, G.M.; Afify, A.Z. The uniform Poisson-Ailamujia distribution: Actuarial measures and applications in biological science. Symmetry 2021, 13, 1258. [Google Scholar] [CrossRef]
  22. Eldeeb, A.S.; Ahsan-Ul-Haq, M.; Babar, A. A discrete analog of inverted Topp- Leone distribution: Properties, estimation and applications. Int. J. Anal. Appl. 2021, 19, 695–708. [Google Scholar]
  23. Alghamdi, A.S.; Ahsan-Ul-Haq, M.; Babar, A.; Aljohani, H.M.; Afify, A.Z. The discrete power-Ailamujia distribution: Properties, inference, and applications. AIMS Math. 2022, 7, 8344–8360. [Google Scholar] [CrossRef]
  24. Kemp, A.W. The discrete half-normal distribution. Adv. Math. Stat. Model. 2006, 9, 353–360. [Google Scholar]
  25. Lawless, J.F. Statistical Models and Methods for Lifetime Data; Wiley: New York, NY, USA, 1982. [Google Scholar]
  26. Rezaei, A.H.; Arghami, N.R. Right censoring in a discrete life model. Metrika 2002, 55, 151–160. [Google Scholar] [CrossRef]
  27. Achcar, J.A.; Martinez, E.Z.; de Freitas, B.C.L.; de Oliveira, M.V. Classical and Bayesian inference approaches for the exponentiated discrete Weibull model with censored data and a cure fraction. Pak. J. Stat. Oper. Res. 2021, 17, 467–481. [Google Scholar] [CrossRef]
  28. Billingsley, P. Probability and Measure; anniversary edition; John Wiley & Sons: New York, NY, USA, 2012. [Google Scholar]
  29. Casella, G.; Berger, R.L. Statistical Inference, 2nd ed.; Duxbury: Pacific Grove, CA, USA, 2002. [Google Scholar]
  30. Graham, R.L.; Knuth, D.E.; Patashnik, O. Concrete Mathematics: A Foundation for Computer Science, 2nd ed.; Addison-Wesley Longman Publishing Co., Inc.: Boston, MA, USA, 1994. [Google Scholar]
  31. Csörgő, M. Quantile Processes with Statistical Applications; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1983. [Google Scholar]
  32. Kenney, J.F.; Keeping, E.S. Mathematics of Statistics, Part 1, 3rd ed.; Van Nostrand Company: Princeton, NJ, USA, 1962. [Google Scholar]
  33. Moors, J.J.A. A quantile alternative for kurtosis. J. R. Stat. Soc. Ser. D 1988, 37, 25–32. [Google Scholar] [CrossRef] [Green Version]
  34. Çetinkaya, Ç.; Genç, A. Stress–strength reliability estimation under the standard two-sided power distribution. Appl. Math. Model. 2019, 65, 72–88. [Google Scholar] [CrossRef]
  35. Çetinkaya, Ç. Reliability estimation of a stress-strength model with non-identical component strengths under generalized progressive hybrid censoring scheme. Statistics 2021, 55, 250–275. [Google Scholar] [CrossRef]
  36. Asadi, S.; Panahi, H. Estimation of stress–strength reliability based on censored data and its evaluation for coating processes. Qual. Technol. Quant. Manag. 2022, 19, 379–401. [Google Scholar] [CrossRef]
  37. Panahi, H. Reliability analysis for stress–strength model from inverted exponentiated Rayleigh based on the hybrid censored sample. Int. J. Qual. Reliab. Manag. 2023, 40, 1412–1428. [Google Scholar] [CrossRef]
  38. Choudhary, N.; Tyagi, A.; Singh, B. Estimation of R = P[Y < X < Z] under Progressive Type-II Censored Data from Weibull Distribution. Lobachevskii J. Math. 2021, 42, 318–335. [Google Scholar] [CrossRef]
  39. Renyi, A. On measures of entropy and information. Math. Stat. Probab. 1961, 1, 547–561. [Google Scholar]
  40. Efron, B. Bootstrap Methods: Another Look at the Jackknife. Ann. Stat. 1979, 7, 1–26. [Google Scholar] [CrossRef]
  41. Henningsen, A.; Toomet, O. MaxLik: A package for maximum likelihood estimation in R. Comput. Stat. 2011, 26, 443–458. [Google Scholar] [CrossRef]
  42. Mann, S.P. Introductoty Statistics; John Wiley & Sons: New York, NY, USA, 2016. [Google Scholar]
  43. Gupta, R.D.; Kundu, D. A new class of weighted exponential distributions. Statistics 2009, 43, 621–634. [Google Scholar] [CrossRef]
  44. Bakouch, H.S.; Jazi, M.A.; Nadarajah, S. A new discrete distribution. Statistics 2014, 48, 200–240. [Google Scholar] [CrossRef]
  45. Poisson, S.D. Recherches sur la Probabilité des Jugements en Matière Criminelle et en Matière Civile: Précédées des Règles Générales du Calcul des Probabilités; Bachelier: Paris, France, 1837. [Google Scholar]
  46. Johnson, N.L.; Kemp, A.W.; Kotz, S. Univariate Discrete Distributions; John Wiley & Sons: New York, NY, USA, 2005. [Google Scholar]
  47. Nekoukhou, V.; Alamatsaz, M.H.; Bidram, H. Discrete generalized exponential distribution of a second type. Statistics 2013, 47, 876–887. [Google Scholar] [CrossRef]
  48. El-Morshedy, M.; Eliwa, M.S.; Altun, E. Discrete Burr-Hatke distribution with properties, estimation methods and regression model. IEEE Access 2020, 8, 74359–74370. [Google Scholar] [CrossRef]
  49. Almalki, S.J.; Nadarajah, S. A new discrete modified Weibull distribution. IEEE Trans. Reliab. 2014, 63, 68–80. [Google Scholar] [CrossRef]
  50. Marinho, P.R.D.; Silva, R.B.; Bourguignon, M.; Cordeiro, G.M.; Nadarajah, S. AdequacyModel: An R package for probability distributions and general purpose optimization. PLoS ONE 2019, 14, e0221487. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. The PMF and HRF plots of the DEC model. (a) PMF. (b) HRF.
Figure 1. The PMF and HRF plots of the DEC model. (a) PMF. (b) HRF.
Symmetry 15 01278 g001
Figure 2. 3-D plot for skewness and kurtosis of the DEC model. (a) For θ = 0.1 . (b) For θ = 1.5 .
Figure 2. 3-D plot for skewness and kurtosis of the DEC model. (a) For θ = 0.1 . (b) For θ = 1.5 .
Symmetry 15 01278 g002
Figure 6. The PP plots of the DEC model and its competing models from data sets I and II. (a) Data-I. (b) Data-II.
Figure 6. The PP plots of the DEC model and its competing models from data sets I and II. (a) Data-I. (b) Data-II.
Symmetry 15 01278 g006aSymmetry 15 01278 g006b
Figure 7. Plots of data histograms with fitted PDF, fitted CDF, QQ, and TTT of the DEC models. (a) Data-I. (b) Data-II.
Figure 7. Plots of data histograms with fitted PDF, fitted CDF, QQ, and TTT of the DEC models. (a) Data-I. (b) Data-II.
Symmetry 15 01278 g007aSymmetry 15 01278 g007b
Table 1. Some statistics for the DEC distribution using different combinations of its parameters.
Table 1. Some statistics for the DEC distribution using different combinations of its parameters.
λ 0.10.51.5
β θ 123412341234
1 M 2.32074.41336.30088.00350.23530.43640.60910.75820.17940.32660.44740.5465
V 95.134174.98242.48300.020.31930.52020.64320.71530.14720.21990.24720.2478
I D 40.99439.64838.48537.4861.35671.19221.05610.94340.82060.67340.55260.4535
C V 4.20302.99732.47142.16422.40121.65291.31681.11552.13891.43601.11140.9110
S k 6.04414.24063.43902.96092.84121.83831.38481.12121.67140.73960.21170.1868
K u 50.63829.62423.03720.04215.98411.55511.54312.9583.79361.54711.04491.0350
2 M 0.22810.45340.67610.89610.03420.06750.09970.13090.03220.06330.09350.1226
V 6.534012.94119.22625.3910.03740.07160.10270.13100.03110.05930.08470.1076
I D 28.64928.54228.43828.3351.09231.06091.03061.00120.96780.93670.90650.8774
C V 11.2087.93416.48565.62325.64773.96603.21592.76595.48453.84633.11462.6749
S k 20.61014.58511.91810.3296.07674.17073.30252.77125.30223.58632.79352.3011
K u 546.12278.22188.98144.4045.19122.82515.45611.84329.11313.8628.80376.2950
3 M 0.02300.04590.06880.09170.00590.01170.01750.02320.00580.01150.01720.0229
V 0.38400.76681.14831.52860.00600.01190.01770.02340.00570.01140.01690.0224
I D 16.69516.68816.68116.6741.02471.01901.01331.00760.99420.98850.98280.9771
C V 26.94019.05715.56613.48613.2249.33767.61356.584413.1259.26747.55596.5341
S k 66.28846.90638.32533.21413.5339.50607.71006.632313.0499.15957.42356.3810
K u 6359.33189.82133.31605.1194.2996.64864.10647.841171.2784.89756.10841.717
Table 5. Complete data sets I and II.
Table 5. Complete data sets I and II.
Data-I22, 26, 17, 4, 48, 9, 9, 31, 27, 20, 12, 6, 5, 14, 9, 16, 3, 33, 9, 20, 68, 13, 51, 13, 2, 4,17, 16, 6, 52, 50, 48, 23, 12, 13, 10, 15, 8, 1
Data-II29, 25, 50, 15, 13, 27, 15, 18, 7, 7, 8, 19, 12, 18, 5, 21, 15, 86, 21, 15, 14, 39, 15, 14, 70, 44, 6, 23, 58, 19, 50, 23, 11, 6, 34, 18, 28, 34, 12, 37, 4, 60, 20, 23, 40, 65, 19, 31
Table 6. Vital summary statistics of data sets I and II.
Table 6. Vital summary statistics of data sets I and II.
DataMinMax Q 1 Q 2 Q 3 ModeMeanSt.DSkew.Kurt.
I16891424.5919.53816.5071.2883.798
II4861419.5341525.89618.6051.3324.323
Table 7. Summary fit (St.E) of the DEC model and its competing models from data sets I and II.
Table 7. Summary fit (St.E) of the DEC model and its competing models from data sets I and II.
Model α λ θ NLLACABH-QK-S ( p -Value)
Data-I
DEC0.5207 (0.4120)0.2215 (0.0599)10.139 (11.773)153.121309.781309.889311.445310.3780.088 (0.920)
DMW0.6727 (0.0947)1.2965 (0.3646)1.2965 (0.3646)210.594427.187427.732432.801429.3080.204 (0.036)
EDW0.7930 (0.3019)0.7278 (0.3622)0.9389 (0.0518)196.492398.984399.529404.597401.1050.092 (0.914)
DGE1.7338 (0.4101)0.0695 (0.0130)-153.365310.730311.064314.057311.9240.111 (0.721)
DNH1.9281 (1.1305)0.0200 (0.0155)-154.907313.814314.147317.141315.0080.110 (0.730)
DB14.531 (46.924)0.0260 (0.0838)-214.879431.758431.866433.421432.3540.758 (0.001)
DW1.2974 (0.1582)21.791 (2.8466)-180.934365.868366.201369.195367.0620.338 (0.003)
DG0.0816 (0.0198)1.6343 (0.3401)-153.907311.814312.148315.141313.0080.126 (0.570)
NB19.534 (2.5126)1.6827 (0.3936)-153.458310.917311.250314.244312.1100.116 (0.667)
DBH0.9985 (0.0064)--153.609311.218311.551314.545312.4110.097 (0.854)
G0.0487 (0.0076)--155.904313.809313.917315.472314.4060.162 (0.257)
P19.539 (0.7078)--324.089650.178650.286651.841650.7750.407 (0.002)
Data-II
DEC0.6483 (0.5218)0.2064 (0.0569)25.021 (36.898)196.448397.903398.170400.984399.3170.080 (0.917)
DMW0.9429 (0.0826)0.8011 (1.7965)1.0309 (0.0087)153.699309.939310.085314.390311.1900.135 (0.479)
EDW0.7051 (0.3785)0.6433 (0.3169)8.0766 (11.453)153.153312.307312.993317.298314.0970.090 (0.912)
DGE2.6662 (0.6283)0.0657 (0.0102)-196.951398.897399.199401.645399.8200.108 (0.632)
DNH3.7619 (2.1185)0.0070 (0.0046)-201.743407.485407.752411.228408.8990.155 (0.201)
DB4.7414 (13.418)0.0693 (0.1964)-297.676597.352597.439599.223598.0590.837 (0.001)
DW1.5458 (0.1666)29.549 (2.9245)-247.486498.971499.238502.714500.3850.403 (0.002)
DG0.0879 (0.0188)2.3093 (0.4419)-198.609401.217401.484404.960402.6310.132 (0.371)
NB25.896 (2.5012)2.4440 (0.5219)-197.296398.593398.860402.335400.0070.115 (0.553)
DBH0.9990 (0.0046)--197.522399.044399.310402.786400.4580.095 (0.774)
G0.0372 (0.0053)--205.111412.222412.309414.093412.9290.222 (0.017)
P25.896 (0.7345)--396.589795.178795.265797.050795.8860.400 (0.004)
Table 8. Type-II censored samples from data sets I and II.
Table 8. Type-II censored samples from data sets I and II.
DataSample ( n , r ) Censored Data
IS1(39,15)1, 2, 3, 4, 4, 5, 6, 6, 8, 9, 9, 9, 9, 10, 12
S2(39,25)1, 2, 3, 4, 4, 5, 6, 6, 8, 9, 9, 9, 9, 10, 12, 12, 13, 13, 13, 14, 15, 16, 16, 17, 17
S3(39,35)1, 2, 3, 4, 4, 5, 6, 6, 8, 9, 9, 9, 9, 10, 12, 12, 13, 13, 13, 14, 15, 16, 16, 17, 17, 20, 20, 22, 23, 26, 27, 31, 33, 48, 48
IIS1(40,10)4, 5, 6, 6, 7, 7, 8, 11, 12, 12
S2(40,20)4, 5, 6, 6, 7, 7, 8, 11, 12, 12, 13, 14, 14, 15, 15, 15, 15, 15, 18, 18
S3(40,30)4, 5, 6, 6, 7, 7, 8, 11, 12, 12, 13, 14, 14, 15, 15, 15, 15, 15, 18, 18, 18, 19, 19, 19, 20, 21, 21, 23, 23, 23
Table 9. Point and Interval estimates of β , λ , and θ from data sets I and II.
Table 9. Point and Interval estimates of β , λ , and θ from data sets I and II.
DataSamplePar.MLE95% B-P95% B-T
Est.St.ELowerUpperIWLowerUpperIW
IS1 β 0.156090.265270.000821.759081.758261.691493.450151.75866
λ 0.338390.141740.215110.865440.650330.215220.865350.65013
θ 109.4366.269170.49886111.171110.673107.700218.373110.674
S2 β 1.725480.146440.043411.303761.260350.888352.149921.26157
λ 0.108910.020400.419460.874450.454990.419130.874400.45527
θ 37.01915.713751.6492587.996986.34761.6475987.459885.8122
S3 β 1.096700.120540.000010.178780.178770.133310.312180.17886
λ 0.158420.016560.166180.439150.272970.166170.439030.27286
θ 3.380604.225000.294013.571613.277603.182526.467143.28463
IIS1 β 0.085320.177750.000481.054581.054101.509582.563981.05440
λ 0.386390.142330.255370.754330.498970.255410.754370.49897
θ 107.0059.623470.68500108.049107.364105.961213.325107.364
S2 β 1.282230.153830.493571.131660.638090.493501.131670.63817
λ 0.158280.022610.451650.799430.347770.451670.799410.34774
θ 3.534674.925230.343643.213462.869823.845196.725402.88021
S3 β 0.009810.005290.000010.073410.073400.096950.170640.07369
λ 0.525410.004010.145000.340210.195210.710230.905770.19553
θ 50.01590.9523249.155551.14731.9918049.154551.14751.99299
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Alotaibi, R.; Rezk, H.; Park, C.; Elshahhat, A. The Discrete Exponentiated-Chen Model and Its Applications. Symmetry 2023, 15, 1278. https://doi.org/10.3390/sym15061278

AMA Style

Alotaibi R, Rezk H, Park C, Elshahhat A. The Discrete Exponentiated-Chen Model and Its Applications. Symmetry. 2023; 15(6):1278. https://doi.org/10.3390/sym15061278

Chicago/Turabian Style

Alotaibi, Refah, Hoda Rezk, Chanseok Park, and Ahmed Elshahhat. 2023. "The Discrete Exponentiated-Chen Model and Its Applications" Symmetry 15, no. 6: 1278. https://doi.org/10.3390/sym15061278

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop