Next Article in Journal
Measure of Similarity between GMMs Based on Autoencoder-Generated Gaussian Component Representations
Next Article in Special Issue
A Certain Class of Equi-Statistical Convergence in the Sense of the Deferred Power-Series Method
Previous Article in Journal
Statistical Analysis and Applications of Adaptive Progressively Type-II Hybrid Poisson–Exponential Censored Data
Previous Article in Special Issue
Consistency of the Estimator for the Common Mean in Fixed-Effect Meta-Analyses
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications

1
Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
2
Department of Statistics and Operation Research, College of Science, Qassim University, Buraydah 51482, Saudi Arabia
3
Department of Mathematics, Faculty of Science, Mansoura University, Mansoura 35516, Egypt
4
Department of Mathematics, College of Science and Humanities in Al-Kharj, Prince Sattam bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia
5
Department of Statistics and Computer Sciences, Faculty of Science, Mansoura University, Mansoura 35516, Egypt
*
Author to whom correspondence should be addressed.
Axioms 2023, 12(6), 534; https://doi.org/10.3390/axioms12060534
Submission received: 6 April 2023 / Revised: 24 May 2023 / Accepted: 25 May 2023 / Published: 29 May 2023

Abstract

:
In this article, a new family of bivariate discrete distributions is proposed based on the copula concept, in the so-called bivariate discrete odd generalized exponential-G family. Some distributional properties, including the joint probability mass function, joint survival function, joint failure rate function, median correlation coefficient, and conditional expectation, are derived. After proposing the general class, one special model of the new bivariate family is discussed in detail. The maximum likelihood approach is utilized to estimate the family parameters. A detailed simulation study is carried out to examine the bias and mean square error of maximum likelihood estimators. Finally, the importance of the new bivariate family is explained by means of two distinctive real data sets in various fields.

1. Introduction

Probability distributions are statistical functions that describe all possible values of a random variable and their likelihood, within a specific range. These are a result of the data generating process of an occurrence or its probability density function (PDF). Probability distributions are useful in modeling our environment to acquire estimates of the likelihood that a specific event will occur or to determine the variability of occurrence. There are many types of probability distributions, including binomial distribution, uniform distribution, Poisson distribution, normal distribution, lognormal distribution, beta distribution, exponential, and bivariate distribution. In our study, we highlight the exponential distribution (E) as a basic probability model for the proposed generator. The E model is a fundamental and widely recognized probability distribution in research. For lifetime phenomenon and reliability studies, the E distribution is rarely utilized due to the constant failure rate (FR); however, its memoryless property proves useful in queuing theory. Different extensions or generalizations of the E distribution have been introduced in the literature, with the generalized exponential (GE) garnering increased attention. The cumulative distribution function (CDF) of the GE distribution can be formulated as expression
F GE ( x ; ξ , ν ) = 1 e ξ x ν ; x 0 ,
where ξ > 0 is the scale parameter and ν > 0 is the shape parameter. Based on the technique of Alzaatreh et al. [1] for generating families of distributions, Tahir et al. [2] proposed the new odd generator of distributions called the odd generalized exponential-G (OGE-G), which exhibits flexible FR shapes such as increasing, decreasing, bathtub or upside-down bathtub. The CDF of the OGE-G family is given by the following formula
F OGE - G ( x ; ξ , ν , Θ ) = 1 e ξ Ψ ( x ; Θ ) ν ; x 0 ,
where Ψ ( x ; Θ ) = G ( x ; Θ ) / ( 1 G ( x ; Θ ) ) , G ( x ; Θ ) is a non-negative function of any model that depends on positive parameter vector Θ > 0 . For more detail around the odd ratio, Ψ ( x ; Θ ) (see Tahir et al., [2]). Due to the flexibility of Alzaatreh et al. [1] approach, several authors utilized this technique to generate new models for analyzing different types of data sets in various fields, for instance, see Silva et al. [3], Alizadeh et al. [4], Korkmaz et al. [5], Djibrila [6], Reyad et al. [7], Alizadeh et al. [8], among others.
The above probability models can be applied to discuss and analyze univariate data. In some applied sciences, bivariate data can be generated from different fields. Bivariate data refer to a data set that contains exactly two variables. In statistics, bivariate data represents data where each value of one variable is paired with a corresponding value of the other variable. Typically, it would be of interest to investigate the possible association between the two variables. A probability distribution involving two random variables is called a bivariate probability distribution. The bivariate (BV) distributions have been introduced, developed, and discussed by many authors and have wide applications in various fields like engineering, weather, sports, drought, among others. More detail is given in Balakrishnan and Lai [9]. The construction of the BV discrete and continuous distributions are mainly via (I) the copulas (II) compounding (III) marginals (IV) reduction and (V) conditioning, for instance, Johnson and Tenenbein [10], Quesada and Rodrguez [11], Fang et al. [12], Durante [13], Kundu and Gupta [14], Sarabia et al. [15], Roozegar and Jafari [16], Eliwa et al. [17], among others.
In several cases, the BV lifetimes need to be recorded on a discrete scale rather than on a continuous one. Due to the previous reasons, discretizing continuous BV distributions has received much attention in the statistical literature, for instance, Lee and Cha [18], Kundu and Nekoukhou [19], El-Morshedy et al. [20], Nekoukhou et al. [21], De Oliveira and Achcar [22], among others. Herein, we focus on the copulas approach. A copula function for bivariate discrete probability distributions is a function that connects the joint probability mass function of two discrete variables to their marginal probability mass functions. A copula function can be obtained by discretizing a continuous copula function, which is a function that links a multivariate distribution function to its one-dimensional marginal distribution functions. Alternatively, a copula function can be derived from a specific bivariate discrete distribution, such as the bivariate geometric, binomial, Poisson, or negative binomial distributions. A copula function for bivariate discrete probability distributions can be used to model the dependence structure between the variables and to simulate bivariate discrete data with given marginals and correlation; for more details, see, Kobus and Kurek, [23], Najarzadegan et al., [24], and Yamaguchi and Maruo, [25]. One of the important constructions of the bivariate distribution in survival and lifetime models is frailty models. So, both copulas and frailty are important in lifetime models (see Emura et al., [26]). The copula function can be formulated as follows
C ( F 1 ( x 1 ) , F 2 ( x 2 ) ) = G ( x 1 , x 2 ) for all x 1 , x 2 = 0 , 1 , ,
where G ( x 1 , x 2 ) represents the CDF of a given BV continuous model on ( 0 , ) 2 , whereas F 1 ( x 1 ) and F 2 ( x 2 ) are the marginals. There are some known parametric copulas such as the following: the normal copulas, Farlie-Gumbel-Morgenstern (FGM) copulas, Marshall and Olkin (MO) copulas, Cuadras and Augé (CA) copulas (see Cuadras and Augé, [27]), among others. The CA copulas family was proposed as an extension of the BV distributions, defined as follows
C θ ( x 1 , x 2 ) = min { x 1 , x 2 } max { x 1 , x 2 } 1 θ ,
where 0 θ 1 and 0 x 1 , x 2 1 . The parameter θ measures the degree of dependence and plays the role of the parameter of upper-tail dependence of C θ ( x 1 , x 2 ) . The CA copulas family is utilized in a variety of modeling’s for exchangeable random vectors because it is symmetric in its marginals. The aim of this paper is to propose a new discrete BV family of distributions based on the CA copulas approach, in the so-called BV discrete odd generalized exponential-G (BDsOGE-G) family, where the marginals have the OGE-G families.
The article is organized as follows: In Section 2, the BDsOGE-G family of distributions is introduced. Some mathematical and statistical properties are derived in Section 3. In Section 4, the BDsOGE-Weibull distribution is discussed in detail. The parameters of the new bivariate family are estimated via the maximum likelihood technique in Section 5. In Section 6, a simulation study is performed to discuss the performance of the maximum likelihood estimators. Two distinctive data sets are analyzed to discuss the flexibility of the proposed family in Section 7. Finally, some concluding remarks and future work are listed in Section 8.

2. The BDOGE-G Class

Recall that, in Equation (2), the CDF and probability mass function (PMF) of the discrete OGE-G (DsOGE-G) family can be formulated as follows
F DsOGE - G ( x ; p , ν , Θ ) = 1 p Ψ ( x + 1 ; Θ ) ν ; x = 0 , 1 , 2 , 3 ,
and
f DsOGE - G ( x ; p , ν , Θ ) = 1 p Ψ ( x + 1 ; Θ ) ν 1 p Ψ ( x ; Θ ) ν ; x = 0 , 1 , 2 , 3 , ,
respectively, where p = e ξ , 0 < p < 1 , ν > 0 and Θ is the vector of parameters ( 1 × k ). Equation (6) can be writen as follows
f DsOGE - G ( x ; p , ν , Θ ) = k = 1 ν ( 1 ) k + 1 ν k p k Ψ ( x ; Θ ) p k Ψ ( x + 1 ; Θ ) .
Utilizing the CA copulas approach with γ = 1 θ , the joint PMF of the BDsOGE-G can be expressed as
f X 1 , X 2 ( x 1 , x 2 ) = P 1 ( x 1 , x 2 ) if x 1 < x 2 P 2 ( x 1 , x 2 ) if x 2 < x 1 P 3 ( x ) if x 1 = x 2 = x ,
where
P 1 ( x 1 , x 2 ) = 1 p Ψ ( x 1 + 1 ; Θ ) ν 1 p Ψ ( x 1 ; Θ ) ν 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν γ = f DsOGE - G ( x 1 ; p , ν , Θ ) f DsOGE - G ( x 2 ; p , ν γ , Θ ) ,
P 2 ( x 1 , x 2 ) = 1 p Ψ ( x 1 + 1 ; Θ ) ν γ 1 p Ψ ( x 1 ; Θ ) ν γ 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν = f DsOGE - G ( x 1 ; p , ν γ , Θ ) f DsOGE - G ( x 2 ; p , ν , Θ ) ,
and
P 3 ( x ) = 1 p Ψ ( x + 1 ; Θ ) ν γ 1 p Ψ ( x + 1 ; Θ ) ν 1 p Ψ ( x ; Θ ) ν 1 p Ψ ( x ; Θ ) ν × 1 p Ψ ( x + 1 ; Θ ) ν γ 1 p Ψ ( x ; Θ ) ν γ = 1 p Ψ ( x + 1 ; Θ ) ν γ f DsOGE - G ( x ; p , ν , Θ ) 1 p Ψ ( x ; Θ ) ν f DsOGE - G ( x ; p , ν γ , Θ ) .
The joint CDF of the BDsOGE-G family can be proposed as
F X 1 , X 2 ( x 1 , x 2 ) = 1 p Ψ ( x 1 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( z + 1 ; Θ ) ν ( 1 γ ) ; x 1 , x 2 = 0 , 1 , 2 , 3 ,
where z = min { x 1 , x 2 } . The marginal CDF of the BDsOGE-G family is
F DsOGE - G ( x ; p , ν , Θ ) = 1 p Ψ ( x i + 1 ; Θ ) ν ; i = 1 , 2 .
If ( X 1 , X 2 ) BDsOGE-G ( p , ν , γ , Θ ) , then X 1 and X 2 are positive quadrant dependent (PQD) where
Pr ( X 1 x 1 , X 2 x 2 ) Pr ( X 1 x 1 ) Pr ( X 2 x 2 ) , for all x 1 and x 2 .
The joint RF is given by
S X 1 , X 2 ( x 1 , x 2 ) = S 1 ( x 1 , x 2 ) if x 1 < x 2 S 2 ( x 1 , x 2 ) if x 2 < x 1 S 3 ( x ) if x 1 = x 2 = x ,
where
S 1 ( x 1 , x 2 ) = 1 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 1 + 1 ; Θ ) ν 1 1 p Ψ ( x 2 + 1 ; Θ ) ν γ ,
S 2 ( x 1 , x 2 ) = 1 1 p Ψ ( x 1 + 1 ; Θ ) ν 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 1 p Ψ ( x 1 + 1 ; Θ ) ν γ ,
and
S 3 ( x ) = 1 2 1 p Ψ ( x + 1 ; Θ ) ν + 1 p Ψ ( x + 1 ; Θ ) ν ( 1 + γ ) .
The joint HRF function can be expressed by using h X 1 , X 2 ( x 1 , x 2 ) = f X 1 , X 2 ( x 1 , x 2 ) / S X 1 , X 2 ( x 1 1 , x 2 1 ) whereas the joint reversed HRF function can be proposed via v X 1 , X 2 ( x 1 , x 2 ) = f X 1 , X 2 ( x 1 , x 2 ) / F X 1 , X 2 ( x 1 , x 2 ) . If the bivariate vector X have the BDsOGE-G model, then the distribution for each W = max { X 1 , X 2 } and V = min { X 1 , X 2 } can be written as follows
F W ( z ) = F DsOGE - G ( z ; p , ν 1 + ν 2 + ν 3 , Θ )
and
F V ( z ) = F DsOGE - G ( z ; p , ν 1 + ν 3 , Θ ) + F DsOGE - G ( z ; p , ν 2 + ν 3 , Θ ) F W ( z ) ,
respectively.

3. Distributional Properties

3.1. Median Correlation Coefficient

In statistics and probability theory, the median is a type of summary statistics used to give important information about a certain data point or population. It is the value separating the higher half from the lower half of an ordered data sample, a population, or a probability distribution. For a bivariate version, the median correlation coefficient, say L X 1 , X 2 , can be expressed as a form L X 1 , X 2 = 4 F X 1 , X 2 ( L X 1 , L X 2 ) 1 , where L X 1 and L X 2 denote the median of X 1 and X 2 , respectively. If X 1 DsOGE-G ( Θ , ν 1 + ν 3 ) and X 2 DsOGE-G ( Θ , ν 2 + ν 3 ) ,then
L X 1 , X 2 = 4 F DsOGE - G L X 2 ; p , ν 2 , Θ F DsOGE - G L X 1 ; p , ν 1 + ν 3 , Θ 1 if x 1 x 2 4 F DsOGE - G L X 1 ; p , ν 1 , Θ F DsOGE - G L X 2 ; p , ν 2 + ν 3 , Θ 1 if x 1 > x 2 ,
where L X i ; i = 1 , 2 is the quantile function for the marginals.

3.2. The Conditional CDF of X 1 Given X 2 = x 2 ( X 2 x 2 )

A conditional distribution is a probability distribution for a sub-population. In other words, it shows the probability that a randomly selected item in a sub-population has a characteristic of interest. Conditional distributions have many applications in statistics and machine learning. For example, they can be used in regressing structured response variables. In addition, conditional distributions can be embedded into a Hilbert space, which is potentially useful in applications where conditional distributions are the key quantities of interest. Assuming X follows the BDsOGE-G family, the conditional CDF of ( X 1 X 2 = x 2 ) , say F X 1 X 2 = x 2 ( x 1 ) , is given by
F X 1 X 2 = x 2 ( x 1 x 2 ) = F 1 ( x 1 x 2 ) if 0 x 1 < x 2 F 2 ( x 1 x 2 ) if 0 x 2 < x 1 F 3 ( x 1 x 2 ) if 0 x 1 = x 2 = x ,
where
F 1 ( x 1 x 2 ) = 1 p Ψ ( x 1 + 1 ; Θ ) ν 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν γ 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν ,
F 2 ( x 1 x 2 ) = 1 p Ψ ( x 1 + 1 ; Θ ) ν γ ,
and
F 3 ( x 1 x 2 ) = 1 p Ψ ( x + 1 ; Θ ) ν γ ,
where F X 1 X 2 = x 2 ( x 1 x 2 ) = P ( X 1 x 1 , X 2 = x 2 ) P ( X 2 = x 2 ) . Similarly, the conditional CDF of X 1 X 2 x 2 , say F X 1 X 2 x 2 ( x 1 x 2 ) , is given by
F X 1 X 2 x 2 ( x 1 x 2 ) = 1 p Ψ ( x 1 + 1 ; Θ ) ν 1 p Ψ ( x 2 + 1 ; Θ ) ν ( 1 γ ) if 0 x 1 < x 2 , 1 p Ψ ( x 1 + 1 ; Θ ) ν γ if 0 x 2 < x 1 , 1 p Ψ ( x + 1 ; Θ ) ν γ if 0 x 1 = x 2 = x .

3.3. The Conditional Expectation of X 1 Given X 2 = x 2

In probability theory, the conditional expectation of a random variable is its expected value given that a certain set of “conditions” is known to occur. The conditional expectation is the expected value of a random variable, computed with respect to a conditional probability distribution. To derive the conditional expectation of X 1 , given X 2 = x 2 , the PMF of X 1 given X 2 = x 2 should be calculated first. Assuming X follow the BDsOGE-G family, the PMF of X 1 given X 2 = x 2 can be expressed as
f X 1 X 2 = x 2 ( x 1 x 2 ) = f 1 ( x 1 x 2 ) if 0 x 1 < x 2 f 2 ( x 1 x 2 ) if 0 x 2 < x 1 f 3 ( x 1 x 2 ) if 0 x 1 = x 2 = x ,
where
f 1 ( x 1 x 2 ) = f DsOGE - G ( x 1 ; p , ν , Θ ) f DsOGE - G ( x 2 ; p , ν γ , Θ ) f DsOGE - G ( x 2 ; p , ν , Θ ) ,
f 2 ( x 1 x 2 ) = f DsOGE - G ( x 1 ; p , ν γ , Θ ) ,
and
f 3 ( x 1 x 2 ) = 1 p Ψ ( x + 1 ; Θ ) ν γ f DsOGE - G ( x ; p , ν , Θ ) 1 p Ψ ( x ; Θ ) ν f DsOGE - G ( x ; p , ν γ , Θ ) f DsOGE - G ( x ; p , ν , Θ ) .
Thus, the conditional expectation of X 1 X 2 = x 2 , say E ( X 1 X 2 = x 2 ) , is given by
E ( X 1 X 2 = x 2 ) = i = 0 ν γ x 1 = x 2 + 1 ( 1 ) i x 1 ν γ i p i Ψ ( x 1 + 1 ; Θ ) p i Ψ ( x 1 ; Θ ) + x 2 1 p Ψ ( x 2 + 1 ; Θ ) ν γ + i = 0 ν x 1 = 0 x 2 1 ( 1 ) i x 1 ν i p i Ψ ( x 1 + 1 ; Θ ) p i Ψ ( x 1 ; Θ ) 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν γ 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν x 2 1 p Ψ ( x 2 ; Θ ) ν 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν γ 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν ,
where E ( X 1 X 2 = x 2 ) = x 1 = 0 x 1 f X 1 X 2 = x 2 ( x 1 x 2 ) . The accurate estimation of conditional expectations is an important problem arising in different branches of science and engineering as well as finance, economics and various business applications (see Casella and Berger, [28]; Pfeiffer, [29]; and Emura et al., [26]).

4. The BDsOGE-Weibull (BDsOGEW) Distribution

Considering the CDF of Weibull distribution with one positive shape parameter a, the joint PMF of the BDsOGEW model is
f X 1 , X 2 ( x 1 , x 2 ) = P 1 ( x 1 , x 2 ) if x 1 < x 2 P 2 ( x 1 , x 2 ) if x 2 < x 1 P 3 ( x ) if x 1 = x 2 = x ,
where
P 1 ( x 1 , x 2 ) = 1 p ( e ( x 1 + 1 ) a 1 ) ν 1 p ( e x 1 a 1 ) ν 1 p ( e ( x 2 + 1 ) a 1 ) ν γ 1 p ( e x 2 a 1 ) ν γ ,
P 2 ( x 1 , x 2 ) = 1 p ( e ( x 1 + 1 ) a 1 ) ν γ 1 p ( e x 1 a 1 ) ν γ 1 p ( e ( x 2 + 1 ) a 1 ) ν 1 p ( e x 2 a 1 ) ν
and
P 3 ( x ) = 1 p ( e ( x + 1 ) a 1 ) ν γ 1 p ( e ( x + 1 ) a 1 ) ν 1 p ( e x a 1 ) ν 1 p ( e x a 1 ) ν × 1 p ( e ( x + 1 ) a 1 ) ν γ 1 p ( e x a 1 ) ν γ .
Figure 1 shows the joint PMF of the BDsOGEW model for various values of the parameters.
It is worth noting that the joint probability mass function (PMF) can exhibit either a unimodal or a decreasing surface shape. This flexibility allows it to be used for modeling and analyzing both asymmetric and symmetric data. The corresponding joint CDF and joint RF to Equation (17) can be formulated as
F X 1 , X 2 ( x 1 , x 2 ) = 1 p ( e x 1 a 1 ) ν γ 1 p ( e x 2 a 1 ) ν γ 1 p ( e z a 1 ) ν ( 1 γ )
and
S X 1 , X 2 ( x 1 , x 2 ) = 1 1 p ( e x 1 a 1 ) ν 1 p ( e x 2 a 1 ) ν + 1 p ( e x 1 a 1 ) ν γ × 1 p ( e x 2 a 1 ) ν γ 1 p ( e z a 1 ) ν ( 1 γ ) ,
respectively. Figure 2 and Figure 3 illustrate the joint RF and joint HRF of the proposed bivariate model based on different values of its parameters.
It is noted that the joint HRF can be either a unimodal, constant, or decreasing surface. Thus, the joint HRF of this model can be used to analyze several types of data in various fields. Figure 4 lists different plots of the conditional PMF of X 1 , given that x 2 = 5 , 15 , 25 under various specific schemes Figure 4a: a = 0.4 , p = 0.8 , γ = 1.3 , v = 10 and Figure 4b: a = 0.4 , p = 0.8 , γ = 0.3 , v = 1.8 . Whereas the attitude of the conditional PMF of X 2 , given that x 1 = 5 , 15 , 25 can be proposed through schemes Figure 5a: a = 0.4 , p = 0.8 , γ = 1.3 , v = 10 and Figure 5b: a = 0.4 , p = 0.8 , γ = 0.3 , v = 1.8 in Figure 5.

5. Point and Interval Estimations

5.1. Maximum Likelihood Estimation (MLE)

In statistics, MLE is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed statistical model, the observed data is most probable. MLE has applications in many fields such as physics, engineering, economics, finance, and biology. In this section, the approach of MLE is used to estimate the unknown parameters p , ν , γ and Θ of the BDsOGE-G family. Suppose that a sample size n, of the form ( x 11 , x 21 ) , ( x 12 , x 22 ) , , ( x 1 n , x 2 n ) from BDsOGE-G family. Under the following notations: I 1 = { x 1 j < x 2 j } , I 2 = { x 2 j < x 1 j } , I 0 = { x 1 j = x 2 j = x j } , I = I 1 I 2 I 0 , I 1 = n 1 , I 2 = n 2 , I 0 = n 0 and n = n 1 + n 2 + n 0 , the likelihood function can be expressed as
L ( p , ν , γ , Θ ) = j I 1 P 1 ( x 1 j , x 2 j ) j I 2 P 2 ( x 1 j , x 2 j ) j I 3 P 3 ( x j ) .
The log-likelihood function becomes
l ( p , ν , γ , Θ ) = j I 1 ln g 1 ( x 1 j ; ν ) + j I 1 ln g 1 ( x 2 j ; ν γ ) + j I 2 ln g 1 ( x 1 j ; ν γ ) + j I 2 ln g 1 ( x 2 j ; ν ) + j I 0 ln 1 p Ψ ( x j + 1 ; Θ ) ν γ g 1 ( x j ; ν ) 1 p Ψ ( x j ; Θ ) ν g 1 ( x j ; ν γ ) ,
where
g 1 ( x ; ν ) = 1 p Ψ ( x + 1 ; Θ ) ν 1 p Ψ ( x ; Θ ) ν .
The MLEs of the parameters p , ν , γ and Θ can be obtained by computing the first partial derivatives of Equation (19) with respect to p , ν , γ and Θ , respectively, and setting the resulting equations equal to zero. The first partial derivatives are
l ( p , ν , γ , Θ ) p = j I 1 g 2 ( x 1 j , ν ) g 2 ( x 1 j + 1 , ν ) g 1 ( x 1 j ; ν ) + j I 1 g 2 ( x 2 j , ν γ ) g 2 ( x 2 j + 1 , ν γ ) g 1 ( x 2 j ; ν γ ) + j I 2 g 2 ( x 1 j , ν γ ) g 2 ( x 1 j + 1 , ν γ ) g 1 ( x 1 j ; ν γ ) + j I 2 g 2 ( x 2 j , ν ) g 2 ( x 2 j + 1 , ν ) g 1 ( x 2 j ; ν ) + j I 0 1 p Ψ ( x j + 1 ; Θ ) ν γ ( g 2 ( x j , ν ) g 2 ( x j + 1 , ν ) ) g 2 ( x j + 1 , ν γ ) g 1 ( x j ; ν ) 1 p Ψ ( x j + 1 ; Θ ) ν γ g 1 ( x j ; ν ) 1 p Ψ ( x j ; Θ ) ν g 1 ( x j ; ν γ ) j I 0 1 p Ψ ( x j ; Θ ) ν ( g 2 ( x j , ν γ ) g 2 ( x j + 1 , ν γ ) ) g 2 ( x j , ν ) g 1 ( x j ; ν γ ) 1 p Ψ ( x j + 1 ; Θ ) ν γ g 1 ( x j ; ν ) 1 p Ψ ( x j ; Θ ) ν g 1 ( x j ; ν γ ) ,
l ( p , ν , γ , Θ ) ν = j I 1 g 3 ( x 1 j , ν ) g 3 ( x 1 j + 1 , ν ) g 1 ( x 1 j ; ν ) + j I 1 γ ( g 3 ( x 2 j , ν γ ) g 3 ( x 2 j + 1 , ν γ ) ) g 1 ( x 2 j ; ν γ ) + j I 2 γ ( g 3 ( x 1 j , ν γ ) g 3 ( x 1 j + 1 , ν γ ) ) g 1 ( x 1 j ; ν γ ) + j I 2 g 3 ( x 2 j , ν ) g 3 ( x 2 j + 1 , ν ) g 1 ( x 2 j ; ν ) + j I 0 1 p Ψ ( x j + 1 ; Θ ) ν γ ( g 3 ( x j , ν ) g 3 ( x j + 1 , ν ) ) γ g 3 ( x j + 1 , ν γ ) g 1 ( x j ; ν ) 1 p Ψ ( x j + 1 ; Θ ) ν γ g 1 ( x j ; ν ) 1 p Ψ ( x j ; Θ ) ν g 1 ( x j ; ν γ ) j I 0 γ 1 p Ψ ( x j ; Θ ) ν ( g 3 ( x j , ν γ ) g 3 ( x j + 1 , ν γ ) ) g 3 ( x j , ν ) g 1 ( x j ; ν γ ) 1 p Ψ ( x j + 1 ; Θ ) ν γ g 1 ( x j ; ν ) 1 p Ψ ( x j ; Θ ) ν g 1 ( x j ; ν γ ) ,
l ( p , ν , γ , Θ ) γ = j I 1 ν ( g 3 ( x 2 j , ν γ ) g 3 ( x 2 j + 1 , ν γ ) ) g 1 ( x 2 j ; ν γ ) + j I 2 ν ( g 3 ( x 1 j , ν γ ) g 3 ( x 1 j + 1 , ν γ ) ) g 1 ( x 1 j ; ν γ ) + j I 0 ν g 3 ( x j + 1 , ν γ ) g 1 ( x j ; ν ) ν 1 p Ψ ( x j ; Θ ) ν ( g 3 ( x j , ν γ ) g 3 ( x j + 1 , ν γ ) ) 1 p Ψ ( x j + 1 ; Θ ) ν γ g 1 ( x j ; ν ) 1 p Ψ ( x j ; Θ ) ν g 1 ( x j ; ν γ )
and
l ( p , ν , γ , Θ ) Θ k = j I 1 g 4 ( x 1 j , ν ) g 4 ( x 1 j + 1 , ν ) g 1 ( x 1 j ; ν ) + j I 1 g 4 ( x 2 j , ν γ ) g 4 ( x 2 j + 1 , ν γ ) g 1 ( x 2 j ; ν γ ) + j I 2 g 4 ( x 1 j , ν γ ) g 4 ( x 1 j + 1 , ν γ ) g 1 ( x 1 j ; ν γ ) + j I 2 g 4 ( x 2 j , ν ) g 4 ( x 2 j + 1 , ν ) g 1 ( x 2 j ; ν ) + j I 0 1 p Ψ ( x j + 1 ; Θ ) ν γ ( g 4 ( x j , ν ) g 4 ( x j + 1 , ν ) ) g 4 ( x j + 1 , ν γ ) g 1 ( x j ; ν ) 1 p Ψ ( x j + 1 ; Θ ) ν γ g 1 ( x j ; ν ) 1 p Ψ ( x j ; Θ ) ν g 1 ( x j ; ν γ ) j I 0 1 p Ψ ( x j ; Θ ) ν ( g 4 ( x j , ν γ ) g 4 ( x j + 1 , ν γ ) ) g 4 ( x j , ν ) g 1 ( x j ; ν γ ) 1 p Ψ ( x j ; Θ ) ν γ g 1 ( x j ; ν ) 1 p Ψ ( x j ; Θ ) ν g 1 ( x j ; ν γ ) .
where
g 2 ( x ; ν ) = ν p Ψ ( x ; Θ ) 1 1 p Ψ ( x ; Θ ) ν 1 , g 3 ( x ; ν ) = ln 1 p Ψ ( x ; Θ ) 1 p Ψ ( x ; Θ ) ν , g 4 ( x ; ν ) = ν H · ( x ; Θ ) ln p p Ψ ( x ; Θ ) 1 p Ψ ( x ; Θ ) ν 1 ,
where H · ( x ; Θ ) = Θ k Ψ ( x ; Θ ) and Θ k is a vector of the parameters of the baseline model ( k = 1 , 2 , 3 , ). The MLEs of the parameters p , ν , γ and Θ can be obtained by solving the above system of non-linear equations. The solution of these equations is not easy to solve. Thus, the “NMaximize” function in the Mathematica software was utilized. The “NMaximize” is a built-in function that attempts to find a global maximum of a function or an expression numerically see Appendix A.

5.2. Asymptotic Confidence Intervals

Asymptotic confidence intervals (ACIs) are confidence intervals (CIs) that are based on the normal approximation of the sampling distribution of a statistic. They are also called large-sample CIs because they are only valid when the sample size is large enough. The asymptotic normal distribution of the MLE is the most widely utilized approach for establishing confidence bounds for the model parameters. With respect to the Fisher information matrix (FIM), denoted as I ( Y ) , which consists of the negative second derivatives of the l ( p , ν , γ , Θ k ) evaluated at Y ^ = ( p ^ , ν ^ , γ ^ , Θ k ^ ) , the asymptotic variance-covariance matrix (AVCM) of the MLE of the parameters, assuming that the AVCM of the parameter vector can be formulated as
I ( Y ^ ) = E 2 l ( Y ) p 2 2 l ( Y ) ν p 2 l ( Y ) ν 2 2 l ( Y ) γ p 2 l ( Y ) γ ν 2 l ( Y ) γ 2 2 l ( Y ) Θ k p 2 l ( Y ) Θ k ν 2 l ( Y ) Θ k γ 2 l ( Y ) Θ k 2 ,
where V a r ( Y ^ ) = I 1 ( Y ^ ) . Based on the asymptotic normality of the MLE, a 100 ( 1 α ) % confidence interval for parameter Y can be constructed as follows: p ^ ± Z α / 2 V a r ( p ) , ν ^ ± Z α / 2 V a r ( ν ) , γ ^ ± Z α / 2 V a r ( γ ) , and Θ k ^ ± Z α / 2 V a r ( Θ k ) . The second derivatives of the likelihood function can be calculated with Maple software.

6. Simulation: Estimators Performance

Simulation for bivariate discrete data in statistics is a technique to generate random data that follows a joint probability distribution of two discrete variables. There are different methods to simulate bivariate discrete data, depending on the type and structure of the distribution. One common method is to use copulas, which are functions that link univariate marginal distributions to a multivariate distribution. Copulas can capture the dependence structure between the variables and allow for flexible modeling of different types of correlations. Another method is to use discretization of continuous bivariate distributions. This involves rounding or truncating the continuous values to discrete values, and adjusting the probabilities accordingly. A third method is to use specific bivariate discrete distributions. Depending on the type of estimator, different metrics may be used to measure the performance of the estimator. These measures may include bias, mean squared error (MSE), confidence intervals (CIs), and coverage probability (CP) of the 95% CIs. In this segment, the second approach has been used. The performance of the MLE is tested under different schemes for the BDsOGEW parameter as follows:
  • Scheme I: (∀ p = 0.1 , ν = 0.3 , γ = 0.5 , a = 0.2 | n 1 = 20, n 2 = 50, n 3 = 150, n 4 = 300, n 5 = 500, n 6 = 700);
  • Scheme II: (∀ p = 0.2 , ν = 0.5 , γ = 0.3 , a = 0.5 | n 1 = 20, n 2 = 50, n 3 = 150, n 4 = 300, n 5 = 500, n 6 = 700);
  • Scheme III: (∀ p = 0.5 , ν = 0.3 , γ = 0.7 ,   a = 0.9 | n 1 = 20, n 2 = 50, n 3 = 150, n 4 = 300, n 5 = 500, n 6 = 700)
The numerical assessments are performed depending on the bias, MSE, CP, and 95 % CIs “lower bound (LB) and upper bound (UB)” using software R package. In general, an estimator is considered good if it has low bias and low variance. The empirical results are reported in Table 1, Table 2 and Table 3.
According to the simulation results/performance as n + , the bias and MSE decrease; and consequently, an unbiased estimator was achieved for large samples under consistency condition. Thus, the maximum likelihood approach can be used effectively to estimate the BDsOGEW parameters.

7. Data Analysis

In this section, we clear the experimental importance of the BDsOGE-G family by using three applications. In each data set, we compare the fits of the proposed BDsOGEW distribution with well-known bivariate models. The tested distributions are compared using some criteria namely, the negative maximized log-likelihood ( l ), Akaike information criterion (AIC), and Hannan-Quinn information criterion (HQIC). The AIC is an estimator of prediction error and thereby relative quality of statistical models for a given set of data. It estimates the quality of each model, relative to each of the other models. AIC provides a means for model selection. The HQIC is a criterion for model selection. It is an alternative to AIC and Bayesian information criterion.

7.1. Data Set I: Nasal Drainage Severity Score

This data set represents the efficacy of steam inhalation in the treatment of common cold symptoms “ 0 = no symptoms; 1 = mild symptoms; 2 = moderate symptoms; 3 = severe symptoms” (see Davis, [30]). Figure 6 shows scatter and violin plots of the Nasal drainage severity score data.
Before analyzing the bivariate data, we first fit the marginals X 1 and X 2 separately and min ( X 1 , X 2 ) . The MLEs of the parameters ν , p and α of the corresponding univariate distribution for X 1 , X 2 and min ( X 1 , X 2 ) are ( 1.915 , 0.809 , 0.911 ), ( 5.073 , 0.516 , 0.601 ) and ( 1.591 , 0.769 , 0.923 ), respectively. The l values are 78.855 , 78.031 and 76.781 , respectively. Moreover, the p-values ranged from 0.658 to 0.759 . Now, we compare the BDsOGEW distribution with some competitive distributions like bivariate discrete exponential (BDsE), bivariate Poisson with four parameters (BPo-4P), independent bivariate Poisson (IBPo), bivariate discrete inverse exponential (BDsIE), bivariate discrete inverse Rayleigh (BDsIR), and bivariate discrete inverse Weibull (BDsIW) distributions. Table 4 lists the MLEs and some goodness-of-fit measures (GOFM).
It is observed that the BDsOGEW model is the best among all tested models because it has the smallest values among l , AIC, and HQIC when compared to the other competitive models. The 95 % CIs for the BDsOGEW parameters γ , v , p , a can be listed as [ 0.378 , 0.770 ] , [ 2.165 , 2.748 ] , [ 0.624 , 0.820 ] , and [ 0.594 , 0.965 ] , respectively. Figure 7 shows the estimated joint PMF for the BDsOGEW distribution and the other competitive models, which support the results of Table 4.
Figure 8, Figure 9 and Figure 10 show the profile l for each parameter as well as contour plots. It has been found that the maximum likelihood estimates are unique.

7.2. Data Set II: Football Score

The data set consists of a football match score in talian football matches from 1996 to 2011, between “ACF Fiorentina” ( X 1 ) and “Juventus” ( X 2 ) . The data source is “http://www.worldfootball.net/competition/ita-serie-a/ (accessed on 16 March 2023)”. Figure 11 shows scatter and violin plots of the football score data.
We first fit the marginals X 1 and X 2 separately and min ( X 1 , X 2 ) . The MLEs of the parameters ν , p and α of the corresponding univariate distribution for X 1 , X 2 and min ( X 1 , X 2 ) are ( 29.269 , 0.165 , 0.357 ), ( 5.083 , 0.534 , 0.631 ) and ( 28.051 , 0.152 , 0.384 ), respectively. The l values are 65.898 , 67.856 and 64.723 , respectively. Moreover, the p-values ranged from 0.786 to 0.801 . Now, we compare the BDsOGEW distribution with some competitive distributions like BDsE, bivariate discrete Rayleigh (BDsR), bivariate discrete Weibull (BDsW), bivariate Poisson with minimum operator (BPo min ), bivariate Poisson with three parameters (BPo-3P), inverse bivariate Poisson (IBPo), BDsIE, and BDsIR distributions. Table 5 lists the MLEs and some GOFM.
It is noted that the BDsOGEW distribution is the best among all tested models. The 95 % CIs for the BDsOGEW parameters γ , v , p , a can be reported as [ 0.620 , 0.897 ] , [ 23.529 , 24.084 ] , [ 0.0611 , 0.3648 ] , and [ 0.253 , 0.544 ] , respectively. Figure 12 shows the estimated joint PMF for the BDsOGEW distribution and the other competitive distributions, which prove the results of Table 5.
Figure 13, Figure 14 and Figure 15 prove that the maximum likelihood estimates are unique.

8. Concluding Remarks and Future Work

In this paper, a flexible bivariate odd discrete generator has been introduced based on the copula concept. Some of its statistical properties have been investigated. After reporting the general class, one special model of the new bivariate family has been studied in-detail. It was found that the new joint probability mass function can be utilized to model asymmetric as well as symmetric data surfaces. Moreover, the hazard rate function of the newly created distribution can be used to discuss various types of failures, including increasing-, decreasing-, bathtub-, and unimodal-shaped surface. The bivariate model parameters have been estimated utilizing the maximum likelihood technique. A simulation has been performed to test the performance of the maximum likelihood estimators based on different sample sizes, and it was found that the maximum likelihood approach could be used to discuss the real data herein. Finally, two distinctive real data sets, “Nasal drainage severity score” and “Football score”, have been analyzed, and it was found that the proposed bivariate family has worked quite well in modeling the data. As a future work, the bivariate quantile residual life and multivariate extension of the proposed model with its applications will be discussed.

Author Contributions

Conceptualization, M.S.E. and M.E.-M.; Methodology, H.S.S. and H.N.A.; Software, M.S.E., L.A.A.-E. and M.E.-M.; Validation, A.A.K.; Formal analysis, L.A.A.-E. and M.S.E.; Investigation, H.S.S., A.A.K. and M.E.-M.; Resources, L.A.A.-E., H.S.S. and A.A.K.; Data curation, M.S.E. and H.N.A.; Writing—review & editing, L.A.A.-E., M.S.E. and H.N.A.; Visualization, H.S.S. and A.A.K.; Supervision, M.S.E. and M.E.-M. All authors have read and agreed to the published version of the manuscript.

Funding

Princess Nourah bint Abdulrahman University Researchers Supporting Project and Prince Sattam bin Abdulaziz Universities under project numbers (PNURSP2023R443) and (PSAU/2023/R/1444), respectively.

Data Availability Statement

The data sets are available in the paper.

Acknowledgments

Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R443), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. This study is supported via funding from Prince Sattam bin Abdulaziz University, project number (PSAU/2023/R/1444).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

  • The PMF of the BDsOGE-G
Utilizing Equations (3)–(6), the join PMF of the BDsOGE-G can be expressed as
f X 1 , X 2 ( x 1 , x 2 ) = C ( F X 1 ( x 1 + 1 ) , F X 2 ( x 2 + 1 ) ) C ( F X 1 ( x 1 + 1 ) , F X 2 ( x 2 ) ) C ( F X 1 ( x 1 ) , F X 2 ( x 2 + 1 ) ) + C ( F X 1 ( x 1 ) , F X 2 ( x 2 ) ) = P 1 ( x 1 , x 2 ) if x 1 < x 2 P 2 ( x 1 , x 2 ) if x 2 < x 1 P 3 ( x ) if x 1 = x 2 = x ,
where
P 1 ( x 1 , x 2 ) = 1 p Ψ ( x 1 + 1 ; Θ ) ν 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( x 1 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν γ 1 p Ψ ( x 1 ; Θ ) ν 1 p Ψ ( x 2 + 1 ; Θ ) ν γ + 1 p Ψ ( x 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν γ = 1 p Ψ ( x 1 + 1 ; Θ ) ν [ 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν γ ] 1 p Ψ ( x 1 ; Θ ) ν [ 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν γ ] = 1 p Ψ ( x 1 + 1 ; Θ ) ν 1 p Ψ ( x 1 ; Θ ) ν 1 p Ψ ( x 2 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν γ = f DsOGE - G ( x 1 ; p , ν , Θ ) f DsOGE - G ( x 2 ; p , ν , γ , Θ ) ,
P 2 ( x 1 , x 2 ) = 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 1 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν 1 p Ψ ( x 1 + 1 ; Θ ) ν γ 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν + 1 p Ψ ( x 1 ; Θ ) ν γ 1 p Ψ ( x 2 ; Θ ) ν = 1 p Ψ ( x 1 + 1 ; Θ ) ν γ [ 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν ] 1 p Ψ ( x 1 ; Θ ) ν γ [ 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν ] = 1 p Ψ ( x 1 + 1 ; Θ ) ν γ 1 p Ψ ( x 1 ; Θ ) ν γ 1 p Ψ ( x 2 + 1 ; Θ ) ν 1 p Ψ ( x 2 ; Θ ) ν = f DsOGE - G ( x 1 ; p , ν , γ , Θ ) f DsOGE - G ( x 2 ; p , ν , Θ )
and
P 3 ( x ) = 1 p Ψ ( x + 1 ; Θ ) ν γ 1 p Ψ ( x + 1 ; Θ ) ν 1 p Ψ ( x + 1 ; Θ ) ν γ 1 p Ψ ( x ; Θ ) ν 1 p Ψ ( x ; Θ ) ν 1 p Ψ ( x + 1 ; Θ ) ν γ + 1 p Ψ ( x ; Θ ) ν 1 p Ψ ( x ; Θ ) ν γ = 1 p Ψ ( x + 1 ; Θ ) ν γ 1 p Ψ ( x + 1 ; Θ ) ν 1 p Ψ ( x ; Θ ) ν 1 p Ψ ( x ; Θ ) ν × 1 p Ψ ( x + 1 ; Θ ) ν γ 1 p Ψ ( x ; Θ ) ν γ = 1 p Ψ ( x + 1 ; Θ ) ν γ f DsOGE - G ( x ; p , ν , Θ ) 1 p Ψ ( x ; Θ ) ν f DsOGE - G ( x ; p , ν , γ , Θ ) .
  • Estimation code for real data
L o k [ a _ , p _ v _ , γ _ ] : = i = 1 n 1 Log 1 p 1 + e ( x 1 [ [ i ] ] + 1 ) a v 1 p 1 + e ( x 1 [ [ i ] ] ) a v
+ i = 1 n 1 Log 1 p 1 + e ( x 2 [ [ i ] ] + 1 ) a v * γ 1 p 1 + e ( x 2 [ [ i ] ] ) a v * γ
+ i = 1 n 2 Log 1 p 1 + e ( Y 1 [ [ i ] ] + 1 ) a v * γ 1 p 1 + e ( Y 1 [ [ i ] ] ) a v * γ
+ i = 1 n 2 Log 1 p 1 + e ( Y 2 [ [ i ] ] + 1 ) a v 1 p 1 + e ( Y 2 [ [ i ] ] ) a v
+ i = 1 n 3 Log 1 p 1 + e ( z [ [ i ] ] + 1 ) a v * γ * 1 p 1 + e ( z [ [ i ] ] + 1 ) a v 1 p 1 + e ( z [ [ i ] ] ) a v 1 p 1 + e ( z [ [ i ] ] ) a v * 1 p 1 + e ( z [ [ i ] ] + 1 ) a v * γ 1 p 1 + e ( z [ [ i ] ] ) a v * γ
parameter = 4;
large = {{2, 0}, {3, 2}, {1, 0}, {2, 1}, {2, 1}, {3, 1}, {2, 1}, {2, 0}};
small = {{0, 2}, {2, 3}, {1, 3}, {2, 3}, {0, 1}};
equal = {{1, 1}, {0, 0}, {1, 1}, {1, 1}, {2, 2}, {1, 1}, {2, 2}, {1, 1}, {2, 2}, {1, 1}, {2, 2}, {1, 1}, {2, 2}, {1, 1}, {1, 1}, {1, 1}, {3, 3}};
Unprotect[Power]; Power[0 | 0., 0 | 0.] = 1; Protect[Power];
Y1 = large[[All, 1]];
Y2 = large[[All, 2]];
n1 = Length[x1]
x1 = small[[All, 1]];
x2 = small[[All, 2]];
n2 = Length[Y1]
z = equal[[All, 1]];
n3 = Length[z]
NMaximize { L o k [ a _ , p _ v _ , γ _ ] , 0 < p < 1 , 0 < a < , 0 < v < , 0 < γ < 1 } , { a , p , v , γ }

References

  1. Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
  2. Tahir, M.H.; Cordeiro, G.M.; Alizadeh, M.; Mansoor, M.; Zubair, M.; Hamedani, G.G. The odd generalized exponential family of distributions with applications. J. Stat. Appl. 2015, 2, 1. [Google Scholar] [CrossRef]
  3. Silva, G.F.S.; Percontini, A.; de Brito, E.; Ramos, M.W.; Venâncio, R.; Cordeiro, G.M. The odd Lindley-G family of distributions. Austrian J. Stat. 2017, 46, 65–87. [Google Scholar] [CrossRef]
  4. Alizadeh, M.; Ghosh, I.; Yousof, H.M.; Rasekhi, M.; Hamedani, G.G. The generalized odd generalized exponential family of distributions: Properties, characterizations and applications. J. Data Sci. 2017, 15, 443–465. [Google Scholar] [CrossRef]
  5. Korkmaz, M.C.; Yousof, H.M.; Hamedani, G.G. The exponential Lindley odd log-logistic-G family: Properties, characterizations and applications. J. Stat. Theory Appl. 2018, 17, 554–571. [Google Scholar] [CrossRef]
  6. Djibrila, S. The generalized odd inverted exponential-G family of distributions: Properties and applications. Eurasian Bull. Math. 2019, 2, 86–110. [Google Scholar]
  7. Reyad, H.; Othman, S.; Ul Haq, M.A. The transmuted generalized odd generalized exponential-G family of distributions: Theory and applications. J. Data Sci. 2019, 17, 279–300. [Google Scholar] [CrossRef]
  8. Alizadeh, M.; Afify, A.Z.; Eliwa, M.S.; Ali, S. The odd log-logistic Lindley-G family of distributions: Properties, Bayesian and non-Bayesian estimation with applications. Comput. Stat. 2020, 35, 281–308. [Google Scholar] [CrossRef]
  9. Balakrishnan, N.; Lai, C.D. Continuous Bivariate Distributions, 2nd ed.; Wiley: New York, NY, USA, 2009; Volume 1. [Google Scholar]
  10. Johnson, M.E.; Tenenbein, A. A bivariate distribution family with specified marginals. J. Am. Assoc. 1981, 76, 198–201. [Google Scholar] [CrossRef]
  11. Quesada-Molina, J.J.; Rodrguez-Lallena, J.A. Bivariate copulas with quadratic sections. Journaltitle Nonparametr. Stat. 1995, 5, 323–337. [Google Scholar] [CrossRef]
  12. Fang, K.T.; Fang, H.B.; Rosen, D.V. A family of bivariate distributions with non-elliptical contours. Commun.-Stat.-Theory Methods 2000, 29, 1885–1898. [Google Scholar] [CrossRef]
  13. Durante, F. A new family of symmetric bivariate copulas. Comptes Rendus Math. 2007, 344, 195–198. [Google Scholar] [CrossRef]
  14. Kundu, D.; Gupta, R.D. A class of bivariate models with proportional reversed hazard marginals. Sankhya B 2010, 72, 236–253. [Google Scholar] [CrossRef]
  15. Sarabia, J.M.; Prieto, F.; Jorda, V. Bivariate beta-generated distributions with applications to well-being data. J. Stat. Distrib. Appl. 2014, 1, 15. [Google Scholar] [CrossRef]
  16. Roozegar, R.; Jafari, A.A. On bivariate exponentiated extended Weibull family of distributions. arXiv 2015, arXiv:1507.07535. [Google Scholar] [CrossRef]
  17. Eliwa, M.S.; Alhussain, Z.A.; Ahmed, E.A.; Salah, M.M.; Ahmed, H.H.; El-Morshedy, M. Bivariate Gompertz generator of distributions: Statistical properties and estimation with application to model football data. J. Natl. Sci. Found. Sri Lanka 2020, 48. [Google Scholar] [CrossRef]
  18. Lee, H.; Cha, J.H. On two general classes of discrete bivariate distributions. Am. Stat. 2015, 69, 221–230. [Google Scholar] [CrossRef]
  19. Kundu, D.; Nekoukhou, V. Univariate and bivariate geometric discrete generalized exponential distributions. J. Stat. Theory Pract. 2018, 12, 595–614. [Google Scholar] [CrossRef]
  20. El-Morshedy, M.; Eliwa, M.S.; El-Gohary, A.; Khalil, A.A. Bivariate exponentiated discrete Weibull distribution: Statistical properties, estimation, simulation and applications. Math. Sci. 2020, 14, 29–42. [Google Scholar] [CrossRef]
  21. Nekoukhou, V.; Khalifeh, A.; Bidram, H. A bivariate discrete inverse resilience family of distributions with resilience marginals. J. Appl. Stat. 2020, 48, 1071–1091. [Google Scholar] [CrossRef] [PubMed]
  22. De Oliveira, R.P.; Achcar, J.A. A new flexible bivariate discrete Rayleigh distribution generated by the Marshall-Olkin family. Model Assist. Stat. Appl. 2020, 15, 19–34. [Google Scholar] [CrossRef]
  23. Kobus, M.; Kurek, R. Copula-based measurement of interdependence for discrete distributions. J. Math. 2018, 79, 27–39. [Google Scholar] [CrossRef]
  24. Najarzadegan, H.; Alamatsaz, M.H.; Kazemi, I. Discrete bivariate distributions generated by copulas: Dbeew distribution. J. Stat. Theory Pract. 2019, 13, 1–30. [Google Scholar] [CrossRef]
  25. Yamaguchi, Y.; Maruo, K. Bivariate beta-binomial model using Gaussian copula for bivariate meta-analysis of two binary outcomes with low incidence. Jpn. J. Stat. Data Sci. 2019, 2, 347–373. [Google Scholar] [CrossRef]
  26. Emura, T.; Matsui, S.; Rondeau, V. Survival Analysis with Correlated Endpoints: Joint Frailty-Copula Models; Springer: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
  27. Cuadras, C.M.; Augé, J. A continuous general multivariate distribution and its properties. Commun.-Stat.-Theory Methods 1981, 10, 339–353. [Google Scholar] [CrossRef]
  28. Casella, G.; Berger, R.L. Statistical Inference; Duxbury Press: Pacific Grove, CA, USA, 2002. [Google Scholar]
  29. Pfeiffer, P.E. Conditional Independence in Applied Probability; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
  30. Davis, C.S. Statistical Methods for the Analysis of Repeated Measures Data; Springer: New York, NY, USA, 2002. [Google Scholar]
Figure 1. The joint PMF of the BDsOGEW model.
Figure 1. The joint PMF of the BDsOGEW model.
Axioms 12 00534 g001
Figure 2. The joint RF of the BDsOGEW model.
Figure 2. The joint RF of the BDsOGEW model.
Axioms 12 00534 g002
Figure 3. The joint HRF of the BDsOGEW model.
Figure 3. The joint HRF of the BDsOGEW model.
Axioms 12 00534 g003
Figure 4. Some scatter plots of the conditional PMF for X 1 given x 2 .
Figure 4. Some scatter plots of the conditional PMF for X 1 given x 2 .
Axioms 12 00534 g004
Figure 5. Some scatter plots of the conditional PMF for X 2 given x 1 .
Figure 5. Some scatter plots of the conditional PMF for X 2 given x 1 .
Axioms 12 00534 g005
Figure 6. Scatter and violin plots of data set I.
Figure 6. Scatter and violin plots of data set I.
Axioms 12 00534 g006
Figure 7. The estimated joint PMF based on data set I.
Figure 7. The estimated joint PMF based on data set I.
Axioms 12 00534 g007
Figure 8. The profile l for parameters a, γ , p and v for data set I.
Figure 8. The profile l for parameters a, γ , p and v for data set I.
Axioms 12 00534 g008
Figure 9. A 3-dimensional surface profile likelihood of a, γ , p and v based on data set I.
Figure 9. A 3-dimensional surface profile likelihood of a, γ , p and v based on data set I.
Axioms 12 00534 g009
Figure 10. Contour diagrams of the model estimators based on data set I.
Figure 10. Contour diagrams of the model estimators based on data set I.
Axioms 12 00534 g010
Figure 11. Scatter and violin plots of data set II.
Figure 11. Scatter and violin plots of data set II.
Axioms 12 00534 g011
Figure 12. The estimated joint PMF based on data set II.
Figure 12. The estimated joint PMF based on data set II.
Axioms 12 00534 g012
Figure 13. The profile l of a, γ , p and v according to data set II.
Figure 13. The profile l of a, γ , p and v according to data set II.
Axioms 12 00534 g013
Figure 14. A 3-dimensional surface profile likelihood of a, γ , p and v based on data set II.
Figure 14. A 3-dimensional surface profile likelihood of a, γ , p and v based on data set II.
Axioms 12 00534 g014
Figure 15. Contour diagrams of the model estimators based on data set II.
Figure 15. Contour diagrams of the model estimators based on data set II.
Axioms 12 00534 g015
Table 1. Simulation results for scheme I.
Table 1. Simulation results for scheme I.
n 1 = 20 n 2 = 50
p ν γ a p ν γ a
Bias 0.10125 0.21458 0.18256 0.25203 0.05236 0.18256 0.12469 0.19636
MSE 0.09856 0.18256 0.15246 0.21552 0.04254 0.15635 0.09850 0.16504
CP 0.88719 0.92335 0.93419 0.83963 0.88436 0.92144 0.93223 0.83745
95 % CI L B 0.06523 0.24714 0.42302 0.14025 0.07136 0.25748 0.44120 0.15699
95 % CI U B 0.14569 0.35125 0.59368 0.27145 0.13825 0.34283 0.56774 0.26632
n 3 = 150 n 4 = 300
p ν γ a p ν γ a
Bias 0.01365 0.12145 0.08254 0.14231 0.00858 0.07156 0.0236 0.11258
MSE 0.01258 0.10395 0.06632 0.12569 0.00652 0.05236 0.01428 0.09254
CP 0.88253 0.92074 0.93064 0.83164 0.88162 0.91886 0.92846 0.82854
95 % CI L B 0.07512 0.26314 0.46256 0.16369 0.08124 0.27142 0.47428 0.17402
95 % CI U B 0.12415 0.34472 0.54239 0.24012 0.11932 0.33748 0.53201 0.23148
n 5 = 500 n 6 = 700
p ν γ a p ν γ a
Bias 0.00103 0.01254 0.00125 0.06317 0.00003 0.00235 0.00024 0.00301
MSE 0.00084 0.00825 0.00052 0.01426 0.00001 0.00082 0.00011 0.00082
CP 0.87949 0.91278 0.92676 0.81746 0.87552 0.91075 0.92019 0.81298
95 % CI L B 0.08412 0.28301 0.48241 0.18012 0.09174 0.28102 0.49102 0.19125
95 % CI U B 0.11363 0.32823 0.53294 0.21989 0.10857 0.31025 0.52138 0.21011
Table 2. Simulation results for scheme II.
Table 2. Simulation results for scheme II.
n 1 = 20 n 2 = 50
p ν γ a p ν γ a
Bias 0.12369 0.23529 0.13085 0.16625 0.10238 0.20136 0.09820 0.14823
MSE 0.10925 0.22968 0.12821 0.15530 0.09569 0.18014 0.07742 0.13224
CP 0.91203 0.82734 0.94537 0.85016 0.91027 0.82256 0.94521 0.84771
95 % CI L B 0.15365 0.40123 0.22303 0.46369 0.16325 0.42013 0.24120 0.47102
95 % CI U B 0.26636 0.59325 0.38204 0.54215 0.24932 0.57012 0.36025 0.53325
n 3 = 150 n 4 = 300
p ν γ a p ν γ a
Bias 0.08256 0.16328 0.04230 0.12012 0.02137 0.11205 0.00803 0.08825
MSE 0.06636 0.14200 0.03047 0.09825 0.01452 0.09852 0.00625 0.06328
CP 0.90734 0.82019 0.94338 0.84309 0.90193 0.81578 0.94219 0.84290
95 % CI L B 0.17714 0.43025 0.25202 0.48158 0.18230 0.45201 0.28323 0.48623
95 % CI U B 0.23852 0.55236 0.34525 0.52745 0.22825 0.54414 0.32256 0.51445
n 5 = 500 n 6 = 700
p ν γ a p ν γ a
Bias 0.00615 0.02167 0.00012 0.00215 0.00023 0.00273 0.00003 0.00052
MSE 0.00321 0.00825 0.00008 0.00102 0.00004 0.00019 0.00001 0.00008
CP 0.89723 0.81426 0.94139 0.84311 0.89227 0.81337 0.93772 0.84382
95 % CI L B 0.18936 0.46236 0.29201 0.49012 0.19102 0.47525 0.29025 0.49523
95 % CI U B 0.21985 0.53926 0.30941 0.51125 0.20824 0.53101 0.30125 0.50585
Table 3. Simulation results for scheme III.
Table 3. Simulation results for scheme III.
n 1 = 20 n 2 = 50
p ν γ a p ν γ a
Bias 0.09858 0.24223 0.19302 0.16014 0.07145 0.19073 0.16325 0.14085
MSE 0.07145 0.21014 0.16328 0.14025 0.05241 0.17452 0.14145 0.13748
CP 0.92747 0.81457 0.97194 0.95712 0.91265 0.82602 0.97188 0.95639
95 % CI L B 0.46568 0.19365 0.58636 0.75145 0.47188 0.22325 0.61258 0.79569
95 % CI U B 0.54258 0.41256 0.82414 1.15369 0.53521 0.38525 0.79201 1.10254
n 3 = 150 n 4 = 300
p ν γ a p ν γ a
Bias 0.02035 0.14625 0.13205 0.12638 0.00413 0.12025 0.11015 0.11365
MSE 0.01996 0.13236 0.11859 0.12025 0.00251 0.11301 0.09852 0.09852
CP 0.91746 0.82746 0.96984 0.95612 0.92957 0.88230 0.96901 0.95598
95 % CI L B 0.48194 0.24189 0.64302 0.82638 0.48831 0.25885 0.66358 0.86677
95 % CI U B 0.52825 0.34125 0.76012 0.98510 0.51254 0.33236 0.73258 0.93748
n 5 = 500 n 6 = 700
p ν γ a p ν γ a
Bias 0.00074 0.08236 0.08858 0.09014 0.00002 0.00714 0.00752 0.01173
MSE 0.00066 0.06326 0.07968 0.06852 0.00001 0.00513 0.00524 0.00721
CP 0.90166 0.88109 0.96836 0.95583 0.90654 0.88112 0.96554 0.95566
95 % CI L B 0.49134 0.27256 0.68225 0.87480 0.49014 0.28541 0.69221 0.88369
95 % CI U B 0.51015 0.32748 0.71529 0.93254 0.50895 0.31526 0.70418 0.91596
Table 4. The MLEs and GOFM for data set I.
Table 4. The MLEs and GOFM for data set I.
ModelMLEs l AICHQIC
BDsE ν ^ 1 = 0.846 ,   ν ^ 2 = 0.792 ,   ν ^ 3 = 0.693 88.002 182.004 183.349
BPo-4P λ ^ 1 = 0.262 ,   γ ^ 1 = 0.165 ,   λ ^ 2 = 0.405 ,   γ ^ 2 = 2.971 77.664 163.328 165.121
IBPo λ ^ 1 = 1.499 ,   λ ^ 2 = 1.367 92.478 188.956 189.853
BDsIE ν ^ 1 = 0.501 ,   ν ^ 2 = 0.622 ,   ν ^ 3 = 0.383 92.482 190.964 192.309
BDsIR ν ^ 1 = 0.262 ,   ν ^ 2 = 0.405 ,   ν ^ 3 = 0.363 78.659 163.318 164.663
BDsIW ν ^ 1 = 0.192 ,   ν ^ 2 = 0.337 ,   ν ^ 3 = 0.360 ,   ζ ^ = 2.453 76.513 161.026 162.815
BDsOGEW γ ^ = 0.574 ,   ν ^ = 2.457 ,   p ^ = 0.722 ,   a ^ = 0.780 71.162 150.324 152.117
Table 5. The MLEs and GOFM for data set II.
Table 5. The MLEs and GOFM for data set II.
ModelMLEs l AICHQIC
BDsE ν ^ 1 = 0.625 ,   ν ^ 2 = 0.812 ,   ν ^ 3 = 0.713 75.421 156.842 157.929
BDsR ν ^ 1 = 0.790 ,   ν ^ 2 = 0.872 ,   ν ^ 3 = 0.905 63.931 133.862 134.949
BDsW ν ^ 1 = 1.36 ,   ν ^ 2 = 2.10 ,   ν ^ 3 = 2.27 ,   ζ ^ = 2.125 63.911 135.822 137.271
BPo min ν ^ 1 = 1.36 ,   ν ^ 2 = 2.10 ,   ν ^ 3 = 2.27 64.228 134.456 135.543
BPo-3P γ ^ 1 = 1.08 ,   γ ^ 2 = 1.38 ,   γ ^ 3 = 0.70 64.932 135.864 136.951
IBPo λ ^ 1 = 1.08 ,   λ ^ 2 = 1.38 67.623 139.246 139.971
BDsIE ν ^ 1 = 0.669 ,   ν ^ 2 = 0.388 ,   ν ^ 3 = 0.514 78.541 163.082 164.169
BDsIR ν ^ 1 = 0.493 ,   ν ^ 2 = 0.212 ,   ν ^ 3 = 0.561 64.102 134.204 135.291
BDsOGEW γ ^ = 0.759 ,   ν ^ = 23.807 ,   p ^ = 0.213 ,   a ^ = 0.399 62.589 133.178 134.627
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Al-Essa, L.A.; Eliwa, M.S.; Shahen, H.S.; Khalil, A.A.; Alqifari, H.N.; El-Morshedy, M. Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications. Axioms 2023, 12, 534. https://doi.org/10.3390/axioms12060534

AMA Style

Al-Essa LA, Eliwa MS, Shahen HS, Khalil AA, Alqifari HN, El-Morshedy M. Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications. Axioms. 2023; 12(6):534. https://doi.org/10.3390/axioms12060534

Chicago/Turabian Style

Al-Essa, Laila A., Mohamed S. Eliwa, Hend S. Shahen, Amal A. Khalil, Hana N. Alqifari, and Mahmoud El-Morshedy. 2023. "Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications" Axioms 12, no. 6: 534. https://doi.org/10.3390/axioms12060534

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop