Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications

Al-Essa, Laila A.; Eliwa, Mohamed S.; Shahen, Hend S.; Khalil, Amal A.; Alqifari, Hana N.; El-Morshedy, Mahmoud

doi:10.3390/axioms12060534

Open AccessArticle

Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications

by

Laila A. Al-Essa

¹,

Mohamed S. Eliwa

^2,3

,

Hend S. Shahen

³,

Amal A. Khalil

³,

Hana N. Alqifari

² and

Mahmoud El-Morshedy

^4,5,*

¹

Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia

²

Department of Statistics and Operation Research, College of Science, Qassim University, Buraydah 51482, Saudi Arabia

³

Department of Mathematics, Faculty of Science, Mansoura University, Mansoura 35516, Egypt

⁴

Department of Mathematics, College of Science and Humanities in Al-Kharj, Prince Sattam bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia

⁵

Department of Statistics and Computer Sciences, Faculty of Science, Mansoura University, Mansoura 35516, Egypt

^*

Author to whom correspondence should be addressed.

Axioms 2023, 12(6), 534; https://doi.org/10.3390/axioms12060534

Submission received: 6 April 2023 / Revised: 24 May 2023 / Accepted: 25 May 2023 / Published: 29 May 2023

(This article belongs to the Special Issue Mathematical Tools and Techniques Applicable to Probability Theory and Statistics II)

Download

Browse Figures

Versions Notes

Abstract

:

In this article, a new family of bivariate discrete distributions is proposed based on the copula concept, in the so-called bivariate discrete odd generalized exponential-G family. Some distributional properties, including the joint probability mass function, joint survival function, joint failure rate function, median correlation coefficient, and conditional expectation, are derived. After proposing the general class, one special model of the new bivariate family is discussed in detail. The maximum likelihood approach is utilized to estimate the family parameters. A detailed simulation study is carried out to examine the bias and mean square error of maximum likelihood estimators. Finally, the importance of the new bivariate family is explained by means of two distinctive real data sets in various fields.

Keywords:

copula technique; bivariate discrete distributions; failure analysis; conditional expectation; simulation; maximum likelihood estimators; statistics and numerical data

MSC:

60E05; 62E10

1. Introduction

Probability distributions are statistical functions that describe all possible values of a random variable and their likelihood, within a specific range. These are a result of the data generating process of an occurrence or its probability density function (PDF). Probability distributions are useful in modeling our environment to acquire estimates of the likelihood that a specific event will occur or to determine the variability of occurrence. There are many types of probability distributions, including binomial distribution, uniform distribution, Poisson distribution, normal distribution, lognormal distribution, beta distribution, exponential, and bivariate distribution. In our study, we highlight the exponential distribution (E) as a basic probability model for the proposed generator. The E model is a fundamental and widely recognized probability distribution in research. For lifetime phenomenon and reliability studies, the E distribution is rarely utilized due to the constant failure rate (FR); however, its memoryless property proves useful in queuing theory. Different extensions or generalizations of the E distribution have been introduced in the literature, with the generalized exponential (GE) garnering increased attention. The cumulative distribution function (CDF) of the GE distribution can be formulated as expression

F_{GE} (x; ξ, ν) = {(1 - e^{- ξ x})}^{ν}; x \geq 0,

(1)

where

ξ > 0

is the scale parameter and

ν > 0

is the shape parameter. Based on the technique of Alzaatreh et al. [1] for generating families of distributions, Tahir et al. [2] proposed the new odd generator of distributions called the odd generalized exponential-G (OGE-G), which exhibits flexible FR shapes such as increasing, decreasing, bathtub or upside-down bathtub. The CDF of the OGE-G family is given by the following formula

F_{OGE - G} (x; ξ, ν, Θ) = {(1 - e^{- ξ Ψ (x; Θ)})}^{ν}; x \geq 0,

(2)

where

Ψ (x; Θ) = G (x; Θ) / (1 - G (x; Θ))

,

G (x; Θ)

is a non-negative function of any model that depends on positive parameter vector

Θ > 0

. For more detail around the odd ratio,

Ψ (x; Θ)

(see Tahir et al., [2]). Due to the flexibility of Alzaatreh et al. [1] approach, several authors utilized this technique to generate new models for analyzing different types of data sets in various fields, for instance, see Silva et al. [3], Alizadeh et al. [4], Korkmaz et al. [5], Djibrila [6], Reyad et al. [7], Alizadeh et al. [8], among others.

The above probability models can be applied to discuss and analyze univariate data. In some applied sciences, bivariate data can be generated from different fields. Bivariate data refer to a data set that contains exactly two variables. In statistics, bivariate data represents data where each value of one variable is paired with a corresponding value of the other variable. Typically, it would be of interest to investigate the possible association between the two variables. A probability distribution involving two random variables is called a bivariate probability distribution. The bivariate (BV) distributions have been introduced, developed, and discussed by many authors and have wide applications in various fields like engineering, weather, sports, drought, among others. More detail is given in Balakrishnan and Lai [9]. The construction of the BV discrete and continuous distributions are mainly via (I) the copulas (II) compounding (III) marginals (IV) reduction and (V) conditioning, for instance, Johnson and Tenenbein [10], Quesada and Rodrguez [11], Fang et al. [12], Durante [13], Kundu and Gupta [14], Sarabia et al. [15], Roozegar and Jafari [16], Eliwa et al. [17], among others.

In several cases, the BV lifetimes need to be recorded on a discrete scale rather than on a continuous one. Due to the previous reasons, discretizing continuous BV distributions has received much attention in the statistical literature, for instance, Lee and Cha [18], Kundu and Nekoukhou [19], El-Morshedy et al. [20], Nekoukhou et al. [21], De Oliveira and Achcar [22], among others. Herein, we focus on the copulas approach. A copula function for bivariate discrete probability distributions is a function that connects the joint probability mass function of two discrete variables to their marginal probability mass functions. A copula function can be obtained by discretizing a continuous copula function, which is a function that links a multivariate distribution function to its one-dimensional marginal distribution functions. Alternatively, a copula function can be derived from a specific bivariate discrete distribution, such as the bivariate geometric, binomial, Poisson, or negative binomial distributions. A copula function for bivariate discrete probability distributions can be used to model the dependence structure between the variables and to simulate bivariate discrete data with given marginals and correlation; for more details, see, Kobus and Kurek, [23], Najarzadegan et al., [24], and Yamaguchi and Maruo, [25]. One of the important constructions of the bivariate distribution in survival and lifetime models is frailty models. So, both copulas and frailty are important in lifetime models (see Emura et al., [26]). The copula function can be formulated as follows

C (F_{1} (x_{1}), F_{2} (x_{2})) = G (x_{1}, x_{2}) for all x_{1}, x_{2} = 0, 1, \dots,

(3)

where

G (x_{1}, x_{2})

represents the CDF of a given BV continuous model on

{(0, \infty)}^{2}

, whereas

F_{1} (x_{1})

and

F_{2} (x_{2})

are the marginals. There are some known parametric copulas such as the following: the normal copulas, Farlie-Gumbel-Morgenstern (FGM) copulas, Marshall and Olkin (MO) copulas, Cuadras and Augé (CA) copulas (see Cuadras and Augé, [27]), among others. The CA copulas family was proposed as an extension of the BV distributions, defined as follows

C_{θ} (x_{1}, x_{2}) = min {x_{1}, x_{2}} max {x_{1}, x_{2}}^{1 - θ},

(4)

where

0 \leq θ \leq 1

and

0 \leq x_{1}, x_{2} \leq 1 .

The parameter

θ

measures the degree of dependence and plays the role of the parameter of upper-tail dependence of

C_{θ} (x_{1}, x_{2})

. The CA copulas family is utilized in a variety of modeling’s for exchangeable random vectors because it is symmetric in its marginals. The aim of this paper is to propose a new discrete BV family of distributions based on the CA copulas approach, in the so-called BV discrete odd generalized exponential-G (BDsOGE-G) family, where the marginals have the OGE-G families.

The article is organized as follows: In Section 2, the BDsOGE-G family of distributions is introduced. Some mathematical and statistical properties are derived in Section 3. In Section 4, the BDsOGE-Weibull distribution is discussed in detail. The parameters of the new bivariate family are estimated via the maximum likelihood technique in Section 5. In Section 6, a simulation study is performed to discuss the performance of the maximum likelihood estimators. Two distinctive data sets are analyzed to discuss the flexibility of the proposed family in Section 7. Finally, some concluding remarks and future work are listed in Section 8.

2. The BDOGE-G Class

Recall that, in Equation (2), the CDF and probability mass function (PMF) of the discrete OGE-G (DsOGE-G) family can be formulated as follows

F_{DsOGE - G} (x; p, ν, Θ) = {(1 - p^{Ψ (x + 1; Θ)})}^{ν}; x = 0, 1, 2, 3, \dots

(5)

and

f_{DsOGE - G} (x; p, ν, Θ) = {(1 - p^{Ψ (x + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x; Θ)})}^{ν}; x = 0, 1, 2, 3, \dots,

(6)

respectively, where

p = e^{- ξ},

0 < p < 1,

ν > 0

and

Θ

is the vector of parameters (

1 \times k

). Equation (6) can be writen as follows

f_{DsOGE - G} (x; p, ν, Θ) = \sum_{k = 1}^{ν} {(- 1)}^{k + 1} (\binom{ν}{k}) [p^{k Ψ (x; Θ)} - p^{k Ψ (x + 1; Θ)}] .

(7)

Utilizing the CA copulas approach with

γ = 1 - θ

, the joint PMF of the BDsOGE-G can be expressed as

f_{X_{1}, X_{2}} (x_{1}, x_{2}) = \{\begin{matrix} P_{1} (x_{1}, x_{2}) if x_{1} < x_{2} \\ P_{2} (x_{1}, x_{2}) if x_{2} < x_{1} \\ P_{3} (x) if x_{1} = x_{2} = x, \end{matrix}

(8)

where

\begin{matrix} P_{1} (x_{1}, x_{2}) & = & [{(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{1}; Θ)})}^{ν}] [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ}] \\ = & f_{DsOGE - G} (x_{1}; p, ν, Θ) f_{DsOGE - G} (x_{2}; p, ν γ, Θ), \end{matrix}

\begin{matrix} P_{2} (x_{1}, x_{2}) & = & [{(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{1}; Θ)})}^{ν γ}] [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν}] \\ = & f_{DsOGE - G} (x_{1}; p, ν γ, Θ) f_{DsOGE - G} (x_{2}; p, ν, Θ), \end{matrix}

and

\begin{matrix} P_{3} (x) & = & {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} [{(1 - p^{Ψ (x + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x; Θ)})}^{ν}] - {(1 - p^{Ψ (x; Θ)})}^{ν} \\ \times [{(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x; Θ)})}^{ν γ}] \\ = & {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} f_{DsOGE - G} (x; p, ν, Θ) - {(1 - p^{Ψ (x; Θ)})}^{ν} f_{DsOGE - G} (x; p, ν γ, Θ) . \end{matrix}

The joint CDF of the BDsOGE-G family can be proposed as

F_{X_{1}, X_{2}} (x_{1}, x_{2}) = {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ} {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} {(1 - p^{Ψ (z + 1; Θ)})}^{ν (1 - γ)}; x_{1}, x_{2} = 0, 1, 2, 3 \dots,

where

z = min {x_{1}, x_{2}} .

The marginal CDF of the BDsOGE-G family is

F_{DsOGE - G} (x; p, ν, Θ) = {(1 - p^{Ψ (x_{i} + 1; Θ)})}^{ν}; i = 1, 2 .

(9)

If

(X_{1}, X_{2}) \sim

BDsOGE-G

(p, ν, γ, Θ),

then

X_{1}

and

X_{2}

are positive quadrant dependent (PQD) where

Pr (X_{1} \leq x_{1}, X_{2} \leq x_{2}) \geq Pr (X_{1} \leq x_{1}) Pr (X_{2} \leq x_{2}), for all x_{1} and x_{2} .

The joint RF is given by

S_{X_{1}, X_{2}} (x_{1}, x_{2}) = \{\begin{matrix} S_{1} (x_{1}, x_{2}) if x_{1} < x_{2} \\ S_{2} (x_{1}, x_{2}) if x_{2} < x_{1} \\ S_{3} (x) if x_{1} = x_{2} = x, \end{matrix}

(10)

where

S_{1} (x_{1}, x_{2}) = 1 - {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} (1 - {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ}),

S_{2} (x_{1}, x_{2}) = 1 - {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} (1 - {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ}),

and

S_{3} (x) = 1 - 2 {(1 - p^{Ψ (x + 1; Θ)})}^{ν} + {(1 - p^{Ψ (x + 1; Θ)})}^{ν (1 + γ)} .

The joint HRF function can be expressed by using

h_{X_{1}, X_{2}} (x_{1}, x_{2}) = f_{X_{1}, X_{2}} (x_{1}, x_{2}) / S_{X_{1}, X_{2}}

(x_{1} - 1, x_{2} - 1)

whereas the joint reversed HRF function can be proposed via

v_{X_{1}, X_{2}} (x_{1}, x_{2}) = f_{X_{1}, X_{2}} (x_{1}, x_{2}) / F_{X_{1}, X_{2}} (x_{1}, x_{2})

. If the bivariate vector X have the BDsOGE-G model, then the distribution for each

W = max {X_{1}, X_{2}}

and

V = min {X_{1}, X_{2}}

can be written as follows

F_{W} (z) = F_{DsOGE - G} (z; p, ν_{1} + ν_{2} + ν_{3}, Θ)

(11)

and

F_{V} (z) = F_{DsOGE - G} (z; p, ν_{1} + ν_{3}, Θ) + F_{DsOGE - G} (z; p, ν_{2} + ν_{3}, Θ) - F_{W} (z),

(12)

respectively.

3. Distributional Properties

3.1. Median Correlation Coefficient

In statistics and probability theory, the median is a type of summary statistics used to give important information about a certain data point or population. It is the value separating the higher half from the lower half of an ordered data sample, a population, or a probability distribution. For a bivariate version, the median correlation coefficient, say

L_{X_{1}, X_{2}},

can be expressed as a form

L_{X_{1}, X_{2}} = 4 F_{X_{1}, X_{2}} (L_{X_{1}}, L_{X_{2}}) - 1,

where

L_{X_{1}}

and

L_{X_{2}}

denote the median of

X_{1}

and

X_{2},

respectively. If

X_{1} \sim

DsOGE-G

(Θ, ν_{1} + ν_{3})

and

X_{2} \sim

DsOGE-G

(Θ, ν_{2} + ν_{3})

,then

L_{X_{1}, X_{2}} = \{\begin{matrix} 4 F_{DsOGE - G} (L_{X_{2}}; p, ν_{2}, Θ) F_{DsOGE - G} (L_{X_{1}}; p, ν_{1} + ν_{3}, Θ) - 1 if x_{1} \leq x_{2} \\ 4 F_{DsOGE - G} (L_{X_{1}}; p, ν_{1}, Θ) F_{DsOGE - G} (L_{X_{2}}; p, ν_{2} + ν_{3}, Θ) - 1 if x_{1} > x_{2}, \end{matrix}

(13)

where

L_{X_{i}}; i = 1, 2

is the quantile function for the marginals.

3.2. The Conditional CDF of $X_{1}$ Given $X_{2} = x_{2}$ ( $X_{2} \leq x_{2}$ )

A conditional distribution is a probability distribution for a sub-population. In other words, it shows the probability that a randomly selected item in a sub-population has a characteristic of interest. Conditional distributions have many applications in statistics and machine learning. For example, they can be used in regressing structured response variables. In addition, conditional distributions can be embedded into a Hilbert space, which is potentially useful in applications where conditional distributions are the key quantities of interest. Assuming X follows the BDsOGE-G family, the conditional CDF of

(X_{1} ∣ X_{2} = x_{2}),

say

F_{X_{1} ∣ X_{2} = x_{2}} (x_{1}),

is given by

F_{X_{1} ∣ X_{2} = x_{2}} (x_{1} ∣ x_{2}) = \{\begin{matrix} F_{1} (x_{1} ∣ x_{2}) if 0 \leq x_{1} < x_{2} \\ F_{2} (x_{1} ∣ x_{2}) if 0 \leq x_{2} < x_{1} \\ F_{3} (x_{1} ∣ x_{2}) if 0 \leq x_{1} = x_{2} = x, \end{matrix}

(14)

where

F_{1} (x_{1} ∣ x_{2}) = \frac{{(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ}]}{[{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν}]},

F_{2} (x_{1} ∣ x_{2}) = {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ},

and

F_{3} (x_{1} ∣ x_{2}) = {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ},

where

F_{X_{1} ∣ X_{2} = x_{2}} (x_{1} ∣ x_{2}) = \frac{P (X_{1} \leq x_{1}, X_{2} = x_{2})}{P (X_{2} = x_{2})} .

Similarly, the conditional CDF of

X_{1} ∣ X_{2} \leq x_{2},

say

F_{X_{1} ∣ X_{2} \leq x_{2}} (x_{1} ∣ x_{2}),

is given by

F_{X_{1} ∣ X_{2} \leq x_{2}} (x_{1} ∣ x_{2}) = \{\begin{matrix} {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{- ν (1 - γ)} if 0 \leq x_{1} < x_{2}, \\ {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ} if 0 \leq x_{2} < x_{1}, \\ {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} if 0 \leq x_{1} = x_{2} = x . \end{matrix}

(15)

3.3. The Conditional Expectation of $X_{1}$ Given $X_{2} = x_{2}$

In probability theory, the conditional expectation of a random variable is its expected value given that a certain set of “conditions” is known to occur. The conditional expectation is the expected value of a random variable, computed with respect to a conditional probability distribution. To derive the conditional expectation of

X_{1}

, given

X_{2} = x_{2}

, the PMF of

X_{1}

given

X_{2} = x_{2}

should be calculated first. Assuming X follow the BDsOGE-G family, the PMF of

X_{1}

given

X_{2} = x_{2}

can be expressed as

f_{X_{1} ∣ X_{2} = x_{2}} (x_{1} ∣ x_{2}) = \{\begin{matrix} f_{1} (x_{1} ∣ x_{2}) if 0 \leq x_{1} < x_{2} \\ f_{2} (x_{1} ∣ x_{2}) if 0 \leq x_{2} < x_{1} \\ f_{3} (x_{1} ∣ x_{2}) if 0 \leq x_{1} = x_{2} = x, \end{matrix}

(16)

where

f_{1} (x_{1} ∣ x_{2}) = \frac{f_{DsOGE - G} (x_{1}; p, ν, Θ) f_{DsOGE - G} (x_{2}; p, ν γ, Θ)}{f_{DsOGE - G} (x_{2}; p, ν, Θ)},

f_{2} (x_{1} ∣ x_{2}) = f_{DsOGE - G} (x_{1}; p, ν γ, Θ),

and

f_{3} (x_{1} ∣ x_{2}) = \frac{{(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} f_{DsOGE - G} (x; p, ν, Θ) - {(1 - p^{Ψ (x; Θ)})}^{ν} f_{DsOGE - G} (x; p, ν γ, Θ)}{f_{DsOGE - G} (x; p, ν, Θ)} .

Thus, the conditional expectation of

X_{1} ∣ X_{2} = x_{2},

say

E (X_{1} ∣ X_{2} = x_{2}),

is given by

\begin{matrix} E (X_{1} & ∣ & X_{2} = x_{2}) = \overset{ν γ}{\sum_{i = 0}} \overset{\infty}{\sum_{x_{1} = x_{2} + 1}} {(- 1)}^{i} x_{1} (\binom{ν γ}{i}) [p^{i Ψ (x_{1} + 1; Θ)} - p^{i Ψ (x_{1}; Θ)}] + x_{2} {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} \\ + \frac{\sum_{i = 0}^{ν} \sum_{x_{1} = 0}^{x_{2} - 1} {(- 1)}^{i} x_{1} (\binom{ν}{i}) [p^{i Ψ (x_{1} + 1; Θ)} - p^{i Ψ (x_{1}; Θ)}] [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ}]}{[{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν}]} \\ - \frac{x_{2} {(1 - p^{Ψ (x_{2}; Θ)})}^{ν} [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ}]}{[{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν}]}, \end{matrix}

where E

(X_{1} ∣ X_{2} = x_{2}) = \sum_{x_{1} = 0}^{\infty} x_{1} f_{X_{1} ∣ X_{2} = x_{2}} (x_{1} ∣ x_{2})

. The accurate estimation of conditional expectations is an important problem arising in different branches of science and engineering as well as finance, economics and various business applications (see Casella and Berger, [28]; Pfeiffer, [29]; and Emura et al., [26]).

4. The BDsOGE-Weibull (BDsOGEW) Distribution

Considering the CDF of Weibull distribution with one positive shape parameter a, the joint PMF of the BDsOGEW model is

f_{X_{1}, X_{2}} (x_{1}, x_{2}) = \{\begin{matrix} P_{1} (x_{1}, x_{2}) if x_{1} < x_{2} \\ P_{2} (x_{1}, x_{2}) if x_{2} < x_{1} \\ P_{3} (x) if x_{1} = x_{2} = x, \end{matrix}

(17)

where

P_{1} (x_{1}, x_{2}) = [{(1 - p^{(e^{{(x_{1} + 1)}^{a}} - 1)})}^{ν} - {(1 - p^{(e^{x_{1}^{a}} - 1)})}^{ν}] [{(1 - p^{(e^{{(x_{2} + 1)}^{a}} - 1)})}^{ν γ} - {(1 - p^{(e^{x_{2}^{a}} - 1)})}^{ν γ}],

P_{2} (x_{1}, x_{2}) = [{(1 - p^{(e^{{(x_{1} + 1)}^{a}} - 1)})}^{ν γ} - {(1 - p^{(e^{x_{1}^{a}} - 1)})}^{ν γ}] [{(1 - p^{(e^{{(x_{2} + 1)}^{a}} - 1)})}^{ν} - {(1 - p^{(e^{x_{2}^{a}} - 1)})}^{ν}]

and

\begin{matrix} P_{3} (x) & = & {(1 - p^{(e^{{(x + 1)}^{a}} - 1)})}^{ν γ} [{(1 - p^{(e^{{(x + 1)}^{a}} - 1)})}^{ν} - {(1 - p^{(e^{x^{a}} - 1)})}^{ν}] - {(1 - p^{(e^{x^{a}} - 1)})}^{ν} \times \\ [{(1 - p^{(e^{{(x + 1)}^{a}} - 1)})}^{ν γ} - {(1 - p^{(e^{x^{a}} - 1)})}^{ν γ}] . \end{matrix}

Figure 1 shows the joint PMF of the BDsOGEW model for various values of the parameters.

It is worth noting that the joint probability mass function (PMF) can exhibit either a unimodal or a decreasing surface shape. This flexibility allows it to be used for modeling and analyzing both asymmetric and symmetric data. The corresponding joint CDF and joint RF to Equation (17) can be formulated as

F_{X_{1}, X_{2}} (x_{1}, x_{2}) = {(1 - p^{(e^{x_{1}^{a}} - 1)})}^{ν γ} {(1 - p^{(e^{x_{2}^{a}} - 1)})}^{ν γ} {(1 - p^{(e^{z^{a}} - 1)})}^{ν (1 - γ)}

and

\begin{matrix} S_{X_{1}, X_{2}} (x_{1}, x_{2}) & = & 1 - {(1 - p^{(e^{x_{1}^{a}} - 1)})}^{ν} - {(1 - p^{(e^{x_{2}^{a}} - 1)})}^{ν} + {(1 - p^{(e^{x_{1}^{a}} - 1)})}^{ν γ} \\ \times {(1 - p^{(e^{x_{2}^{a}} - 1)})}^{ν γ} {(1 - p^{(e^{z^{a}} - 1)})}^{ν (1 - γ)}, \end{matrix}

respectively. Figure 2 and Figure 3 illustrate the joint RF and joint HRF of the proposed bivariate model based on different values of its parameters.

It is noted that the joint HRF can be either a unimodal, constant, or decreasing surface. Thus, the joint HRF of this model can be used to analyze several types of data in various fields. Figure 4 lists different plots of the conditional PMF of

X_{1,}

given that

x_{2} = 5,

15,

25 under various specific schemes Figure 4a:

a = 0.4,

p = 0.8,

γ = 1.3,

v = 10

and Figure 4b:

a = 0.4,

p = 0.8,

γ = 0.3,

v = 1.8

. Whereas the attitude of the conditional PMF of

X_{2,}

given that

x_{1} = 5,

15,

25 can be proposed through schemes Figure 5a:

a = 0.4,

p = 0.8,

γ = 1.3,

v = 10

and Figure 5b:

a = 0.4,

p = 0.8,

γ = 0.3,

v = 1.8

in Figure 5.

5. Point and Interval Estimations

5.1. Maximum Likelihood Estimation (MLE)

In statistics, MLE is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed statistical model, the observed data is most probable. MLE has applications in many fields such as physics, engineering, economics, finance, and biology. In this section, the approach of MLE is used to estimate the unknown parameters

p,

ν,

γ

and

Θ

of the BDsOGE-G family. Suppose that a sample size n, of the form

\{(x_{11}, x_{21}), (x_{12}, x_{22}), \dots, (x_{1 n}, x_{2 n})\}

from BDsOGE-G family. Under the following notations:

I_{1} = {x_{1 j} < x_{2 j}},

I_{2} = {x_{2 j} < x_{1 j}},

I_{0} = {x_{1 j} = x_{2 j} = x_{j}},

I = I_{1} \cup I_{2} \cup I_{0},

|I_{1}| = n_{1},

|I_{2}| = n_{2},

|I_{0}| = n_{0}

and

n = n_{1} + n_{2} + n_{0},

the likelihood function can be expressed as

L (p, ν, γ, Θ) = \prod_{j \in I_{1}} P_{1} (x_{1 j}, x_{2 j}) \prod_{j \in I_{2}} P_{2} (x_{1 j}, x_{2 j}) \prod_{j \in I_{3}} P_{3} (x_{j}) .

(18)

The log-likelihood function becomes

\begin{matrix} l (p, ν, γ, Θ) & = & \sum_{j \in I_{1}} ln (g_{1} (x_{1 j}; ν)) + \sum_{j \in I_{1}} ln (g_{1} (x_{2 j}; ν γ)) \\ + \sum_{j \in I_{2}} ln (g_{1} (x_{1 j}; ν γ)) + \sum_{j \in I_{2}} ln (g_{1} (x_{2 j}; ν)) \\ + \sum_{j \in I_{0}} ln ({(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} g_{1} (x_{j}; ν) - {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} g_{1} (x_{j}; ν γ)), \end{matrix}

(19)

where

g_{1} (x; ν) = {(1 - p^{Ψ (x + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x; Θ)})}^{ν} .

The MLEs of the parameters

p,

ν,

γ

and

Θ

can be obtained by computing the first partial derivatives of Equation (19) with respect to

p,

ν,

γ

and

Θ

, respectively, and setting the resulting equations equal to zero. The first partial derivatives are

\begin{matrix} \frac{\partial l (p, ν, γ, Θ)}{\partial p} & = & \sum_{j \in I_{1}} \frac{g_{2} (x_{1 j}, ν) - g_{2} (x_{1 j} + 1, ν)}{g_{1} (x_{1 j}; ν)} + \sum_{j \in I_{1}} \frac{g_{2} (x_{2 j}, ν γ) - g_{2} (x_{2 j} + 1, ν γ)}{g_{1} (x_{2 j}; ν γ)} \\ + \sum_{j \in I_{2}} \frac{g_{2} (x_{1 j}, ν γ) - g_{2} (x_{1 j} + 1, ν γ)}{g_{1} (x_{1 j}; ν γ)} + \sum_{j \in I_{2}} \frac{g_{2} (x_{2 j}, ν) - g_{2} (x_{2 j} + 1, ν)}{g_{1} (x_{2 j}; ν)} \\ + \sum_{j \in I_{0}} \frac{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} (g_{2} (x_{j}, ν) - g_{2} (x_{j} + 1, ν)) - g_{2} (x_{j} + 1, ν γ) g_{1} (x_{j}; ν)}{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} g_{1} (x_{j}; ν) - {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} g_{1} (x_{j}; ν γ)} \\ - \sum_{j \in I_{0}} \frac{{(1 - p^{Ψ (x_{j}; Θ)})}^{ν} (g_{2} (x_{j}, ν γ) - g_{2} (x_{j} + 1, ν γ)) - g_{2} (x_{j}, ν) g_{1} (x_{j}; ν γ)}{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} g_{1} (x_{j}; ν) - {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} g_{1} (x_{j}; ν γ)}, \end{matrix}

\begin{matrix} \frac{\partial l (p, ν, γ, Θ)}{\partial ν} & = & \sum_{j \in I_{1}} \frac{g_{3} (x_{1 j}, ν) - g_{3} (x_{1 j} + 1, ν)}{g_{1} (x_{1 j}; ν)} + \sum_{j \in I_{1}} \frac{γ (g_{3} (x_{2 j}, ν γ) - g_{3} (x_{2 j} + 1, ν γ))}{g_{1} (x_{2 j}; ν γ)} \\ + \sum_{j \in I_{2}} \frac{γ (g_{3} (x_{1 j}, ν γ) - g_{3} (x_{1 j} + 1, ν γ))}{g_{1} (x_{1 j}; ν γ)} + \sum_{j \in I_{2}} \frac{g_{3} (x_{2 j}, ν) - g_{3} (x_{2 j} + 1, ν)}{g_{1} (x_{2 j}; ν)} \\ + \sum_{j \in I_{0}} \frac{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} (g_{3} (x_{j}, ν) - g_{3} (x_{j} + 1, ν)) - γ g_{3} (x_{j} + 1, ν γ) g_{1} (x_{j}; ν)}{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} g_{1} (x_{j}; ν) - {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} g_{1} (x_{j}; ν γ)} \\ - \sum_{j \in I_{0}} \frac{γ {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} (g_{3} (x_{j}, ν γ) - g_{3} (x_{j} + 1, ν γ)) - g_{3} (x_{j}, ν) g_{1} (x_{j}; ν γ)}{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} g_{1} (x_{j}; ν) - {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} g_{1} (x_{j}; ν γ)}, \end{matrix}

\begin{matrix} \frac{\partial l (p, ν, γ, Θ)}{\partial γ} & = & \sum_{j \in I_{1}} \frac{ν (g_{3} (x_{2 j}, ν γ) - g_{3} (x_{2 j} + 1, ν γ))}{g_{1} (x_{2 j}; ν γ)} + \sum_{j \in I_{2}} \frac{ν (g_{3} (x_{1 j}, ν γ) - g_{3} (x_{1 j} + 1, ν γ))}{g_{1} (x_{1 j}; ν γ)} \\ + \sum_{j \in I_{0}} \frac{- ν g_{3} (x_{j} + 1, ν γ) g_{1} (x_{j}; ν) - ν {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} (g_{3} (x_{j}, ν γ) - g_{3} (x_{j} + 1, ν γ))}{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} g_{1} (x_{j}; ν) - {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} g_{1} (x_{j}; ν γ)} \end{matrix}

and

\begin{matrix} \frac{\partial l (p, ν, γ, Θ)}{\partial Θ_{k}} & = & \sum_{j \in I_{1}} \frac{g_{4} (x_{1 j}, ν) - g_{4} (x_{1 j} + 1, ν)}{g_{1} (x_{1 j}; ν)} + \sum_{j \in I_{1}} \frac{g_{4} (x_{2 j}, ν γ) - g_{4} (x_{2 j} + 1, ν γ)}{g_{1} (x_{2 j}; ν γ)} \\ + \sum_{j \in I_{2}} \frac{g_{4} (x_{1 j}, ν γ) - g_{4} (x_{1 j} + 1, ν γ)}{g_{1} (x_{1 j}; ν γ)} + \sum_{j \in I_{2}} \frac{g_{4} (x_{2 j}, ν) - g_{4} (x_{2 j} + 1, ν)}{g_{1} (x_{2 j}; ν)} \\ + \sum_{j \in I_{0}} \frac{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} (g_{4} (x_{j}, ν) - g_{4} (x_{j} + 1, ν)) - g_{4} (x_{j} + 1, ν γ) g_{1} (x_{j}; ν)}{{(1 - p^{Ψ (x_{j} + 1; Θ)})}^{ν γ} g_{1} (x_{j}; ν) - {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} g_{1} (x_{j}; ν γ)} \\ - \sum_{j \in I_{0}} \frac{{(1 - p^{Ψ (x_{j}; Θ)})}^{ν} (g_{4} (x_{j}, ν γ) - g_{4} (x_{j} + 1, ν γ)) - g_{4} (x_{j}, ν) g_{1} (x_{j}; ν γ)}{{(1 - p^{Ψ (x_{j}; Θ)})}^{ν γ} g_{1} (x_{j}; ν) - {(1 - p^{Ψ (x_{j}; Θ)})}^{ν} g_{1} (x_{j}; ν γ)} . \end{matrix}

where

\begin{matrix} g_{2} (x; ν) & = & ν p^{Ψ (x; Θ) - 1} {(1 - p^{Ψ (x; Θ)})}^{ν - 1}, \\ g_{3} (x; ν) & = & ln (1 - p^{Ψ (x; Θ)}) {(1 - p^{Ψ (x; Θ)})}^{ν}, \\ g_{4} (x; ν) & = & ν \overset{\cdot}{H} (x; Θ) ln p p^{Ψ (x; Θ)} {(1 - p^{Ψ (x; Θ)})}^{ν - 1}, \end{matrix}

where

\overset{\cdot}{H} (x; Θ) = \frac{\partial}{\partial Θ_{k}} Ψ (x; Θ)

and

Θ_{k}

is a vector of the parameters of the baseline model (

k = 1, 2, 3, \dots

). The MLEs of the parameters

p,

ν,

γ

and

Θ

can be obtained by solving the above system of non-linear equations. The solution of these equations is not easy to solve. Thus, the “NMaximize” function in the Mathematica software was utilized. The “NMaximize” is a built-in function that attempts to find a global maximum of a function or an expression numerically see Appendix A.

5.2. Asymptotic Confidence Intervals

Asymptotic confidence intervals (ACIs) are confidence intervals (CIs) that are based on the normal approximation of the sampling distribution of a statistic. They are also called large-sample CIs because they are only valid when the sample size is large enough. The asymptotic normal distribution of the MLE is the most widely utilized approach for establishing confidence bounds for the model parameters. With respect to the Fisher information matrix (FIM), denoted as

I (Y)

, which consists of the negative second derivatives of the

l (p, ν, γ, Θ_{k})

evaluated at

\hat{Y} = (\hat{p}, \hat{ν}, \hat{γ}, \hat{Θ_{k}})

, the asymptotic variance-covariance matrix (AVCM) of the MLE of the parameters, assuming that the AVCM of the parameter vector can be formulated as

I (\hat{Y}) = - E [\begin{matrix} \frac{\partial^{2} l (Y)}{\partial p^{2}} \\ \frac{\partial^{2} l (Y)}{\partial ν \partial p} & \frac{\partial^{2} l (Y)}{\partial ν^{2}} \\ \frac{\partial^{2} l (Y)}{\partial γ \partial p} & \frac{\partial^{2} l (Y)}{\partial γ \partial ν} & \frac{\partial^{2} l (Y)}{\partial γ^{2}} \\ \frac{\partial^{2} l (Y)}{\partial Θ_{k} \partial p} & \frac{\partial^{2} l (Y)}{\partial Θ_{k} \partial ν} & \frac{\partial^{2} l (Y)}{\partial Θ_{k} \partial γ} & \frac{\partial^{2} l (Y)}{\partial Θ_{k}^{2}} \end{matrix}],

(20)

where

V a r (\hat{Y}) = I^{- 1} (\hat{Y})

. Based on the asymptotic normality of the MLE, a

100 (1 - α) %

confidence interval for parameter

Y

can be constructed as follows:

\hat{p} \pm Z_{α / 2} \sqrt{V a r (p)}

,

\hat{ν} \pm Z_{α / 2} \sqrt{V a r (ν)}

,

\hat{γ} \pm Z_{α / 2} \sqrt{V a r (γ)}

, and

\hat{Θ_{k}} \pm Z_{α / 2} \sqrt{V a r (Θ_{k})}

. The second derivatives of the likelihood function can be calculated with Maple software.

6. Simulation: Estimators Performance

Simulation for bivariate discrete data in statistics is a technique to generate random data that follows a joint probability distribution of two discrete variables. There are different methods to simulate bivariate discrete data, depending on the type and structure of the distribution. One common method is to use copulas, which are functions that link univariate marginal distributions to a multivariate distribution. Copulas can capture the dependence structure between the variables and allow for flexible modeling of different types of correlations. Another method is to use discretization of continuous bivariate distributions. This involves rounding or truncating the continuous values to discrete values, and adjusting the probabilities accordingly. A third method is to use specific bivariate discrete distributions. Depending on the type of estimator, different metrics may be used to measure the performance of the estimator. These measures may include bias, mean squared error (MSE), confidence intervals (CIs), and coverage probability (CP) of the 95% CIs. In this segment, the second approach has been used. The performance of the MLE is tested under different schemes for the BDsOGEW parameter as follows:

Scheme I: (∀ $p$ = $0.1, ν$ = $0.3, γ$ = $0.5,$ $a$ = $0.2$ | $n_{1}$ = 20, $n_{2}$ = 50, $n_{3}$ = 150, $n_{4}$ = 300, $n_{5}$ = 500, $n_{6}$ = 700);
Scheme II: (∀ $p$ = $0.2, ν$ = $0.5, γ$ = $0.3,$ $a$ = $0.5$ | $n_{1}$ = 20, $n_{2}$ = 50, $n_{3}$ = 150, $n_{4}$ = 300, $n_{5}$ = 500, $n_{6}$ = 700);
Scheme III: (∀ $p$ = $0.5, ν$ = $0.3, γ$ = $0.7,$ $a$ = $0.9$ | $n_{1}$ = 20, $n_{2}$ = 50, $n_{3}$ = 150, $n_{4}$ = 300, $n_{5}$ = 500, $n_{6}$ = 700)

The numerical assessments are performed depending on the bias, MSE, CP, and

95 %

CIs “lower bound (LB) and upper bound (UB)” using software R package. In general, an estimator is considered good if it has low bias and low variance. The empirical results are reported in Table 1, Table 2 and Table 3.

According to the simulation results/performance as

n \to + \infty

, the bias and MSE decrease; and consequently, an unbiased estimator was achieved for large samples under consistency condition. Thus, the maximum likelihood approach can be used effectively to estimate the BDsOGEW parameters.

7. Data Analysis

In this section, we clear the experimental importance of the BDsOGE-G family by using three applications. In each data set, we compare the fits of the proposed BDsOGEW distribution with well-known bivariate models. The tested distributions are compared using some criteria namely, the negative maximized log-likelihood (

- l

), Akaike information criterion (AIC), and Hannan-Quinn information criterion (HQIC). The AIC is an estimator of prediction error and thereby relative quality of statistical models for a given set of data. It estimates the quality of each model, relative to each of the other models. AIC provides a means for model selection. The HQIC is a criterion for model selection. It is an alternative to AIC and Bayesian information criterion.

7.1. Data Set I: Nasal Drainage Severity Score

This data set represents the efficacy of steam inhalation in the treatment of common cold symptoms “

0 =

no symptoms;

1 =

mild symptoms;

2 =

moderate symptoms;

3 =

severe symptoms” (see Davis, [30]). Figure 6 shows scatter and violin plots of the Nasal drainage severity score data.

Before analyzing the bivariate data, we first fit the marginals

X_{1}

and

X_{2}

separately and

min (X_{1}, X_{2})

. The MLEs of the parameters

ν, p

and

α

of the corresponding univariate distribution for

X_{1}

,

X_{2}

and

min (X_{1}, X_{2})

are (

1.915, 0.809, 0.911

), (

5.073, 0.516, 0.601

) and (

1.591, 0.769, 0.923

), respectively. The

- l

values are

78.855

,

78.031

and

76.781,

respectively. Moreover, the p-values ranged from

0.658

to

0.759

. Now, we compare the BDsOGEW distribution with some competitive distributions like bivariate discrete exponential (BDsE), bivariate Poisson with four parameters (BPo-4P), independent bivariate Poisson (IBPo), bivariate discrete inverse exponential (BDsIE), bivariate discrete inverse Rayleigh (BDsIR), and bivariate discrete inverse Weibull (BDsIW) distributions. Table 4 lists the MLEs and some goodness-of-fit measures (GOFM).

It is observed that the BDsOGEW model is the best among all tested models because it has the smallest values among

- l

, AIC, and HQIC when compared to the other competitive models. The

95 %

CIs for the BDsOGEW parameters

γ, v, p, a

can be listed as

[0.378, 0.770]

,

[2.165, 2.748]

,

[0.624, 0.820]

, and

[0.594, 0.965]

, respectively. Figure 7 shows the estimated joint PMF for the BDsOGEW distribution and the other competitive models, which support the results of Table 4.

Figure 8, Figure 9 and Figure 10 show the profile l for each parameter as well as contour plots. It has been found that the maximum likelihood estimates are unique.

7.2. Data Set II: Football Score

The data set consists of a football match score in talian football matches from 1996 to 2011, between “ACF Fiorentina” (

X_{1}

) and “Juventus” (

X_{2})

. The data source is “http://www.worldfootball.net/competition/ita-serie-a/ (accessed on 16 March 2023)”. Figure 11 shows scatter and violin plots of the football score data.

We first fit the marginals

X_{1}

and

X_{2}

separately and

min (X_{1}, X_{2})

. The MLEs of the parameters

ν, p

and

α

of the corresponding univariate distribution for

X_{1}

,

X_{2}

and

min (X_{1}, X_{2})

are (

29.269, 0.165, 0.357

), (

5.083, 0.534, 0.631

) and (

28.051, 0.152, 0.384

), respectively. The

- l

values are

65.898

,

67.856

and

64.723,

respectively. Moreover, the p-values ranged from

0.786

to

0.801

. Now, we compare the BDsOGEW distribution with some competitive distributions like BDsE, bivariate discrete Rayleigh (BDsR), bivariate discrete Weibull (BDsW), bivariate Poisson with minimum operator (BPo

_{min}

), bivariate Poisson with three parameters (BPo-3P), inverse bivariate Poisson (IBPo), BDsIE, and BDsIR distributions. Table 5 lists the MLEs and some GOFM.

It is noted that the BDsOGEW distribution is the best among all tested models. The

95 %

CIs for the BDsOGEW parameters

γ, v, p, a

can be reported as

[0.620, 0.897]

,

[23.529, 24.084]

,

[0.0611, 0.3648]

, and

[0.253, 0.544]

, respectively. Figure 12 shows the estimated joint PMF for the BDsOGEW distribution and the other competitive distributions, which prove the results of Table 5.

Figure 13, Figure 14 and Figure 15 prove that the maximum likelihood estimates are unique.

8. Concluding Remarks and Future Work

In this paper, a flexible bivariate odd discrete generator has been introduced based on the copula concept. Some of its statistical properties have been investigated. After reporting the general class, one special model of the new bivariate family has been studied in-detail. It was found that the new joint probability mass function can be utilized to model asymmetric as well as symmetric data surfaces. Moreover, the hazard rate function of the newly created distribution can be used to discuss various types of failures, including increasing-, decreasing-, bathtub-, and unimodal-shaped surface. The bivariate model parameters have been estimated utilizing the maximum likelihood technique. A simulation has been performed to test the performance of the maximum likelihood estimators based on different sample sizes, and it was found that the maximum likelihood approach could be used to discuss the real data herein. Finally, two distinctive real data sets, “Nasal drainage severity score” and “Football score”, have been analyzed, and it was found that the proposed bivariate family has worked quite well in modeling the data. As a future work, the bivariate quantile residual life and multivariate extension of the proposed model with its applications will be discussed.

Author Contributions

Conceptualization, M.S.E. and M.E.-M.; Methodology, H.S.S. and H.N.A.; Software, M.S.E., L.A.A.-E. and M.E.-M.; Validation, A.A.K.; Formal analysis, L.A.A.-E. and M.S.E.; Investigation, H.S.S., A.A.K. and M.E.-M.; Resources, L.A.A.-E., H.S.S. and A.A.K.; Data curation, M.S.E. and H.N.A.; Writing—review & editing, L.A.A.-E., M.S.E. and H.N.A.; Visualization, H.S.S. and A.A.K.; Supervision, M.S.E. and M.E.-M. All authors have read and agreed to the published version of the manuscript.

Funding

Princess Nourah bint Abdulrahman University Researchers Supporting Project and Prince Sattam bin Abdulaziz Universities under project numbers (PNURSP2023R443) and (PSAU/2023/R/1444), respectively.

Data Availability Statement

The data sets are available in the paper.

Acknowledgments

Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R443), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. This study is supported via funding from Prince Sattam bin Abdulaziz University, project number (PSAU/2023/R/1444).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

The PMF of the BDsOGE-G

Utilizing Equations (3)–(6), the join PMF of the BDsOGE-G can be expressed as

\begin{matrix} f_{X_{1}, X_{2}} (x_{1}, x_{2}) & = & C (F_{X_{1}} (x_{1} + 1), F_{X_{2}} (x_{2} + 1)) - C (F_{X_{1}} (x_{1} + 1), F_{X_{2}} (x_{2})) \\ - C (F_{X_{1}} (x_{1}), F_{X_{2}} (x_{2} + 1)) + C (F_{X_{1}} (x_{1}), F_{X_{2}} (x_{2})) \\ = & \{\begin{matrix} P_{1} (x_{1}, x_{2}) if x_{1} < x_{2} \\ P_{2} (x_{1}, x_{2}) if x_{2} < x_{1} \\ P_{3} (x) if x_{1} = x_{2} = x, \end{matrix} \end{matrix}

where

\begin{matrix} P_{1} (x_{1}, x_{2}) & = & {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ} \\ - {(1 - p^{Ψ (x_{1}; Θ)})}^{ν} {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} + {(1 - p^{Ψ (x_{1}; Θ)})}^{ν} {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ} \\ = & {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ}] - \\ {(1 - p^{Ψ (x_{1}; Θ)})}^{ν} [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ}] \\ = & [{(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{1}; Θ)})}^{ν}] [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν γ}] \\ = & f_{DsOGE - G} (x_{1}; p, ν, Θ) f_{DsOGE - G} (x_{2}; p, ν, γ, Θ), \end{matrix}

\begin{matrix} P_{2} (x_{1}, x_{2}) & = & {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν} {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ} \\ - {(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} {(1 - p^{Ψ (x_{2}; Θ)})}^{ν} + {(1 - p^{Ψ (x_{1}; Θ)})}^{ν γ} {(1 - p^{Ψ (x_{2}; Θ)})}^{ν} \\ = & {(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ} [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν}] - \\ {(1 - p^{Ψ (x_{1}; Θ)})}^{ν γ} [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν}] \\ = & [{(1 - p^{Ψ (x_{1} + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x_{1}; Θ)})}^{ν γ}] [{(1 - p^{Ψ (x_{2} + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x_{2}; Θ)})}^{ν}] \\ = & f_{DsOGE - G} (x_{1}; p, ν, γ, Θ) f_{DsOGE - G} (x_{2}; p, ν, Θ) \end{matrix}

and

\begin{matrix} P_{3} (x) & = & {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} {(1 - p^{Ψ (x + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} {(1 - p^{Ψ (x; Θ)})}^{ν} \\ - {(1 - p^{Ψ (x; Θ)})}^{ν} {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} + {(1 - p^{Ψ (x; Θ)})}^{ν} {(1 - p^{Ψ (x; Θ)})}^{ν γ} \\ = & {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} [{(1 - p^{Ψ (x + 1; Θ)})}^{ν} - {(1 - p^{Ψ (x; Θ)})}^{ν}] - {(1 - p^{Ψ (x; Θ)})}^{ν} \\ \times [{(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} - {(1 - p^{Ψ (x; Θ)})}^{ν γ}] \\ = & {(1 - p^{Ψ (x + 1; Θ)})}^{ν γ} f_{DsOGE - G} (x; p, ν, Θ) - {(1 - p^{Ψ (x; Θ)})}^{ν} f_{DsOGE - G} (x; p, ν, γ, Θ) . \end{matrix}

Estimation code for real data

L o k [a_, p_v_, γ_] : = \sum_{i = 1}^{n 1}

Log

[{(1 - p^{(- 1 + e^{{(x 1 [[i]] + 1)}^{a}})})}^{v} - {(1 - p^{(- 1 + e^{{(x 1 [[i]])}^{a}})})}^{v}]

+ \sum_{i = 1}^{n 1}

Log

[{(1 - p^{(- 1 + e^{{(x 2 [[i]] + 1)}^{a}})})}^{v * γ} - {(1 - p^{(- 1 + e^{{(x 2 [[i]])}^{a}})})}^{v * γ}]

+ \sum_{i = 1}^{n 2}

Log

[{(1 - p^{(- 1 + e^{{(Y 1 [[i]] + 1)}^{a}})})}^{v * γ} - {(1 - p^{(- 1 + e^{{(Y 1 [[i]])}^{a}})})}^{v * γ}]

+ \sum_{i = 1}^{n 2}

Log

[{(1 - p^{(- 1 + e^{{(Y 2 [[i]] + 1)}^{a}})})}^{v} - {(1 - p^{(- 1 + e^{{(Y 2 [[i]])}^{a}})})}^{v}]

+ \sum_{i = 1}^{n 3}

Log

[\begin{matrix} {(1 - p^{(- 1 + e^{{(z [[i]] + 1)}^{a}})})}^{v * γ} * ({(1 - p^{(- 1 + e^{{(z [[i]] + 1)}^{a}})})}^{v} - {(1 - p^{(- 1 + e^{{(z [[i]])}^{a}})})}^{v}) \\ - {(1 - p^{(- 1 + e^{{(z [[i]])}^{a}})})}^{v} * ({(1 - p^{(- 1 + e^{{(z [[i]] + 1)}^{a}})})}^{v * γ} - {(1 - p^{(- 1 + e^{{(z [[i]])}^{a}})})}^{v * γ}) \end{matrix}]

parameter = 4;

large = {{2, 0}, {3, 2}, {1, 0}, {2, 1}, {2, 1}, {3, 1}, {2, 1}, {2, 0}};

small = {{0, 2}, {2, 3}, {1, 3}, {2, 3}, {0, 1}};

equal = {{1, 1}, {0, 0}, {1, 1}, {1, 1}, {2, 2}, {1, 1}, {2, 2}, {1, 1}, {2, 2}, {1, 1}, {2, 2}, {1, 1}, {2, 2}, {1, 1}, {1, 1}, {1, 1}, {3, 3}};

Unprotect[Power]; Power[0 | 0., 0 | 0.] = 1; Protect[Power];

Y1 = large[[All, 1]];

Y2 = large[[All, 2]];

n1 = Length[x1]

x1 = small[[All, 1]];

x2 = small[[All, 2]];

n2 = Length[Y1]

z = equal[[All, 1]];

n3 = Length[z]

NMaximize

[{L o k [a_, p_v_, γ_], 0 < p < 1, 0 < a < \infty, 0 < v < \infty, 0 < γ < 1}, {a, p, v, γ}]

References

Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
Tahir, M.H.; Cordeiro, G.M.; Alizadeh, M.; Mansoor, M.; Zubair, M.; Hamedani, G.G. The odd generalized exponential family of distributions with applications. J. Stat. Appl. 2015, 2, 1. [Google Scholar] [CrossRef]
Silva, G.F.S.; Percontini, A.; de Brito, E.; Ramos, M.W.; Venâncio, R.; Cordeiro, G.M. The odd Lindley-G family of distributions. Austrian J. Stat. 2017, 46, 65–87. [Google Scholar] [CrossRef]
Alizadeh, M.; Ghosh, I.; Yousof, H.M.; Rasekhi, M.; Hamedani, G.G. The generalized odd generalized exponential family of distributions: Properties, characterizations and applications. J. Data Sci. 2017, 15, 443–465. [Google Scholar] [CrossRef]
Korkmaz, M.C.; Yousof, H.M.; Hamedani, G.G. The exponential Lindley odd log-logistic-G family: Properties, characterizations and applications. J. Stat. Theory Appl. 2018, 17, 554–571. [Google Scholar] [CrossRef]
Djibrila, S. The generalized odd inverted exponential-G family of distributions: Properties and applications. Eurasian Bull. Math. 2019, 2, 86–110. [Google Scholar]
Reyad, H.; Othman, S.; Ul Haq, M.A. The transmuted generalized odd generalized exponential-G family of distributions: Theory and applications. J. Data Sci. 2019, 17, 279–300. [Google Scholar] [CrossRef]
Alizadeh, M.; Afify, A.Z.; Eliwa, M.S.; Ali, S. The odd log-logistic Lindley-G family of distributions: Properties, Bayesian and non-Bayesian estimation with applications. Comput. Stat. 2020, 35, 281–308. [Google Scholar] [CrossRef]
Balakrishnan, N.; Lai, C.D. Continuous Bivariate Distributions, 2nd ed.; Wiley: New York, NY, USA, 2009; Volume 1. [Google Scholar]
Johnson, M.E.; Tenenbein, A. A bivariate distribution family with specified marginals. J. Am. Assoc. 1981, 76, 198–201. [Google Scholar] [CrossRef]
Quesada-Molina, J.J.; Rodrguez-Lallena, J.A. Bivariate copulas with quadratic sections. Journaltitle Nonparametr. Stat. 1995, 5, 323–337. [Google Scholar] [CrossRef]
Fang, K.T.; Fang, H.B.; Rosen, D.V. A family of bivariate distributions with non-elliptical contours. Commun.-Stat.-Theory Methods 2000, 29, 1885–1898. [Google Scholar] [CrossRef]
Durante, F. A new family of symmetric bivariate copulas. Comptes Rendus Math. 2007, 344, 195–198. [Google Scholar] [CrossRef]
Kundu, D.; Gupta, R.D. A class of bivariate models with proportional reversed hazard marginals. Sankhya B 2010, 72, 236–253. [Google Scholar] [CrossRef]
Sarabia, J.M.; Prieto, F.; Jorda, V. Bivariate beta-generated distributions with applications to well-being data. J. Stat. Distrib. Appl. 2014, 1, 15. [Google Scholar] [CrossRef]
Roozegar, R.; Jafari, A.A. On bivariate exponentiated extended Weibull family of distributions. arXiv 2015, arXiv:1507.07535. [Google Scholar] [CrossRef]
Eliwa, M.S.; Alhussain, Z.A.; Ahmed, E.A.; Salah, M.M.; Ahmed, H.H.; El-Morshedy, M. Bivariate Gompertz generator of distributions: Statistical properties and estimation with application to model football data. J. Natl. Sci. Found. Sri Lanka 2020, 48. [Google Scholar] [CrossRef]
Lee, H.; Cha, J.H. On two general classes of discrete bivariate distributions. Am. Stat. 2015, 69, 221–230. [Google Scholar] [CrossRef]
Kundu, D.; Nekoukhou, V. Univariate and bivariate geometric discrete generalized exponential distributions. J. Stat. Theory Pract. 2018, 12, 595–614. [Google Scholar] [CrossRef]
El-Morshedy, M.; Eliwa, M.S.; El-Gohary, A.; Khalil, A.A. Bivariate exponentiated discrete Weibull distribution: Statistical properties, estimation, simulation and applications. Math. Sci. 2020, 14, 29–42. [Google Scholar] [CrossRef]
Nekoukhou, V.; Khalifeh, A.; Bidram, H. A bivariate discrete inverse resilience family of distributions with resilience marginals. J. Appl. Stat. 2020, 48, 1071–1091. [Google Scholar] [CrossRef] [PubMed]
De Oliveira, R.P.; Achcar, J.A. A new flexible bivariate discrete Rayleigh distribution generated by the Marshall-Olkin family. Model Assist. Stat. Appl. 2020, 15, 19–34. [Google Scholar] [CrossRef]
Kobus, M.; Kurek, R. Copula-based measurement of interdependence for discrete distributions. J. Math. 2018, 79, 27–39. [Google Scholar] [CrossRef]
Najarzadegan, H.; Alamatsaz, M.H.; Kazemi, I. Discrete bivariate distributions generated by copulas: Dbeew distribution. J. Stat. Theory Pract. 2019, 13, 1–30. [Google Scholar] [CrossRef]
Yamaguchi, Y.; Maruo, K. Bivariate beta-binomial model using Gaussian copula for bivariate meta-analysis of two binary outcomes with low incidence. Jpn. J. Stat. Data Sci. 2019, 2, 347–373. [Google Scholar] [CrossRef]
Emura, T.; Matsui, S.; Rondeau, V. Survival Analysis with Correlated Endpoints: Joint Frailty-Copula Models; Springer: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
Cuadras, C.M.; Augé, J. A continuous general multivariate distribution and its properties. Commun.-Stat.-Theory Methods 1981, 10, 339–353. [Google Scholar] [CrossRef]
Casella, G.; Berger, R.L. Statistical Inference; Duxbury Press: Pacific Grove, CA, USA, 2002. [Google Scholar]
Pfeiffer, P.E. Conditional Independence in Applied Probability; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Davis, C.S. Statistical Methods for the Analysis of Repeated Measures Data; Springer: New York, NY, USA, 2002. [Google Scholar]

Figure 1. The joint PMF of the BDsOGEW model.

Figure 2. The joint RF of the BDsOGEW model.

Figure 3. The joint HRF of the BDsOGEW model.

Figure 4. Some scatter plots of the conditional PMF for

X_{1}

given

x_{2}

.

Figure 4. Some scatter plots of the conditional PMF for

X_{1}

given

x_{2}

.

Figure 5. Some scatter plots of the conditional PMF for

X_{2}

given

x_{1}

.

Figure 5. Some scatter plots of the conditional PMF for

X_{2}

given

x_{1}

.

Figure 6. Scatter and violin plots of data set I.

Figure 7. The estimated joint PMF based on data set I.

Figure 8. The profile l for parameters a,

γ

, p and v for data set I.

Figure 8. The profile l for parameters a,

γ

, p and v for data set I.

Figure 9. A 3-dimensional surface profile likelihood of a,

γ

, p and v based on data set I.

Figure 9. A 3-dimensional surface profile likelihood of a,

γ

, p and v based on data set I.

Figure 10. Contour diagrams of the model estimators based on data set I.

Figure 11. Scatter and violin plots of data set II.

Figure 12. The estimated joint PMF based on data set II.

Figure 13. The profile l of a,

γ

, p and v according to data set II.

Figure 13. The profile l of a,

γ

, p and v according to data set II.

Figure 14. A 3-dimensional surface profile likelihood of a,

γ

, p and v based on data set II.

Figure 14. A 3-dimensional surface profile likelihood of a,

γ

, p and v based on data set II.

Figure 15. Contour diagrams of the model estimators based on data set II.

Table 1. Simulation results for scheme I.

	$n_{1} = 20$				$n_{2} = 50$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.10125$	$0.21458$	$0.18256$	$0.25203$	$0.05236$	$0.18256$	$0.12469$	$0.19636$
MSE	$0.09856$	$0.18256$	$0.15246$	$0.21552$	$0.04254$	$0.15635$	$0.09850$	$0.16504$
CP	$0.88719$	$0.92335$	$0.93419$	$0.83963$	$0.88436$	$0.92144$	$0.93223$	$0.83745$
$95 %$ CI $_{L B}$	$0.06523$	$0.24714$	$0.42302$	$0.14025$	$0.07136$	$0.25748$	$0.44120$	$0.15699$
$95 %$ CI $_{U B}$	$0.14569$	$0.35125$	$0.59368$	$0.27145$	$0.13825$	$0.34283$	$0.56774$	$0.26632$
	$n_{3} = 150$				$n_{4} = 300$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.01365$	$0.12145$	$0.08254$	$0.14231$	$0.00858$	$0.07156$	$0.0236$	$0.11258$
MSE	$0.01258$	$0.10395$	$0.06632$	$0.12569$	$0.00652$	$0.05236$	$0.01428$	$0.09254$
CP	$0.88253$	$0.92074$	$0.93064$	$0.83164$	$0.88162$	$0.91886$	$0.92846$	$0.82854$
$95 %$ CI $_{L B}$	$0.07512$	$0.26314$	$0.46256$	$0.16369$	$0.08124$	$0.27142$	$0.47428$	$0.17402$
$95 %$ CI $_{U B}$	$0.12415$	$0.34472$	$0.54239$	$0.24012$	$0.11932$	$0.33748$	$0.53201$	$0.23148$
	$n_{5} = 500$				$n_{6} = 700$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.00103$	$0.01254$	$0.00125$	$0.06317$	$0.00003$	$0.00235$	$0.00024$	$0.00301$
MSE	$0.00084$	$0.00825$	$0.00052$	$0.01426$	$0.00001$	$0.00082$	$0.00011$	$0.00082$
CP	$0.87949$	$0.91278$	$0.92676$	$0.81746$	$0.87552$	$0.91075$	$0.92019$	$0.81298$
$95 %$ CI $_{L B}$	$0.08412$	$0.28301$	$0.48241$	$0.18012$	$0.09174$	$0.28102$	$0.49102$	$0.19125$
$95 %$ CI $_{U B}$	$0.11363$	$0.32823$	$0.53294$	$0.21989$	$0.10857$	$0.31025$	$0.52138$	$0.21011$

Table 2. Simulation results for scheme II.

	$n_{1} = 20$				$n_{2} = 50$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.12369$	$0.23529$	$0.13085$	$0.16625$	$0.10238$	$0.20136$	$0.09820$	$0.14823$
MSE	$0.10925$	$0.22968$	$0.12821$	$0.15530$	$0.09569$	$0.18014$	$0.07742$	$0.13224$
CP	$0.91203$	$0.82734$	$0.94537$	$0.85016$	$0.91027$	$0.82256$	$0.94521$	$0.84771$
$95 %$ CI $_{L B}$	$0.15365$	$0.40123$	$0.22303$	$0.46369$	$0.16325$	$0.42013$	$0.24120$	$0.47102$
$95 %$ CI $_{U B}$	$0.26636$	$0.59325$	$0.38204$	$0.54215$	$0.24932$	$0.57012$	$0.36025$	$0.53325$
	$n_{3} = 150$				$n_{4} = 300$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.08256$	$0.16328$	$0.04230$	$0.12012$	$0.02137$	$0.11205$	$0.00803$	$0.08825$
MSE	$0.06636$	$0.14200$	$0.03047$	$0.09825$	$0.01452$	$0.09852$	$0.00625$	$0.06328$
CP	$0.90734$	$0.82019$	$0.94338$	$0.84309$	$0.90193$	$0.81578$	$0.94219$	$0.84290$
$95 %$ CI $_{L B}$	$0.17714$	$0.43025$	$0.25202$	$0.48158$	$0.18230$	$0.45201$	$0.28323$	$0.48623$
$95 %$ CI $_{U B}$	$0.23852$	$0.55236$	$0.34525$	$0.52745$	$0.22825$	$0.54414$	$0.32256$	$0.51445$
	$n_{5} = 500$				$n_{6} = 700$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.00615$	$0.02167$	$0.00012$	$0.00215$	$0.00023$	$0.00273$	$0.00003$	$0.00052$
MSE	$0.00321$	$0.00825$	$0.00008$	$0.00102$	$0.00004$	$0.00019$	$0.00001$	$0.00008$
CP	$0.89723$	$0.81426$	$0.94139$	$0.84311$	$0.89227$	$0.81337$	$0.93772$	$0.84382$
$95 %$ CI $_{L B}$	$0.18936$	$0.46236$	$0.29201$	$0.49012$	$0.19102$	$0.47525$	$0.29025$	$0.49523$
$95 %$ CI $_{U B}$	$0.21985$	$0.53926$	$0.30941$	$0.51125$	$0.20824$	$0.53101$	$0.30125$	$0.50585$

Table 3. Simulation results for scheme III.

	$n_{1} = 20$				$n_{2} = 50$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.09858$	$0.24223$	$0.19302$	$0.16014$	$0.07145$	$0.19073$	$0.16325$	$0.14085$
MSE	$0.07145$	$0.21014$	$0.16328$	$0.14025$	$0.05241$	$0.17452$	$0.14145$	$0.13748$
CP	$0.92747$	$0.81457$	$0.97194$	$0.95712$	$0.91265$	$0.82602$	$0.97188$	$0.95639$
$95 %$ CI $_{L B}$	$0.46568$	$0.19365$	$0.58636$	$0.75145$	$0.47188$	$0.22325$	$0.61258$	$0.79569$
$95 %$ CI $_{U B}$	$0.54258$	$0.41256$	$0.82414$	$1.15369$	$0.53521$	$0.38525$	$0.79201$	$1.10254$
	$n_{3} = 150$				$n_{4} = 300$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.02035$	$0.14625$	$0.13205$	$0.12638$	$0.00413$	$0.12025$	$0.11015$	$0.11365$
MSE	$0.01996$	$0.13236$	$0.11859$	$0.12025$	$0.00251$	$0.11301$	$0.09852$	$0.09852$
CP	$0.91746$	$0.82746$	$0.96984$	$0.95612$	$0.92957$	$0.88230$	$0.96901$	$0.95598$
$95 %$ CI $_{L B}$	$0.48194$	$0.24189$	$0.64302$	$0.82638$	$0.48831$	$0.25885$	$0.66358$	$0.86677$
$95 %$ CI $_{U B}$	$0.52825$	$0.34125$	$0.76012$	$0.98510$	$0.51254$	$0.33236$	$0.73258$	$0.93748$
	$n_{5} = 500$				$n_{6} = 700$
	p	$ν$	$γ$	$a$	p	$ν$	$γ$	$a$
Bias	$0.00074$	$0.08236$	$0.08858$	$0.09014$	$0.00002$	$0.00714$	$0.00752$	$0.01173$
MSE	$0.00066$	$0.06326$	$0.07968$	$0.06852$	$0.00001$	$0.00513$	$0.00524$	$0.00721$
CP	$0.90166$	$0.88109$	$0.96836$	$0.95583$	$0.90654$	$0.88112$	$0.96554$	$0.95566$
$95 %$ CI $_{L B}$	$0.49134$	$0.27256$	$0.68225$	$0.87480$	$0.49014$	$0.28541$	$0.69221$	$0.88369$
$95 %$ CI $_{U B}$	$0.51015$	$0.32748$	$0.71529$	$0.93254$	$0.50895$	$0.31526$	$0.70418$	$0.91596$

Table 4. The MLEs and GOFM for data set I.

Model	MLEs	$- l$	AIC	HQIC
BDsE	${\hat{ν}}_{1} = 0.846, {\hat{ν}}_{2} = 0.792, {\hat{ν}}_{3} = 0.693$	$88.002$	$182.004$	$183.349$
BPo-4P	${\hat{λ}}_{1} = 0.262, {\hat{γ}}_{1} = 0.165, {\hat{λ}}_{2} = 0.405, {\hat{γ}}_{2} = 2.971$	$77.664$	$163.328$	$165.121$
IBPo	${\hat{λ}}_{1} = 1.499, {\hat{λ}}_{2} = 1.367$	$92.478$	$188.956$	$189.853$
BDsIE	${\hat{ν}}_{1} = 0.501, {\hat{ν}}_{2} = 0.622, {\hat{ν}}_{3} = 0.383$	$92.482$	$190.964$	$192.309$
BDsIR	${\hat{ν}}_{1} = 0.262, {\hat{ν}}_{2} = 0.405, {\hat{ν}}_{3} = 0.363$	$78.659$	$163.318$	$164.663$
BDsIW	${\hat{ν}}_{1} = 0.192, {\hat{ν}}_{2} = 0.337, {\hat{ν}}_{3} = 0.360, \hat{ζ} = 2.453$	$76.513$	$161.026$	$162.815$
BDsOGEW	$\hat{γ} = 0.574, \hat{ν} = 2.457, \hat{p} = 0.722, \hat{a} = 0.780$	$71.162$	$150.324$	$152.117$

Table 5. The MLEs and GOFM for data set II.

Model	MLEs	$- l$	AIC	HQIC
BDsE	${\hat{ν}}_{1} = 0.625, {\hat{ν}}_{2} = 0.812, {\hat{ν}}_{3} = 0.713$	$75.421$	$156.842$	$157.929$
BDsR	${\hat{ν}}_{1} = 0.790, {\hat{ν}}_{2} = 0.872, {\hat{ν}}_{3} = 0.905$	$63.931$	$133.862$	$134.949$
BDsW	${\hat{ν}}_{1} = 1.36, {\hat{ν}}_{2} = 2.10, {\hat{ν}}_{3} = 2.27, \hat{ζ} = 2.125$	$63.911$	$135.822$	$137.271$
BPo $_{min}$	${\hat{ν}}_{1} = 1.36, {\hat{ν}}_{2} = 2.10, {\hat{ν}}_{3} = 2.27$	$64.228$	$134.456$	$135.543$
BPo-3P	${\hat{γ}}_{1} = 1.08, {\hat{γ}}_{2} = 1.38, {\hat{γ}}_{3} = 0.70$	$64.932$	$135.864$	$136.951$
IBPo	${\hat{λ}}_{1} = 1.08, {\hat{λ}}_{2} = 1.38$	$67.623$	$139.246$	$139.971$
BDsIE	${\hat{ν}}_{1} = 0.669, {\hat{ν}}_{2} = 0.388, {\hat{ν}}_{3} = 0.514$	$78.541$	$163.082$	$164.169$
BDsIR	${\hat{ν}}_{1} = 0.493, {\hat{ν}}_{2} = 0.212, {\hat{ν}}_{3} = 0.561$	$64.102$	$134.204$	$135.291$
BDsOGEW	$\hat{γ} = 0.759, \hat{ν} = 23.807, \hat{p} = 0.213, \hat{a} = 0.399$	$62.589$	$133.178$	$134.627$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Al-Essa, L.A.; Eliwa, M.S.; Shahen, H.S.; Khalil, A.A.; Alqifari, H.N.; El-Morshedy, M. Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications. Axioms 2023, 12, 534. https://doi.org/10.3390/axioms12060534

AMA Style

Al-Essa LA, Eliwa MS, Shahen HS, Khalil AA, Alqifari HN, El-Morshedy M. Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications. Axioms. 2023; 12(6):534. https://doi.org/10.3390/axioms12060534

Chicago/Turabian Style

Al-Essa, Laila A., Mohamed S. Eliwa, Hend S. Shahen, Amal A. Khalil, Hana N. Alqifari, and Mahmoud El-Morshedy. 2023. "Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications" Axioms 12, no. 6: 534. https://doi.org/10.3390/axioms12060534

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications

Abstract

1. Introduction

2. The BDOGE-G Class

3. Distributional Properties

3.1. Median Correlation Coefficient

3.2. The Conditional CDF of $X_{1}$ Given $X_{2} = x_{2}$ ( $X_{2} \leq x_{2}$ )

3.3. The Conditional Expectation of $X_{1}$ Given $X_{2} = x_{2}$

4. The BDsOGE-Weibull (BDsOGEW) Distribution

5. Point and Interval Estimations

5.1. Maximum Likelihood Estimation (MLE)

5.2. Asymptotic Confidence Intervals

6. Simulation: Estimators Performance

7. Data Analysis

7.1. Data Set I: Nasal Drainage Severity Score

7.2. Data Set II: Football Score

8. Concluding Remarks and Future Work

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Bivariate Discrete Odd Generalized Exponential Generator of Distributions for Count Data: Copula Technique, Mathematical Theory, and Applications

Abstract

1. Introduction

2. The BDOGE-G Class

3. Distributional Properties

3.1. Median Correlation Coefficient

3.2. The Conditional CDF of X 1 Given X 2 = x 2 ( X 2 ≤ x 2 )

3.3. The Conditional Expectation of X 1 Given X 2 = x 2

4. The BDsOGE-Weibull (BDsOGEW) Distribution

5. Point and Interval Estimations

5.1. Maximum Likelihood Estimation (MLE)

5.2. Asymptotic Confidence Intervals

6. Simulation: Estimators Performance

7. Data Analysis

7.1. Data Set I: Nasal Drainage Severity Score

7.2. Data Set II: Football Score

8. Concluding Remarks and Future Work

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2. The Conditional CDF of $X_{1}$ Given $X_{2} = x_{2}$ ( $X_{2} \leq x_{2}$ )

3.3. The Conditional Expectation of $X_{1}$ Given $X_{2} = x_{2}$