A Class of Exponentiated Regression Model for Non Negative Censored Data with an Application to Antibody Response to Vaccine

Martínez-Flórez, Guillermo; Vergara-Cardozo, Sandra; Tovar-Falón, Roger

doi:10.3390/sym13081419

Open AccessArticle

A Class of Exponentiated Regression Model for Non Negative Censored Data with an Application to Antibody Response to Vaccine

by

Guillermo Martínez-Flórez

^1,†

,

Sandra Vergara-Cardozo

^2,† and

Roger Tovar-Falón

^1,*,†

¹

Departamento de Matemáticas y Estadística, Facultad de Ciencias Básicas, Universidad de Córdoba, Montería 230027, Colombia

²

Departamento de Estadística, Facultad de Ciencias, Universidad Nacional de Colombia, Bogotá 111321, Colombia

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Symmetry 2021, 13(8), 1419; https://doi.org/10.3390/sym13081419

Submission received: 9 July 2021 / Revised: 30 July 2021 / Accepted: 2 August 2021 / Published: 3 August 2021

(This article belongs to the Special Issue Symmetric and Asymmetric Distributions: Theoretical Developments and Applications Ⅲ)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, an asymmetric regression model for censored non-negative data based on the centred exponentiated log-skew-normal and Bernoulli distributions mixture is introduced. To connect the discrete part with the continuous distribution, the logit link function is used. The parameters of the model are estimated by using the likelihood maximum method. The score function and the information matrix are shown in detail. Antibody data from a study of the measles vaccine are used to illustrate applicability of the proposed model, and it was found the best fit to the data with respect to an others models used in the literature.

Keywords:

centred exponentiated log-skew-normal distribution; censored data; asymmetry two-part model

1. Introduction

Statistical models for dealing with the issue of random variables with limited or censored responses have been approached by different authors, standing out among others, the censored normal (CN) model, widely known in the literature as the Tobit model, Tobin [1]. The CN or normal Tobit (NT) model is defined from the considering the random variable

y_{i} = max {y_{i}^{*}, 0}

, with

y_{i}^{*} = x_{i}^{⊤} β + ε_{i}

, for

i = 1, 2, \dots, n

, where

β = {(β_{1}, β_{2}, \dots, β_{p})}^{⊤}

is a

p \times 1

unknown parameter vector,

x_{i} = {(x_{1 i}, x_{2 i}, \dots, x_{p i})}^{⊤}

is a

p \times 1

vector of known independent variables, and the error term

ε_{i} \sim N (0, σ^{2}),

i = 1, \dots, n

. This can be written as:

y_{i} = \{\begin{matrix} x_{i}^{⊤} β + ε_{i}, & if x_{i}^{⊤} β + ε_{i} > 0, \\ 0, & otherwise \end{matrix}

(1)

The model (1) has been used in applications in various areas of knowledge, often in situations with excess of zeros in the censoring value; however, the probability of censoring is not well estimated by the Tobit model since the tails of the data distribution are heavier than the normal distribution and other assumptions of the error distribution are not satisfied. A similar situation is presented in the case of data with non-negative support, where usually log-normal Tobit model (LNT) is used.

In some situations, the degrees of asymmetry and kurtosis of the distribution of the errors in the model can not be captured by the normal or the log-normal models. In this case it is not advisable to fit the CN or censored log-normal (CLN) models. To consider more flexible models, such as the skew-normal (SN) model by Azzalini [2] or the power-normal (PN) model of Durrans [3] is another solution. In the case of non-negative data, it can be considered the log-skew-normal alpha-power model by Martínez-Flórez et al. [4] or the log-power-normal model of Martínez-Flórez et al. [5].

The Tobit model extension for censored data with high degree of kurtosis was proposed by Arellano-Valle et al. [6], while the case of censored data with high or low degree of asymmetry was studied by Martínez-Flórez et al. [7], the latter is known as the Tobit power-normal model (TPN). When the censored part does not fit well with the Tobit model, mixture of distributions can be used, in this situation as in the usual censored distribution, the mean and the variance of the response variable is associated with the linear predictor. In addition, the proportion of censored data can be explained by using the binomial model with the logit or probit link function. This type of model has been used in many areas of knowledge such as economics, biology, agriculture, medicine, among others. Cragg [8], Moulton and Halsey [9], Moulton and Halsey [10] and Chai and Bailey [11], for example, use a mixture of distribution (which is denoted by “/”) between a Bernoulli distribution and other continuous distribution.

The probability density function (PDF) of the random variable

Y_{i}

proposed by Cragg [8], which is often called “two-part model”, is given by

g (y_{i}) = p_{i} I_{i} + (1 - p_{i}) f (y_{i}) (1 - I_{i}),

(2)

where

p_{i}

is the probability that determines the relative contribution made by the point mass distribution to the overall mixture distribution, f is a density function with positive support and

I_{i}

is an indicator variable given by

I_{i} = 0

if

y_{i} > T

and

I_{i} = 1

if

y_{i} \leq T

. The model (2) is more informative than the Tobit model because the two components are determined by different stochastic processes, see Chai and Bailey [11].

The two-part model of Cragg [8] was generalized by Moulton and Halsey [9], by introducing a new model in which the response limit can result from a censorship interval of the PDF f, that is, a zero point can result in mass or may be a value of f in the censorship interval

(0, T)

, where T is constant. Specifically, the model proposed by [9] is represented by the PDF given by

g_{_{F}} (y_{i}) = [p_{i} + (1 - p_{i}) F (T)] I_{i} + (1 - p_{i}) f (y_{i}) (1 - I_{i}),

(3)

where

F (\cdot)

is the cumulative distribution function (CDF) associated to the PDF

f (\cdot)

. In particular, a Bernoulli variable can be used with the logit or probit link functions, see Cragg [8]. Moulton and Halsey [9] consider asymmetric log-normal and log-gamma data, while the log-skew-normal model (LSN) and log-power-normal model (LPN) with limited response are studied by Chai and Bailey [11] and Martínez-Flórez et al. [5], respectively.

2. Models for Asymmetric Data

Important proposals to modelling data with high/low degree of asymmetry and/or kurtosis in relation to the normal model have arisen in recent decades. Two of these proposals widely discussed in the statistical literature are the SN model of Azzalini [2] and PN model by Durrans [3]. The SN model, with asymmetry parameter

λ

, which is denoted by

Z \sim SN (λ)

has PDF given by

ϕ_{SN} (z; λ) = 2 ϕ (z) Φ (λ z), z \in R,

(4)

where

λ \in R

and,

ϕ (\cdot)

and

Φ (\cdot)

represent the PDF and CDF of the standard normal distribution, respectively. The

λ

parameter controls the asymmetry in the model. The associated CDF to the PDF in (4) is given by

Φ_{SN} (z; λ) = \int_{- \infty}^{z} ϕ_{SN} (t; λ) d t = Φ (z) - 2 T (z, λ), z \in R,

(5)

where

T (\cdot, λ)

is the Owen’s function, see [12]. The PN model is denoted by

Z \sim PN (α)

and has the PDF given by

f_{PN} (z; α) = α ϕ (z) {\{Φ (z)\}}^{α - 1}, z \in R,

(6)

where

α \in R^{+}

is a shape parameter. The model (6) was introduced by Durrans [3] and it has had multiple applications in the situations where the data distribution presents high or low asymmetry and/or kurtosis, which can not be fitted by the normal distribution.

The extension to the location and scale version of the SN model is obtained by applying the transformation

Y = ξ + η Z

, where

ξ \in R

is a location parameter,

η > 0

is a scale parameter, and

Z \sim SN (λ) .

This is denoted by

Y \sim SN (ξ, η, λ) .

In a similar way, the extension of the location and scale version of the PN model is obtained, which is denoted by

Y \sim PN (ξ, η, α) .

A special feature of the SN and PN models is that both containing the normal model as special case when

λ = 0

and

α = 1

, respectively, highlighting that the SN model has a range of asymmetry higher than the PN model, and at the same time, the PN model has a higher range of kurtosis than the SN model, see Pewsey et al. [13]. Therefore, it is natural to expect that respective extensions for positive data have similar characteristics. Martínez-Flórez et al. [5] studied the extension of the PN model to the case of positive data and they denoted it the log-power-normal (LPN) model, while the extension of the SN distribution for positive data was studied by Azzalini et al. [14], which is denominated the log-skew-normal (LSN) model.

The main difficult with the SN model, which does not present the PN model is that, for

λ = 0

the Fisher information matrix for the parameters vector

(ξ, η, λ)

is singular, see Azzalini [2]. Hence, the regularity conditions are not satisfied in general and the usual

\sqrt{n}

-property for the maximum likelihood estimator is kept only for

λ \neq 0

. The information matrix problem for the case

λ = 0

has been addressed by using the methodology proposed by Rotnitzky et al. [15], who devised an iterative algorithm that under certain conditions leads to a non-singular information matrix for

λ = 0

.

The singularity of the Fisher information matrix has been found in multiple extensions of SN model, such as the LSN model, the skew-exponential power distribution, DiCiccio and Monti [16] and the skew-flexible-normal model, Gómez et al. [17], to name some of these cases. The log-skew-normal alpha-power (LSNAP) distribution is an extension of LPN model, which is obtained by replacing in the LPN model, the PDF and the CDF of the normal distribution for the PDF and the CDF of the SN distribution. This proposal is based on the non-singularity property of the information matrix of the PN model, Pewsey et al. [13], and the flexibility in terms of asymmetry in the SN model, Azzalini [2]. So, it is natural that a new model based on these two distributions can fit the distributions with higher or lower asymmetry than the fitted by the LSN model and/or higher or lower kurtosis than the fitted by the LPN model.

The PDF of the location and scale version of a random variable with LSNAP distribution is given by

f_{LSNAP} (y; ξ, η, λ, α) = \frac{α}{η y} ϕ_{SN} (z; λ) {[Φ_{SN} (z; λ)]}^{α - 1}, y \in R^{+},

(7)

where

z = (log (y) - ξ) / η

, with

ξ \in R

being the location parameter and,

η > 0

the scale parameter. The functions

ϕ_{SN} (\cdot)

and

Φ_{SN} (\cdot)

are the PDF and CDF of the SN distribution given in (4) and (5), respectively. The LSPN distribution is represented by the notation

Y \sim LSNAP (ξ, η, λ, α)

. One can see that, the model in (7) contains as special cases, the log-normal (LN) model when

λ = 0

and

α = 1

; the LSN model when

α = 1

, and the LPN model when

λ = 0

. Thus, the LSPN model is more flexible in terms of asymmetry and kurtosis than the LN, LSN and LPN models.

The Centred Parametrization of the Skew-Normal Model

Facing the problem of the singularity of the information matrix of the SN model and the consequences in the estimation process of the parameters when

λ = 0

, Arellano-Valle and Azzalini [18] proposed an alternative parametrization to the SN model of Azzalini [2]. This new parametrization starts from the definition of the variable

Y = μ + σ \frac{Z - E (Z)}{\sqrt{Var (Z)}},

where

μ \in R

and

σ \in R^{+}

are parameters of the random variable Y, and

Z \sim SN (λ)

. This representation is called centred parametrization, since

E (Y) = μ

and

Var (Y) = σ^{2}

. The centred parametrization of the SN model is denoted by

Y \sim {SN}_{c} (μ, σ, γ_{1})

, where the parameters

μ \in R

,

σ \in R^{+}

and

γ_{1} \in (- 0.9953, 0.9953)

, represent the mean, standard deviation and coefficient of asymmetry of Y, respectively. One can see that, if

Z \sim SN (λ)

, then

E (Z) = b δ

and

Var (Z) = 1 - {(b δ)}^{2}

, where

b = \sqrt{2 / π}

and

δ = λ / \sqrt{1 + λ^{2}}

.

Thus, we have that the random variable Y is allowed to be written in the form

Y = λ_{1} + λ_{2} Z

, which follows a SN distribution of location and scale version denoted by

SN (λ_{1}, λ_{2}, λ)

, where

λ_{1} = μ - c σ γ_{1}^{1 / 3}, λ_{2} = σ \sqrt{1 + c^{2} γ_{1}^{2 / 3}} and λ = \frac{c γ_{1}^{1 / 3}}{\sqrt{b^{2} + c^{2} (b^{2} - 1) γ_{1}^{2 / 3}}}

(8)

with

c = {2 / (4 - π)}^{1 / 3}

.

Under the centred parametrization of the SN model, the Fisher information matrix can be written as

I_{γ_{1}} = D I_{λ} D

, where

D

is a matrix representing the derivative of the parameters

λ_{1}

,

λ_{2}

and

λ

of the standard representation, regarding to the new parameters

μ

,

σ

and

γ_{1}

. In addition, when

λ \to 0

, the information matrix converges to the diagonal matrix

Σ_{c} = diag (σ^{2}, σ^{2} / 2, 6)

, which guarantees the existence and uniqueness of the maximum likelihood estimator (MLE) of the parameters

λ_{1}

and

λ_{2}

, for each fixed value of

λ

.

Given the properties of the LSN and LPN models, the Bernoulli/log-skew-normal (BLSN) and the Bernoulli/log-power-normal (BLPN) mixture models are alternatives to the Bernoulli/log-normal (BLN) model for the case of positive data when the distribution of the continuous part presents greater or lower asymmetry and/or kurtosis than the LN model. Thus, the BLSN and BLPN mixture models are more flexible than the BLN mixture model. Details of the inferential properties of the MLE for the BSLN mixture model when

λ = 0

are not presented by Chai and Bailey [11], since it is expected that the same difficulties are arisen in relation to the continuous part that is fitted through the LSN model.

In this paper, we introduced a new model to fit asymmetric data, more flexible than LN, LSN and LPN models, and with non-singular Fisher information matrix. This model is obtained by replacing in the LPN model, the normal distribution by the

{SN}_{c} (μ, σ, γ_{1})

distribution. From the introduced model, a new regression model for censored data is proposed, which is a mixture of the proposed asymmetric model and a random variable with logit link function. The new model is more flexible in terms of asymmetry and kurtosis than the proposed by Moulton and Halsey [9], Chai and Bailey [11] and Martínez-Flórez et al. [5]. Data from a safety and immunogenicity study of measles vaccine conducted in Haiti during 1987–1990, see Job et al. [19] are used as an illustration. Here, the goal of the study was to demonstrate that the higher titer vaccines could effectively immunize infants as young as 12 months of age.

The rest of the paper is organized as follows. In Section 3, the centred exponentiated log-skew-normal distribution for censored data is presented. A small simulation study to evaluate the asymptotic properties of the parameter estimators is presented. In Section 4, the Bernoulli/centred log-skew-normal alpha-power mixture model is introduced. The inference process is carried out by using the maximum likelihood method. In Section 5, an application with measles vaccine data is presented to illustrate the proposed model.

3. The Centred Exponentiated Log-Skew-Normal Family of Distribution for Censored Data

Based on the flexibility and the non-singularity of the Fisher information matrix of the

{SN}_{c}

model, the LPN model is extended to the case of the

{SN}_{c}

model. This extension is denominated the centred exponentiated log-skew-normal (

{ELSN}_{c}

) distribution. The PDF of the

{ELSN}_{c}

distribution, with parameters

μ

,

σ

,

γ_{1}

and

α

is given by

f_{{ELSN}_{c}} (y; μ, σ, γ_{1}, α) = \frac{α}{λ_{2} y} ϕ_{SN} (z; λ) {\{Φ_{SN} (z; λ)\}}^{α - 1}, \in R^{+}

(9)

where

z = \frac{log (y) - λ_{1}}{λ_{2}}

and

λ_{1},

λ_{2}

and

λ

defined as in (8). This model is denoted by

{ELSN}_{c} (μ, σ, γ_{1}, α)

. It is important to note that

{ELSN}_{c} (μ, σ, γ_{1}, α) \equiv LSNAP (λ_{1}, λ_{2}, λ, α)

with

λ_{1},

λ_{2}

and

λ

defined in (8), that is, the

{ELSN}_{c}

model can be assumed as a reparametrization of LSNAP model, which corrects the problem of singularity in the Fisher information matrix. In addition, if

γ_{1} = 0

, the

LPN (μ, σ, α)

model is obtained and, when

α = 1

, the centred log-skew-normal model follows, which is denoted by

{LSN}_{c} (μ, σ, γ_{1})

. Some forms of the PDF of the ELSN

_{c}

distribution are presented in the Figure 1. One can shown that, LSN

_{c}

model has non-singular information matrix. Finally, if

γ_{1} = 0

and

α = 1

, the log-normal model is obtained,

LN (μ, σ^{2})

. This shows that the

{ELSN}_{c}

model is more flexible in terms the asymmetry and kurtosis than LN, LSN and LPN models.

The importance of the proposed extension is that the information matrix of the model is non-singular, since for the parameter vector

θ = {(μ, σ, γ_{1}, α)}^{⊤}

, the information matrix is given by

I_{θ} = D I_{λ, α} D

, where

I_{λ, α}

is the information matrix of the model (7) given in Martínez-Flórez et al. [5] with

z = (log (y) - ξ) / η

, and

D = (\begin{matrix} 1 & - c γ_{1}^{1 / 3} & - \frac{1}{3} c σ γ_{1}^{- 2 / 3} & 0 \\ 0 & \sqrt{1 + c^{2} γ_{1}^{2 / 3}} & \frac{c^{2} σ γ_{1}^{- 1 / 3}}{3 \sqrt{1 + c^{2} γ_{1}^{2 / 3}}} & 0 \\ 0 & 0 & \frac{c b^{2} γ_{1}^{- 1 / 3}}{3 (b^{2} + c^{2} (b^{2} - 1) γ_{1}^{2 / 3})} & 0 \\ 0 & 0 & 0 & 1 \end{matrix}) .

therefore, when

λ \to 0

and

α = 1

it follows from Azzalini [2] and Pewsey et al. [13] that the information matrix of the

{ELSN}_{c}

model converges to

(\begin{matrix} Σ_{c} & I_{θ_{1} α} \\ I_{θ_{1} α}^{⊤} & 1 \end{matrix})

where

I_{θ_{1}, α}

represents the vector of mixed second derivatives of

α

and the rests of the parameters

θ_{1} = {(μ, σ, γ_{1})}^{⊤}

. This turns out to be non-singular matrix, since its columns (or rows) are linearly independent. Hence, the regularity conditions are satisfied in general and the usual

\sqrt{n}

-property for the MLE

\hat{θ}

of

θ

is satisfied for all

λ

and

α

. This result guarantees the asymptotic distribution of the MLE for large samples, allowing to make inferences for the parameters of the

{ELSN}_{c}

model, which is an advantage against to the LSN model whose information matrix is singular for

λ = 0

.

The Figure 2 and Figure 3 represent of the log-likelihood profiled of the

{ELSN}_{c} (0, 1, 0, 1) \equiv LPN (0, 1, 1) \equiv {LSN}_{c} (0, 1, 0) \equiv LN (0, 1)

distribution for samples sizes 50, 100 and 150. The graphics show a regularity in the behaviour of the log-likelihood function, which gives strong evidence for the existence and uniqueness of the MLE.

3.1. The ELSN $_{c}$ Regression Models

Azzalini [2] ensures that the properties of existence and uniqueness of the MLE model

{SN}_{c}

can be extended to the more general models case such as

y_{i} = x_{i}^{⊤} β + σ Z_{i}

,

i = 1, 2, \dots, n

, where

x_{i}

is a

p \times 1

vector of covariates,

β = {(β_{0}, β_{1}, \dots, β_{p})}^{⊤}

is an unknown vector of regression coefficients and

z_{1}, \dots, z_{n}

are independent and identically distributed random variables

SN (λ)

.

In this section, the location and scale version of the

{ELSN}_{c} (μ, σ, γ_{1}, α)

model, is extended to situations of the regression models, that is, we consider the regression model

log (y_{i}) = x_{i}^{⊤} β + ε_{i}, i = 1, 2, \dots, n,

(10)

where

ε_{i} \sim {ELSN}_{c} (0, σ, γ_{1}, α)

. The model (10) is denominated the centred exponentiated log-skew-normal regression (ELSNR

_{c}

) model and is denoted by

{ELSNR}_{c} (β^{⊤}, σ, γ_{1}, α)

. Estimates for the components of the parameters vector

{(β^{⊤}, σ, γ_{1}, α)}^{⊤}

of the ELSNR

_{c}

model can be obtained by using the maximum likelihood method.

To analyse the behaviour of the estimators of the parameters in the ELSNR

_{c}

model, we carried out a small Monte Carlo simulation study, so, we analysed the behaviour of the estimators in the model (10). Since the coefficients

β_{i}

, for

i = 0, 1, \dots, p

, have no restrictions on the values that can be assumed, without loss of generality we took

p = 1

and the particular values

β_{0} = 1.5,

β_{1} = 2.5

. Furthermore, without loss of generality, we took the value of the scale parameter equal to

σ = 1.0

; however, the following results can be obtained for any value of the scale parameter from the simple transformation

ε_{i} = σ δ_{i}

with

δ_{i} \sim {ELSN}_{c} (0, 1, γ_{1}, α)

. The values of shape parameter were taken as

α = 0.75, 1.5

to take into account different configurations in the form of the pdf of the random variable

ε_{i}

. Finally, we took values for the asymmetry parameter

γ_{1} = 0.25, 0.50, 0.75

, to take into account different degrees of asymmetry in the distribution of the data.

To analyse some statistical measures of the MLE, we considered small, moderate and large sample sizes:

n = 60, 70, 80

and 500, and 1000 iterations were performed for each sample size. The studied characteristics were the bias and the root of the mean square error (RMSE) of the MLEs of the parameters. All calculations and estimates were obtained by using optim function of R Development Core Team [20].

Table 1 presents the results of the simulation study, where it can be observed that the bias (in absolute value) and the RMSE of the MLEs tend to decrease when the sample size increases, which guarantees the asymptotic convergence of the MLEs. Another important fact, is the good estimation of the regression coefficients for all sample sizes considered, with a strong evidence that the model achieves to fit the high levels of skewness and kurtosis present in the response variable. On the other hand, the near zero values of the bias for the parameters

γ_{1}

and

α

for large sample sizes (

n = 500

), the values indicate that average iterations fences were true parameter value and therefore, there are no problems of identifiability in the estimation process.

3.2. The Censored ELSN $_{c}$ Distribution

In this section, the centred ELSN

_{c}

model for censored positive data is introduced. Suppose that random variable

Y^{*}

follows a

{ELSN}_{c} (μ, σ, λ, α)

model, and let

Y_{1}^{*}, Y_{2}^{*}, \dots, Y_{n}^{*}

a random sample of size n, where only those values of

Y^{*}

greater than constant T are recorded; and for values

Y^{*} \leq T

only the value T is recorded. The observed values, which we denote by

Y_{i}

can be written as

Y_{i} = \{\begin{matrix} T, & if Y_{i}^{*} \leq T, \\ Y_{i}^{*}, & if Y_{i}^{*} > T . \end{matrix}

The PDF of

Y_{i}

is

\begin{matrix} \begin{matrix} \Pr (Y_{i} = T) & = \Pr (Y_{i}^{*} \leq T) = {\{Φ_{SN} (t_{c}; λ)\}}^{α}, if Y_{i} = T, \\ Y_{i} & \sim {ELSN}_{c} (μ, σ, λ, α), if Y_{i} > T, \end{matrix} \end{matrix}

(11)

where

t_{c} = \frac{log (T) - λ_{1}}{λ_{2}},

with

λ,

λ_{1}

y

λ_{2}

defined in

(8)

. This model is represented by the notation

Y_{i} \sim {CELSN}_{c} (μ, σ, γ_{1}, α)

. One can see that, if

γ_{1} = 0 and α = 1

, the

{CELSN}_{c}

distribution is identical to the log-normal Tobit model, see Moulton and Halsey [9]. This shows that the

{CELSN}_{c}

model is much more flexible than the log-normal Tobit model. Furthermore, for

α = 1

, the centred LSN model for censored data follows, while for

γ_{1} = 0

the censored LPN model of Martínez-Flórez et al. [5] is obtained.

Extensions of the

{CELSN}_{c}

model to the case of regression models are defined in the same way, by assuming

ε_{i} \sim {ELSN}_{c} (0, σ, γ_{1}, α)

, and defining

t_{c i} = \frac{log (T) - x_{i}^{⊤} β + c σ γ_{1}^{1 / 3}}{λ_{2}} and t_{i} = \frac{log (y_{i}) - x_{i}^{⊤} β + c σ γ_{1}^{1 / 3}}{λ_{2}},

(12)

with

λ_{2}

defined as in (8).

4. The Bernoulli/ Centred Log-Skew-Normal Alpha-Power Mixture Model

This section aims to make an extension of the generalized two-part model presented by Moulton and Halsey [9], where the Logit/Log-normal model is proposed.

4.1. The Logit/Centred Log-Skew-Normal Alpha-Power Model

The extension of the Moulton and Halsey [9] model to the case of the

{ELSN}_{c}

distribution is obtained by following Martínez-Flórez et al. [5]. We assume the existence of two random variables which define two different stochastic processes, D with Bernoulli distribution and Y with

{ELSN}_{c}

distribution. According to the model

(3)

, the PDF is given by

\begin{matrix} g_{C} (y_{i}) = {(p_{0 i} + (1 - p_{0 i}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α})}^{I_{i}} {((1 - p_{0 i}) \frac{α}{λ_{2} y_{i}} ϕ_{SN} (t_{i}; λ) {\{Φ_{SN} (t_{i}; λ)\}}^{α - 1})}^{1 - I_{i}} \end{matrix}

where

λ_{2}

and

λ

were defined in Equation

(8)

.

One can get a more informative model if covariates are introduced to explain the response variable Y and covariates to explain the associated distribution to censored part, that is, the random variable D. Thus, we consider two sets of covariates,

If $I_{i} = 0$ , i.e., the non-censored part, the covariates vector will be denoted by $x_{(2) i} = {(1, X_{2 i 1}, X_{2 i 2}, \dots, X_{2 i q_{0}})}^{⊤}$ with the parameter vector given by $β_{(2)} = {(β_{20}, β_{21}, \dots, β_{2 q_{0}})}^{⊤}$ .
If $I_{i} = 1$ and $Y_{i} = T$ , i.e., the censored part, the covariates vector will be denoted by $x_{(1) i} = {(1, X_{1 i 1}, X_{1 i 2}, \dots, X_{1 i q_{1}})}^{⊤}$ with the parameter vector given by $β_{(1)} = {(β_{10}, β_{11}, \dots, β_{1 q_{1}})}^{⊤}$ .

If

I_{i} = 1

and

Y_{i} < T

, we will also have an associated distribution; however, in this case is assumed that there is no observations for

Y_{i} < T

due the censorship. For the variable D, we consider the logit link function, so that

logit (P [D = 1 ∣ x_{(1)}]) = x_{(1)}^{⊤} β_{(1)},

then, we have

p_{0 i} = {(1 + exp (x_{(1) i}^{⊤} β_{(1)}))}^{- 1},

For non-censored part is considered

Y_{i} \sim {ELSN}_{c} (x_{(2) i}^{⊤} β_{(2)}, σ_{2}, γ_{1}, α), Y_{i} > T .

(13)

The model (13) is a generalization of the models of Martínez-Flórez et al. [5], Moulton and Halsey [9] and Chai and Bailey [11]. It is denominated the Bernoulli/centred exponentiated log-skew-normal mixture model, and it will be represented by the notation

Y_{i} \sim {BELSNM}_{c} (β_{(1)}, β_{(2)}, σ, γ_{1}, α)

.

One can see that, when

γ_{1} = 0

, the

{BELSNM}_{c}

model is identical to the Logit/log-power-normal (logit/LPN) mixture model, for

α = 1

, the

{BELSNM}_{c}

model is identical to the Logit/centred log-skew-normal (Logit/LSN

_{c}

) mixture model,

γ_{1} = 0

and

α = 1

, the

{BELSNM}_{c}

model is identical to the Logit/log-normal (Logit/LN) mixture model, see Martínez-Flórez et al. [5], Chai and Bailey [11] and Moulton and Halsey [9]. It can be concluded from the above results and the characteristics of the

{ELSN}_{c}

model to fit positive data with higher (or lower) degree of asymmetry and kurtosis than LPN and LSN models, that, the

{BELSNM}_{c}

model is a great extension of the logit/log-normal model. This new distribution turns out to be more flexible in terms of asymmetry and kurtosis than the models of Moulton and Halsey [9], Chai and Bailey [11] and Martínez-Flórez et al. [5], becoming a great alternative to censored asymmetric positive data or distributions with excess of zeros.

Is necessary to emphasize that

{\hat{β}}_{(2) 0}

is the biased estimation for the intercept in the regression model. In fact, since

E (Y ∣ Y > 0) \neq X β

, then, to correct the bias, it is necessary to calculate

{\hat{β}}_{(2) 0}^{*} = {\hat{β}}_{(2) 0} + \hat{E} (\hat{e})

, where

E (e) = α η \int_{0}^{1} Φ_{ISN} (y) y^{α - 1} d y

, where

Φ_{ISN} (\cdot)

represents the inverse function of the

{SN}_{c}

distribution

Φ_{SN} (\cdot)

.

4.2. Fitting Model

The parameters vector

θ = {(β_{(1)}^{⊤}, β_{(2)}^{⊤}, σ, γ_{1}, α)}^{⊤}

for the

{BELSNM}_{c}

model can be estimated by using the maximum likelihood method. The log-likelihood function based on a random sample

Y_{1}, Y_{2}, \dots, Y_{n}

, with

Y_{i} \sim {BELSNM}_{c} (θ)

, given

X_{(1)}

,

X_{(2)}

is given by

\begin{matrix} \begin{matrix} ℓ (θ; X_{(1)}, X_{(2)}, Y) & = \sum_{i} I_{i} log [1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}] \\ + \sum_{i} (1 - I_{i}) [log (α) - log (λ_{2} y_{i}) + x_{(1) i}^{⊤} β_{(1)} \\ + log {ϕ_{SN} (t_{i}; λ)} + (α - 1) log {Φ_{SN} (t_{i}; λ)}] \\ - \sum_{i} log \{1 + exp (x_{(1) i}^{⊤} β_{(1)})\}, \end{matrix} \end{matrix}

(14)

where

t_{c i}

and

t_{i}

are as defined in (12). The equations scores obtained by equating the score function to zero are given by (for

j = 1, 2, \dots, q_{1}

and

k = 1, 2, \dots, q_{0}

).

\begin{matrix} - \sum_{i} I_{i} \frac{x_{1 i j} exp (x_{(1) i}^{⊤} β_{(1)})}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \frac{1 - {\{Φ_{SN} (t_{c i}; λ)\}}^{α}}{1 + exp (x_{(1) i}^{⊤} β_{(1)})} \\ + \sum_{i} \frac{x_{1 i j}}{1 + exp (x_{(1) i}^{⊤} β_{(1)})} = 0, for j = 1, 2, \dots, q_{1} \\ - \sum_{i} I_{i} \frac{x_{2 i k} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ - \frac{1}{λ_{2}} \sum_{i} (1 - I_{i}) x_{2 i k} \{t_{i} + λ ω (λ t_{i}) + (α - 1) ω_{λ} (t_{i})\} = 0, for k = 1, 2, \dots, q_{0} \end{matrix}

\begin{matrix} - \sum_{i} I_{i} \frac{t_{c i} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ - \frac{1}{λ_{2}} \sum_{i} (1 - I_{i}) \times (1 - t_{i}^{2} + λ t_{i} ω (λ t_{i}) + (α - 1) t_{i} ω_{λ} (t_{i})) = 0 \\ - \sqrt{\frac{2}{π}} \frac{1}{1 + λ^{2}} \sum_{i} I_{i} \frac{exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α - 1} ϕ (\sqrt{1 + λ^{2}} t_{c i})}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ + \sum_{i} (1 - I_{i}) (t_{i} ω (λ t_{i}) - \sqrt{\frac{2}{π}} \frac{α - 1}{1 + λ^{2}} ν_{λ} (t_{i})) = 0 \\ \sum_{i} I_{i} \frac{exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α} log \{Φ_{SN} (t_{c i}; λ)\}}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ + \sum_{i} (1 - I_{i}) (\frac{1}{α} + log \{Φ_{SN} (t_{i}; λ)\}) = 0 \end{matrix}

where

ω (t) = ϕ (t) / Φ (t)

,

ω_{λ} (t) = ϕ_{SN} (t; λ) / Φ_{SN} (t; λ)

,

ν_{λ} (t) = ϕ (\sqrt{1 + λ^{2}} t) / Φ_{SN} (t; λ)

and

ϕ_{PSN} (\cdot; λ, α)

is the PDF of the

{ELSN}_{c}

model. The system of score equations has no closed form solution and have to be obtained numerically. The solutions of this system of equations provide the MLE of the parameters vector

{(β_{(1)}^{⊤}, β_{(2)}^{⊤}, σ, γ_{1}, α)}^{⊤}

. The log-likelihood function can be maximized by implementing some statistical packages such as R Development Core Team [20], which has the x-Optim, optim, nlem, x-maxLIK or maxLIK commands to maximize non-linear functions.

The initial values for the parameters

β_{(1)}

and

β_{(2)}

can be obtained from the fit of a Tobit model, while, initial values for

γ_{1}

and

α

can be obtained by fitting the

{ELSN}_{c}

model for the response variable Y. The standard errors of the MLE can be obtained as the square root of the inverse of the observed information matrix

J (\cdot)

, which converges asymptotically to the Fisher information matrix, this matrix is given by

J (β_{(1)}^{⊤}, β_{(2)}^{⊤}, σ, γ_{1}, α) = B^{⊤} J (β_{(1)}^{⊤}, β_{(2)}^{⊤}, η, λ, α) B

where the elements of the

J (β_{(1)}^{⊤}, β_{(2)}^{⊤}, η, λ, α)

matrix are in the Appendix A, and

B = (\begin{matrix} I_{q_{1} + 1} & 0_{q_{1} + 1} & 0_{(q_{1} + 1) \times q_{0}} & 0_{q_{1} + 1} & 0_{q_{1} + 1} & 0_{q_{1} + 1} \\ 0_{q_{1} + 1}^{⊤} & 1 & 0_{q_{0}}^{⊤} & - c γ_{1}^{1 / 3} & - \frac{1}{3} c σ γ_{1}^{- 2 / 3} & 0 \\ 0_{q_{0} \times (q_{1} + 1)} & 0_{q_{0}} & I_{q_{0}} & 0_{q_{0}} & 0_{q_{0}} & 0_{q_{0}} \\ 0_{q_{1} + 1}^{⊤} & 0 & 0_{q_{0}}^{⊤} & \sqrt{1 + c^{2} γ_{1}^{2 / 3}} & \frac{c^{2} σ γ_{1}^{- 1 / 3}}{3 \sqrt{1 + c^{2} γ_{1}^{2 / 3}}} & 0 \\ 0_{q_{1} + 1}^{⊤} & 0 & 0_{q_{0}}^{⊤} & 0 & \frac{c b^{2} γ_{1}^{- 1 / 3}}{3 (b^{2} + c^{2} (b^{2} - 1) γ_{1}^{2 / 3})} & 0 \\ 0_{q_{1} + 1}^{⊤} & 0 & 0_{q_{0}}^{⊤} & 0 & 0 & 1 \end{matrix}) .

Then, the Fisher information matrix is given by

I (β_{(1)}^{⊤}, β_{(2)}^{⊤}, σ, γ_{1}, α) = E (J (β_{(1)}^{⊤}, β_{(2)}^{⊤}, σ, γ_{1}, α)) = B^{⊤} I (β_{(1)}^{⊤}, β_{(2)}^{⊤}, λ_{2}, λ, α) B,

where

I (β_{(1)}^{⊤}, β_{(2)}^{⊤}, λ_{2}, λ, α) = E (J (β_{(1)}^{⊤}, β_{(2)}^{⊤}, λ_{2}, λ, α)) .

(15)

From the information matrix in (15), it can be obtained the asymptotic distribution of the MLE for large samples with covariance matrix

Σ = {(I (β_{(1)}^{⊤}, β_{(2)}^{⊤}, σ, γ_{1}, α))}^{- 1}

. Confidence intervals for the model parameters can be obtained from the MLE and the standard errors of the MLE.

5. An Application to Antibody Response to Vaccine

Data from a safety and immunogenicity study of measles vaccine conducted in Haiti during 1987–1990 are used as an illustration, see Job et al. [19]. In this case, the goal of the study was to demonstrate that the higher titer vaccines could effectively immunize infants as young as 12 months of age. The response variable was neutralization antibody and the covariates involved in the study were: EZ (vaccine type;

0 = :

Schwarz,

1 = :

Edmonston-Zagreb), HI (vaccine dose;

0 = :

medium,

1 = :

high) and FEM (gender;

0 = :

male,

1 = :

female). The sample size was 330 children, of which 86 were at or below the lower detection limit, (LDL). The number of expected zeros by considering the usual Tobit model was four. The response variable was the neutralization antibody, with LDL equal to 0.1 international units (UI), and the covariates involved in the study were encoded as

EZ = X_{1}

,

HI = X_{2}

and

FEM = X_{3}

.

The high asymmetry degree for values above

0.1

indicated by the sample asymmetry coefficient

(\sqrt{b_{1}})

reveals that it seems worthwhile trying to fit an asymmetric model for this data set, so we fit the Moulton and Halsey [9], Martínez-Flórez et al. [4] and the centred version of the Chai and Bailey [11] models, with

X_{(1)} = X_{(2)} = {(X_{1}, X_{2}, X_{3})}^{⊤}

, and the results are presented in Table 2. The parameter estimates of the fitted models were obtained by using the optim function of R Development Core Team [20]. The source codes of the fitted models can be obtained by requesting them by email to the authors.

We also fit the

{BELSNM}_{c}

model, initially only with covariates in the continuous part and subsequently with covariates in the two components. The fitted models are shown in Table 3. To compare the fitted models, we computed the Akaike information criterion [21], namely

A I C = - 2 ℓ (\cdot) + 2 p

, where p is the number of parameters for the considered model. The best model is the one with the smallest AIC value.

According to the AIC criterion, the best fit is presented by the

{BELSNM}_{c}

model. To corroborate the good performance of the

{BELSNM}_{c}

model, the proportion of the data set coming from units with low response was estimated. For the

{BELSNM}_{c}

model without covariates the estimator of the Bernoulli intercept is 1.106, so that the estimator of the proportion of observations at or below the detection limit is

100 \times 1 / [1 + exp (1.106)] = 24.86 %

which, compared to the observed

26.1 %

, indicates good agreement with the proposed model.

We also consider the problem of testing the null hypothesis of no difference between the

{BELSNM}_{c}

model and the censored log-normal (CLN) model, i.e.,

H_{0} : (γ_{1}, α) = (0, 1) versus H_{1} : (γ_{1}, α) \neq (0, 1)

We use the likelihood ratio statistic

Λ = \frac{L_{CLN} (\hat{θ})}{L_{{BELSNM}_{c}} (\hat{θ})} .

where

L_{F} (\cdot)

is the likelihood function under model F. Numerical evaluations indicate that

- 2 log (Λ) = - 2 (- 511.18 + 461.36) = 99.63,

which is greater than the 5% chi-square critical value with two degree of freedom,

χ_{2, 5 %}^{2} = 5.991

. Hence, the null hypothesis is rejected and we conclude that the

{BELSNM}_{c}

fits the data better than the logit/LN model.

Hypothesis testing for the Logit/LPN (CLPN) and Logit/LSN

_{c}

(CLSN

_{c}

) models against the

{BELSNM}_{c}

model are also conducted. Formally the hypotheses

H_{01} : γ_{1} = 0 versus H_{11} : γ_{1} \neq 0, and H_{02} : α = 1 versus H_{12} : α \neq 1

can be tested by using the statistics

Λ_{1} = \frac{L_{CLPN} (\hat{θ})}{L_{{BELSNM}_{c}} (\hat{θ})} and Λ_{2} = \frac{L_{{CLSN}_{c}} (\hat{θ})}{L_{{BELSNM}_{c}} (\hat{θ})} .

After numerical evaluations, we obtained

- 2 log (Λ_{1}) = 39.39 and - 2 log (Λ_{2}) = 11.66,

which are greater than the 5% chi-square critical value with one degree of freedom,

χ_{1, 5 %}^{2} = 3.8414

. The null hypothesis are rejected and we conclude that the

{BELSNM}_{c}

model fits the data better than the Logit/LPN and Logit/LSN

_{c}

models.

Using distributions LN, LPN and

{ELSN}_{c}

for the continuous part, the scaled residuals

e_{i} = (log (y_{i}) - x_{(2) i}^{⊤} {\hat{β}}_{(2)}) / \hat{η}

are evaluated and presented in the Figure 4 and Figure 5.

The figures reveal good performance of the

{BELSNM}_{c}

distribution, further indicating that it is a viable alternative for asymmetric data with censored responses.

6. Final Discussion

In this paper, a more flexible model than the Logit/LN, Logit/LSN and Logit/LPN distributions is proposed. The new model is able to fit data with greater degree asymmetry and kurtosis than the Moulton and Halsey [9], Chai and Bailey [11] and Martínez-Flórez et al. [5] models. The score function and the maximum likelihood estimator (MLE) of the model parameters are presented. A small Monte Carlo simulation study carried out showed a good performance of the MLE. An illustration with safety and immunogenicity data was presented in which the

{BELSNM}_{c}

model makes a better fit with respect to the Logit/LN, Logit/LSN and Logit/LPN models.

Among the main advantages that can be seen from the proposed models, there is greater flexibility with respect to the log-normal (log-Tobit), log-SN and log-PN models. On the other hand, the logit link function allows us to estimate the point mass probability with greater precision compared to the Tobit and log-Tobit models. As a disadvantage, the number of parameters in the model—although making it more flexible—also make it less parsimonious. However, even though the model is less parsimonious, it continues to be a good proposal, especially in cases where the asymmetry and kurtosis indices are high.

Author Contributions

Conceptualization, G.M.-F., S.V.-C. and R.T.-F.; Data curation, G.M.-F. and S.V.-C.; Formal analysis, G.M.-F., S.V.-C. and R.T.-F.; Funding acquisition, G.M.-F., S.V.-C. and R.T.-F.; Investigation, G.M.-F., S.V.-C. and R.T.-F.; Resources, G.M.-F., S.V.-C. and R.T.-F.; Software, G.M.-F. and S.V.-C.; Supervision, G.M.-F.; Validation, G.M.-F. and R.T.-F.; Visualization, G.M.-F., S.V.-C. and R.T.-F.; Writing—original draft, G.M.-F., S.V.-C. and R.T.-F.; Writing—review & editing, G.M.-F., S.V.-C. and R.T.-F. All authors have read and agreed to the published version of the manuscript.

Funding

The research of G. Martínez-Flórez and R. Tovar-Falón was supported by project: Resolución de Problemas de Situaciones Reales Usando Análisis Estadístico a través del Modelamiento Multidimensional de Tasas y Proporciones; Esquemas de Monitoreamiento para Datos Asimétricos no Normales y una Estrategia Didáctica para el Desarrollo del Pensamiento Lógico-Matemático. Universidad de Córdoba, Colombia, Code FCB-05-19.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Details about data available are given in Section 5.

Acknowledgments

G. Martínez-Flórez and R. Tovar-Falón acknowledge the support given by Universidad de Córdoba, Montería, Colombia. S. Vergara-Cardozo recognizes the support given by Universidad Nacional de Colombia, Sede Bogotá.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Letting:

ω (t) = ϕ (t) / Φ (t)

,

ω_{λ} (t) = ϕ_{SN} (t; λ) / Φ_{SN} (t; λ)

,

ν_{λ} (t) = ϕ (\sqrt{1 + λ^{2}} t) / Φ_{SN} (t; λ)

, the elements of the observed information matrix

J (β_{(1)}^{⊤}, β_{(2)}^{⊤}, λ_{2}, λ, α)

, denoted

j_{β_{(1) j} β_{(1) j^{'}}},

j_{β_{(2) k} β_{(1) j}}

,

\dots, j_{α α},

are given by

\begin{matrix} j_{β_{(1) j} β_{(1) j^{'}}} & = - \sum I_{i} \frac{x_{1 i j} x_{1 i j^{'}} exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}}{{[1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}]}^{2}} + \sum_{i} \frac{x_{1 i j} x_{1 i j^{'}} exp (x_{(1) i}^{⊤} β_{(1)})}{{\{1 + exp (x_{(1) i}^{⊤} β_{(1)})\}}^{2}} \\ j_{β_{(1) j} β_{(2) k}} & = \sum_{i} I_{i} \frac{x_{1 i j} x_{2 i k} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{{[1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}]}^{2}} \\ j_{η β_{(1) j}} & = \sum_{i} I_{i} \frac{x_{1 i j} t_{c i} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{{[1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}]}^{2}} \\ j_{λ β_{(1) j}} & = \sqrt{\frac{2}{π}} \frac{α}{1 + λ^{2}} \sum_{i} I_{i} \frac{x_{1 i j} exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α - 1} ϕ (\sqrt{1 + λ^{2}} t_{c i})}{{[1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}]}^{2}} \\ j_{α β_{(1) j}} & = - \sum_{i} I_{i} \frac{x_{1 i j} exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α} log \{Φ_{SN} (t_{c i}; λ)\}}{{[1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}]}^{2}} \end{matrix}

\begin{matrix} j_{β_{(2) k} β_{(2) k^{'}}} & = \frac{1}{λ_{2}} \sum_{i} I_{i} \frac{x_{2 i k} x_{2 i k^{'}} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} [t_{c i} - (α - 1) ω_{λ} (t_{i})] \\ - \sqrt{\frac{2}{π}} \frac{α λ}{λ_{2}} \sum_{i} I_{i} \frac{x_{2 i k} x_{2 i k^{'}} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α) ϕ (\sqrt{1 + λ^{2}} t_{i}) {(Φ_{SN} (t_{c i}; λ))}^{α - 1}}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ + \sum_{i} I_{i} x_{2 i k} x_{2 i k^{'}} {[\frac{exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}}]}^{2} \\ + \frac{1}{λ_{2}^{2}} \sum_{i} (1 - I_{i}) x_{2 i k} x_{2 i k^{'}} [- 1 + λ^{3} t_{i} ω (λ t_{i}) + λ^{2} {(ω (λ t_{i}))}^{2}] \\ + \frac{α - 1}{λ_{2}^{2}} \sum_{i} (1 - I_{i}) x_{2 i k} x_{2 i k^{'}} [t_{i} ω_{λ} (t_{i}) + {(ω_{λ} (t_{i}))}^{2} - \sqrt{\frac{2}{π}} λ ν_{λ} (t_{i})] \end{matrix}

\begin{matrix} j_{η β_{(2) k}} & = \frac{1}{η} \sum_{i} I_{i} \frac{x_{2 i k} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} (- 1 + t_{c i}^{2} - α t_{c i} ω_{λ} (t_{c i})) \\ - \sqrt{\frac{2}{π}} \frac{α λ}{η^{2}} \sum_{i} I_{i} \frac{x_{2 i k} t_{c i} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ (\sqrt{1 + λ^{2}} t_{i}) {(Φ_{SN} (t_{c i}; λ))}^{α - 1}}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ + \sum_{i} I_{i} x_{2 i k} t_{c i} {(\frac{exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}})}^{2} \\ + \frac{1}{η^{2}} \sum_{i} (1 - I_{i}) x_{2 i k} (2 t_{i} + (α - 1) (- 1 + t_{i}^{2} + t_{i} ω_{λ} (t_{i})) ω_{λ} (t_{i})) \\ + \frac{λ (α - 1)}{η^{2}} \sum_{i} (1 - I_{i}) x_{2 i k} t_{i} ν_{λ} (t_{i}) \\ + \frac{λ}{η^{2}} \sum_{i} (1 - I_{i}) x_{2 i k} ω (λ t_{i}) (- 1 + λ^{2} t_{i}^{2} + t_{i} ω (λ t_{i})) \end{matrix}

\begin{matrix} j_{λ β_{(2) k}} & = \sqrt{\frac{2}{π}} \sum_{i} I_{i} \frac{x_{2 i k} t_{c i} [1 + {\{Φ_{SN} (t_{c i}; λ)\}}^{α}] exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α) ϕ (\sqrt{1 + λ^{2}} t_{c i})}{{[1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}]}^{2}} \\ - \sqrt{\frac{2}{π}} \frac{1}{1 + λ^{2}} \sum_{i} I_{i} ({(Φ_{SN} (t_{c i}; λ))}^{α} - (α - 1)) \\ \times \frac{x_{2 i k} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α) ϕ (\sqrt{1 + λ^{2}} t_{c i})}{Φ_{SN} (t_{i}; λ) {(1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α})}^{2}} \\ + \sqrt{\frac{2}{π}} \frac{α - 1}{η} \sum_{i} (1 - I_{i}) x_{2 i k} ((1 - λ^{2} t_{i}^{2}) ω (λ t_{i}) - t_{i} {\{ω (λ t_{i})\}}^{2}) \\ + \sqrt{\frac{2}{π}} \frac{α - 1}{η} \sum_{i} (1 - I_{i}) x_{2 i k} ν_{λ} (t_{i}) (t_{i} + \frac{1}{1 + λ^{2}} ω_{λ} (t_{i})) \end{matrix}

\begin{matrix} j_{α β_{(2) k}} & = \sum_{i} I_{i} \frac{x_{2 i k} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{{[1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}]}^{2}} \\ \times (log (Φ_{SN} (t_{c i}; λ)) + α^{- 1} (1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α - 1})) \\ + \frac{1}{η} \sum_{i} (1 - I_{i}) x_{2 i k} ω_{λ} (t_{i}) \end{matrix}

\begin{matrix} j_{λ_{2} β_{(2) k}} & = \frac{1}{λ_{2}} \sum_{i} I_{i} \frac{x_{2 i k} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} (- 1 + t_{c i}^{2} - α t_{c i} ω_{λ} (t_{c i})) \\ - \sqrt{\frac{2}{π}} \frac{α λ}{λ_{2}^{2}} \sum_{i} I_{i} \frac{x_{2 i k} t_{c i} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ (\sqrt{1 + λ^{2}} t_{i}) {(Φ_{SN} (t_{c i}; λ))}^{α - 1}}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ + \sum_{i} I_{i} x_{2 i k} t_{c i} {(\frac{exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}})}^{2} \\ + \frac{1}{λ_{2}^{2}} \sum_{i} (1 - I_{i}) x_{2 i k} [2 t_{i} + (α - 1) (- 1 + t_{i}^{2} + t_{i} ω_{λ} (t_{i})) ω_{λ} (t_{i})] \\ - \sqrt{\frac{2}{π}} \frac{λ (α - 1)}{λ_{2}^{2}} \sum_{i} (1 - I_{i}) x_{2 i k} t_{i} ν_{λ} (t_{i}) \\ + \frac{λ}{λ_{2}^{2}} \sum_{i} (1 - I_{i}) x_{2 i k} ω (λ t_{i}) (- 1 + λ^{2} t_{i}^{2} + t_{i} ω (λ t_{i})) \end{matrix}

\begin{matrix} j_{λ β_{(2) k}} & = \sqrt{\frac{2}{π}} \sum_{i} I_{i} \frac{x_{2 i k} t_{c i} (1 + {(Φ_{SN} (t_{c i}; λ))}^{α}) exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α) ϕ (\sqrt{1 + λ^{2}} t_{c i})}{{(1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α})}^{2}} \\ - \sqrt{\frac{2}{π}} \frac{1}{1 + λ^{2}} \sum_{i} I_{i} ({(Φ_{SN} (t_{c i}; λ))}^{α} - (α - 1)) \\ \times \frac{x_{2 i k} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α) ϕ (\sqrt{1 + λ^{2}} t_{c i})}{Φ_{SN} (t_{i}; λ) {(1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α})}^{2}} \\ + \sqrt{\frac{2}{π}} \frac{α - 1}{λ_{2}} \sum_{i} (1 - I_{i}) x_{2 i k} ((1 - λ^{2} t_{i}^{2}) ω (λ t_{i}) - t_{i} {(ω (λ t_{i}))}^{2}) \\ + \sqrt{\frac{2}{π}} \frac{α - 1}{λ_{2}} \sum_{i} (1 - I_{i}) x_{2 i k} ν_{λ} (t_{i}) (t_{i} + \frac{1}{1 + λ^{2}} ω_{λ} (t_{i})) \\ j_{α β_{(2) k}} & = \sum_{i} I_{i} \frac{x_{2 i k} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{{(1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α})}^{2}} \\ \times (log \{Φ_{SN} (t_{c i}; λ)\} + α^{- 1} (1 + exp (x_{(1) i}^{⊤} β_{(1)}) {(Φ_{SN} (t_{c i}; λ))}^{α - 1})) \\ + \frac{1}{λ_{2}} \sum_{i} (1 - I_{i}) x_{2 i k} ω_{λ} (t_{i}) \end{matrix}

\begin{matrix} j_{λ λ_{2}} & = \frac{α}{λ_{2}} \sqrt{\frac{2}{π}} \sum_{i} I_{i} \frac{t_{c i} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ (\sqrt{1 + λ^{2}} t_{c i}) {(Φ_{SN} (t_{c i}; λ))}^{α - 1}}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ \times (t_{c i} - \frac{α - 1}{1 + λ^{2}} ω_{λ} (t_{c i}) + \frac{λ_{2} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}}) \\ + \frac{1}{λ_{2}} \sum_{i} (1 - I_{i}) t_{i} ((1 - λ^{2} t_{i}^{2}) ω (λ t_{i}) - t_{i} {(ω (λ t_{i}))}^{2}) \\ + \sqrt{\frac{2}{π}} \frac{α - 1}{λ_{2}} \sum_{i} (1 - I_{i}) t_{i} ν_{λ} (t_{i}) (t_{i} + \frac{1}{1 + λ^{2}} ω_{λ} (t_{i})) \end{matrix}

\begin{matrix} j_{λ λ} & = \sqrt{\frac{2}{π}} \sum_{i} I_{i} \frac{α exp (x_{(1) i}^{⊤} β_{(1)}) ϕ (\sqrt{1 + λ^{2}} t_{c i}) {(Φ_{SN} (t_{c i}; λ))}^{α - 1}}{(1 + λ^{2}) (1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α})} \\ \times (\sqrt{\frac{2}{π}} \frac{1}{1 + λ^{2}} ν_{λ} (t_{c i}) + \frac{2 δ}{\sqrt{1 + λ^{2}}} + λ t_{c i}^{2}) \\ - \frac{2}{π} \frac{α}{{(1 + λ^{2})}^{2}} \sum_{i} I_{i} {(\frac{exp (x_{(1) i}^{⊤} β_{(1)}) ϕ (\sqrt{1 + λ^{2}} t_{c i}) {(Φ_{SN} (t_{c i}; λ))}^{α - 1}}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}})}^{2} \\ + \frac{1}{λ_{2}} \sum_{i} (1 - I_{i}) t_{i}^{2} (λ t_{i} ω (λ t_{i}) + {(ω (λ t_{i}))}^{2}) \\ - \sqrt{\frac{2}{π}} \frac{α - 1}{1 + λ^{2}} \sum_{i} (1 - I_{i}) ν_{λ} (t_{i}) (- (\frac{2 δ}{\sqrt{1 + λ^{2}}} + λ t_{i}^{2}) + \sqrt{\frac{2}{π}} \frac{1}{1 + λ^{2}} ν_{λ} (t_{i})) \\ j_{α λ} & = \sqrt{\frac{2}{π}} \frac{1}{1 + λ^{2}} \sum_{i} I_{i} ((1 + α log (Φ_{SN} (t_{c i}; λ))) \\ - \frac{exp (x_{(1) i}^{⊤} β_{(1)}) {(Φ_{SN} (t_{c i}; λ))}^{α - 1} ϕ (\sqrt{1 + λ^{2}} t_{c i})}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ \times \frac{α exp (x_{(1) i}^{⊤} β_{(1)}) {(Φ_{SN} (t_{c i}; λ))}^{α} log (Φ_{SN} (t_{c i}; λ))}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}}) \\ + \sqrt{\frac{2}{π}} \frac{1}{1 + λ^{2}} \sum_{i} (1 - I_{i}) ν_{λ} (t_{i}) \end{matrix}

\begin{matrix} j_{λ_{2} λ_{2}} & = \frac{1}{λ_{2}} \sum_{i} I_{i} \frac{t_{c i} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} (- 2 + t_{c i} ω_{λ} (t_{c i}) + t_{c i}^{2}) \\ + α^{- 1} \sum_{i} I_{i} + \frac{1}{λ_{2}^{2}} \sum_{i} (1 - I_{i}) [- 1 + 3 t_{i}^{2} \\ + λ t_{i} {(\frac{t_{c i} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}})}^{2} (2 + λ^{2} t_{i}^{2}) ω (λ t_{i}) + λ^{2} t_{i}^{2} {(ω (λ t_{i}))}^{2}] \\ + \frac{α - 1}{λ_{2}^{2}} \sum_{i} (1 - I_{i}) t_{i} ω_{λ} (t_{i}) (- 2 + t_{i}^{2} + t_{i} ω_{λ} (t_{i})) \end{matrix}

\begin{matrix} j_{α λ} & = \sum_{i} I_{i} \frac{t_{c i} log \{Φ_{SN} (t_{c i}; λ)\} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} \\ \times (1 - \frac{exp (x_{(1) i}^{⊤} β_{(1)}) {(Φ_{SN} (t_{c i}; λ))}^{α}}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}}) \\ + α^{- 1} \sum_{i} I_{i} \frac{t_{c i} exp (x_{(1) i}^{⊤} β_{(1)}) ϕ_{PSN} (t_{c i}; λ, α)}{1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α}} + \frac{1}{λ_{2}} \sum_{i} I_{i} t_{i} ω_{λ} (t_{i}) \\ j_{α α} & = - \sum_{i} I_{i} \frac{exp (x_{(1) i}^{⊤} β_{(1)}) {(Φ_{SN} (t_{c i}; λ))}^{α} {log}^{2} (Φ_{SN} (t_{c i}; λ))}{{(1 + exp (x_{(1) i}^{⊤} β_{(1)}) {\{Φ_{SN} (t_{c i}; λ)\}}^{α})}^{2}} + \sum_{i} (1 - I_{i}) \frac{1}{α^{2}} \end{matrix}

References

Tobin, J. Estimation of relationships for limited dependent variables. Econometrica 1958, 26, 24–36. [Google Scholar] [CrossRef] [Green Version]
Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Durrans, S.R. Distributions of fractional order statistics in hydrology. Water Resour. Res. 1992, 28, 1649–1655. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Vergara-Cardozo, S.; González, L.M. The family of log-skew-normal alpha-power distributions using the precipitation data. Rev. Colomb. EstadÍstica 2013, 36, 351–361. [Google Scholar]
Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. Asymmetric regression models with limited responses with an application to antibody response to vaccine. Biom. J. 2013, 55, 156–172. [Google Scholar] [CrossRef] [PubMed]
Arellano-Valle, R.B.; Castro, L.M.; González-Farias, G.; Muñóz-Gajardo, K.A. Student-t censored regression model: Properties and inference. Stat. Methods Appl. 2012, 21, 453–473. [Google Scholar] [CrossRef]
Martínez-Flórez, G.; Bolfarine, H.; Gómez, H.W. The alpha-power tobit model. Commun. Stat. Theory Methods 2013, 42, 633–643. [Google Scholar] [CrossRef]
Cragg, J. Some statistical models for limited dependent variables with application to the demand for durable goods. Econometrica 1971, 39, 829–844. [Google Scholar] [CrossRef]
Moulton, L.; Halsey, N. A mixture model with detection limits for regression analyses of antibody response to vaccine. Biometrics 1995, 51, 1550–1578. [Google Scholar] [CrossRef]
Moulton, L.; Halsey, N. A mixed gamma model for regression analyses of quantitative assay data. Vaccine 1996, 14, 1154–1158. [Google Scholar] [CrossRef]
Chai, H.; Bailey, K. Use of log-normal distribution in analysis of continuous data with a discrete component at zero. Stat. Med. 2008, 27, 3643–3655. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Owen, D.B. Tables for computing bi-variate normal probabilities. Ann. Math. Stat. 1956, 27, 1075–1090. [Google Scholar] [CrossRef]
Pewsey, A.; Gómez, H.; Bolfarine, H. Likelihood-based inference for power distributions. Test 2012, 21, 775–789. [Google Scholar] [CrossRef]
Azzalini, A.; Cappello, D.; Kotz, S. Log-skew-normal and log-skew-t distributions as models for family income data. J. Incone Distrib. 2002, 11, 12–20. [Google Scholar]
Rotnitzky, A.; Cox, D.R.; Bottai, M.; Robins, J. Likelihood-based inference with singular information matrix. Bernoulli 2000, 6, 243–284. [Google Scholar] [CrossRef]
DiCiccio, T.J.; Monti, A.C. Inferential aspects of the skew exponential power distribution. J. Am. Stat. Assoc. 2004, 99, 439–450. [Google Scholar] [CrossRef]
Gómez, H.W.; Elal-Olivero, D.; Salinas, H.; Bolfarine, H. Bimodal extension based on the skew-normal distribution with application to pollen data. Environmetrics 2011, 22, 50–62. [Google Scholar] [CrossRef]
Arellano-Valle, R.B.; Azzalini, A. The centred parametrization for the multivariate skew-normal distribution. J. Multivar. Anal. 2008, 99, 1362–1382. [Google Scholar] [CrossRef]
Job, J.; Halsey, N.; Boulos, R.; Holt, E.; Farrel, D.; Albrecht, P.; Brutus, J.; Adrien, M.; Andre, J.; Chan, E.; et al. The cite soleil/JHU project team. Successful immunization of infants at 6 months of age with high dose Edmonston-Zagreb measles vaccine. Pediatr. Infect. Dis. J. 1991, 30, 303–311. [Google Scholar] [CrossRef] [PubMed]
R Development Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2019; Available online: http://www.R-project.org (accessed on 22 February 2021).
Akaike, H. A new look at statistical model identification. IEEE Trans. Autom. Control 1974, AU–19, 716–722. [Google Scholar] [CrossRef]

Figure 1. (a)

{ELSN}_{c} (0, 1, 0.75, α

) for

α = 0.35

(solid line),

α = 0.5

(dashed line),

α = 0.75

(dotted line),

α = 2

(dotted–dashed line), (b)

{ELSN}_{c} (0, 1

,

γ_{1}, 2

) with

γ_{1} = - 0.50

(solid line),

γ_{1} = - 0.25

(dashed line),

γ_{1} = 0.50

(dotted line),

γ_{1} = 0.90

(dotted–dashed line).

Figure 1. (a)

{ELSN}_{c} (0, 1, 0.75, α

) for

α = 0.35

(solid line),

α = 0.5

(dashed line),

α = 0.75

(dotted line),

α = 2

(dotted–dashed line), (b)

{ELSN}_{c} (0, 1

,

γ_{1}, 2

) with

γ_{1} = - 0.50

(solid line),

γ_{1} = - 0.25

(dashed line),

γ_{1} = 0.50

(dotted line),

γ_{1} = 0.90

(dotted–dashed line).

Figure 2. Log-likelihood profiled for

γ_{1}

assuming

{ELSN}_{c}

distribution with samples sizes (a) 50, (b) 100 and (c) 150 from a simulated

LN (0, 1) \equiv {ELSN}_{c} (0, 1)

distribution.

Figure 2. Log-likelihood profiled for

γ_{1}

assuming

{ELSN}_{c}

distribution with samples sizes (a) 50, (b) 100 and (c) 150 from a simulated

LN (0, 1) \equiv {ELSN}_{c} (0, 1)

distribution.

Figure 3. Log-likelihood profiled for

α

assuming

{ELSN}_{c}

distribution with samples sizes (a) 50, (b) 100 and (c) 150 from a simulated

LN (0, 1) \equiv {ELSN}_{c} (0, 1)

distribution.

Figure 3. Log-likelihood profiled for

α

assuming

{ELSN}_{c}

distribution with samples sizes (a) 50, (b) 100 and (c) 150 from a simulated

LN (0, 1) \equiv {ELSN}_{c} (0, 1)

distribution.

Figure 4. Histogram of the scaled residuals

e_{i}

for (a) LN model, (b) LPN model, (c) ELSN

_{c}

model.

Figure 4. Histogram of the scaled residuals

e_{i}

for (a) LN model, (b) LPN model, (c) ELSN

_{c}

model.

Figure 5. QQ-plots of the scaled residuals

e_{i}

for (a) LN model, (b) LPN model, (c) ELSN

_{c}

model.

Figure 5. QQ-plots of the scaled residuals

e_{i}

for (a) LN model, (b) LPN model, (c) ELSN

_{c}

model.

Table 1. Simulation study with 1000 iterations for

α = 0.75, 1.5

,

γ = 0.25, 0.5, 0.75

,

β_{0} = 1.5

, and

β_{1} = 2.5

, with sample sizes of

n = 60, 70, 80

and 500.

Table 1. Simulation study with 1000 iterations for

α = 0.75, 1.5

,

γ = 0.25, 0.5, 0.75

,

β_{0} = 1.5

, and

β_{1} = 2.5

, with sample sizes of

n = 60, 70, 80

and 500.

	$β_{0} = 1.5$		$β_{1} = 2.5$		$α = 0.75$		$γ_{1} = 0.25$
n	Bias	RMSE	Bias	RMSE	Bias	RMSE	Bias	RMSE
60	−0.0307	0.5331	0.0019	0.1405	−0.0168	0.3787	0.1614	0.5477
70	−0.0296	0.5073	0.0015	0.1305	−0.0130	0.3581	0.1410	0.5616
80	−0.0170	0.4640	−0.0012	0.1255	−0.0204	0.3342	0.1083	0.4157
500	0.0080	0.1700	0.0011	0.0462	−0.0046	0.1199	0.0068	0.1224
	$β_{0} = 1.5$		$β_{1} = 2.5$		$α = 0.75$		$γ_{1} = 0.50$
n	Bias	RMSE	Bias	RMSE	Bias	RMSE	Bias	RMSE
60	−0.0568	0.6625	0.0060	0.1384	−0.0048	0.3726	0.2956	1.0945
70	−0.0536	0.6290	−0.0015	0.1276	−0.0205	0.3309	0.2515	0.8024
80	−0.0301	0.6118	0.0031	0.1189	−0.0091	0.3135	0.2018	0.6585
500	0.0209	0.2970	−0.0006	0.0448	0.0023	0.1156	0.0197	0.2412
	$β_{0} = 1.5$		$β_{1} = 2.5$		$α = 0.75$		$γ_{1} = 0.75$
n	Bias	RMSE	Bias	RMSE	Bias	RMSE	Bias	RMSE
60	−0.0508	0.6767	0.0072	0.1211	0.1848	1.9833	0.3443	1.2177
70	−0.0481	0.67	0.0045	0.1174	0.184	2.0431	0.3243	1.1201
80	−0.0368	0.6483	0.003	0.1047	0.1284	1.0011	0.2918	1.0173
500	−0.0287	0.3889	−0.0015	0.0387	0.0201	0.1639	0.1173	0.4708
n	Bias	RMSE	Bias	RMSE	Bias	RMSE	Bias	RMSE
60	−0.0973	0.5842	0.0500	0.1251	0.0519	0.4253	0.8683	3.2124
70	−0.0815	0.5433	0.0102	0.1165	0.0360	0.4137	0.7459	3.0357
80	−0.0716	0.4592	−0.0068	0.1073	0.0081	0.3963	0.5457	2.2340
500	−0.0151	0.1480	0.0013	0.0408	0.0001	0.1253	0.0564	0.2874
	$β_{0} = 1.5$		$β_{1} = 2.5$		$α = 1.5$		$γ_{1} = 0.5$
n	Bias	RMSE	Bias	RMSE	Bias	RMSE	Bias	RMSE
60	−0.0179	0.6981	−0.0092	0.1203	0.0774	0.6987	1.0013	5.0588
70	0.0173	0.6716	0.0085	0.1153	0.0441	0.4326	0.6727	3.2734
80	0.0154	0.6311	0.0065	0.1057	0.0427	0.3569	0.6311	3.1535
500	−0.0404	0.2858	0.0005	0.0395	0.0190	0.1496	0.1793	0.6559
	$β_{0} = 1.5$		$β_{1} = 2.5$		$α = 1.5$		$γ_{1} = 0.75$
n	Bias	RMSE	Bias	RMSE	Bias	RMSE	Bias	RMSE
60	0.0789	0.6885	0.0053	0.1218	0.1973	1.4280	0.4340	3.1930
70	0.0784	0.6413	0.0025	0.1134	0.1630	1.0152	0.4212	2.4414
80	0.0623	0.6046	0.0012	0.1059	0.1366	0.6108	0.3358	1.8002
500	0.0316	0.3572	−0.0005	0.0380	0.0723	0.3008	0.0890	0.6704

Table 2. Estimated parameters (standard error) of the fitted model.

Density	AIC	$β_{10}$	$β_{11}$	$β_{12}$	$γ_{1} / α$	$β_{20}$	$β_{23}$
Logit/LN	986.19	0.652	0.808	0.422		−0.401	0.264
		(0.220)	(0.304)	(0.288)		(0.112)	(0.155)
Logit/LSN $_{c}$	944.15	0.503	0.648	0.974	0.899	−0.284	0.108
		(0.203)	(0.277)	(0.303)	(0.537)	(0.059)	(0.079)
Logit/LPN	976.11	0.640	0.765	0.357	9.660	−3.030	0.221
		(0.209)	(0.280)	(0.269)	(4.306)	(0.607)	(0.138)

Table 3. Parameter estimation (standard error) and model fitting.

AIC	$β_{10}$	$β_{11}$	$β_{12}$	$β_{20}$	$β_{21}$	$β_{22}$	$β_{23}$	$γ_{1}$	$α$
985.21	1.106	–	–	−1.786	−0.166	0.115	0.176	0.291	4.281
	(0.134)	–	–	(0.633)	(0.136)	(0.138)	(0.137)	(0.432)	(1.459)
938.32	0.349	0.975	0.689	0.045	–	–	0.156	0.923	0.613
	(0.199)	(0.274)	(0.295)	(0.101)	–	–	(0.084)	(0.407)	(0.090)

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Martínez-Flórez, G.; Vergara-Cardozo, S.; Tovar-Falón, R. A Class of Exponentiated Regression Model for Non Negative Censored Data with an Application to Antibody Response to Vaccine. Symmetry 2021, 13, 1419. https://doi.org/10.3390/sym13081419

AMA Style

Martínez-Flórez G, Vergara-Cardozo S, Tovar-Falón R. A Class of Exponentiated Regression Model for Non Negative Censored Data with an Application to Antibody Response to Vaccine. Symmetry. 2021; 13(8):1419. https://doi.org/10.3390/sym13081419

Chicago/Turabian Style

Martínez-Flórez, Guillermo, Sandra Vergara-Cardozo, and Roger Tovar-Falón. 2021. "A Class of Exponentiated Regression Model for Non Negative Censored Data with an Application to Antibody Response to Vaccine" Symmetry 13, no. 8: 1419. https://doi.org/10.3390/sym13081419

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Class of Exponentiated Regression Model for Non Negative Censored Data with an Application to Antibody Response to Vaccine

Abstract

1. Introduction

2. Models for Asymmetric Data

The Centred Parametrization of the Skew-Normal Model

3. The Centred Exponentiated Log-Skew-Normal Family of Distribution for Censored Data

3.1. The ELSN $_{c}$ Regression Models

3.2. The Censored ELSN $_{c}$ Distribution

4. The Bernoulli/ Centred Log-Skew-Normal Alpha-Power Mixture Model

4.1. The Logit/Centred Log-Skew-Normal Alpha-Power Model

4.2. Fitting Model

5. An Application to Antibody Response to Vaccine

6. Final Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Class of Exponentiated Regression Model for Non Negative Censored Data with an Application to Antibody Response to Vaccine

Abstract

1. Introduction

2. Models for Asymmetric Data

The Centred Parametrization of the Skew-Normal Model

3. The Centred Exponentiated Log-Skew-Normal Family of Distribution for Censored Data

3.1. The ELSN c Regression Models

3.2. The Censored ELSN c Distribution

4. The Bernoulli/ Centred Log-Skew-Normal Alpha-Power Mixture Model

4.1. The Logit/Centred Log-Skew-Normal Alpha-Power Model

4.2. Fitting Model

5. An Application to Antibody Response to Vaccine

6. Final Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. The ELSN $_{c}$ Regression Models

3.2. The Censored ELSN $_{c}$ Distribution