Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method

Bai, Hao; Zhong, Yuan; Gao, Xin; Xu, Wei

doi:10.3390/stats3030016

Open AccessArticle

Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method

¹

Department of Mathematics and Statistics, York University, Toronto, ON M3J 1P3, Canada

²

Department of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5S 1A1, Canada

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Stats 2020, 3(3), 203-220; https://doi.org/10.3390/stats3030016

Submission received: 30 June 2020 / Revised: 11 July 2020 / Accepted: 13 July 2020 / Published: 15 July 2020

Download

Browse Figure

Versions Notes

Abstract

:

In clinical research, study outcomes usually consist of various patients’ information corresponding to the treatment. To have a better understanding of the effects of different treatments, one often needs to analyze multiple clinical outcomes simultaneously, while the data are usually mixed with both continuous and discrete variables. We propose the multivariate mixed response model to implement statistical inference based on the conditional grouped continuous model through a pairwise composite-likelihood approach. It can simplify the multivariate model by dealing with three types of bivariate models and incorporating the asymptotical properties of the composite likelihood via the Godambe information. We demonstrate the validity and the statistic power of the multivariate mixed response model through simulation studies and clinical applications. This composite-likelihood method is advantageous for statistical inference on correlated multivariate mixed outcomes.

Keywords:

composite likelihood; multivariate analysis; mixed outcome; Godambe information

1. Introduction

Clinical research, such as toxicity studies and laboratory examinations, can provide relevant information for measuring the effect of various treatments or experiments on the patients. This type of research needs to jointly analyze different experimental outcomes, while the research outcomes collected during the treatment are correlated and mixed with both categorical and continuous variables. For example, we are studying the efficacy of treatments along with the toxicity and adverse drug reactions simultaneously. In this case, the severity level could be measured as discrete or ordinal data, while the clinical examination results such as the blood test measures are continuous. In the traditional approach, these multiple outcomes are analyzed by different linear models to estimate the effects of the treatments together with the relevant clinical and demographic information. However, this approach ignores the correlation between the outcomes and only provides marginal inferences. Thus, it is desirable to develop a multivariate approach, which can jointly model the multiple mixed-type responses with the treatment and clinical covariates.

In the recent literature, there are two main approaches to build a joint model for mixed response variables. The conditional Gaussian distribution model (CGDM) can decompose the joint distribution of mixed response variables into a combination of the conditional distribution and the marginal distribution. In the bivariate mixed case, the conditional Gaussian distribution model can produce a conditional distribution of the categorical variables given the continuous variables and marginal distribution for the continuous variables. In particular, Cox [1] provided the logistic conditional distribution for binary variables, and Cox and Wermuth [2] extended the model with a probit-type function and showed the potential connection to the latent variable model. Another conditional Gaussian distribution model, referred to as the general location model (GLOM), was proposed by Olkin and Tate [3]. They adopted the opposite factorization, which consists of a conditional normal distribution given the categorical variables and marginal multinomial distribution. Teixeira-Pinto and Normand [4] compared this approach with the models proposed by Sammel et al. [5,6] in a comprehensive review. Yang et al. [7] extended the model to mixed Poisson and continuous response variables through a likelihood-based approach.

The grouped continuous model (GCM) proposed by Anderson and Pemberton [8] and de Leon [9] provides another solution for this problem. The fundamental technique allows the categorical variables to be treated as partitioned continuous latent variables with different nonoverlapping intervals (Poon and Lee [10]; Skrondal and Rabe-Hesketh [11]). This type of transformation allows the latent variables to follow a multivariate Gaussian distribution. Poon and Lee [10] demonstrated maximum likelihood estimation for latent variables with polychoric correlations. As an extension of the grouped continuous model, de Leon [9] proposed the conditional grouped continuous model (CGCM) to build a joint model for the variables mixed with categorical and continuous outcomes. Catalano and Ryan [12], Catalano [13], and Najita et al. [14] applied the conditional grouped continuous model to the studies of fetal toxicity for longitudinal data. Gueorguieva and Agresti [15] proposed that the estimation of correlated mixed response variables can be obtained by the expectation–maximization (EM) algorithm. Zhang et al. [16] extended this to the parameter expanded EM algorithm under the full-likelihood approach. de Leon and Carriére [17,18] developed a general mixed-data model, which aggregates the CGDM and CGCM to jointly analyze correlated nominal, ordinal, and continuous data together.

It remains computationally challenging to estimate the joint distribution of multivariate mixed data. The composite-likelihood method is based on compounded lower-dimensional distributions, which offers an alternative solution to the estimation problem (Lindsay [19], Cox and Reid [20], Varin [21], Varin et al. [22], and Reid et al. [23]). Faes et al. [24] applied this method for longitudinal data with mixed outcomes. In their model, the correlation structure is induced by the random effect, which does not have a closed-form expression. We aim to follow the approach of the CGCM and use the composite-likelihood method to analyze the joint distribution of multivariate mixed-type response variables, where the categorical response variables are modeled by continuous latent variables. The parameters of the mean structure, as well as the correlation among different outcomes, can be estimated simultaneously through the numerical algorithm. The proposed composite-likelihood method consists of three types of bivariate joint densities: two continuous outcomes; two discrete outcomes modeled by two continuous latent variables; and two mixed outcomes with one continuous and one categorical variable.

We provide the numerical algorithm for composite-likelihood estimation and discuss the asymptotical properties of the composite-likelihood estimates. In addition, we derive three composite-likelihood test statistics for joint inference on the multivariate mixed response model. Simulation studies were conducted to examine the empirical performance of the proposed method in comparison with the conventional approaches. The algorithm was applied to the clinical data from a colorectal cancer study. We analyze the effect of the treatment and other clinical factors’ on multiple correlated responses of the patients.

2. Methodology

2.1. Model Setup

Suppose there are n observations

z_{1}, z_{2}, \dots, z_{i}, \dots, z_{n}

in a clinical dataset, and each observation contains q multiple outcomes

z_{i} = {(z_{i 1}, z_{i 2}, \dots, z_{i j}, \dots, z_{i q})}^{T}

, which are correlated and mixed with continuous and binary variables. Suppose we wish to model the effects of a collection of covariates, and the generalized linear model can be constructed for each outcome as

\begin{matrix} g_{j} (E (z_{i j})) = x_{i j}^{T} β_{j}, \end{matrix}

in which the covariate

x_{i j}

with

i = 1, 2, \dots, n

and

j = 1, 2, \dots, q

for different responses can be the same or different, and

g_{j}

denotes the link function used for the jth response. If we want to analyze these multivariate mixed-type outcomes simultaneously with the corresponding covariates, the conventional likelihood-based approach needs to identify the multivariate joint densities of all responses. Alternatively, we can set up the pairwise likelihood function for responses

z_{i j}

and

z_{i k}

as

\begin{matrix} L_{j k} (θ_{j k}) = \prod_{i = 1}^{n} f (z_{i j}, z_{i k}), \end{matrix}

(1)

where

θ_{j k}

denotes the parameters associated with the pairwise likelihood. The log-likelihood function is given by

l_{j k} (θ_{j k}) = log L_{j k} (θ_{j k})

, and the score function is given by

\begin{matrix} U_{j k} (θ_{j k}) = \sum_{i = 1}^{n} f {(z_{i j}, z_{i k})}^{- 1} \frac{\partial}{\partial θ_{j k}} f (z_{i j}, z_{i k}) . \end{matrix}

(2)

According to Lindsay [19] and Cox and Reid [20], the pairwise composite-likelihood function of these q response variables is the product of

(\binom{q}{2})

paired likelihood functions:

\begin{matrix} CL (θ) = \prod_{j = 1}^{q - 1} \prod_{k = j + 1}^{q} L_{j k} (θ_{j k}), \end{matrix}

and the score function is constructed by defferentiating the composite log-likelihood function:

\begin{matrix} U (θ) = \frac{\partial}{\partial θ} log CL (θ) . \end{matrix}

(3)

Our multivariate mixed response model estimates the parameters of interest

θ

by numerically solving the composite score Function (3) equal to 0 through the Newton–Raphson [25] method. Since the outcomes are mixed with continuous and categorical variables, the score Function (2) can be derived under three different bivariate structures: outcomes with both continuous response variables, outcomes with both binary response variables, and outcomes mixed with one continuous and one binary response variable.

2.1.1. Case 1: Continuous Outcomes

We first discuss the case that both responses

z_{i j}

and

z_{i k}

are continuous and assume they follow a bivariate normal distribution:

\begin{matrix} (\begin{matrix} z_{i j} \\ z_{i k} \end{matrix}) \sim N_{2} ((\begin{matrix} μ_{i j} \\ μ_{i k} \end{matrix}), (\begin{matrix} σ_{j}^{2}, & ρ_{j k} σ_{j} σ_{k} \\ ρ_{j k} σ_{j} σ_{k}, & σ_{k}^{2} \end{matrix})), \end{matrix}

where the mean structure of

z_{i j}

and

z_{i k}

is associated with the generalized linear models as

μ_{i j} = x_{i j}^{T} β_{j}

and

μ_{i k} = x_{i k}^{T} β_{k}

. Thus, the pairwise density of

z_{i j}

and

z_{i k}

is the same as a bivariate normal density based on Johnson and Wichern [26]:

\begin{matrix} f (z_{i j}, z_{i k}) = & \frac{1}{2 π σ_{j} σ_{k} \sqrt{1 - ρ_{j k}^{2}}} exp [- \frac{1}{2 (1 - ρ_{j k}^{2})} \times \\ (\frac{{(z_{i j} - μ_{i j})}^{2}}{σ_{j}^{2}} - \frac{2 ρ (z_{i j} - μ_{i j}) (z_{i k} - μ_{i k})}{σ_{j} σ_{k}} + \frac{{(z_{i k} - μ_{i k})}^{2}}{σ_{k}^{2}})] . \end{matrix}

The pairwise composite-likelihood function can be constructed by

\begin{matrix} L_{j k} (θ_{j k}) = \prod_{i = 1}^{n} f (z_{i j}, z_{i k}), \end{matrix}

with the parameters of interest

θ_{j k} = {β_{j}, β_{k}, σ_{j}, σ_{k}, ρ_{j k}}

, and the log-likelihood function

l_{j k} (θ_{j k}) = \sum_{i = 1}^{n} log f (z_{i j}, z_{i k})

, which gives the score function of both continuous responses used in the Equation (2):

\begin{matrix} U_{j k} (θ_{j k}) & = & \frac{\partial l_{j k} (θ_{j k})}{\partial θ_{j k}} . \end{matrix}

(4)

2.1.2. Case 2: Binary Outcomes

For binary outcomes, the transformation can be obtained using the threshold value t such that if the latent variable

z_{i j}^{*} \geq t

, then the response

z_{i j} = 1

, otherwise

z_{i j} = 0

. Without loss of generality, the value of t is set to 0, and we have

μ_{i j} = P (z_{i j} = 1) = P (z_{i j}^{*} \geq 0)

. We follow the grouped continuous model settings with the distributional assumptions

z_{i j} \sim B e r n o u l l i (μ_{i j})

and

z_{i j}^{*} \sim N (μ_{i j}^{*}, σ_{j}^{2})

. Thus, the model can be constructed with the covariates

x_{i j}

as

\begin{matrix} p r o b i t (μ_{i j}) = x_{i j}^{T} β_{j} . \end{matrix}

Since the mean

μ_{i j}^{*}

and the variance

σ_{j}^{2}

are not identifiable at the same time, we follow the rescaling method based on Dunson [27] by setting

σ_{j}^{2} = 1

. Therefore, the mean of the latent variable can be simplified as

μ_{i j}^{*} = x_{i j}^{T} β_{j}

.

Since we jointly analyze binary variables

z_{i j}

and

z_{i k}

, the paired latent variables

z_{i j}^{*}

and

z_{i k}^{*}

are generated from a model with the polychoric correlation. Thus, the joint function of

(z_{i j}, z_{i k})

can be derived by following composite-likelihood functions in four cases

\begin{matrix} P (z_{i j} = 1, z_{i k} = 1) & = & P (z_{i j}^{*} \geq 0, z_{i k}^{*} \geq 0) = Φ_{2} (μ_{i j}^{*}, μ_{i k}^{*}, ρ_{j k}), \\ P (z_{i j} = 1, z_{i k} = 0) & = & P (z_{i j}^{*} \geq 0, z_{i k}^{*} < 0) = Φ_{2} (μ_{i j}^{*}, - μ_{i k}^{*}, - ρ_{j k}), \\ P (z_{i j} = 0, z_{i k} = 1) & = & P (z_{i j}^{*} < 0, z_{i k}^{*} \geq 0) = Φ_{2} (- μ_{i j}^{*}, μ_{i k}^{*}, - ρ_{j k}), \\ P (z_{i j} = 0, z_{i k} = 0) & = & P (z_{i j}^{*} < 0, z_{i k}^{*} < 0) = Φ_{2} (- μ_{i j}^{*}, - μ_{i k}^{*}, ρ_{j k}), \end{matrix}

where

\pm ρ_{j k}

represents the polychoric correlation under different scenarios and

Φ_{2}

denotes the bivariate normal cumulative density function. The equations above can be rewritten as

\begin{matrix} P (z_{i j}, z_{i k}) & = & Φ_{2} ((2 z_{i j} - 1) μ_{i j}^{*}, (2 z_{i k} - 1) μ_{i k}^{*}, (2 z_{i j} - 1) (2 z_{i k} - 1) ρ_{j k}) \\ = & Φ_{2} (s_{i j} μ_{i j}^{*}, s_{i k} μ_{i k}^{*}, s_{i j} s_{i k} ρ_{j k}), \end{matrix}

with

s_{i j} = (2 z_{i j} - 1)

and

s_{i k} = (2 z_{i k} - 1)

. The log-likelihood function of paired binary responses is

\begin{matrix} l_{j k} (θ_{j k}) = \sum_{i = 1}^{n} log Φ_{2} (s_{i j} μ_{i j}^{*}, s_{i k} μ_{i k}^{*}, s_{i j} s_{i k} ρ_{j k}) . \end{matrix}

The score function can be derived as

\begin{matrix} U_{j k} (θ_{j k}) = \sum_{i = 1}^{n} P {(z_{i j}, z_{i k})}^{- 1} \frac{\partial}{\partial θ_{j k}} log P (z_{i j}, z_{i k}) . \end{matrix}

(5)

2.1.3. Case 3: Mixed Outcomes

When one response variable

z_{i j}

is binary, and another response variable

z_{i k}

is continuous, we can model the mixed outcomes together by adopting a latent normal variable

z_{i j}^{*}

as the conditional grouped continuous model. Thus, they follow a bivariate normal distribution given the polyserial correlation

ρ_{j k}

:

\begin{matrix} (\begin{matrix} z_{i j}^{*} \\ z_{i k} \end{matrix}) \sim N_{2} ((\begin{matrix} μ_{i j}^{*} \\ μ_{i k} \end{matrix}), (\begin{matrix} σ_{j}^{2}, & ρ_{j k} σ_{j} σ_{k} \\ ρ_{j k} σ_{j} σ_{k}, & σ_{k}^{2} \end{matrix})) . \end{matrix}

The factorization method can be applied for the joint bivariate normal distribution with a marginal density for the continuous response

P (z_{i k})

and a conditional density for the latent variable

P (z_{i j}^{*} | z_{i k})

given the continuous response.

$z_{i k}$ is the continuous response

$\begin{matrix} z_{i k} \sim N (μ_{i k}, σ_{k}^{2}); \end{matrix}$
$z_{i j}^{*}$ is the latent variable for the binary response $z_{i j}$ :

$\begin{matrix} z_{i j}^{*} | z_{i k} \sim N (μ_{i j | k} = μ_{i j}^{*} + ρ_{j k} \frac{σ_{j}}{σ_{k}} (z_{i k} - μ_{i k}), σ_{j | k}^{2} = σ_{j}^{2} (1 - ρ_{j k}^{2})) . \end{matrix}$

Let

d_{i j} = - \frac{μ_{i j | k}}{σ_{j | k}}

, the conditional probability

P (z_{i j}^{*} | z_{i k})

can be written as

\begin{matrix} P (z_{i j}^{*} | z_{i k}) = \{\begin{matrix} Φ (d_{i j}), & if z_{i j} = 0, \\ Φ (- d_{i j}), & if z_{i j} = 1 . \end{matrix} \end{matrix}

The pairwise likelihood function and log-likelihood function is given as follows:

\begin{matrix} L_{j k} = \prod_{i = 1}^{n} P (z_{i j}^{*} | z_{i k}) p (z_{i k}), \\ l_{j k} = \sum_{i = 1}^{n} log P (z_{i k}) + log 1 (z_{i j} = 0) Φ (d_{i j}) + log 1 (z_{i j} = 1) Φ (- d_{i j}) . \end{matrix}

The score function can be obtained as

\begin{matrix} U_{j k} = \sum_{i = 1}^{n} \frac{1}{P (z_{i k})} \frac{\partial P (z_{i k})}{\partial θ_{j k}} + [\frac{1 (z_{i j} = 0)}{Φ (d_{i j})} - \frac{1 (z_{i j} = 1)}{1 - Φ (d_{i j})}] \frac{\partial Φ (d_{i j})}{\partial θ_{j k}} . \end{matrix}

(6)

A similar formulation was extended to the longitudinal settings by Najita et al. [14]. More derivation details are included in Appendix A.

The model setting above can simplify the score Function (3) with three types of components as in Equations (4)–(6). We can obtain the maximum composite-likelihood estimate

\hat{θ}

(MCLE) for the parameters

θ

by solving the score function numerically. Our numerical algorithm is presented below (Algorithm 1):

Algorithm 1: Multivariate Mixed Response Model Algorithm.

1.: Select the initial values $θ^{(0)}$ of the parameters. Calculate the score function $U (θ^{(0)})$ in Equation (3) and the Hessian matrix $\nabla_{θ} U (θ^{(0)})$ as the derivative of the score function;
2.: Implement the Newton–Raphson method to obtain the updated parameters $θ^{(t)}$ with $t = 0, 1, \dots$ ;

$θ^{(t + 1)} = θ^{(t)} - {[\nabla_{θ} U (θ^{(t)})]}^{- 1} U (θ^{(t)});$
3.: Calculate the score function and Hessian matrix with the updated parameters $θ^{(t + 1)}$ ;
4.: Repeat step 2 and 3 until convergence.

2.2. Statistical Inference Using the Composite-Likelihood Method

Cox and Hinkley [28] and Kent [29] presented various hypothesis testing procedures using the full-likelihood function. The composite-likelihood function can be treated as the misspecified-likelihood function. Its asymptotic properties were reviewed and discussed by Varin et al. [22], Reid et al. [23], Jin [30], and Gao and Song [31]. Following this framework, the pairwise composite-likelihood function implemented in our proposed model produces the estimators, which are consistent and asymptotically normally distributed.

The Godambe information [32] G of the parameters

θ

for the log composite-likelihood function involves the sensitivity matrix H and the variability matrix J:

G (θ) = H (θ) J^{- 1} (θ) H (θ),

where the sensitivity matrix and variability matrix are defined as

\begin{matrix} H (θ) = E_{θ} {- \nabla_{θ} U (θ; z_{i})} and J (θ) = V a r_{θ} {U (θ; z_{i})}, \end{matrix}

where

U (θ; z_{i})

denotes the score function of the ith observation and the total score function

U (θ) = \sum_{i = 1}^{n} U (θ; z_{i})

.

Theorem 1.

Let

θ_{0} \in ℜ^{d}

denote the true parameter value to set up the multivariate mixed response model. Under regularity conditions,

n \to \infty

, the maximum composite-likelihood estimator

\hat{θ}

is asymptotically normally distributed as

\sqrt{n} (\hat{θ} - θ_{0}) \overset{d}{\to} N_{d} (0, G^{- 1}) .

Theorem 2.

Under regularity conditions,

n \to \infty

, the maximum composite-likelihood estimator

\hat{θ}

is consistent to

θ_{0}

satisfying

\sqrt{n} (\hat{θ} - θ_{0}) = O_{p} (1)

.

The sensitivity matrix H and the variability matrix J can be evaluated by the empirical estimates under the maximum composite-likelihood estimators:

\begin{matrix} H (\hat{θ}) = - \frac{1}{n} \sum_{i = 1}^{n} \nabla_{θ} U (θ; z_{i}) |_{\hat{θ}} and J (\hat{θ}) = \frac{1}{n} \sum_{i = 1}^{n} U (θ; z_{i}) U {(θ; z_{i})}^{T} . \end{matrix}

Furthermore, according to Theorem 1, the composite Wald statistic, the composite score statistic, and the composite-likelihood ratio statistic for testing the null hypothesis

H_{0}

:

θ = θ_{0}

are given respectively by

\begin{matrix} W_{e} = n (\hat{θ} - θ_{0}) G (\hat{θ} - θ_{0}) \\ W_{u} = n^{- 1} U (θ_{0}) J^{- 1} (\hat{θ}) U (θ_{0}), \\ W = 2 {log CL (\hat{θ}) - log CL (θ_{0})} . \end{matrix}

Testing with Nuisance Parameters

Suppose the parameters are partitioned as

θ = {ψ, λ}

with

ψ \in ℜ^{s}

,

λ \in ℜ^{p}

, and

d = s + p

. The parameter of interest is

ψ

, and

λ

is treated as the nuisance parameter for hypothesis testing. In this setting, the Godambe information matrix and its inverse can be partitioned as

\begin{matrix} G = [\begin{matrix} G_{ψ ψ} & G_{ψ λ} \\ G_{λ ψ} & G_{λ λ} \end{matrix}] and G^{- 1} = [\begin{matrix} G^{ψ ψ} & G^{ψ λ} \\ G^{λ ψ} & G^{λ λ} \end{matrix}], \end{matrix}

and the inverse of the submatrix pertaining to

ψ

is given by

G_{ψ ψ, λ} = {(G^{ψ ψ})}^{- 1} = G_{ψ ψ} - G_{ψ λ} G_{λ λ}^{- 1} G_{λ ψ}

. According to the asymptotic theorem, the composite Wald statistics under the null hypothesis

H_{0}

:

ψ = ψ_{0}

using

\hat{λ} (ψ_{0})

is given by

\begin{matrix} W_{e} (ψ_{0}) = n (\hat{ψ} - ψ_{0}) G_{ψ ψ, λ} (\hat{ψ} - ψ_{0}), \end{matrix}

which has an asymptotic

χ_{q}^{2}

distribution. Similarly, we define the composite score statistics:

\begin{matrix} W_{u} (ψ_{0}) = n^{- 1} U (ψ_{0}, \hat{λ} (ψ_{0})) H^{ψ ψ} G_{ψ ψ, λ} H^{ψ ψ} U (ψ_{0}, \hat{λ} (ψ_{0})), \end{matrix}

where the matrix

H^{ψ ψ}

can be obtained by partitioning the inverse sensitivity matrix H. Furthermore, the composite-likelihood ratio statistic can be obtained by

\begin{matrix} W (ψ_{0}) = 2 {log CL (\hat{ψ}, \hat{λ}) - log CL (ψ_{0}, \hat{λ} (ψ_{0}))}, \end{matrix}

with the unrestricted maximum composite-likelihood estimate

\hat{θ} = {\hat{ψ}, \hat{λ}}

. However, the asymptotic distribution of the composite-likelihood ratio under

H_{0}

is given by

\sum_{j = 1}^{q} λ_{j} χ_{1 (j)}^{2}

, where

χ_{1 (j)}^{2}

are independent

χ_{1}^{2}

variates and

λ_{1}, λ_{2} \dots, λ_{q}

are the eigenvalues of the matrix

H_{ψ ψ, λ} G^{ψ ψ}

with

H_{ψ ψ, λ} = H_{ψ ψ} - H_{ψ λ} H_{λ λ}^{- 1} H_{λ ψ}

. There are different adjustments to this nonstandard weighted chi-square distribution (Rotnitzky and Jewell [33], Geys et al. [34], and Pace et al. [35]). For example, we can apply the adjustment by introducing the scaling factor

\bar{λ} = \sum_{j = 1}^{q} λ_{j} / q

, then the adjusted composite-likelihood ratio has the same asymptotic distribution as

W_{e} (ψ_{0})

and

W_{u} (ψ_{0})

:

\begin{matrix} \frac{W}{\bar{λ}} \overset{d}{\to} χ_{s}^{2} . \end{matrix}

(7)

Therefore, the composite-likelihood method can simplify the modeling of correlated responses with multiple generalized linear models and allow users to conduct statistic inferences on parameters of interest from different generalized linear models. Moreover, we can select a subset of the parameters and conduct the further inferential assessment in the presence of nuisance parameters.

3. Simulation

Different simulation studies were implemented to show the validity of the multivariate mixed response model. The estimation results from the proposed model are compared with the full-likelihood and marginal approaches, respectively.

3.1. Comparison with Maximum Full-Likelihood Estimation

In the multivariate regression with correlated continuous outcomes, the full-likelihood estimation can be conducted without numerical integration. Thus, we can compare the maximum composite-likelihood estimates with the full-likelihood approach through the simulation study. The simulated samples contain four continuous response variables

z_{i c_{1}}, z_{i c_{2}}, z_{i c_{3}},

and

z_{i c_{4}}

, which are generated from Equation (8):

\begin{matrix} \begin{matrix} z_{i c_{1}} = α_{c_{1}} + β_{c_{1}} x_{i c_{1}} + γ_{c_{1}} y_{i c_{1}} + ε_{i c_{1}}, \\ z_{i c_{2}} = α_{c_{2}} + β_{c_{2}} x_{i c_{2}} + γ_{c_{2}} y_{i c_{2}} + ε_{i c_{2}}, \\ z_{i c_{3}} = α_{c_{1}} + β_{c_{1}} x_{i c_{1}} + γ_{c_{1}} y_{i c_{1}} + ε_{i c_{1}}, \\ z_{i c_{4}} = α_{c_{2}} + β_{c_{2}} x_{i c_{2}} + γ_{c_{2}} y_{i c_{2}} + ε_{i c_{2}} . \end{matrix} \end{matrix}

(8)

The covariates

{x_{i c_{1}}, x_{i c_{2}}, x_{i c_{1}}, x_{i c_{2}}} \sim N (0, 1)

and

{y_{i c_{1}}, y_{i c_{2}}, y_{i c_{1}}, y_{i c_{2}}} \sim N (0, 0.5)

are independently simulated. The errors are correlated and generated from a multivariate normal distribution

N_{4} (0, Σ)

, and the variance–covariance matrix

Σ

is given by

\begin{matrix} [\begin{matrix} σ_{c_{1}}^{2} & σ_{c_{1}} σ_{c_{2}} ρ_{c_{1} c_{2}} & σ_{c_{1}} σ_{c_{3}} ρ_{c_{1} c_{3}} & σ_{c_{1}} σ_{c_{4}} ρ_{c_{1} c_{4}} \\ σ_{c_{1}} σ_{c_{2}} ρ_{c_{1} c_{2}} & σ_{c_{2}}^{2} & σ_{c_{2}} σ_{c_{3}} ρ_{c_{2} c_{4}} & σ_{c_{2}} σ_{c_{4}} ρ_{c_{2} c_{4}} \\ σ_{c_{1}} σ_{c_{3}} ρ_{c_{1} c_{3}} & σ_{c_{2}} σ_{c_{3}} ρ_{c_{2} c_{3}} & σ_{c_{3}}^{2} & σ_{c_{3}} σ_{c_{4}} ρ_{c_{3} c_{4}} \\ σ_{c_{1}} σ_{c_{4}} ρ_{c_{1} c_{4}} & σ_{c_{2}} σ_{c_{4}} ρ_{c_{2} c_{4}} & σ_{c_{3}} σ_{c_{4}} ρ_{c_{3} c_{4}} & σ_{c_{4}}^{2} \end{matrix}] . \end{matrix}

In the simulation, the variances are designed as

σ_{c_{1}}^{2} = 1

,

σ_{c_{2}}^{2} = 1

,

σ_{c_{3}}^{2} = 2.25

, and

σ_{c_{3}}^{2} = 4

, and an identical correlation

ρ = 0.3

is applied between the errors in the data generating process.

The simulation results (Figure 1) were obtained through 1000 independent replications. The maximum composite-likelihood estimators demonstrate similar performance when compared with the full-likelihood approach. The simulated results also show that the estimates are close to each other, and the maximum likelihood estimators have slightly higher relative efficiency.

3.2. Comparison with the Marginal Approach

For the mixed outcome regression, the full-likelihood approach is computationally challenging, and marginal regression is often resorted to in order to conduct the analysis. We implemented simulation studies to evaluate the performance of our proposed method in comparison with marginal regression. We first tested the overall performance of the point estimates when the outcomes had different levels of dependency and covariates. Next, we focused on the test of the composite statistics. The multivariate mixed response model can provide the statistical inference with nuisance parameters and attains a higher statistical power in terms of dealing with joint inference.

3.2.1. Simulation Settings

We generated the sample data consisting of two binary responses

z_{i b_{1}}

and

z_{i b_{2}}

and two continuous responses

z_{i c_{1}}

and

z_{i c_{2}}

. The binary variables were obtained based on the corresponding latent normal variables

z_{i b_{1}}^{*}

and

z_{i b_{2}}^{*}

through the probit link function:

\begin{matrix} p r o b i t (μ_{i b_{1}}) = μ_{i b_{1}}^{*}, \\ p r o b i t (μ_{i b_{2}}) = μ_{i b_{2}}^{*} . \end{matrix}

The simulation studies of the responses are based on Equation (9) associated with covariates

x_{i} = {x_{i b_{1}}, x_{i b_{2}}, x_{i c_{1}}, x_{i c_{2}}}

and

y_{i} = {y_{i b_{1}}, y_{i b_{2}}, y_{i c_{1}}, y_{i c_{2}}}

, respectively:

\begin{matrix} \begin{matrix} z_{i b_{1}}^{*} = α_{b_{1}} + β_{b_{1}} x_{i b_{1}} + γ_{b_{1}} y_{i b_{1}} + ε_{i b_{1}}, \\ z_{i b_{2}}^{*} = α_{b_{2}} + β_{b_{2}} x_{i b_{2}} + γ_{b_{2}} y_{i b_{2}} + ε_{i b_{2}}, \\ z_{i c_{1}} = α_{c_{1}} + β_{c_{1}} x_{i c_{1}} + γ_{c_{1}} y_{i c_{1}} + ε_{i c_{1}}, \\ z_{i c_{2}} = α_{c_{2}} + β_{c_{2}} x_{i c_{2}} + γ_{c_{2}} y_{i c_{2}} + ε_{i c_{2}} . \end{matrix} \end{matrix}

(9)

We provided different simulation scenarios of the covariates values and three levels of correlation to analyze the response variables with the proposed model. The regression parameters were arbitrarily chosen and set to be fixed values in each simulation study. The errors in Equation (9) follow a multivariate normal distribution

N_{4} (0, Σ)

, and the variance-covariance matrix

Σ

is given by

\begin{matrix} [\begin{matrix} σ_{b_{1}}^{2} & σ_{b_{1}} σ_{b_{2}} ρ_{b_{1} b_{2}} & σ_{b_{1}} σ_{c_{1}} ρ_{b_{1} c_{1}} & σ_{b_{1}} σ_{c_{2}} ρ_{b_{1} c_{2}} \\ σ_{b_{1}} σ_{b_{2}} ρ_{b_{1} b_{2}} & σ_{b_{2}}^{2} & σ_{b_{2}} σ_{c_{1}} ρ_{b_{2} c_{1}} & σ_{b_{2}} σ_{c_{2}} ρ_{b_{2} c_{2}} \\ σ_{b_{1}} σ_{c_{1}} ρ_{b_{1} c_{1}} & σ_{b_{2}} σ_{c_{1}} ρ_{b_{2} c_{1}} & σ_{c_{1}}^{2} & σ_{c_{1}} σ_{c_{2}} ρ_{c_{1} c_{2}} \\ σ_{b_{1}} σ_{c_{2}} ρ_{b_{1} c_{2}} & σ_{b_{2}} σ_{c_{2}} ρ_{b_{2} c_{2}} & σ_{c_{1}} σ_{c_{2}} ρ_{c_{1} c_{2}} & σ_{c_{2}}^{2} \end{matrix}] . \end{matrix}

In the following data generating processes, the values of the variance-covariance parameters are set as

σ_{b_{1}}^{2} = 1

,

σ_{b_{2}}^{2} = 1

,

σ_{c_{1}}^{2} = 16

, and

σ_{c_{2}}^{2} = 25

, and the correlation is designed at the levels of low (all

ρ = 0.3

), medium (all

ρ = 0.5

), and high (all

ρ = 0.7

), respectively, to assess the underlying model. Since there is no constraint on the sign of the correlation, the negative correlation can be estimated through our algorithm without further assumptions. Our simulation studies focus on the overall performance of the multivariate mixed response model through independent replications.

3.2.2. Point Estimates

Different simulation scenarios were designed to assess the performance of the underlying model on the point estimates by 1000 independent replications. There are two different sets of simulations for the data generating process, and within each setting, we analyze three levels of correlation, respectively. As shown in Table 1, the values of the regression parameters and the standard deviation of the continuous response variables are given across all simulation studies. In the first simulation setting, we provide 300 samples, and the mixed response variables are associated with covariates of distinct values. The covariate sets of

x_{i}

and

y_{i}

are identically and independently simulated from a normal distribution

N (0, 1)

, respectively, in each linear model.

In the second simulation setting, we present the multivariate mixed response model dealing with the common covariate. Regarding the data generating process with 1000 samples, all responses share one common covariate, which was generated from a normal distribution

N (0, 1)

, such as

x_{i b_{1}} = x_{i b_{2}} = x_{i c_{1}} = x_{i c_{2}}

in Equation (9). The second covariates

y_{i}

are from a Bernoulli (0.5), and they are different for each response. This setting represents the scenario in practice when a common factor is included in all of the response models.

In Table 1, we provide the ratio of the mean squared error (MSE) of the proposed method to the marginal approaches. This ratio represents the relative efficiency of the proposed method in comparison with the marginal method under different settings. In most of the simulation settings, the ratio rates of the MSE are well below 1. When the responses are highly correlated and have different covariate sets, our method can reduce MSE by 50%, which indicates a large efficiency gain.

3.2.3. Statistical Test

The test of composite-likelihood statistics can jointly assess the parameters of interest across different generalized linear models, while the conventional methods cannot achieve this. The simulation studies were conducted to measure the type I error rate and the power in comparison with the marginal approaches.

This simulation study was conducted to perform the hypothesis test. The correlated responses were generated based on Equation (9) with all correlation

ρ = 0.3

. The parameters of interest are the regression coefficients

{β_{b_{1}}, β_{b_{2}}, β_{c_{1}}, β_{c_{2}}}

of the first covariates across four generalized linear models, and the first covariates

x_{i}

are independently simulated from

N (0, 1)

. The regression coefficients of the second covariates

y_{i}

and other parameters are nuisance parameters with

y_{i} \sim N (0, . 5)

. In the simulation study to assess the type 1 error rate, the regression parameters

{β_{b_{1}}, β_{b_{2}}, β_{c_{1}}, β_{c_{2}}}

are equal to zero in all generalized linear models, while other parameters have the same values as the previous simulation in Table 1. To assess the power, we fixed the values of the regression parameters as

β_{b_{1}} = β_{b_{2}} = 0.1

for the binary responses and

β_{c_{1}} = β_{c_{2}} = 0.3

for the continuous responses. In comparison, we combined the results of the marginal approaches by applying the Bonferroni adjustment.

Table 2 illustrates the results over 2000 independent replications. The proposed model analyzes all responses simultaneously, and the simulated type I error rates are valid and close to the

0.05

. Through the test of the joint effect of the covariate of interest on all responses, the simulated power is enhanced by our proposed model in comparison with the results from the Bonferroni test. As the sample size increases from 500 to 1000, the composite-likelihood statistics produce increased statistical power from approximately

0.800

to

0.989

. The overall performance demonstrates that the composite statistics are more powerful than the conventional approach.

4. Data Analysis

In this section, the multivariate mixed response model is applied to the clinical data from a colorectal cancer study. The data consist of clinical observations and demographic information on 743 patients, which are mixed with both categorical and continuous data. Our research interest was to evaluate the effect of treatment and other clinical factors on the toxicity outcomes. We focused on four common toxicity events that are related to colorectal cancer treatment. First, we chose nausea and diarrhea as two categorical responses. They are made up of ordinal data measuring the severity of the toxicity from grade 1 to 4. In our model setting, we only considered the occurrence of nausea and diarrhea for each patient. Therefore, these two responses were designed as binary variables, which are coded as 1 if they occurred and 0 if there is no record during the treatment. The continuous responses include two blood test measures, namely the hemoglobin (HGB) count and the white blood cell (WBC) count. Each patient had several blood examinations during the treatment, and we took the highest value for analysis. The explanatory variables contain the treatment effect, demographic information, tumor status, and genetic markers for each patient. There are two different treatment therapies in this colorectal cancer study. The patient demographic and clinical information, such as age, height, and weight, were collected as continuous variables. The tumor identified in either the colon or rectum was recorded as a binary variable. The study also includes the genetic markers, such as PERF1, PERF2, and KRAS, which are binary variables showing the occurrence of mutation. In total, we needed to jointly estimate 68 parameters for the coefficients of four linear models and the correlation between each outcome.

Table 3 shows the main result of the effect of the treatment, and the complete result is presented in Appendix B. We can observe that the statistical inference on the effect of the treatment through two models was in agreement. The second treatment therapy resulted in lower measures of the hemoglobin and indicates a negative association with the occurrence of nausea and diarrhea, whereas, the effect difference on the measures of white blood cells is insignificant. We can use the composite statistics to jointly assess the overall effect of this therapy on four responses. Table 4 provides the standard deviation and the correlation of the four clinical outcomes estimated based on the proposed model.

Using the conventional approach, we cannot make a statistic inference across different linear models. The proposed model is able to test the hypothesis

H_{0} : β_{b_{1}} = β_{b_{2}} = β_{c_{1}} = β_{c_{2}} = 0

based on the asymptotical properties of the composite-likelihood function. The test statistics of the composite Wald statistics under the

H_{0}

is approximately

138.5890

, the composite score statistics is

476.975

, and the adjusted composite-likelihood ratio is

264.3069

, which are all greater than the critical value of

χ_{4}^{2}

. Therefore, we can reject the null hypothesis and conclude that the two different treatments have a statistically significant difference in terms of patient toxicity response. More specifically, in our estimation results, we infer that there exists a significant difference in terms of the occurrence of nausea and diarrhea and a significant difference in HGB between the two treatments.

5. Discussion

The problem of mixed outcomes is widely discussed in health-related studies. As a result of the computational complexity, most existing methods mainly focus on the case of two outcomes mixed with one continuous variable and one categorical variable. As an extension of the conditional grouped continuous model, we present the multivariate mixed response model to solve high-dimensional mixed multivariate regressions. Our model is constructed using the pairwise composite-likelihood method, such that multiple outcomes are analyzed through different bivariate models simultaneously. Regarding data mixed with continuous and binary responses, our method simplifies the problem of multiple outcomes into three types of scenario, which is both methodologically flexible and analytic appealing. From the simulation studies, the estimators of the proposed model demonstrate a lower MSE than the marginal approaches. The composite statistics also provide increased statistical power for joint hypothesis testing across different generalized linear models, which could make this a favorable approach to analyze clinical data with multiple mixed-type responses.

In addition, the model can be generalized to deal with ordinal and continuous data simultaneously. Under the same setup, the latent variable

z_{i j}^{*}

is normally distributed with mean

μ_{i j}^{*} = x_{i j}^{T} β_{j}

and variance

σ_{j}^{2} = 1

. If the latent normal variable

z_{i j}^{*} \in [t_{l - 1}, t_{l})

with the threshold values

- \infty = t_{0} < t_{1} < \dots < t_{l} < \dots < t_{L - 1} < t_{L} = \infty

, the ordinal variable

z_{i j} = l

with

l = 1, 2, \dots, L

. Thus, the bivariate model of two ordinal outcomes can be formulated by the pairwise probability:

\begin{matrix} P (z_{i j} = l, z_{i k} = l^{^{'}}) = P (z_{i j}^{*} \in [t_{l - 1}, t_{l}), z_{i k}^{*} \in [t_{l^{^{'}} - 1}, t_{l^{^{'}}})) = \int_{t_{l - 1}}^{t_{l}} \int_{t_{l^{^{'}} - 1}}^{t_{l^{^{'}}}} ϕ_{2} (ω_{i j}, ω_{i k}; μ_{i j}^{*}, μ_{i k}^{*}, ρ_{j k}) d ω_{i j} d ω_{i k} . \end{matrix}

The score function can be obtained by taking the derivatives of

\sum_{i = 1}^{n} \sum_{j = 1}^{q - 1} \sum_{k = j + 1}^{q} log P (z_{i j} = l, z_{i k} = l^{^{'}})

, and the thresholds

t_{l}

’s need to be estimated with the monotone restriction. The maximum composite-likelihood estimation for mixed outcomes can be conducted through the same approaches as proposed in this paper. Further research will be considered to analyze multivariate outcomes with various distributions.

Author Contributions

Conceptualization, H.B. and X.G.; Methodology, H.B., Y.Z., and X.G.; Software, H.B. and Y.Z.; Validation, Y.Z., X.G. and W.X.; Formal Analysis, H.B., Y.Z., X.G., and W.X.; Investigation, Y.Z., X.G., and W.X.; Resources, W.X.; Data Curation, W.X.; Writing—Original Draft Preparation, H.B. and Y.Z.; Writing—Review & Editing, X.G. and W.X.; Supervision, X.G. and W.X.; Project Administration, X.G. and W.X.; Funding Acquisition, X.G. All authors have read and agreed to the published version of the manuscript.

Funding

Zhong and Gao’s research is supported by the Natural Sciences and Engineering Research Council of Canada (NSERC).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Derivatives for Mixed Response Variables

The score Function (6) contains the derivative of log marginal normal density and the log of the conditional probability:

\begin{matrix} U_{j k} & = & \frac{\partial}{\partial θ_{j k}} log P (z_{k}) + [\frac{1 (z_{j} = 0)}{Φ (d_{j})} - \frac{1 (z_{j} = 1)}{1 - Φ (d_{j})}] \frac{\partial Φ (d_{j})}{\partial θ_{j k}} . \end{matrix}

To illustrate the derivation, let

z_{j}

and

z_{k}

represent the

n \times 1

vector response variables with the design matrices

X_{j}

and

X_{k}

, respectively. Let

c_{k} = \frac{(z_{k} - μ_{k})}{σ_{k}}, and d_{j} = \frac{μ_{j}^{*} + ρ_{j k} \frac{σ_{j}}{σ_{k}} (z_{k} - μ_{k})}{σ_{j} \sqrt{1 - ρ_{j k}^{2}}} .

The normal CDF has the following properties:

\begin{matrix} Φ^{'} (d_{j}) = ϕ (d_{j}), ϕ^{'} (c_{k}) = - c_{k} ϕ (c_{k}), ϕ^{″} (c_{k}) = (c_{k}^{2} - 1) ϕ (c_{k}) . \end{matrix}

The first component in the score function can be simplified as

\begin{matrix} \frac{\partial}{\partial θ_{j k}} log P (z_{k}) & = \frac{\partial}{\partial θ_{j k}} log (\frac{1}{σ_{k}} ϕ (c_{k})) \\ = \frac{1}{ϕ (c_{k})} \frac{\partial ϕ (c_{k})}{\partial θ_{j k}} - \frac{1}{σ_{k}} \frac{\partial σ_{k}}{\partial θ_{j k}} \\ = - c_{k} \frac{\partial c_{k}}{\partial θ_{j k}} - \frac{1}{σ_{k}} \frac{\partial σ_{k}}{\partial θ_{j k}}, \\ \frac{\partial^{2} log P (z_{k})}{\partial θ_{j k} \partial θ_{j k}^{T}} & = \frac{1}{σ_{k}^{2}} \frac{\partial σ_{k}}{\partial θ_{j k}} \frac{\partial σ_{k}}{\partial θ_{j k}^{T}} - \frac{1}{σ_{k}} \frac{\partial^{2} σ_{k}}{\partial θ_{j k} \partial θ_{j k}^{T}} - \frac{\partial c_{k}}{\partial θ_{j k}} \frac{\partial c_{k}}{\partial θ_{j k}^{T}} - c_{k} \frac{\partial^{2} c_{k}}{\partial θ_{j k} \partial θ_{j k}^{T}} . \end{matrix}

The derivatives of the second component has a simple format with respect

d_{j}

:

\begin{matrix} \frac{\partial Φ (d_{j})}{\partial θ_{j k}} & = ϕ (d_{j}) \frac{\partial d_{j}}{\partial θ_{j k}}, \\ \frac{\partial^{2} Φ (d_{j})}{\partial θ_{j k} \partial θ_{j k}^{T}} & = - d_{j} ϕ (d_{j}) \frac{\partial d_{j}}{\partial θ_{j k}} \frac{\partial d_{j}}{\partial {θ_{j k}}^{T}} + ϕ (d_{j}) \frac{\partial^{2} d_{j}}{\partial {θ_{j k}}^{T} \partial {θ_{j k}}^{T}} . \end{matrix}

By adding the results above, the score function and its derivative can be given by

\begin{matrix} \frac{\partial l_{j k}}{\partial θ_{j k}} & = - \frac{1}{σ_{k}} \frac{\partial σ_{k}}{\partial θ_{j k}} - c_{k} \frac{\partial c_{k}}{\partial θ_{j k}} + [\frac{1 (z_{j} = 0)}{Φ (d_{j})} - \frac{1 (z_{j} = 1)}{1 - Φ (d_{j})}] ϕ (d_{j}) \frac{\partial d_{j}}{\partial θ_{j k}} \\ \frac{\partial^{2} l_{j k}}{\partial θ_{j k} \partial θ_{j k}^{T}} & = (\frac{1}{σ_{k}^{2}} \frac{\partial σ_{k}}{\partial θ_{j k}} \frac{\partial σ_{k}}{\partial θ^{T}} - \frac{1}{σ_{k}} \frac{\partial^{2} σ_{k}}{\partial θ_{j k} \partial θ_{j k}^{T}} - \frac{\partial c_{k}}{\partial θ_{j k}} \frac{\partial c_{k}}{\partial θ_{j k}^{T}} - c_{k} \frac{\partial^{2} c_{k}}{\partial θ_{j k} \partial θ_{j k}^{T}}) \\ - [\frac{1 (z_{j} = 0)}{{(Φ (d_{j}))}^{2}} + \frac{1 (z_{j} = 1)}{{(1 - Φ (d_{j}))}^{2}}] [ϕ {(d_{j})}^{2} \frac{\partial d_{j}}{\partial θ_{j k}} \frac{\partial d_{j}}{\partial {θ_{j k}}^{T}}] \\ + [\frac{1 (z_{j} = 0)}{Φ (d_{j})} - \frac{1 (z_{j} = 1)}{1 - Φ (d_{j})}] \times [- d_{j} ϕ (d_{j}) \frac{\partial d_{j}}{\partial θ_{j k}} \frac{\partial d_{j}}{\partial θ_{j k}^{T}} + ϕ (d_{j}) \frac{\partial^{2} d_{j}}{\partial θ_{j k} \partial {θ_{j k}}^{T}}] . \end{matrix}

Furthermore,

\begin{matrix} \frac{\partial c_{k}}{\partial β_{k}} = - \frac{1}{σ_{k}} X_{k}, \frac{\partial c_{k}}{\partial σ_{k}} = - \frac{1}{σ_{k}} c_{k}, \frac{\partial^{2} c_{k}}{\partial σ_{k}^{2}} = \frac{2}{σ_{k}^{2}} c_{k}, \frac{\partial^{2} c_{k}}{\partial β_{k} \partial σ_{k}} = \frac{1}{σ_{k}^{2}} X_{k} . \end{matrix}

For the term

d_{j}

, more derivation is given here:

\begin{matrix} \frac{\partial d_{j}}{\partial β_{k}} & = \frac{ρ_{j k} X_{k}}{σ_{k} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial d_{j}}{\partial β_{j}} & = - \frac{X_{j}}{σ_{j} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial d_{j}}{\partial σ_{k}} & = \frac{ρ_{j k} c_{k}}{σ_{k} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial d_{j}}{\partial σ_{j}} & = \frac{X_{j} β_{j}}{σ_{j}^{2} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial d_{j}}{\partial ρ_{j k}} & = - \frac{c_{k}}{\sqrt{1 - ρ_{j k}^{2}}} - \frac{X_{j} β_{j} ρ_{j k}}{σ_{j} {(1 - ρ_{j k}^{2})}^{\frac{3}{2}}} - \frac{ρ_{j k}^{2} c_{k}}{{(1 - ρ_{j k}^{2})}^{\frac{3}{2}}}, \\ \frac{\partial^{2} d_{j}}{\partial σ_{k}^{2}} & = - \frac{2 ρ_{j k} c_{k}}{σ_{k}^{2} \sqrt{1 - ρ_{j k}^{2}}}, \end{matrix}

\begin{matrix} \frac{\partial^{2} d_{j}}{\partial σ_{j}^{2}} & = - \frac{2 X_{j}^{T} β_{j}}{σ_{j}^{3} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial^{2} d_{j}}{\partial ρ_{j k}^{2}} & = - \frac{c_{k} ρ_{j k}}{{(1 - ρ_{j k}^{2})}^{\frac{3}{2}}} - \frac{X_{j}^{T} β_{j}}{σ_{j}} [\frac{1}{{(1 - ρ_{j k}^{2})}^{\frac{3}{2}}} + \frac{3 ρ_{j k}^{2}}{{(1 - ρ_{j k}^{2})}^{\frac{5}{2}}}] \\ - c_{k} [\frac{2 ρ_{j k}}{{(1 - ρ_{j k}^{2})}^{\frac{3}{2}}} + \frac{3 ρ_{j k}^{3}}{{(1 - ρ_{j k}^{2})}^{\frac{5}{2}}}], \\ \frac{\partial^{2} d_{j}}{\partial ρ_{j k} \partial σ_{k}} & = \frac{c_{k}}{σ_{k}} [\frac{1}{\sqrt{1 - ρ_{j k}^{2}}} + \frac{ρ_{j k}^{2}}{{(1 - ρ_{j k}^{2})}^{\frac{3}{2}}}], \\ \frac{\partial^{2} d_{j}}{\partial β_{j} \partial σ_{j}} & = \frac{X_{j}}{σ_{j}^{2} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial^{2} d_{j}}{\partial β_{j} \partial ρ_{j k}} & = - \frac{X_{j} ρ_{j k}}{σ_{j} {(1 - ρ_{j k}^{2})}^{\frac{3}{2}}}, \\ \frac{\partial^{2} d_{j}}{\partial β_{k} \partial ρ_{j k}} & = \frac{X_{k}}{σ_{k}} [\frac{1}{\sqrt{1 - ρ_{j k}^{2}}} + \frac{ρ_{j k}^{2}}{{(1 - ρ_{j k}^{2})}^{\frac{3}{2}}}], \\ \frac{\partial^{2} d_{j}}{\partial σ_{k} \partial β_{k}} & = - \frac{ρ_{j k} X_{k}}{σ_{k}^{2} \sqrt{1 - ρ_{j k}^{2}}} . \end{matrix}

In conclusion, the score function

U_{j k} (θ_{j k})

with respect to each parameter is given by

\begin{matrix} \frac{\partial l_{j k}}{\partial β_{k}} & = \frac{c_{k}}{σ_{k}} X_{k} + [\frac{1 (z_{j} = 0)}{Φ (d_{j})} - \frac{1 (z_{j} = 1)}{1 - Φ (d_{j})}] \frac{ϕ (d_{j}) ρ_{j k} X_{k}}{σ_{k} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial l_{j k}}{\partial β_{j}} & = [\frac{1 (z_{j} = 0)}{Φ (d_{j})} - \frac{1 (z_{j} = 1)}{1 - Φ (d_{j})}] [- \frac{ϕ (d_{j}) X_{j}}{σ_{j} \sqrt{1 - ρ_{j k}^{2}}}], \\ \frac{\partial l_{j k}}{\partial σ_{k}} & = - \frac{1}{σ_{k}} + \frac{c_{k}^{2}}{σ_{k}} + [\frac{1 (z_{j} = 0)}{Φ (d_{j})} - \frac{1 (z_{j} = 1)}{1 - Φ (d_{j})}] \frac{ϕ (d_{j}) ρ_{j k} c_{k}}{σ_{k} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial l_{j k}}{\partial σ_{j}} & = [\frac{1 (z_{j} = 0)}{Φ (d_{j})} - \frac{1 (z_{j} = 1)}{1 - Φ (d_{j})}] ϕ (d_{j}) \frac{X_{j} β_{j}}{σ_{j}^{2} \sqrt{1 - ρ_{j k}^{2}}}, \\ \frac{\partial l_{j k}}{\partial ρ_{j k}} & = [\frac{1 (z_{j} = 0)}{Φ (d_{j})} - \frac{1 (z_{j} = 1)}{1 - Φ (d_{j})}] ϕ (d_{j}) \\ \times [- \frac{c_{k}}{\sqrt{1 - ρ_{j k}^{2}}} - \frac{X_{j} β_{j} ρ_{j k}}{σ_{j} {(1 - ρ_{j k}^{2})}^{\frac{3}{2}}} - \frac{ρ_{j k}^{2} c_{k}}{{(1 - ρ_{j k}^{2})}^{\frac{3}{2}}}] . \end{matrix}

Appendix B. Estimation Results of the Clinical Data From a Colorectal Cancer Study

Table A1. The estimated regression coefficients

β

and the standard deviation (sd). GLM: the estimation via the generalized linear model; MRM: estimation via proposed multivariate mixed response model; the column of * lists the significant covariates.

Table A1. The estimated regression coefficients

β

and the standard deviation (sd). GLM: the estimation via the generalized linear model; MRM: estimation via proposed multivariate mixed response model; the column of * lists the significant covariates.

	1. Nausea		2. Diarrhea		3. HBG		4. WBC
Parameters	GLM	MMR	GLM	MMR	GLM	MMR	GLM	MMR	*
Intercept	−0.2685 (1.277)	−0.2793 (1.317)	0.6631 (1.304)	0.6741 (1.329)	160.3759 (16.831)	160.3775 (16.829)	12.2949 (4.761)	12.2946 (4.855)	3, 4
Treatment	−0.2644 (0.097)	−0.2724 (0.099)	−0.6231 (0.098)	−0.6422 (0.101)	−12.4921 (1.274)	−12.4957 (1.252)	−0.1591 (0.360)	−0.1597 (0.358)	1, 2, 3
OS	−0.0037 (0.009)	−0.0039 (0.009)	0.0219 (0.009)	0.0228 (0.009)	0.5674 (0.115)	0.5675 (0.117)	−0.1764 (0.032)	−0.1764 (0.029)	2, 3, 4
OS event	0.0650 (0.195)	0.0702 (0.194)	0.1173(0.200)	0.1208 (0.194)	1.5154 (2.592)	1.5159 (2.614)	0.1290 (0.733)	0.1295 (0.562)
PFS	0.0305 (0.020)	0.0315 (0.020)	0.0066 (0.021)	0.0062 (0.022)	−0.1628 (0.259)	−0.1627 (0.258)	0.1242 (0.073)	0.1243 (0.062)
IN	−0.0204 (0.188)	−0.0200 (0.190)	−0.8622 (0.203)	−0.8907 (0.212)	−11.1599 (2.486)	−11.1621 (2.932)	−0.0809 (0.703)	−0.0803 (0.810)	2, 3
PD	0.1263 (0.136)	0.1319 (0.139)	−0.1375 (0.136)	−0.1432 (0.142)	−7.1013 (1.782)	−7.1005 (1.667)	−0.3695 (0.504)	−0.3689 (0.528)	3
PR	−0.0213 (0.169)	−0.0207 (0.171)	0.0691 (0.176)	0.0707 (0.174)	3.6782 (2.234)	3.6781 (2.210)	0.4003 (0.632)	0.4005 (0.5277)
Age	−0.0142 (0.008)	−0.0091 (0.005)	0.0029 (0.008)	0.0018 (0.005)	−0.1463 (0.061)	−0.1464 (0.061)	−0.0382 (0.017)	−0.0382 (0.018)	3, 4
Gender	−0.0086 (0.005)	−0.0091 (0.005)	0.0016 (0.005)	0.0018 (0.005)	9.0894 (1.798)	9.0836 (1.825)	−0.1079 (0.509)	−0.1096 (0.534)	1, 3
Colon	0.2313 (0.165)	0.2378 (0.169)	0.1017 (0.170)	0.1059 (0.177)	−1.1366 (2.189)	−1.1338 (2.105)	0.5706 (0.619)	0.5713 (0.676)
Rectum	−0.0075 (0.154)	−0.0111 (0.157)	0.1760 (0.158)	0.1844 (0.166)	−1.6947 (2.032)	−1.6940 (1.913)	0.9096 (0.575)	0.9094 (0.649)
Height	0.0048 (0.007)	0.0050 (0.008)	−0.0050 (0.007)	−0.0051 (0.008)	−0.0820 (0.097)	−0.0820 (0.097)	0.0237 (0.028)	0.0237 (0.028)
Weight	0.0023 (0.003)	0.0024 (0.003)	0.0012 (0.003)	0.0012 (0.003)	0.0906 (0.044)	0.0907 (0.042)	−0.0189 (0.012)	−0.0189 (0.012)	3
PERF1	0.2046 (0.107)	0.2132 (0.110)	0.0898 (0.109)	0.0933 (0.110)	−4.8490 (1.414)	−4.8469 (1.352)	0.6028 (0.400)	0.6036 (0.380)	3
PERF2	0.5041 (0.187)	0.5244 (0.187)	0.2637 (0.189)	0.2692 (0.193)	−7.1494 (3.814)	−7.1455 (2.628)	1.8158 (0.688)	1.8176 (0.685)	1, 3, 4
KRAS	−0.0407 (0.292)	−0.0387 (0.295)	0.0080 (0.293)	0.0150 (0.294)	1.3456 (3.814)	1.3455 (3.285)	−0.7323 (1.079)	−0.7325 (0.936)

In Section 4, we illustrate some estimation results of the regression parameters and correlation between outcomes. The complete analysis of these data consists of 68 parameters.

References

Cox, D.R. The analysis of multivariate binary data. J. R. Stat. Soc. Ser. C Appl. Stat. 1972, 21, 113–120. [Google Scholar] [CrossRef]
Cox, D.R.; Wermuth, N. Response models for mixed binary and quantitative variable. Biometrika 1992, 79, 441–461. [Google Scholar] [CrossRef]
Olkin, L.; Tate, R.F. Multivariate correlation models with mixed discrete and continuous variables. Ann. Math. Stat. 1961, 32, 448–456. [Google Scholar] [CrossRef]
Teixeira-Pinto, A.; Normand, S.T. Correlated bivariate continuous and binary outcomes: Issues and applications. Stat. Med. 2009, 28, 1753–1773. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Sammel, M.D.; Ryan, L.M.; Legler, J.M. Latent variable models for mixed discrete and continuous outcomes. J. R. Stat. Soc. Ser. B Methodol. 1997, 59, 667–678. [Google Scholar] [CrossRef]
Sammel, M.D.; Lin, X.; Ryan, L.M. Multivariate linear mixed models for multiple outcomes. Stat. Med. 1999, 18, 2479–2492. [Google Scholar] [CrossRef]
Yang, Y.; Kang, J.; Mao, K.; Zhang, J. Regression models for mixed poisson and continuous longitudinal data. Stat. Med. 1961, 26, 3782–3800. [Google Scholar] [CrossRef]
Anderson, J.A.; Pemberton, J.D. The grouped continuous model for multivariate ordered categorical variables and covariate adjustment. Biometrics 1985, 41, 875–885. [Google Scholar] [CrossRef]
De Leon, A. Pairwise likelihood approach to grouped continuous model and its extension. Stat. Probab. Lett. 2005, 75, 49–57. [Google Scholar] [CrossRef]
Poon, W.Y.; Lee, S.Y. Maximum likelihood estimation of multivariate polyserial and polychoric correlation coefficients. Psychometrika 1987, 52, 409–430. [Google Scholar] [CrossRef]
Skrondal, A.; Rabe-Hesketh, S. Latent Variable Modelling: A Survey; Blackwell: Oxford, UK, 2007; Volume 34. [Google Scholar]
Catalano, P.; Ryan, L. Bivariate latent variable models for clustered discrete and continuous outcomes. J. Am. Stat. Assoc. 1992, 50, 1078–1095. [Google Scholar] [CrossRef]
Catalano, P.J. Bivariate modeling of clustered continuous and ordered categorical outcomes. Stat. Med. 1997, 16, 883–900. [Google Scholar] [CrossRef]
Najita, J.S.; Li, Y.; Catalano, P.J. A novel application of a bivariate regression model for binary and continuous outcomes to studies of fetal toxicity. J. R. Stat. Soc. Ser. C Appl. Stat. 2009, 58, 555–573. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gueorguieva, R.V.; Agresti, A. A correlated probit model for joint modeling of clustered binary and continuous responses. J. Am. Stat. Assoc. 2001, 96, 1102–1112. [Google Scholar] [CrossRef]
Zhang, H.; Liu, D.; Zhao, J.; Bi, X. Modeling hybrid traits for comorbidity and genetic studies of alcohol and nicotine co-dependence. Ann. Appl. Stat. 2018, 12, 2359–2378. [Google Scholar] [CrossRef]
De Leon, A.R.; Carriégre, K.C. General mixed-data model: Extension of general location and grouped continuous models. Can. J. Stat. 2007, 35, 533–548. [Google Scholar] [CrossRef]
De Leon, A.R.; Carriégre, K.C. Analysis of Mixed Data; Chapman and Hall/CRC: New York, NY, USA, 2013. [Google Scholar]
Lindsay, B. Composite likelihood methods. Contemp. Math. 1988, 80, 220–239. [Google Scholar]
Cox, D.R.; Reid, N. A note on pseudolikelihood constructed from marginal densities. Biometrika 2004, 91, 729–737. [Google Scholar] [CrossRef]
Varin, C. On composite marginal likelihoods. Adv. Stat. Anal. 2008, 92, 1–28. [Google Scholar] [CrossRef]
Varin, C.; Reid, N.; Firth, D. An overview of composite likelihood methods. Stat. Sin. 2011, 21, 5–42. [Google Scholar]
Reid, N.; Lindsay, B.; Liang, K.Y. Composite likelihood methods introduction. Stat. Sin. 2011, 21, 1–3. [Google Scholar]
Faes, C.; Aerts, M.; Molenberghs, G.; Geys, H.; Teuns, G.; Bijnens, L. A high-dimensional joint model for longitudinal outcomes of different nature. Stat. Med. 2008, 27, 4408–4427. [Google Scholar] [CrossRef] [PubMed]
Atkinson, K.E. An Introduction to Numerical Analysis; John Wiley and Sons Inc.: New York, NY, USA, 1989. [Google Scholar]
Johnson, R.A.; Wichern, D.W. Applied Multivariate Statistical Analysis; Prentice Hall: Upper Saddle River, NJ, USA, 1998. [Google Scholar]
Dunson, D.B. Bayesian latent variable models for clustered mixed outcomes. J. R. Stat. Soc. 2000, 62, 355–366. [Google Scholar] [CrossRef]
Cox, D.R.; Hinkley, D.V. Theoretical Statistics; Chapman and Hall: London, UK, 1974. [Google Scholar]
Kent, J.T. Robust properties of likelihood ratio tests. Biometrika 1982, 69, 19–27. [Google Scholar]
Jin, Z. Aspects of Composite Likelihood Inference. Ph.D. Thesis, University of Toronto, Toronto, ON, Canada, 2010. [Google Scholar]
Gao, X.; Song, P.X.K. Composite likelihood bayesian information criteria for model selection in high-dimensional data. J. Am. Stat. Assoc. 2010, 105, 1531–1540. [Google Scholar] [CrossRef] [Green Version]
Godambe, V. An optimum property of regular maximum likelihood estimation. Ann. Math. Stat. 1960, 31, 1208–1211. [Google Scholar] [CrossRef]
Rotnitzky, A.; Jewell, N.P. Hypothesis testing of regression parameters in semiparametric generalized linear models for cluster correlated data. Biometrika 1990, 77, 485–497. [Google Scholar] [CrossRef]
Geys, H.; Molenberghs, G.; Ryan, L.M. Pseudolikelihood modeling of multivariate outcomes in developmental toxicology. J. Am. Stat. Assoc. 1999, 94, 734–745. [Google Scholar] [CrossRef]
Pace, L.; Salvan, A.; Sartori, N. Adjusting composite likelihood ratio statistics. Stat. Sin. 2011, 21, 129–148. [Google Scholar]

Figure 1. The comparison between the maximum full-likelihood estimation and maximum composite- likelihood estimation for the regression coefficients on the multivariate continuous outcomes. The ratio of the mean squared error (MSE) was computed using the MSE of the maximum composite-likelihood estimate (MCLE) over the MSE of the maximum likelihood estimate (MLE). .

Table 1. The ratio of the mean squared error (MSE) of the multivariate mixed response model (MMR) to the marginal model (GLM). Results based on 1000 independent simulation under two different scenarios and three different levels of correlation.

	Simulation I *			Simulation II ^†
	Low	Med	High	Low	Med	High
$α_{b_{1}} = 0.2$	1.00	0.99	0.98	0.97	0.92	0.85
$β_{b_{1}} = 0.3$	0.93	0.81	0.66	1.00	0.99	0.97
$γ_{b_{1}} = 0.3$	0.93	0.81	0.66	0.95	0.85	0.71
$α_{b_{2}} = 0.2$	1.00	0.98	0.97	0.97	0.92	0.86
$β_{b_{2}} = 0.3$	0.94	0.84	0.69	1.00	0.98	0.95
$γ_{b_{2}} = 0.5$	0.94	0.83	0.70	0.95	0.86	0.71
$α_{c_{1}} = 0.5$	1.00	1.00	1.00	0.96	0.89	0.79
$β_{c_{1}} =$ 8	0.89	0.73	0.50	1.00	1.00	1.00
$γ_{c_{1}} =$ 10	0.90	0.74	0.51	0.93	0.80	0.59
$σ_{c_{1}} =$ 4	1.01	1.01	1.01	1.01	1.01	1.01
$α_{c_{2}} = 0.4$	1.00	1.00	1.00	0.97	0.90	0.79
$β_{c_{2}} =$ 5	0.92	0.77	0.53	1.00	1.00	1.00
$γ_{c_{2}} =$ 8	0.92	0.75	0.50	0.94	0.80	0.57
$σ_{c_{2}} =$ 5	1.01	1.01	1.01	1.01	1.01	1.01

* Simulation I: n = 300, and the four responses have different covariates, e.g.,

x_{i}

and

y_{i}

generated from a normal distribution

N (0, 1)

; ^† Simulation II: n = 1000, the responses shared one common covariate

x_{i}

∼

N (0, 1)

and added a different covariate

y_{i}

∼ Bernoulli

(0.5)

.

Table 2. Type 1 error rate and power under different sample sizes (

N = 500

and

N = 1000

).

Table 2. Type 1 error rate and power under different sample sizes (

N = 500

and

N = 1000

).

	Type I Error		Power
	$N = 500$	$N = 1000$	$N = 500$	$N = 1000$
	Composite-Likelihood Method
	$H_{0} : β_{b_{1}} = β_{b_{2}} = β_{c_{1}} = β_{c_{2}} = 0$
Likelihood ratio	0.054	0.043	0.804	0.988
Wald statistics	0.058	0.043	0.800	0.989
Scoring statistics	0.058	0.042	0.798	0.989
	Multiple Test
Bonferroni test	0.051	0.040	0.569	0.902

The likelihood ratio statistic is adjusted as Equation (7), which approximates to a

χ_{4}^{2}

.

Table 3. The difference in the effect of treatment between the two treatment therapies. GLM: the generalized linear model; MMR: the multivariate mixed response model.

Regression Parameter	Models
Regression Parameter	GLM	MMR
$z_{b_{1}}$ : occurrence of nausea
Intercept $α_{b_{1}}$	$- 0.2685 \pm 2.502$	$- 0.2793 \pm 2.582$
(p value)	(0.833)	(0.832)
Treatment effect $β_{b_{1}}$	$- 0.2644 \pm 0.190$	$- 0.2724 \pm 0.193$
(p value)	(0.006)	(0.006)
$z_{b_{2}}$ : occurrence of diarrhea
Intercept $α_{b_{2}}$	$0.6631 \pm 2.557$	$0.6741 \pm 2.605$
(p value)	(0.611)	(0.612)
Treatment effect $β_{b_{2}}$	$- 0.6231 \pm 0.192$	$- 0.6422 \pm 0.198$
(p value)	(<0.001)	(<0.001)
$z_{c_{1}}$ : measures of hemoglobin
Intercept $α_{c_{1}}$	$160.3758 \pm 32.989$	$160.3775 \pm 32.984$
(p value)	(<0.001)	(<0.001)
Treatment effect $β_{c_{1}}$	$- 12.492 \pm 2.498$	$- 12.496 \pm 2.454$
(p value)	(<0.001)	(<0.001)
$z_{c_{2}}$ : measures of white blood cell
Intercept $α_{c_{2}}$	$12.295 \pm 9.331$	$12.2946 \pm 9.515$
(p value)	(0.010)	(0.011)
Treatment effect $β_{c_{2}}$	$- 0.1591 \pm 0.706$	$- 0.1597 \pm 0.702$
(p value)	(0.659)	(0.656)

Table 4. Estimation Results II: the estimated parameters contain second moments of each outcome.

	Esimated Correlation				Estimated Standard Deviation
	Nausea	Diarrhea	HGB	WBC	Estimated Standard Deviation
Nausea	1.0000	0.3954	0.0736	0.0899	-
Diarrhea		1.0000	0.0351	−0.0126	-
HGB			1.0000	0.0139	16.796
WBC				1.0000	4.7507

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bai, H.; Zhong, Y.; Gao, X.; Xu, W. Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method. Stats 2020, 3, 203-220. https://doi.org/10.3390/stats3030016

AMA Style

Bai H, Zhong Y, Gao X, Xu W. Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method. Stats. 2020; 3(3):203-220. https://doi.org/10.3390/stats3030016

Chicago/Turabian Style

Bai, Hao, Yuan Zhong, Xin Gao, and Wei Xu. 2020. "Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method" Stats 3, no. 3: 203-220. https://doi.org/10.3390/stats3030016

Article Menu

Multivariate Mixed Response Model with Pairwise Composite-Likelihood Method

Abstract

1. Introduction

2. Methodology

2.1. Model Setup

2.1.1. Case 1: Continuous Outcomes

2.1.2. Case 2: Binary Outcomes

2.1.3. Case 3: Mixed Outcomes

2.2. Statistical Inference Using the Composite-Likelihood Method

Testing with Nuisance Parameters

3. Simulation

3.1. Comparison with Maximum Full-Likelihood Estimation

3.2. Comparison with the Marginal Approach

3.2.1. Simulation Settings

3.2.2. Point Estimates

3.2.3. Statistical Test

4. Data Analysis

5. Discussion

Author Contributions

Funding

Conflicts of Interest

Appendix A. Derivatives for Mixed Response Variables

Appendix B. Estimation Results of the Clinical Data From a Colorectal Cancer Study

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI