Next Article in Journal
Product Convolution of Generalized Subexponential Distributions
Next Article in Special Issue
Prediction of a Sensitive Feature under Indirect Questioning via Warner’s Randomized Response Technique and Latent Class Model
Previous Article in Journal
CNN-Based Temporal Video Segmentation Using a Nonlinear Hyperbolic PDE-Based Multi-Scale Analysis
Previous Article in Special Issue
Estimation Curve of Mixed Spline Truncated and Fourier Series Estimator for Geographically Weighted Nonparametric Regression
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Non-Parametric Non-Inferiority Assessment in a Three-Arm Trial with Non-Ignorable Missing Data

Yunnan Key Laboratory of Statistical Modeling and Data Analysis, Yunnan University, Kunming 650091, China
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(1), 246; https://doi.org/10.3390/math11010246
Submission received: 25 October 2022 / Revised: 27 December 2022 / Accepted: 28 December 2022 / Published: 3 January 2023
(This article belongs to the Special Issue Statistical Methods in Data Science and Applications)

Abstract

:
A three-arm non-inferiority trial including a placebo is usually utilized to assess the non-inferiority of an experimental treatment to a reference treatment. Existing methods for assessing non-inferiority mainly focus on the fully observed endpoints. However, in some clinical trials, treatment endpoints may be subject to missingness for various reasons, such as the refusal of subjects or their migration. To address this issue, this paper aims to develop a non-parametric approach to assess the non-inferiority of an experimental treatment to a reference treatment in a three-arm trial with non-ignorable missing endpoints. A logistic regression is adopted to specify a non-ignorable missingness data mechanism. A semi-parametric imputation method is proposed to estimate parameters in the considered logistic regression. Inverse probability weighting, augmented inverse probability weighting and non-parametric methods are developed to estimate treatment efficacy for known and unknown parameters in the considered logistic regression. Under some regularity conditions, we show asymptotic normality of the constructed estimators for treatment efficacy. A bootstrap resampling method is presented to estimate asymptotic variances of the estimated treatment efficacy. Three Wald-type statistics are constructed to test the non-inferiority based on the asymptotic properties of the estimated treatment efficacy. Empirical studies show that the proposed Wald-type test procedure is robust to the misspecified missingness data mechanism, and behaves better than the complete-case method in the sense that the type I error rates for the former are closer to the pre-given significance level than those for the latter.

1. Introduction

Non-inferiority (NI) trials are often performed to verify that the efficacy of an experimental treatment with low toxicity or small side-effects is non-inferior to that of a reference treatment by more than a pre-given small margin [1,2]. Many methods have been presented to assess the NI of an experimental treatment to a reference treatment via the efficacy in a two-arm NI trial. For example, see Tang et al. [3] for a score test via relative risk in a matched-pair NI trial; Tang et al. [4] for exact and approximate unconditional confidence intervals for rate difference based on a score test statistic in a small-sample paired NI trial; Wellek [5] for frequentist and Bayesian approaches to testing NI in a matched-pair design with binary endpoints; Freitag et al. [6] for a non-parametric approach to testing NI with censored data; Arboretti et al. [7] and Pesarin et al. [8] for a permutation test in a non-inferiority trial, and Gamalo et al. [9] for a Bayesian method for testing NI with normally distributed endpoints. However, it is widely recognized that there are two key problems for two-arm NI trials [10]. The first issue is the selection of the NI margin (i.e., the clinically acceptable amount or a combination of statistical reasoning and clinical judgement), and the second is the evaluation of assay sensitivity (i.e., the ability of a trial to distinguish an effective treatment from a less effective or ineffective treatment). To solve the aforementioned problems, if ethically acceptable and practically feasible, a three-arm trial including a placebo, which is called a three-arm NI trial, is usually conducted to assess the NI of an experimental treatment to the active reference treatment [11].
Many methods have been developed to draw statistical inferences based on a three-arm NI trial over the past years. For example, Pigeot et al. [12] studied an NI assessment problem via mean difference in a three-arm trial with normally distributed endpoints; Tang and Tang [10] proposed two asymptotic approaches to testing NI via a rate difference for binary outcomes; Mielke and Munk [13] considered the NI testing problem for Poisson-distributed endpoints; Lui and Chang [14] discussed the NI testing problem via a generalized odds ratio for ordinal data; Brannath et al. [15] considered an NI adaptive testing and sample size determination problem in a three-arm trial with normally distributed endpoints; Tang et al. [16] developed exact and approximate unconditional, and bootstrap-resampling-based approaches to testing NI for binary outcomes; Tang and Yu [17] presented a hybrid approach to constructing simultaneous confidence intervals for simultaneously assessing NI and assay sensitivity for binary endpoints; Tang and Yu [18] utilized two Bayesian approaches (i.e., posterior variance and Bayes factor approaches) to determine the sample size required in a three-arm NI trial with binary endpoints; Paul et al. [19] presented both frequentist and Bayesian procedures for testing NI via the risk difference in a three-arm trial with binary endpoints; Homma and Diamon [20] investigated the assay sensitivity hypothesis and the sample size calculation problem for gold-standard NI trials with two fixed margins and negative binomial endpoints; Ghosh et al. [21] presented a new method to test NI for Poisson-distributed endpoints; Ghosh et al. [22] considered a hierarchical testing procedure with two stages in three-arm NI trials; Scharpenberg and Brannath [23] discussed simultaneous confidence intervals of risk differences in three-arm non-inferiority trials; and Tang and Liang [24] constructed two simultaneous confidence intervals for assessing NI and assay sensitivity in a three-arm trial. However, when misspecifying the distributions of treatment endpoints, statistical inference obtained with the aforementioned methods may be misleading or unreasonable. To this end, a number of non-parametric methods were proposed to make statistical inference on three-arm NI trials under an unknown distribution assumption of endpoints. For example, see Munzel [25] for a rank-based NI test and Tseng and Hsu [26] for binomially distributed outcomes. The aforementioned methods were developed for the fully observed endpoints in a three-arm trial.
However, in some clinical trials, treatment endpoints may be subject to missingness occurring for various reasons, such as unwillingness of some respondents to answer sensitivity questions, loss of information caused by uncontrollable factors, or drop-out from the study in clinical trials [27]. For example, for a clinical trial associated with HIV patients in the AIDS Clinical Trial Group (ACTG) Study 193A, the primary endpoint was the CD4 cell count, which was scheduled to be observed at baseline and eight-week intervals during the follow-up period, potentially subject to missingness due to skipped visits and dropouts. In this study, 1309 patients were randomly assigned to one of the following four daily regimens: zidovudine alternating monthly with 400 mg didanosine (regarded as “Treatment 1”), zidovudine plus 2.25 mg of zalcitabine (regarded as “Treatment 2”), zidovudine plus 400 mg of didanosine (regarded as “Treatment 3”), zidovudine plus 400 mg of didanosine plus 400 mg of nevirapine (regarded as “Treatment 4”). As an illustration, we here take “Treatment 1”, “Treatment 2” and “Treatment 4” as the placebo, reference and experiment, respectively, let the log-transformed CD4 cell counts (i.e., log(1 + CD4 cell counts at time interval (4,12])) be the treatment endpoints, and regard log(baseline measurement + 1) as an instrument variable for skipped visits or dropouts. Because the baseline measurements were considered before the treatments were assigned, it was reasonable to assume that the dropouts were missing not at random (MNAR) or due to non-ignorable missing. The average missing proportions of endpoints for the placebo, reference and experiment treatments were 29.74%, 30.99% and 28.65%, respectively. Our main purpose is to test the NI of treatment 4 to treatment 2 in terms of the assay sensitivity and the internal validity of treatment 4 in a three-arm trial with unknown distributed endpoints in the presence of non-ignorable missing endpoints.
For the above described example, the simplest and most intuitive method for handling missing data is the well-known complete-case (‘CC’) method, i.e., deleting subjects with missing data. But the CC method may lead to a biased estimator of treatment efficacy when the missingness data mechanism does not involve missing completely at random. To this end, several alternative methods have been proposed to make statistical inferences in two-arm trials with missing endpoints. For example, Choi and Stablein [28] considered the problem of testing the equality of two treatments in a paired two-arm trial with missing at random (MAR) endpoints based on large sample theory, while Tang and Tang [29] developed unconditional exact procedures for testing the equality of two treatments in a paired two-arm trial with MAR endpoints. In addition, some permutation tests were proposed for endpoints with missing data in two-arm trials, for example, see Maritz [30], Yu et al. [31], Pesarin [32], and Pesarin et al. [33]. However, the aforementioned studies mainly focused on equivalence assessment in two-arm trials with a MAR assumption based on two independent binomial distributions for endpoints with non-ignorable missing and a multinomial distribution for the fully observed endpoints. Moreover, to our knowledge, there has been little work undertaken on NI assessment in three-arm trials with unknown distributed endpoints and non-ignorable missing data. Hence, this paper aims to develop a non-parametric approach to testing NI in a three-arm trial with a mixed unknown distribution of endpoints and an MNAR assumption of missing endpoints.
There are many approaches to handling non-ignorable missing data. For example, see Robins et al. [34] for an inverse probability weighting (IPW) method, Lee and Tang [35] and Wang and Tang [36] for Bayesian approaches combining the Gibbs sampler and Metropolis–Hastings algorithm, Kim and Yu [37] for a semi-parametric approach to estimating mean functions in the presence of non-ignorable missing responses, and Tang et al. [38] for an empirical likelihood method for generalized estimating equations with non-ignorable missing data due to certain merits of empirical likelihood, such as feasibly incorporating auxiliary information to improve the efficiency of parameter estimation [39]. Choi and Stablein [40] and Li et al. [41] investigated the equivalence test problem in a paired two-arm trial with non-ignorable missing endpoints under some known distribution assumptions for treatment endpoints. However, the aforementioned approaches cannot be directly used to test NI in a three-arm trial with non-ignorable missing endpoints due to the complexity of the considered test problem, including the imputation of missing endpoints, the estimation problem of treatment efficacy under unknown distribution assumptions of treatment endpoints, and the critical value determination of test statistics at some pre-given significance level.
The main contributions of this paper include: (i) presentation of a logistic regression to specify the propensity score function associated with respondent endpoints; (ii) proposal of IPW, augmented IPW (AIPW) and non-parametric imputation methods to estimate treatment efficacy in the presence of non-ignorable missing endpoints; (iii) development of a semi-parametric imputation method to estimate unknown parameters in the considered logistic regression by imputing mean score functions rather than missing endpoints using a kernel non-parametric regression method; (iv) establishment of some asymptotic properties of the estimated treatment efficacy; (v) refinement of a bootstrap-resampling method to consistently estimate asymptotic variances of the estimated treatment efficacy; (vi) construction of three Wald-type statistics to test the NI of an experimental treatment to a reference treatment in a three-arm trial with unknown distributed and non-ignorable missing endpoints.
The rest of this paper is organized as follows: Section 2 describes a three-arm NI trial with MNAR endpoints. Section 3 discusses the estimation problem of treatment efficacy and propensity score function. The asymptotic properties of the estimated treatment efficacy and the resultant Wald-type statistics for testing NI are given in Section 4. The simulation studies investigating the finite sample performance of the proposed test statistics are described in Section 5. A real example taken from the ACTG study is illustrated using the proposed method in Section 6. Some concluding remarks are given in Section 7. Technical details are presented in Appendix A.

2. A Three-Arm NI Trail with MNAR Endpoints

2.1. A Three-Arm NI Trial

For a three-arm randomized clinical trial with experimental (E), reference (R) and placebo (P) treatments, we assume that their corresponding clinical endpoints Y E , Y R and Y P independently follow unknown distributions f E ( y E | μ E ) , f R ( y R | μ R ) and f P ( y P | μ P ) , respectively, where μ E , μ R and μ P are their corresponding treatment efficacies, respectively. Generally, we assume that a larger value of treatment efficacy indicates a more favorable treatment.
Following Hida and Tango [42], to test the NI of the experimental treatment to the reference treatment in terms of assay sensitivity in a three-arm trial, we need to simultaneously demonstrate (i) the superiority of the experimental treatment to placebo, (ii) the NI of the experimental treatment to the reference treatment for a pre-specified maximal clinically irrelevant or NI margin δ > 0 , and (iii) the superiority of the reference treatment to placebo by more than δ . That is, μ E , μ R and μ P must satisfy the following inequalities: μ P < μ R δ < μ E , which leads to consideration of the following hypothesis-testing problem:
H 0 : μ E μ R δ versus H 1 : μ E > μ R δ , K 0 : μ R μ P + δ versus K 1 : μ R > μ P + δ .
Clearly, simultaneously rejecting H 0 and K 0 at some pre-given significance level yields the above desirable inequalities: μ P < μ R δ < μ E indicating the NI of the experimental treatment to the reference treatment and assay sensitivity. Generally, the selection of the NI margin δ should combine statistical reasoning and clinical judgement [17]. In a similar way to many three-arm trial studies, the fraction margin approach can be used to specify δ .
Following Kieser and Friede [43], δ can be mathematically expressed as a positive fraction f of the unknown efficacy difference between the reference treatment and placebo, i.e., δ = g ( μ R μ P ) , where g lies in the interval [ 0 , 1 ] . The NI margin δ defined above indicates that the condition of assay sensitivity holds, i.e., μ R μ P > 0 . Following the argument of Ghosh et al. [44], one can take g = 1 / 2 or 1/3. To explain the hypotheses considered above, we set a = 1 g ( 0 , 1 ) , whose different values have different statistical meanings [12]. Under the above assumption, we only need to test H 0 rather than hypothesis (1). That is, for the NI margin δ defined above, we only need to test the following hypothesis:
H 0 : μ E a μ R ( 1 a ) μ P 0 versus H 1 : μ E a μ R ( 1 a ) μ P > 0 .
Rejecting H 0 at some pre-given significance level indicates the NI of the experimental treatment to the reference treatment under the condition of assay sensitivity. For simplicity, we denote ψ ( μ ) = μ E a μ R ( 1 a ) μ P , where μ = { μ E , μ R , μ P } . In this case, the hypothesis (2) can re-expressed as
H ˜ 0 : ψ ( μ ) 0 versus H ˜ 1 : ψ ( μ ) > 0 .

2.2. Missingness Data Mechanism

Let { Y i : i = 1 , , n } be the clinical observations of Y for n subjects randomly assigned to treatment for = E , R , P . Here, we assume that Y i ’s may be subject to missingness, let D i be the indicator of non-missing observation Y i , i.e., D i = 1 if Y i is observed, and D i = 0 if Y i is missing, and define X i as a vector of covariates for = E , R , P and i = 1 , , n . It is also assumed that X i ’s are fully observed, D i 1 is independent of D i 2 for i 1 i 2 { 1 , , n } , and D i depends on the observed covariates X i and missing observation Y i , which indicates that the considered non-missingness data mechanism is non-ignorable. Under the above assumption, we consider the following non-missingness data mechanism model:
π i ( η ; X i , Y i ) = Pr ( D i = 1 | X i , Y i ; η ) , = E , R , P , i = 1 , , n ,
where η is a vector of unknown parameters to be estimated, and π i ( η ; X i , Y i ) is usually called the propensity score function in the missing data literature.
Many methods can be employed to specify the propensity score function π i ( η ; X i , Y i ) . For example, see Lee and Tang [35] for a logistic regression, Kim and Yu [37] and Tang et al. [38] for an exponential tilting model, and Wang and Tang [36] for a probit regression model. Here, similarly to Lee and Tang [35], we consider the following logistic regression model for π i ( η ; X i , Y i ) :
logit π i ( η ; X i , Y i ) = α 0 + α 1 X i + γ Y i , = E , R , P , i = 1 , , n ,
where logit ( c ) = log { c / ( 1 c ) } , and η = ( α 0 , α 1 , γ ) . It is well-known that, when γ is unknown, the above specified logistic regression model is unidentifiable. To address this issue, we decompose X i as X i = ( Z i , U i ) , where Z i may be associated with the propensity score function, and U i is a vector of instrumental variables that is not directly associated with the propensity score function but related to observations Y i . In this case, we can consider the following propensity score function
logit π i ( η ; Z i , Y i ) = α 0 + α 1 Z i + γ Y i , = E , R , P , i = 1 , , n .
Clearly, when γ = 0 , the above defined missingness data mechanism reduces to MAR.

3. Estimation of Treatment Efficacy

3.1. Estimating Treatment Efficacy

When the endpoints are completely observed, treatment efficacy μ can be consistently estimated by its corresponding sample mean, i.e., μ ^ = n 1 i = 1 n Y i for = E , R , P .
When Y i ’s are subject to missingness and the true propensity score function π i ( η ; Z i , Y i ) is known, the IPW method can be employed to estimate μ for = E , R , P . That is, μ can be estimated by
μ ^ H T = 1 n i = 1 n D i π i ( η ; Z i , Y i ) Y i , = E , R , P .
Note that the above defined estimator μ ^ H T may be sensitive to the misspecification of the propensity score function. To address this issue, an imputation technique is adopted to construct a consistent estimator of μ in the presence of MNAR. That is, let m i 0 ( γ ; Z i ) = E ( Y i | Z i , D i = 0 ) , an imputation-based estimator of μ has the form
μ ^ R I = 1 n i = 1 n D i Y i + ( 1 D i ) m i 0 ( γ ; Z i ) , = E , R , P .
The AIPW approach can also be utilized to estimate μ in the presence of MNAR. That is, an AIPW-based estimator of μ can be expressed as
μ ^ A I = 1 n i = 1 n D i π i ( η ; Z i , Y i ) Y i + 1 D i π i ( η ; Z i , Y i ) m i 0 ( γ ; Z i ) , = E , R , P .
In many clinical trials, the cumulative distribution functions of Y i ’s are usually unknown; thus, m i 0 ( γ ; Z i ) ’s are also unknown. On the other hand, π i ( η ; Z i , Y i ) is also unknown in the presence of MNAR. Hence, it is impossible to directly evaluate μ ^ H T , μ ^ R I and μ ^ A I using the above defined forms. In what follows, we consider the estimation problem of m i 0 ( γ ; Z i ) and π i ( η ; Z i , Y i ) .

3.2. Estimation of Conditional Mean m i 0 ( γ ; Z i )

Here a non-parametric method given in Tang et al. [38] is adopted to estimate m i 0 ( γ ; Z i ) in the presence of MNAR.
Let f 0 ( y i ) and f 1 ( y i ) be the conditional probability densities of Y i given D i = 0 and D i = 1 , respectively. Following the argument of Tang et al. [38], we have
f 0 ( y i ) = f 1 ( y i ) × O ( η ; Z i , Y i ) E { O ( η ; Z i , y i ) | Z i , D i = 1 } , = E , R , P ,
where O ( η ; Z i , y i ) = { 1 π i ( η ; Z i , Y i ) } / π i ( η ; Z i , Y i ) . Substituting π i ( η ; Z i , Y i ) defined in Equation (4) into (8) leads to
f 0 ( y i ) = f 1 ( y i ) × exp ( γ Y i ) E { exp ( γ Y i ) | Z i , D i = 1 } , = E , R , P ,
which shows that we can utilize the conditional distribution f 1 ( y i ) of the observed endpoints rather than that of missing endpoints (i.e., f 0 ( y i ) ) to make statistical inferences, where E ( · ) represents the expectation taken with respect to f 1 ( y i ) . Clearly, when γ = 0 , we obtain f 0 ( y i ) = f 1 ( y i ) .
Following the argument of Tang et al. [38], it follows from Equation (9) that
m i 0 ( γ ; Z i ) = E { D i Y i exp ( γ Y i ) | Z i } E { D i exp ( γ Y i ) | Z i } ,
which implies that a non-parametric regression estimator of m i 0 ( γ ; Z i ) can be expressed as
m ^ i 0 ( γ ; Z i ) = k = 1 n ω k 0 i ( γ ; Z i ) Y k ,
where ω k 0 i ( γ ; Z i ) ’s are the weights assigned to Y k , and have the form
ω k 0 i ( γ ; Z i ) = D k exp ( γ Y k ) K h ( Z i Z k ) j = 1 n D j exp ( γ Y j ) K h ( Z i Z j )
in which K h ( v ) = h 1 K ( v / h ) , K ( · ) is the multi-dimensional kernel function, and h = h n is the bandwidth.

3.3. Estimation of Propensity Score Function

Note that the above considered propensity score function has a parametric form indexed by the parameter vector η , which indicates that, if we can obtain the estimation of η (denoted as η ^ ), the estimation of the propensity score function is easily evaluated by π ^ i ( η ^ ; Z i , Y i ) . In the following, we discuss the estimation problem of η .
The mean score approach of Morikawa et al. [45] is employed here to estimate η based on the observed data D = { ( X i , Y i , D i ) : i = 1 , , n } for = E , R , P . For simplicity, we denote D obs = { X , Y obs , D } , where X = { X i : i = 1 , , n } , D = { D i : i = 1 , , n } and Y obs is the observed dataset of Y i ’s.
When the density function f ( Y i | μ ; X i ) of Y i is known, the maximum likelihood estimator (MLE) of η can be obtained by maximizing the following likelihood of the observed data D obs :
L obs ( η | D obs ) = i = 1 n π i ( η ; Z i , Y i ) f ( Y i | μ ; X i ) D i × 1 π i ( η ; Z i , Y i ) f ( Y i | μ ; X i ) d Y i 1 D i .
It follows from Morikawa et al. [45] and the mean score theorem that the MLE of η can be obtained by solving the following “mean score equation”:
1 n i = 1 n D i s ( η ; D i , Z i , y i ) + ( 1 D i ) E s ( η ; D i , Z i , Y i ) | Z i , D i = 0 = 0 ,
where s ( η ; D i , Z i , Y i ) has the form
s ( η ; D i , Z i , Y i ) = η log π i ( η ; Z i , Y i ) D i 1 π i ( η ; Z i , Y i ) 1 D i = D i π i ( η ; Z i , Y i ) π i ( η ; Z i , Y i ) 1 π i ( η ; Z i , Y i ) π ˙ i ( η ; Z i , Y i ) ,
and π ˙ i ( η ; Z i , Y i ) = π i ( η ; Z i , Y i ) / η .
Denote s 0 ( η ) = E { s ( η ; D i , Z i , Y i ) | Z i , D i = 0 } . Again, it follows from Tang et al. [38] and Equation (9) that a non-parametric estimator of s 0 ( η ) is given as
s ^ 0 ( η ; D i , Z i , Y i ) = k = 1 n ω k 0 i ( γ ; Z i ) s ( η ; D k , Z k , Y k ) ,
where ω k 0 i ( γ ; Z i ) is defined in Equation (10). Thus, the estimated “mean score equation" can be written as
1 n i = 1 n D i s ( η ; D i , Z i , y i ) + ( 1 D i ) s ^ 0 ( η ; D i , Z i , Y i ) = 0 ,
which shows that the MLE η ^ of η can be obtained by solving the non-linear equation (11) with respect to η .
Once we obtain MLE η ^ of η , substituting η ^ into Equations (4) and (10) leads to the estimated propensity score function π ^ i ( η ^ ; Z i , Y i ) and the estimated mean functions m ^ i 0 ( γ ^ , Z i ) . Thus, substituting π ^ i ( η ^ ; Z i , Y i ) and m ^ i 0 ( γ ^ , Z i ) into Equations (5)–(7) yields non-parametric estimators of treatment efficacy μ for = E , R , P .

3.4. Dimension Reduction

In some clinical trials, the number of covariates Z R d may be large. In this case, the kernel-based estimators of s ( η ; D i , Z i , Y i ) and m i 0 ( γ , Z i ) may suffer from the well-known curse of dimensionality. The dimension reduction technique of Tang et al. [38] is used to solve this problem.
Let G : R d R be a mapping function such that G i = G ( Z i ) is univariate. In particular, we assume that E { s ( η ; D i , Z i , Y i ) | G i , D i = 0 } and E ( Y i | G i , D i = 0 ) have the same structures as s 0 ( η ) = E { s ( η ; D i , Z i , Y i ) | Z i , D i = 0 } and m i 0 ( γ ; Z i ) = E ( Y i | Z i , D i = 0 ) , except that Z i is replaced by G i . Given the MLE η ^ of η obtained with the above introduced approach, we can obtain non-parametric dimension reduction estimators of treatment efficacy μ for = E , R , P .

4. Asymptotic Properties and Test Statistics

4.1. Asymptotic Properties

In the following, we investigate the consistency and asymptotic normality of the proposed estimators μ ^ H T , μ ^ R I , μ ^ A I with the known and estimated values of parameters η . The notation L represents convergence in distribution and N ( · , · ) denotes the normal distribution.
From Morikawa et al. [45], we obtain the following proposition.
Proposition 1.
Suppose that Assumptions A1–A3 given in the Appendix A hold. The MLE η ^ of η satisfies
n η ^ η 0 L N ( 0 , Σ η ) as n ,
where η 0 is the true value of η , Σ η = I 22 1 E { exp ( γ 0 Y i ) s 0 ( η 0 ) s 0 ( η 0 ) } I 22 , s 0 ( η 0 ) represents s 0 ( η ) evaluated at η = η 0 , I 22 = E s 0 ( η 0 ) π ˙ i ( η 0 ) / π i ( η 0 ) , and π ˙ i ( η ) = π ˙ i ( η ; Z i , Y i ) and π i ( η 0 ) = π i ( η 0 ; Z i , Y i ) ) for = E , R , P .
Proof of Proposition 1 can be found in Morikawa et al. [45]. To save space, we omit it. Proposition 1 shows that the MLE η ^ of η is consistent and asymptotically distributed as the multivariate normal distribution.
Theorem 1.
Suppose that Assumptions A1–A3 given in the Appendix A hold. For a known value η 0 of η , given the true value μ 0 of μ , the proposed estimators μ ^ H T , μ ^ R I and μ ^ A I satisfy
n ( μ ^ h μ 0 ) L N ( 0 , σ 2 ) as n
for h = H T , R I , A I and = E , R , P , where σ 2 = var ( τ i ) with τ i = m i 0 ( γ ; Z i ) + D i π i 1 ( η 0 ) { Y i m i 0 ( γ ; Z i ) } . In addition, σ 2 can be rewritten as σ 2 = var ( Y i ) + E { [ π i 1 ( η 0 ) 1 ] [ Y i m i 0 ( γ ; Z i ) ] 2 } .
Theorem 1 shows that the proposed estimators of μ are consistent and asymptotically distributed as the normal distribution with zero mean and the same variance.
Following the argument of Kim and Yu [37], σ 2 can be consistently estimated by
σ ^ 2 = 1 n i = 1 n τ ^ i 2 1 n i = 1 n τ ^ i 2 ,
where τ ^ i = m ^ i 0 ( γ 0 ; Z i ) + D i π ^ i 1 ( η 0 ) { Y i m ^ i 0 ( γ 0 ; Z i ) } .
When η is unknown, we replace η or γ in Equations (5)–(7) by their corresponding consistent estimators η ^ or γ ^ , respectively. Thus, we can obtain their corresponding plug-in estimators (denoted as μ ^ S H T , μ ^ S R I and μ ^ S A I , respectively) of μ .
Theorem 2.
Suppose that Assumptions A1–A3 given in the Appendix A hold, the propensity score function (4) is correctly specified, and Proposition 1 holds. The plug-in estimators μ ^ S H T , μ ^ S R I and μ ^ S A I of μ satisfy
n μ ^ h μ 0 L N ( 0 , σ , h 2 ) as n , h = S H T , S R I , S A I , = E , R , P ,
where σ , h 2 = var ( e , h i ) with e , h i = { D i π i 1 ( η ) { Y i m i 0 ( γ ; Z i ) } + m i 0 ( γ ; Z i ) μ 0 + I 22 1 s i ( η ) H , h } , s i ( η ) is the ith term in Equation (11), H , S H T = E { ( π i ( η ) 1 ) Y i ( 1 , Z i , Y ) } , H , S R I = E { ( 1 D i ) ( 0 , 0 p 1 , ( Y i m i 0 ( γ ; Z i ) ) 2 ) } , H , S A I = H , S H T + M , S A I , M , S A I = E { ( 1 π i ( η ) ) m i 0 ( γ ; Z i ) ( 1 , Z i , Y i ) } , 0 p 1 is a ( p 1 ) × 1 zero vector and p is the number of covariate vector Z i .
Note that the asymptotic variance σ , h 2 has a complicated form; thus, it is rather difficult to compute the estimate of σ , h 2 . To overcome this difficulty, we utilize a bootstrap-resampling method or empirical jack-knife method to evaluate the estimated asymptotic variances.

4.2. Wald-Type Statistics for Testing H ˜ 0

In what follows, we construct three Wald-type statistics for testing hypothesis H 0 : ψ ( μ ) 0 based on the asymptotic properties of three different estimators given in Theorem 2.
Based on the properties of estimator μ ^ h = ( μ ^ E h , μ ^ R h , μ ^ P h ) for μ = ( μ E , μ R , μ P ) , we obtain that (i) ψ ^ ( μ ^ h ) = μ ^ E h a μ ^ R h ( 1 a ) μ ^ P h is a consistent estimator of ψ ( μ ) , (ii) variance of ψ ^ ( μ ^ h ) is var { ψ ^ ( μ ^ h ) } = var ( μ ^ E h ) + a 2 var ( μ ^ R h ) + ( 1 a ) 2 var ( μ ^ P h ) , which can consistently be estimated by var ^ { ψ ^ ( μ ^ h ) } = σ ˜ E , h 2 / n E + a 2 σ ˜ R , h 2 / n R + ( 1 a ) 2 σ ˜ P , h 2 / n P , where σ ˜ E , h 2 , σ ˜ R , h 2 and σ ˜ P , h 2 defined in Theorem 2 are the consistent estimators of σ E , h 2 , σ R , h 2 and σ P , h 2 , respectively, for h = S H T , S R I , S A I ; (iii) ( ψ ^ ( μ ^ h ) ψ ( μ ) ) / var ^ { ψ ^ ( μ ^ h ) } L N ( 0 , 1 ) as min { n E , n R , n P } . Thus, the Wald-type statistic for testing H ˜ 0 : ψ ( μ ) 0 can be expressed as
T W h = ψ ^ ( μ ^ h ) var ^ { ψ ^ ( μ ^ h ) } = μ ^ E h a μ ^ R h ( 1 a ) μ ^ P h σ ˜ E , h 2 / n E + a 2 σ ˜ R , h 2 / n R + ( 1 a ) 2 σ ˜ P , h 2 / n P
for h = S H T , S R I , S A I , which are asymptotically distributed as the standard normal distribution under H ˜ 0 as min { n E , n R , n P } .
Note that the asymptotic properties of the parameter estimators and test statistics presented above only hold as n ( = E , R , P ). However, for the finite samples, before using asymptotic normality of the estimators μ ^ h ( = E , R , P ) and test statistics T W h ( h = S H T , S R I , S A I ), one should utilize the concept of goodness-of-fit tests [46,47] to check the plausibility of their normality assumption.

5. Simulation Study

In this section, simulation studies were conducted to assess the finite sample performance of the proposed test procedures in terms of empirical type I error rates and empirical powers under four missingness data mechanisms.
For = E , R , P , the data { X i : i = 1 , , n } were independently generated from the multivariate normal distribution, i.e., X i = ( Z i , U i ) i . i . d N ( ξ , Σ ) , and the data { Y i : i = 1 , , n } were independently generated by Y i = X i β + ε i , where ε i ’s were independently sampled from the following normal distributions (denoted as ‘scenario (A)’): ε E i N ( a μ R + ( 1 a ) μ P , 0.34 ) , ε R i N ( μ R , 0.37 ) and ε P i N ( μ P , 0.2 ) with a = 0.8 , which was the three-arm “gold threshold” recommended in the considered literature [18]. The true values of ξ , Σ , β E , β R , β P , μ R and μ P were taken as ξ = ( 0.0 , 0.0 ) , Σ = diag ( 0.25 , 0.25 ) , β E = ( 1.0 , 1.0 ) , β R = ( 1.0 , 1.1 ) , β P = ( 0.5 , 0.5 ) , μ R = 1.1 and μ P = 0.6 , respectively, which were only chosen as an illustration of the proposed methodologies. For comparison with the cases used widely or always justified, we considered the following two scenarios: (B) ε E i N ( a μ R + ( 1 a ) μ P , 0.8 ) , ε R i N ( μ R , 0.8 ) and ε P i N ( μ P , 0.5 ) ; (C) ε E i N ( a μ R + ( 1 a ) μ P , 1.0 ) , ε R i N ( μ R , 1.0 ) and ε P i N ( μ P , 0.5 ) . Under the above specified setting, we have ψ ( μ ) = 0 , where μ = { μ E , μ R , μ P } . That is, the data { ( X i , Y i ) : = E , R , P , i = 1 , , n } were independently generated from the null hypothesis H ˜ 0 , and were used to compute empirical type I error rates. To compute empirical powers, the data { ( X i , Y i ) : = E , R , P , i = 1 , , n } were independently generated with the above presented settings, except for a = ( μ E μ P ) / ( μ R μ P ) > 0.8 , which implied that the data { ( X i , Y i ) : = E , R , P , i = 1 , , n } were sampled from the alternative hypothesis H ˜ 1 : ψ ( μ ) > 0 .
To create missing data for Y i , we assumed that the missing indicators D i ’s were independently generated from the Bernoulli distribution with the respondent probability π i for = E , R , P . Here, we considered the following respondent probabilities for the reference treatment and placebo:
logit ( π R i ) = 1.6 + 0.2 Z R i 0.15 Y R i , logit ( π P i ) = 1.5 + 0.15 Z P i 0.18 Y P i ,
which indicated that missingness data mechanisms were non-ignorable, and the following four respondent probabilities for experimental treatment:
Case E1: logit ( π E i ) = α E 0 + α E 1 Z E i , which led to a MAR missingness data mechanism, where the true values of α E 0 and α E 1 were taken to be 1.3 and 0.1, respectively.
Case E2: logit ( π E i ) = α E 0 + α E 1 Z E i + γ E Y E i , which resulted in a non-ignorable missingness data mechanism, where the true values of α E 0 , α E 1 and γ E were taken as 1.3, 0.1 and −0.1, respectively.
Case E3: logit ( π E i ) = α E 0 + α E 1 sin ( Z E i ) + γ E Y E i , which yielded a non-linear non-ignorable missingness data mechanism with respect to Z E i , where the true values of α E 0 , α E 1 and γ E were set to be 1.3, 0.1 and 0.12, respectively.
Case E4: logit ( π E i ) = α E 0 + α E 1 Z E i + γ E Z E i Y E i , which implied a non-linear non-ignorable missingness data mechanism with an interaction of Z E i and Y E i , where the true values of α E 0 , α E 1 and γ E were set as 1.3, 0.1 and −0.1, respectively.
The titling parameters γ corresponding to Y i ( = E , R , P ) were set to be roughly −0.2 for showing a moderately negative effect on the probability of the data observed, and α 0 and α 1 were chosen so that the average missing rates were roughly 25%. Case E1 was MAR, which was a special case of the considered missingness data mechanism model (4) with γ E = 0 and was used to show that the proposed method can still capture missingness data characteristics even if the true missingness data mechanism was MAR; the other three missingness data mechanisms were non-ignorable and Case E2 satisfied the assumption of model (4), but Cases E3 and E4, which did not satisfy the assumed non-ignorable missingness data mechanism model (4), were used to show that the proposed test procedure was not sensitive to the assumed missingness data mechanism model (4). Here, we consider three balanced designs, i.e., n E = n R = n P = n with n = 50 , 100 , 150 for three scenarios, and the following unbalanced designs with the allocation ratios taken as 2:2:1, 3:3:1, 4:4:1, 2:1:1, 3:2:1, 4:3:1, 4:2:1, 3:1:1, 4:1:1 for Scenario (A) and 2:1:1, 2:2:1, 3:2:1 for Scenarios (B) and (C). The total sample sizes N = n E + n R + n P were set as 200 and 500 for Scenario (A) and 200, 300 and 400 for Scenarios (B) and (C), with a significance level α = 5 % for the three scenarios.
The average missing rates for the experimental, reference and placebo treatments among the 1000 replications were roughly 23.15%, 19.23% and 21.08%, respectively.
For each of the settings described above, we generated 1000 Monte Carlo samples. To evaluate the accuracy of the mean function estimates m ^ i 0 ( γ ; z i ) and the propensity score function estimates π ^ ( η ; z i , y i ) , we took the Gaussian kernel function with K ( Z ) = ( 2 π ) 1 / 2 exp ( Z 2 / 2 ) and set the bandwidths h as σ ^ Z n 1 / 3 , where σ ^ Z was the standard deviation of observations { Z i : i = 1 , , n } for = E , R , P . To compute the estimated asymptotic variances of μ ^ k , we conducted 100 bootstrap replications.
Empirical type I error rates for 1000 replications in Scenario (A) are given in Table 1 for balanced designs with the above considered four missingness data mechanisms and Table 2 for unbalanced designs with only the missingness data mechanism E2. To save space, we moved the corresponding results in Scenarios (B) and (C) to Table A1 and Table A2 in the Appendix A. Examination of Table 1, Table 2, Table A1 and Table A2 showed that (i) the proposed three statistics for testing H ˜ 0 have similar performance because their type I error rates are quite close to the pre-given significance level for all the considered cases, which is consistent with the theoretical properties presented in Theorems 1 and 2; (ii) the proposed three statistics for testing H ˜ 0 performed better than the CC method regardless of the sample sizes, missingness data mechanisms, balanced and unbalanced designs, and the variances of the treatment effects in that the type I error rates of the former were closer to the pre-given significance level than those for the latter; (iii) the type I error rate increased as the sample size increased for the CC method, which was consistent with the observations of Cook and Zea [48]; (iv) empirical type I error rates were not sensitive to the balanced or unbalanced designs.
We computed empirical powers against a = ( μ E μ P ) / ( μ R μ P ) , the sample sizes, the treatment effects, and the alpha and gamma parameters for missingness data mechanism models E1–E4 when the null hypothesis was not true. To save space, we only present empirical powers against the sample size in Figure 1 for balanced design and Figure 2 for unbalanced design (i.e., n E : n R : n P = 2:1:1) under the considered four missingness data mechanism models. Other results are given in Figure A1, Figure A2, Figure A3, Figure A4, Figure A5, Figure A6, Figure A7 and Figure A8 in the Appendix A. Inspection of these figures showed that (i) empirical power increases as a or the sample size n increases, regardless of the missingness data mechanisms and balanced/unbalanced designs and the considered four tests; (ii) empirical power slightly increased as α E 1 increased regardless of the missingness data mechanisms and balanced/unbalanced designs and the considered SHT, SRI and SAI tests, while the empirical power for the CC method showed an increasing tendency as α E 1 increased for missingness data mechanisms E3 and E4, which might be explained by non-linear non-ignorable missing data; (iii) the empirical powers with the proposed three test statistics were larger than those with the CC method for non-ignorable missing data (i.e., E2–E4), regardless of the sample sizes, a, treatment effects, α E 1 and γ ; (iv) the observation that the CC method had a slightly larger empirical power than the three tests considered might be explained by its inflated type I error.
To investigate the effect of the amount of departure from the MAR mechanism (i.e., the change in γ E ) on type I error rates under the missingness data mechanism model E2, with the same values of α E 0 and α E 1 (i.e., α E 0 = 1.3 and α E 1 = 0.1 ) as those given in the first simulation study, we conducted a second simulation study. In this simulation study, 1000 Monte Carlo datasets { ( X i , Y i , D i ) : i = 1 , , n } were independently generated, as in the first simulation study, with γ E = 0.2 , 0.1 , 0.05 , 0 , 0.05 , 0.1 . Empirical type I error rates for the balanced design with the sample sizes n = 50 and 150 are given in Table 3. Inspection of Table 3 yielded that (i) statistics with the IPW, regression imputation and AIPW methods behaved better than those with the CC method because the type I error rates for the former were closer to the pre-given significance level than those for the latter, regardless of the values of γ E and the sample sizes; and (ii) statistics with the IPW, regression imputation and AIPW methods were not sensitive to γ E .

6. An Example

In this section, a real example described in the Introduction is used to illustrate the proposed methodologies. In this dataset, we regarded zidovudine plus 400 mg of didanosine plus 400 mg of nevirapine as the experimental treatment with 330 patients, zidovudine plus 2.25 mg of zalcitabine as the reference treatment with 324 patients, and zidovudine alternating monthly with 400 mg didanosine as the placebo with 325 patients, respectively. CD4 counts were scheduled to be collected at baseline and eight-week intervals during the follow-up. Due to mistimed measurements, CD4 count data were subject to missingness, which led to unbalanced designs. There were 94 patient dropouts at the interval (4,12] among the 330 patients, 97 patient dropouts at the interval (4,12] and two patient dropouts at baseline among the 324 patients, and 95 patient dropouts at the interval (4,12] and five patient dropouts at baseline among the 325 patients. As an illustration, we took log(CD4 count at baseline + 1) as the dropout instrument variable Z, and only considered the data at the interval (4,12], i.e., treatment endpoints were CD4 counts at the interval (4,12], which led to n E = 328 , n R = 313 and n P = 316 , whose average missing rates were 29.74%, 30.99%, and 28.65%, respectively. The dataset was obtained from the R package “ALA”. Our main purpose was to test the NI of the experimental treatment to the reference treatment in the considered three-arm design. To this end, we took the fraction margin as δ = g ( μ R μ P ) with g = 0.2 , which led to a = 0.8 , i.e., the experimental treatment achieved more than 80 percent of the reference treatment compared with the placebo to be claimed as NI.
To compute m ^ i 0 ( γ ^ ; Z i ) and π ^ ( η ^ ; Z i , Y i ) , we took the Gaussian kernel function as K ( x ) = ( 2 π ) 1 / 2 exp ( x 2 / 2 ) and set the bandwidth h to be σ ^ X n 1 / 3 , where σ ^ X was the standard deviation of X i ’s. The p-values for testing H ˜ 0 were 0.0037, 0.0035, 0.0033 and 0.0136 for the Wald-type statistics with the IPW, regression imputation, AIPW and CC methods, respectively, which indicated the NI of the experimental treatment to the reference treatment was at the 5% significance level.

7. Conclusions

This paper considers the non-inferiority assessment problem of an experimental treatment to a reference treatment in a three-arm trial with non-ignorable missing data. A logistic regression was employed to specify the non-ignorable missing endpoint mechanism. Three methods, including the IPW, imputation regression and AIPW methods, were proposed to estimate the treatment efficacy for the known and unknown propensity score functions. The asymptotic properties of estimators for treatment efficacy were established under some regularity conditions. Based on these asymptotic properties, three Wald-type statistics for testing the NI of the experimental treatment to the reference treatment were constructed. Simulation studies indicated that the proposed test procedures behaved better than those with complete-case data in terms of type I error rates and powers, i.e., the type I error rates for the former were closer to the pre-given significance level than those for the latter and the powers for the former were larger than those for the latter; the proposed test procedures were not sensitive to misspecified missingness data mechanisms.

Author Contributions

W.L. carried out the simulation studies, the statistical analysis and drafted the manuscript; Y.Z. drafted the manuscript and conducted the statistical analysis; N.T. conceived the research questions and idea, developed the methods and revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

The authors are very grateful to the Editor, the Associate Editor and the three anonymous referees for their valuable comments that significantly improved this work. This work was supported by grants from the National Natural Science Foundation of China (No. 12271472), the National Key R&D Program of China (No. 2022YFA1003701), and the China Postdoctoral Science Foundation (No. 2021M702778).

Data Availability Statement

The datasets generated and analysed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no competing interest.

Appendix A

Appendix A.1. Regularity Conditions

To obtain asymptotic properties of μ ^ for = E , R , P , we need the following regularity conditions.
Assumption A1.
The true respondent model given in Equation (4) satisfies (i) that there exists a true value η 0 of η such that E { s ( η 0 ; Z , Y ) } = 0 ; (ii) for η , in a neighborhood of η 0 , E { | | s ( η 0 ; Z , Y ) | | 2 } < and E { s ( η 0 ; Z , Y ) / η T } exists and is nonsingular.
Assumption A2.
(i) The marginal probability density function f ( z ) of the random variable z is bounded away from ∞ in the support of z, and the second derivative of f ( z ) in z is continuous and bounded; (ii) The respondent probabilities π i ( η ; Z i , Y i ) satisfy min 1 i n π i ( η ; Z i , Y i ) c 0 a.s. for some positive constant c 0 .
Assumption A3.
The kernel function K ( · ) satisfies (i) it is bounded and has compact support; (ii) it is symmetric with ω 2 K ( ω ) d ω < ; (iii) K ( · ) D for D > 0 in some closed interval centered at zero; (iv) n h and n h 4 0 as n .
Remark A1.
Assumption A1 is used to establish asymptotic normality of η ^ . Assumption A2 is commonly adopted in the missing data literature. Assumption A3 is a standard assumption for the kernel regression method.
Proof of Theorem 1.
By the definition of μ ^ H T , we have the following decomposition
n ( μ ^ H T μ 0 ) = n 1 / 2 i = 1 n D i Y i π i ( η ; Z i , Y i ) μ 0 = n 1 / 2 i = 1 n D i Y i m i 0 ( γ ; Z i ) π i ( η ; Z i , Y i ) + n 1 / 2 i = 1 n m i 0 ( γ ; Z i ) μ 0 + n 1 / 2 i = 1 n D i π i ( η ; Z i , Y i ) 1 m i 0 ( γ ; Z i ) = H 1 n + H 2 n + H 3 n .
By the law of large numbers, it is easily shown that H 3 n = o p ( 1 ) . Thus, combining the above results leads to
n ( μ ^ H T μ 0 ) = n 1 / 2 i = 1 n D i Y i m i 0 ( γ ; Z i ) π i ( η ; Z i , Y i ) + n 1 / 2 i = 1 n m i 0 ( γ ; Z i ) μ 0 + o p ( 1 ) .
Let τ i = m i 0 ( γ ; Z i ) + D i π i 1 ( η 0 ) { y i m i 0 ( γ ; Z i ) } . Then, we have
n μ ^ H T μ 0 L N 0 , σ 2 a s n , for = E , R , P ,
where σ 2 = var ( τ i ) . It is easily shown that E D i π i ( η ; Z i , Y i ) 1 Y i m i 0 ( γ ; Z i ) = 0 , and E m i 0 ( γ ; Z i ) μ 0 = 0 . Since m i 0 ( γ ; Z i ) is independent of Y i m i 0 ( γ ; Z i ) , we have σ 2 = E { π i ( Z i , Y i ; η 0 ) 1 [ Y i m i 0 ( γ ; Z i ] 2 } + E m i 0 ( γ ; Z i ) 2 .
Next, we show the asymptotic property of μ ^ R I . By the definition of μ ^ R I , we obtain
n ( μ ^ R I μ 0 ) = n 1 / 2 i = 1 n D i Y i + ( 1 D i ) m ^ i 0 ( γ ; Z i ) μ 0
= n 1 / 2 i = 1 n D i Y i m i 0 ( γ ; Z i ) + n 1 / 2 i = 1 n ( 1 D i ) m ^ i 0 ( γ ; Z i ) m i 0 ( γ ; Z i )
+ n 1 / 2 i = 1 n m i 0 ( γ ; Z i ) μ 0 = R 1 n + R 2 n + H 2 n .
Using the similar arguments as given in Tang et al. [38], it is easily shown that
R 2 n = n 1 / 2 i = 1 n D i [ 1 π i ( η 0 ; Z i , Y i ) ] { Y i m i 0 ( γ ; Z i ) } / π i ( η 0 ; Z i , Y i ) + o p ( 1 ) .
Thus, we have
n ( μ ^ R I μ 0 ) = H 1 n + H 2 n + o p ( 1 ) = n ( μ ^ H T μ 0 ) + o p ( 1 ) .
By the Slutsky Theorem and the asymptotic property of μ ^ H T , we obtain
n μ ^ R I μ 0 L N 0 , σ 2 a s n , for = E , R , P ,
where σ 2 is defined in the proof of the asymptotic properties of μ ^ H T .
Now, we prove the asymptotic properties of the estimator μ ^ A I . By the definition of the μ ^ A I , we obtain
n ( μ ^ A I μ 0 ) = n 1 / 2 i = 1 n D i Y i π i ( η ; Z i , Y i ) + 1 D i π i ( η ; Z i , Y i ) m ^ i 0 ( γ ; Z i ) μ 0
= n 1 / 2 i = 1 n D i π i ( η ; Z i , Y i ) Y i m i 0 ( γ ; Z i ) + n 1 / 2 i = 1 n m i 0 ( γ ; Z i ) μ 0
+ n 1 / 2 i = 1 n 1 D i π i ( η ; Z i , Y i ) m ^ i 0 ( γ ; Z i ) m i 0 ( γ ; Z i ) = H 1 n + H 2 n + A n .
Following a similar argument as given in the proof of Theorem 4 in Zhao et al. [49], we have A n = o p ( 1 ) . Combining the above results yields
n ( μ ^ A I μ 0 ) = H 1 n + H 2 n + o p ( 1 ) = n ( μ ^ H T μ 0 ) + o p ( 1 ) .
Using the same arguments as given in the proof of the asymptotic properties for μ ^ R I , we obtain
n μ ^ A I μ 0 L N 0 , σ 2 a s n , for = E , R , P ,
where σ 2 is defined in the proof of the asymptotic properties of μ ^ H T . □
Proof of Theorem 2.
We first consider the asymptotic properties of μ ^ S H T based on the following form:
n ( μ ^ S H T μ 0 ) = n 1 / 2 i = 1 n D i Y i π i ( η ^ ; Z i , Y i ) μ 0 .
Taking the Taylor expansion of π i ( η ^ ; Z i , Y i ) at η yields
n 1 / 2 i = 1 n D i Y i π i ( η ^ ; Z i , Y i ) = n 1 / 2 i = 1 n D i Y i π i ( η ; Z i , Y i ) + n 1 / 2 ( η ^ η ) 1 n i = 1 n D i Y i η π i 1 ( η ; Z i , Y i ) | η = η ˜ + o p ( 1 )
where η ˜ lies in the line segment between η ^ and η , π i 1 ( η ; Z i , Y i ) / η | η = η ˜ = { 1 π i 1 ( η ; Z i , Y i ) } ( 1 , Z i , Y i ) . Following the arguments of the proof of Theorem 1, we get
n 1 / 2 i = 1 n D i Y i π i ( η ; Z i , Y i ) μ 0 = n 1 / 2 i = 1 n D i { Y i m i 0 ( γ ; Z i ) } π i ( η ; Z i , Y i ) + n 1 / 2 i = 1 n m i 0 ( γ ; Z i ) μ 0 + o p ( 1 ) .
Combining the above results yields
n ( μ ^ S H T μ 0 ) = n 1 / 2 i = 1 n D i { Y i m i 0 ( γ ; Z i ) } π i ( η ; Z i , Y i ) + n 1 / 2 i = 1 n m i 0 ( γ ; Z i ) μ 0 + n 1 / 2 ( η ^ η ) 1 n i = 1 n D i Y i ( 1 1 π i ( η ˜ ; Z i , Y i ) ) ( 1 , Z i , Y i ) + o p ( 1 ) = n 1 / 2 i = 1 n D i Y i m i 0 ( γ ; Z i ) π i ( η ; Z i , Y i ) + m i 0 ( γ ; Z i ) μ 0 + n 1 / 2 ( η ^ η ) H , S H T + o p ( 1 ) = n 1 / 2 i = 1 n e , S H T i + o p ( 1 ) ,
where
e , S H T i = { D i π i 1 ( η ; Z i , Y i ) { Y i m i 0 ( γ ; Z i ) } + m i 0 ( γ ; Z i ) μ 0 + I 22 1 s i ( η ) H , S H T } ,
s i ( η ) is the ith term in Equation (11), H , S H T = E [ { π ( η ; Z , Y ) 1 } Y ( 1 , Z , Y ) ] . By the Slutsky Theorem and the asymptotic property of μ ^ H T , it is easily shown that
n μ ^ S H T μ 0 L N 0 , σ , S H T 2 a s n , for = E , R , P ,
where σ , S H T 2 = Var ( e , S H T i ) .
Now, we show the asymptotic property of μ ^ S R I for unknown η . By the definition of μ ^ S R I , we have the following form:
n ( μ ^ S R I μ 0 ) = n 1 / 2 i = 1 n D i Y i + ( 1 D i ) m ^ i 0 ( γ ^ ; Z i ) μ 0 .
Taking the Taylor expansion of m ^ i 0 ( γ ^ ; Z i ) at η yields
i = 1 n ( 1 D i ) m ^ i 0 ( γ ^ ; Z i ) = i = 1 n ( 1 D i ) m ^ i 0 ( γ ; Z i ) + n 1 / 2 ( η ^ η ) 1 n i = 1 n ( 1 D i ) η m i 0 ( γ ; Z i ) η = η ˜ + o p ( 1 ) .
Using the conclusion given in the proof of Theorem 1, we can easily get n ( μ ^ R I μ 0 ) = n ( μ ^ H T μ 0 ) + o p ( 1 ) . Combining the above results yields
n ( μ ^ S R I μ 0 ) = n 1 / 2 i = 1 n D i Y i + ( 1 D i ) m ^ i 0 ( γ ; Z i ) μ 0 + n 1 / 2 ( η ^ η ) 1 n i = 1 n ( 1 D i ) Y i η m ^ i 0 ( γ ; Z i ) η = η ˜ = n 1 / 2 i = 1 n D i Y i m i 0 ( γ ; Z i ) π i ( η ; Z i , Y i ) + n 1 / 2 i = 1 n m i 0 ( γ ; Z i ) μ 0 + n 1 / 2 ( η ^ η ) I 22 1 s i ( η ) H , S R I + o p ( 1 ) = n 1 / 2 i = 1 n e , S R I i + o p ( 1 ) ,
where η ˜ lies in the line segment between η ^ and η , e , S R I i = D i π i 1 ( η ; Z i , Y i ) { Y i m i 0 ( γ ; Z i ) } + m i 0 ( γ ; Z i ) μ 0 + I 22 1 s i ( η ) H , S R I , H , S R I = E { ( 1 D i ) ( 0 , 0 p 1 , ( Y i m i 0 ( γ ; Z i ) ) 2 ) } and 0 p 1 is a ( p 1 ) × 1 zero vector and p is the dimension of covariate Z i . By the Slutsky Theorem and the asymptotic property of μ ^ S H T , it is easily shown that
n μ ^ S R I μ 0 L N 0 , σ , S R I 2 a s n , for = E , R , P ,
where σ , S R I 2 = Var ( e , S R I i ) .
Now, we prove the asymptotic properties of μ ^ S A I for unknown η . Combining the above results and taking the Taylor expansion of μ ^ S A I at η , we obtain
n ( μ ^ S A I μ 0 ) = n 1 / 2 i = 1 n D i Y i π i ( η ; Z i , Y i ) + 1 D i π i ( η ; Z i , Y i ) m ^ i 0 ( γ ; Z i ) μ 0 + n 1 / 2 ( η ^ η ) 1 n i = 1 n D i Y i η π i 1 ( η ; Z i , Y i ) | η = η ˜ + n 1 / 2 ( η ^ η ) 1 n i = 1 n m ^ i 0 ( γ ; Z i ) η 1 D i π i ( η ; Z i , Y i ) η = η ˜ + n 1 / 2 ( η ^ η ) 1 n i = 1 n 1 D i π i ( η ; Z i , Y i ) η m ^ i 0 ( γ ; Z i ) η = η ˜
where η ˜ lies in the line segment between η ^ and η . Using the conclusion given in the proof of Theorem 1, we have
n ( μ ^ A I μ 0 ) = n ( μ ^ H T μ 0 ) + o p ( 1 ) ,
1 n i = 1 n 1 D i π i ( η ; Z i , Y i ) η m ^ i 0 ( γ ; Z i ) | η = η ˜ = o p ( 1 ) ,
1 n i = 1 n m ^ i 0 ( γ ; Z i ) η 1 D i π i ( η ; Z i , Y i ) | η = η ˜ = M S A I + o p ( 1 ) ,
where M , S A I = E { ( 1 π i ( η ; Z i , Y i ) ) m i 0 ( γ ; Z i ) ( 1 , Z i , Y i ) } . Combining the above results leads to
n ( μ ^ S A I μ 0 ) = n 1 / 2 i = 1 n D i Y i m i 0 ( γ ; Z i ) π i ( η ; Z i , Y i ) + m i 0 ( γ ; Z i ) μ 0 + n 1 / 2 ( η ^ η ) I 22 1 s i ( η ) ( H , S H T + M , S A I ) + o p ( 1 ) = n 1 / 2 i = 1 n e , S A I i + o p ( 1 ) ,
where e , S A I i = D i π i 1 ( η ; Z i , Y i ) { Y i m i 0 ( γ ; Z i ) } + m i 0 ( γ ; Z i ) μ 0 + I 22 1 s i ( η ) H , S R I , H , S A I = H , S H T + M , S A I . By the Slutsky Theorem and the asymptotic property of μ ^ S H T , it is easily shown that
n μ ^ S R I μ 0 L N 0 , σ , S A I 2 a s n , for = E , R , P ,
where σ , S R I 2 = Var ( e , S A I i ) . □

Appendix A.2. Tables: Empirical Type I Error Rates for Scenarios (B) and (C) with Balanced and Unbalanced Designs

Table A1. Empirical Type I error rates for Scenarios (B) and (C) with balanced designs.
Table A1. Empirical Type I error rates for Scenarios (B) and (C) with balanced designs.
n = 50n = 100n = 150
ScenarioCaseSHTSRISAICCSHTSRISAICCSHTSRISAICC
(B)E10.0410.0550.0550.0680.0420.0530.0590.0820.0480.0580.0620.090
E20.0460.0490.0500.0520.0450.0510.0520.0600.0420.0540.0540.066
E30.0450.0450.0440.0510.0370.0530.0540.0600.0460.0560.0580.060
E40.0500.0480.0470.0520.0320.0550.0580.0600.0460.0520.0540.056
(C)E10.0400.0650.0640.0810.0310.0520.0560.0820.0460.0620.0640.082
E20.0300.0520.0430.0570.0460.0530.0520.0630.0440.0500.0560.068
E30.0340.0580.0550.0600.0340.0500.0530.0600.0360.0480.0540.060
E40.0310.0440.0430.0420.0410.0520.0560.0820.0480.0540.0540.075
Note: SHT, SRI, SAI and CC denote Wald-type test approaches based on IPW, regression imputation, AIPW and CC, respectively.
Table A2. Empirical Type I error rates for Scenarios (B) and (C) with unbalanced designs.
Table A2. Empirical Type I error rates for Scenarios (B) and (C) with unbalanced designs.
N = 200N = 300N = 400
Scen. n E : n R : n P CaseSHTSRISAICCSHTSRISAICCSHTSRISAICC
(B)2:1:1E10.0350.0580.0580.0680.0340.0530.0590.0820.0360.0620.0600.078
E20.0420.0520.0460.0620.0420.0530.0500.0640.0460.0520.0500.070
E30.0410.0660.0610.0560.0380.0620.0580.0420.0340.0510.0490.048
E40.0330.0540.0510.0500.0370.0560.0510.0490.0460.0520.0510.068
2:2:1E10.0260.0520.0500.0790.0490.0680.0660.0830.0300.0400.0420.080
E20.0310.0510.0490.0370.0410.0590.0570.0470.0460.0580.0560.062
E30.0260.0460.0450.0400.0400.0580.0520.0480.0490.0560.0520.060
E40.0290.0440.0420.0420.0310.0460.0450.0350.0300.0440.0380.044
3:2:1E10.0410.0600.0630.0720.0370.0500.0460.0690.0400.0460.0460.078
E20.0320.0550.0570.0540.0350.0460.0430.0490.0340.0440.0420.042
E30.0320.0510.0450.0550.0380.0560.0520.0500.0320.0490.0450.046
E40.0330.0470.0530.0500.0320.0590.0570.0520.0380.0500.0480.038
(C)2:1:1E10.0330.0670.0640.0770.0290.0540.0500.0690.0320.0620.0540.082
E20.0430.0590.0560.0510.0480.0470.0470.0660.0500.0580.0520.064
E30.0270.0500.0490.0500.0420.0620.0600.0420.0280.0500.0510.059
E40.0280.0510.0510.0430.0260.0440.0420.0420.0260.0460.0450.050
2:2:1E10.0210.0490.0470.0810.0440.0680.0670.0820.0300.0420.0440.090
E20.0260.0470.0430.0370.0390.0580.0560.0600.0240.0400.0420.054
E30.0260.0460.0450.0400.0580.0520.0480.0490.0490.0560.0520.060
E40.0290.0440.0420.0420.0310.0460.0450.0350.0300.0440.0380.044
3:2:1E10.0390.0690.0690.0820.0360.0510.0480.0740.0340.0460.0440.082
E20.0390.0580.0550.0650.0370.0450.0470.0520.0340.0440.0420.042
E30.0310.0510.0500.0460.0300.0560.0540.0500.0290.0500.0470.050
E40.0290.0500.0520.0480.0360.0580.0600.0580.0310.0440.0440.045
Note: SHT, SRI, SAI and CC denoteWald-type test approaches based on IPW, terline regression imputation, AIPW and CC, respectively.

Appendix A.3. Figures: Powers for Scenario (A) with a, n, Treatment Effects, Parameters α and γ

Figure A1 presents empirical powers against a = ( μ E μ P ) / ( μ R μ P ) for missingness data mechanism models E1–E4 under the balanced designs, with n = 50 , 100 and 150. Figure A2 presents empirical powers against a = ( μ E μ P ) / ( μ R μ P ) for missingness data mechanism models E1–E4 under the unbalanced designs with n E : n R : n P = 2:1:1.
Figure A1. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against a for missingness data mechanism model E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for n = 50 (the first row), 100 (middle row) and 150 (the last row) under the balanced designs.
Figure A1. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against a for missingness data mechanism model E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for n = 50 (the first row), 100 (middle row) and 150 (the last row) under the balanced designs.
Mathematics 11 00246 g0a1
Figure A2. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against a for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for N = 120 (the first row), 200 (middle row) and 280 (the last row) under the unbalanced designs with n E : n R : n P = 2:1:1.
Figure A2. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against a for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for N = 120 (the first row), 200 (middle row) and 280 (the last row) under the unbalanced designs with n E : n R : n P = 2:1:1.
Mathematics 11 00246 g0a2
Figure A3 presents empirical powers against the sample size n for missingness data mechanism models E1–E4 under balanced design for a = 0.4 and 0.6 . Figure A4 presents empirical powers against the sample size n for missingness data mechanism models E1–E4 under unbalanced design with n E : n R : n P = 2:1:1.
Figure A3. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against the sample size n for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for a = 0.4 (upper row) and 0.6 (lower row) under balanced design.
Figure A3. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against the sample size n for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for a = 0.4 (upper row) and 0.6 (lower row) under balanced design.
Mathematics 11 00246 g0a3
Figure A4. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against the sample size n for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for a = 0.4 (upper row) and 0.6 (lower row) under unbalanced design ( n E : n R : n P = 2:1:1).
Figure A4. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against the sample size n for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for a = 0.4 (upper row) and 0.6 (lower row) under unbalanced design ( n E : n R : n P = 2:1:1).
Mathematics 11 00246 g0a4
Figure A5 presents empirical powers against treatment effect α E 1 for four missingness data mechanism models E1–E4 under balanced design. Figure A6 presents empirical powers against treatment effects α E 1 for four missingness data mechanism models E1–E4 under unbalanced design with n E : n R : n P = 2:1:1.
Figure A5. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against treatment effect α E 1 for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for ( a , n ) = (0.2,80) (the first row), (0.2,120) (the second row), (0.6,80) (the third row) and (0.6,120) (the last row) under the balanced designs.
Figure A5. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against treatment effect α E 1 for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for ( a , n ) = (0.2,80) (the first row), (0.2,120) (the second row), (0.6,80) (the third row) and (0.6,120) (the last row) under the balanced designs.
Mathematics 11 00246 g0a5aMathematics 11 00246 g0a5b
Figure A6. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against treatment effect α E 1 for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for ( a , n ) = (0.2,200) (the first row), (0.2,320) (the second row), (0.6,200) (the third row), and (0.6,320) (the last row) under the unbalanced designs with n E : n R : n P = 2:1:1.
Figure A6. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against treatment effect α E 1 for missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for ( a , n ) = (0.2,200) (the first row), (0.2,320) (the second row), (0.6,200) (the third row), and (0.6,320) (the last row) under the unbalanced designs with n E : n R : n P = 2:1:1.
Mathematics 11 00246 g0a6aMathematics 11 00246 g0a6b
Figure A7 presents empirical powers against the tilting parameter γ for three missingness data mechanism models E2–E4 under balanced design. Figure A8 presents empirical powers against the tilting parameter γ for three missingness data mechanism models E2–E4 under unbalanced design with n E : n R : n P = 2:1:1.
Figure A7. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against γ for missingness data mechanism models E2 (the first row), E3 (the middle row) and E4 (the last row) together with ( a , n ) = (0.2,80), (0.2,120), (0.6,80) and (0.6,120), respectively, under the balanced designs.
Figure A7. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against γ for missingness data mechanism models E2 (the first row), E3 (the middle row) and E4 (the last row) together with ( a , n ) = (0.2,80), (0.2,120), (0.6,80) and (0.6,120), respectively, under the balanced designs.
Mathematics 11 00246 g0a7
Figure A8. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against γ for missingness data mechanism models E2 (the first row), E3 (the middle row) and E4 (the last row) together with ( a , n ) = (0.2,200), (0.2,320), (0.6,200) and (0.6,320), respectively, under the unbalanced designs with n E : n R : n P = 2:1:1.
Figure A8. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against γ for missingness data mechanism models E2 (the first row), E3 (the middle row) and E4 (the last row) together with ( a , n ) = (0.2,200), (0.2,320), (0.6,200) and (0.6,320), respectively, under the unbalanced designs with n E : n R : n P = 2:1:1.
Mathematics 11 00246 g0a8

References

  1. Rothmann, M.D.; Wiens, B.L.; Chan, I.S.F. Design and Analysis of Non-Inferiority Trials; (Chapman & Hall/CRC Biostatistics Series, 43); Chapman & Hall/CRC Press: New York, NY, USA, 2011; pp. 149–166. [Google Scholar]
  2. Tang, M.L.; Tang, N.S.; Chan, I.S.; Chan, B.P. Sample size determination for establishing equivalence/noninferiority via ratio of two proportions in matched-pair design. Biometrics 2002, 58, 957–963. [Google Scholar] [CrossRef] [PubMed]
  3. Tang, N.S.; Tang, M.L.; Chan, S.F. On tests of equivalence via non-unity relative risk for matached-pair design. Stat. Med. 2003, 22, 1217–1233. [Google Scholar] [CrossRef] [PubMed]
  4. Tang, N.S.; Tang, M.L.; Chan, S.F. Confidence interval construction for proportion difference in small-sample paired studies. Stat. Med. 2010, 24, 3565–3579. [Google Scholar] [CrossRef]
  5. Wellek, S. Statistical methods for the analysis of two-arm non-inferiority trials with binary outcomes. Biom. J. 2005, 47, 48–61. [Google Scholar] [CrossRef] [PubMed]
  6. Freitag, G.; Lange, S.; Munk, A. Non-parametric assessment of non-inferiority with censored data. Stat. Med. 2006, 25, 1201–1217. [Google Scholar] [CrossRef] [PubMed]
  7. Arboretti, G.R.; Bonnini, S.; Solmi, F. Non-parametric two-stage active control testing method for non-inferiority tests. Quad. Stat. 2008, 10, 73–98. [Google Scholar]
  8. Pesarin, F.; Salmaso, L.; Carrozzo, E.; Arboretti, R. Union-intersection permutation solution for two-sample equivalence testing. Stat. Comput. 2016, 26, 693–701. [Google Scholar] [CrossRef]
  9. Gamalo, M.A.; Wu, R.; Tiwari, R.C. Bayesian approach to non-inferiority trials for normal means. Stat. Methods Med. Res. 2016, 25, 221–240. [Google Scholar] [CrossRef]
  10. Tang, M.L.; Tang, N.S. Tests of noninferiority via rate difference for three-arm clinical trials with placebo. J. Biopharm. Stat. 2004, 14, 337–347. [Google Scholar] [CrossRef]
  11. Koch, G.G.; Röhmel, J. Hypothesis testing in the gold standard design for proving the efficacy of an experimental treatment relative to placebo and a reference. J. Biopharm. Stat. 2004, 14, 315–325. [Google Scholar] [CrossRef]
  12. Pigeot, I.; Schafer, J.; Rohmel, J.; Hauschke, D. Assessing non-inferiority of a new treatment in a three-arm clinical trial including a placebo. Stat. Med. 2003, 22, 883–899. [Google Scholar] [CrossRef] [PubMed]
  13. Mielke, M.; Munk, A. The assessment and planning of non-inferiority trials for retention of effect hypotheses-towards a general approach. arXiv 2009, arXiv:0912.4169. [Google Scholar]
  14. Lui, K.; Chang, K. Notes on testing noninferiority in ordinal data under the parallel groups design. J. Biopharm. Stat. 2013, 3, 1294–1307. [Google Scholar] [CrossRef] [PubMed]
  15. Brannath, W.; Scharpenberg, M.; Schmidt, S. Single-stage, three-arm, adaptive test strategies for non-inferiority trials with an unstable reference. Stat. Med. 2022, 41, 5033–5045. [Google Scholar] [CrossRef]
  16. Tang, N.S.; Yu, B.; Tang, M.L. Testing non-inferiority of a new treatment in three-arm clinical trials with binary endpoints. BMC Med. Res. Methodol. 2014, 14, 134. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Tang, N.S.; Yu, B. Simultaneous confidence interval for assessing non-inferiority with assay sensitivity in a three-arm trial with binary endpoints. Pharm. Stat. 2020, 19, 518–531. [Google Scholar] [CrossRef] [PubMed]
  18. Tang, N.S.; Yu, B. Bayesian sample size determination in a three-arm non-inferiority trial with binary endpoints. J. Biopharm. Stat. 2022, 32, 768–788. [Google Scholar] [CrossRef]
  19. Paul, E.; Tiwari, R.C.; Chowdhury, S.; Ghosh, S. A more powerful test for three-arm non-inferiority via risk difference: Frequentist and Bayesian approaches. J. Appl. Stat. 2022, 12, 1–23. [Google Scholar] [CrossRef]
  20. Homma, G.; Diamon, T. Sample size calculation for “gold-standard” noninferiority trials with fixed margins and negative binomial endpoints. Stat. Biopharm. Res. 2021, 13, 435–447. [Google Scholar] [CrossRef]
  21. Ghosh, S.; Paul, E.; Chowdhury, S.; Tiwari, R.C. New approaches for testing non-inferiority for three-arm trials with Poisson distributed outcomes. Biostatistics 2022, 23, 136–156. [Google Scholar] [CrossRef]
  22. Ghosh, S.; Guo, W.; Ghosh, S. A hierarchical testing procedure for three arm non-inferiority trials. Comput. Stat. Data Anal. 2022, 174, 107521. [Google Scholar] [CrossRef]
  23. Scharpenberg, M.; Brannath, W. Simultaneous confidence intervals for an extended Koch-Röhmel design in three-arm non-inferiority trials. arXiv 2022, arXiv:2210.08931. [Google Scholar] [CrossRef]
  24. Tang, N.S.; Liang, F. Confidence intervals for assessing non-inferiority with assay sensitivity in a three-arm trial with normally distributed endpoints. Mathematics 2022, 10, 167. [Google Scholar] [CrossRef]
  25. Munzel, U. Nonparametric non-inferiority analyses in the three-arm design with active control and placebo. Stat. Med. 2009, 28, 3643–3656. [Google Scholar] [CrossRef] [PubMed]
  26. Tseng, Y.K.; Hsu, K.N. Study design for a three-arm equivalence clinical trial with binomially distributed outcomes. J. Biopharm. Stat. 2021, 31, 736–744. [Google Scholar] [CrossRef] [PubMed]
  27. Little, R.J.A.; Rubin, D.B. Statistical Analysis with Missing Data, 3rd ed.; (Wiley Series in Probability and Statistics); John Wiley & Sons: Hoboken, NJ, USA, 2020; pp. 45–87. [Google Scholar]
  28. Choi, S.C.; Stablein, D.M. Practical tests for comparing two proportions with incomplete data. J. R. Stat. Soc. Ser. C 1982, 31, 256–262. [Google Scholar] [CrossRef]
  29. Tang, M.L.; Tang, N.S. Exact tests for comparing two paired proportions with incomplete data. Biom. J. 2004, 14, 72–82. [Google Scholar] [CrossRef]
  30. Maritz, J.S. A permutation paired test allowing for missing values. Aust. N. Z. J. Stat. 1995, 37, 153–159. [Google Scholar] [CrossRef]
  31. Yu, D.; Lim, J.; Liang, F.; Kim, K.; Kim, B.S.; Jang, W. Permutation test for incomplete paired data with application to cDNA microarray data. Comput. Stat. Data Anal. 2013, 56, 510–521. [Google Scholar] [CrossRef]
  32. Pesarin, F. Multivariate Permutation Tests: With Applications in Biostatistics; (Wiley Series in Probability and Statistics); John Wiley & Sons: Chichester, UK, 2001; pp. 279–303. [Google Scholar]
  33. Pesarin, F.; Celant, G.; Salmaso, L. Two sample permutation tests for repeated measures with missing values. J. Appl. Stat. Sci. 2000, 9, 291–304. [Google Scholar]
  34. Robins, J.; Rotnitzky, A.; Zhao, L.P. Estimation of regression coefficients when some regressors are not always observed. J. Am. Stat. Assoc. 1994, 89, 846–866. [Google Scholar] [CrossRef]
  35. Lee, S.Y.; Tang, N.S. Bayesian analysis of nonlinear structural equation models with nonignorable missing data. Psychometrika 2006, 71, 541–564. [Google Scholar] [CrossRef]
  36. Wang, Z.Q.; Tang, N.S. Bayesian quantile regression with mixed discrete and nonignorable missing covariates. Bayesian Anal. 2020, 15, 579–604. [Google Scholar] [CrossRef]
  37. Kim, J.K.; Yu, C.L. A semiparametric estimation of mean functionals with nonignorable missing data. J. Am. Stat. Assoc. 2011, 106, 157–165. [Google Scholar] [CrossRef] [Green Version]
  38. Tang, N.S.; Zhao, P.Y.; Zhu, H.T. Empirical likelihood for estimating equations with nonignorable missing data. Stat. Sin. 2014, 24, 723–747. [Google Scholar] [PubMed] [Green Version]
  39. Qin, J.; Lawless, J. Empirical likelihood and general estimating equations. Ann. Stat. 1994, 22, 300–325. [Google Scholar] [CrossRef]
  40. Choi, S.C.; Stablein, D.M. Comparing incomplete paired binomial data under non-random mechanisms. Stat. Med. 1988, 7, 929–939. [Google Scholar] [CrossRef]
  41. Li, H.Q.; Tian, G.L.; Tang, N.S.; Cao, H.Y. Assessing non-inferiority for incomplete paired-data under non-ignorable missing mechanism. Comput. Stat. Data Anal. 2018, 127, 69–81. [Google Scholar] [CrossRef]
  42. Hida, E.; Tango, T. On the three-arm non-inferiority trial including a placebo with a prespecified margin. Stat. Med. 2011, 30, 224–231. [Google Scholar] [CrossRef]
  43. Kieser, M.; Friede, T. Planning and analysis of three-arm non-inferiority trials with binary endpoints. Stat. Med. 2007, 26, 253–273. [Google Scholar] [CrossRef]
  44. Ghosh, P.; Nathoo, F.; Gönen, M.; Tiwari, R.C. Assessing non-inferiority in a three-arm trial using the Bayesian approach. Stat. Med. 2011, 30, 1795–1808. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Morikawa, K.; Kim, J.K.; Kano, Y. Semiparametric maximum likelihood estimation with data missing not at random. Can. J. Stat. 2017, 45, 393–409. [Google Scholar] [CrossRef]
  46. Arnastauskaite, J.; Ruzgas, T.; Brazenas, M. A new goodness of fit test for multivariate normality and comparative simulation study. Mathematics 2021, 9, 3003. [Google Scholar] [CrossRef]
  47. Elbouch, S.E.; Michel, O.J.; Comon, P. A normality test for multivariate dependent samples. Signal Process. 2022, 201, 108705. [Google Scholar] [CrossRef]
  48. Cook, T.; Zea, R. Missing data and sensitivity analysis for binary data with implications for sample size and power of randomized clinical trials. Stat. Med. 2019, 39, 192–204. [Google Scholar] [CrossRef] [PubMed]
  49. Zhao, P.Y.; Tang, M.L.; Tang, N.S. Robust estimation of distribution functions and quantiles with non-ignorable missing data. Can. J. Stat. 2013, 129, 193–205. [Google Scholar] [CrossRef]
Figure 1. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against the sample size n under balanced design with missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for a = 0.2 .
Figure 1. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against the sample size n under balanced design with missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for a = 0.2 .
Mathematics 11 00246 g001
Figure 2. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against the sample size n under unbalanced design (i.e., n E : n R : n P = 2:1:1), with missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for a = 0.2 .
Figure 2. SHT, SRI, SAI and CC represent empirical powers evaluated from IPW, regression imputation, AIPW and CC methods against the sample size n under unbalanced design (i.e., n E : n R : n P = 2:1:1), with missingness data mechanism models E1 (left panel), E2 (left second panel), E3 (right second panel) and E4 (right panel) for a = 0.2 .
Mathematics 11 00246 g002
Table 1. Empirical type I error rates for balanced designs in the first simulation study.
Table 1. Empirical type I error rates for balanced designs in the first simulation study.
n = 50n = 100n = 150
CaseSHTSRISAICCSHTSRISAICCSHTSRISAICC
E10.0480.0510.0530.0570.0480.0500.0520.0740.0500.0510.0500.083
E20.0530.0550.0550.0520.0480.0490.0500.0550.0540.0550.0550.092
E30.0490.0550.0540.0700.0530.0530.0580.0950.0520.0540.0540.117
E40.0510.0530.0520.0610.0480.0500.0530.0860.0470.0510.0490.088
Note: SHT, SRI, SAI and CC denote Wald-type test approaches based on IPW, regression imputation, AIPW and CC, respectively.
Table 2. Empirical type I error rates for unbalanced designs in the first simulation study.
Table 2. Empirical type I error rates for unbalanced designs in the first simulation study.
N = 200N = 500
n E : n R : n P SHTSRISAISHTSRISAI
2:2:10.0460.0480.0480.0490.0500.051
3:3:10.0530.0530.0510.0450.0460.046
4:4:10.0530.0540.0580.0450.0470.050
2:1:10.0480.0510.0500.0450.0440.045
3:2:10.0550.0590.0600.0470.0490.046
4:3:10.0380.0440.0470.0540.0550.055
4:2:10.0480.0510.0530.0630.0640.059
3:1:10.0550.0560.0570.0520.0550.056
4:1:10.0500.0570.0580.0510.0520.054
Table 3. Sensitivity analysis of the proposed test statistics in the second simulation study.
Table 3. Sensitivity analysis of the proposed test statistics in the second simulation study.
n = 50 n = 150
γ E SHTSRISAICCSHTSRISAICC
−0.20.0380.0460.0440.0330.0510.0500.0510.033
−0.10.0530.0550.0550.0520.0540.0550.0550.092
−0.050.0410.0450.0460.0660.0450.0480.0470.061
0.00.0480.0510.0530.0570.0500.0510.0500.083
0.050.0500.0510.0520.0650.0540.0550.0550.092
0.10.0470.0510.0560.0870.0530.0560.0550.111
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, W.; Zhang, Y.; Tang, N. Non-Parametric Non-Inferiority Assessment in a Three-Arm Trial with Non-Ignorable Missing Data. Mathematics 2023, 11, 246. https://doi.org/10.3390/math11010246

AMA Style

Li W, Zhang Y, Tang N. Non-Parametric Non-Inferiority Assessment in a Three-Arm Trial with Non-Ignorable Missing Data. Mathematics. 2023; 11(1):246. https://doi.org/10.3390/math11010246

Chicago/Turabian Style

Li, Wei, Yunqi Zhang, and Niansheng Tang. 2023. "Non-Parametric Non-Inferiority Assessment in a Three-Arm Trial with Non-Ignorable Missing Data" Mathematics 11, no. 1: 246. https://doi.org/10.3390/math11010246

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop