Next Article in Journal
Association between Laughter and Lifestyle Diseases after the Great East Japan Earthquake: The Fukushima Health Management Survey
Next Article in Special Issue
Literature Cases Summarized Based on Their Polysomnographic Findings in Rett Syndrome
Previous Article in Journal
The Dose–Response Relationships of Different Dimensions of Physical Activity with Daily Physical Function and Cognitive Function in Chinese Adults with Hypertension: A Cross-Sectional Study
Previous Article in Special Issue
Shifts in Sources of Food but Stable Nutritional Outcomes among Children in the Early Months of the COVID-19 Pandemic
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

A Critical Review of Statistical Methods for Twin Studies Relating Exposure to Early Life Health Conditions

1
Institute for Biomedical Research and Innovation, National Research Council, 90146 Palermo, Italy
2
Institute Desbrest of Epidemiology and Public Health, Inserm and University of Montpellier, 34093 Montpellier, France
3
Department of Surgical Sciences, Dentistry, Gynecology and Pediatrics, Pediatric Division, University of Verona, 37134 Verona, Italy
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2021, 18(23), 12696; https://doi.org/10.3390/ijerph182312696
Submission received: 25 October 2021 / Revised: 20 November 2021 / Accepted: 30 November 2021 / Published: 2 December 2021

Abstract

:
When investigating disease etiology, twin data provide a unique opportunity to control for confounding and disentangling the role of the human genome and exposome. However, using appropriate statistical methods is fundamental for exploiting such potential. We aimed to critically review the statistical approaches used in twin studies relating exposure to early life health conditions. We searched PubMed, Scopus, Web of Science, and Embase (2011–2021). We identified 32 studies and nine classes of methods. Five were conditional approaches (within-pair analyses): additive-common-erratic (ACE) models (11 studies), generalized linear mixed models (GLMMs, five studies), generalized linear models (GLMs) with fixed pair effects (four studies), within-pair difference analyses (three studies), and paired-sample tests (two studies). Four were marginal approaches (unpaired analyses): generalized estimating equations (GEE) models (five studies), GLMs with cluster-robust standard errors (six studies), GLMs (one study), and independent-sample tests (one study). ACE models are suitable for assessing heritability but require adaptations for binary outcomes and repeated measurements. Conditional models can adjust by design for shared confounders, and GLMMs are suitable for repeated measurements. Marginal models may lead to invalid inference. By highlighting the strengths and limitations of commonly applied statistical methods, this review may be helpful for researchers using twin designs.

1. Introduction

Investigating disease etiology is one of the major goals in the practice of epidemiology. However, controlling for the totality of confounding factors is a big challenge for investigators aiming to unveil the causal effect of a given exposure on a given disease [1].
Twins share a large number of genes, prenatal, neonatal, and postnatal factors, and provide a unique opportunity to account for the human genome and exposome [2,3]. In pediatric populations, twin data are particularly useful to shed light on the origin of associations between fetal and early life exposures and the risk of later diseases. In particular, if paired (conditional) analyses confirm the associations found through unpaired (marginal) analyses, there is evidence of a causal pathway; conversely, if paired analyses do not confirm the associations, they are to be ascribed to shared factors such as the genome or maternal variables [4,5].
Twin data also allow considering the genome as the exposure of interest [6]. Monozygotic (MZ) twins share 100% of their genes, while dizygotic (DZ) twins share about 50% of their genes. Therefore, if MZ twins are more similar than DZ twins with regard to the outcome of interest, this is indicative of an effect of the genome on that outcome. Conversely, non-genetic variability is to be ascribed to the totality of shared and individual environmental exposures (the exposome) [7].
Using appropriate statistical methods is fundamental for exploiting the potential of twin data and carrying out valid statistical inference [8]. In 2005, Carlin et al. [9] critically reviewed the regression approaches for the analysis of twin data, showing that transferring standard methods to twin data is not straightforward, because they need to account for the paired structure of the data that induces correlation between observations. Moreover, a clear awareness of the assumptions underlying each method and the consequent parameter interpretation is fundamental [9]. Recently, Tan et al. [10] reviewed twin methods used in epigenetic studies and found that simplistic approaches (disregarding within-pair dependence and potential confounding effects), such as the unpaired t-test, are rare but still present in the literature, possibly undermining the validity of the evidence base. Similar works have not been performed with regard to the methods applied in published twin studies relating exposure to diagnosable diseases in a broader sense.
This research was motivated by the earlyFOOD (“Long-term impact of gestational and early-life dietary habits on infant gut immunity and disease risk”) project (EU-H2020 Era-Net HDHL-INTIMIC cofounded call “Interrelation of the Intestinal Microbiome, Diet and Health”). Human microbial colonization is a crucial phase in the early development of a child. Indeed, a growing body of literature has associated early-life imbalances of the gut microbiota with several diseases, especially immune-mediated, metabolic, and neurodevelopmental disorders [11]. Colonization begins at birth when the newborn is exposed to microbes of primarily maternal origin. Then, it continues to develop under the influence of maternal antibodies transferred through breastfeeding, dietary-related metabolites, and environmental factors. In this regard, disentangling the role of different exposures in the etiology of the aforementioned conditions is of particular importance. Therefore, we focused on studies assessing the effects of diet or other exposures on the risk of obesity, allergic and neurodevelopmental disorders, aiming to review the statistical approaches used and to summarize their strengths and limitations.

2. Materials and Methods

On 12 April 2021, we searched PubMed, Scopus, Web of Science, and Embase, using combinations of Medical Subject Headings, explosion searches, and free keywords. The full search strategy is reported in the Supplementary Material S1.
The inclusion criteria were: (1) presence of children aged 0–12 years among the participants; (2) presence of statistical analyses carried out exclusively on twin pairs; (3) assessment of diet or other exposures; (4) obesity, asthma, eczema, rhinitis, and neurodevelopmental disorders were among the study outcomes; (5) publication year from 2011 to 2021; and (6) English language. The exclusion criteria were: (1) out of topic; and (2) case reports, reviews, or abstracts. The studies identified in the four databases were combined and duplicates were removed. Three reviewers (S.F., L.M., and S.L.G.) screened the articles for relevance based on titles and abstracts. The same reviewers screened the full texts of potentially eligible articles. The following information was extracted from the included articles: first author and publication year; outcomes; exposures and confounders; study design (including size and age of the twin sample); statistical methods; and main results.

3. Results

Figure 1 summarizes the study selection process. We identified 618 articles through the four electronic databases. After the exclusion of duplicates, 364 articles were screened based on titles and abstracts, and 52 articles were identified as potentially eligible. Of them, 20 were excluded following full-text evaluation (Figure 1). Therefore, 32 articles were included in this review.
Table S1 summarizes the characteristics of the included studies. We identified nine classes of statistical methods. Thereafter, we will present them from the most complex (the most informative) to the most simple (the least informative) one, based on the number of model parameters and underlying assumptions. Five were conditional approaches (within-pair analyses): additive-common-erratic (ACE) models [12,13,14,15,16,17,18,19,20,21,22], generalized linear mixed models (GLMMs) [23,24,25,26,27], generalized linear models (GLMs) with fixed pair effects [28,29,30,31], within-pair difference analyses [32,33,34], and paired-sample tests [33,35]. Four were marginal approaches (unpaired analyses): generalized estimating equations (GEE) models [26,30,36,37,38], GLMs with cluster-robust standard errors [24,29,31,39,40,41], GLMs [42], and independent-sample tests [43].

3.1. ACE Models

A univariate ACE model [44] can be presented as:
{ Y 1 j = μ + a A 1 j + c C 1 j + e E 1 j + β x 1 j + γ z 1 j Y 2 j = μ + a A 2 j + c C 2 j + e E 2 j + β x 2 j + γ z 2 j ,
where Y 1 j and Y 2 j are quantitative outcome values in twins 1 and 2, respectively, with j = 1 , 2 , , n and n equal to the number of twin pairs. Parameter μ is an intercept, x is an optional exposure of interest ( β quantifies its effect), and z is an optional confounder ( γ quantifies its effect); x and z may also be vectors.
The effect of genome and exposome is accounted for by means of unobserved random variables, encompassing additive genetic ( A 1 and A 2 ), common (or shared) environmental ( C 1 and C 2 ), and erratic (or unique) environmental ( E 1 and E 2 ) factors. The random variables are assumed mutually independent within the same twin, and to follow a standard normal distribution. Consequently, a 2 is the outcome variance explained by genetic factors, c 2 is the variance explained by shared environmental factors, and e 2 is the variance explained by unique environmental factors (including measurement error). The percentage genetic contribution (heritability) can be derived as a 2 / ( a 2 + c 2 + e 2 ) ; percentage environmental contributions can be derived similarly. The ACE model is identified due to the following assumptions: the correlation between A 1 and A 2 is 1 in MZ twins and 0.5 in DZ twins; the correlation between C 1 and C 2 is 1 ( C 1 = C 2 ) both in MZ and in DZ twins. All other pairs of latent variables are assumed independent between twins in the same pair. Model (1) can be estimated either from individual data or from observed sample sizes, means, and correlation matrices in MZ and DZ twins.
Only two studies applied model (1) in its plain form [12,14]. In such studies, the ACE model was applied to binary outcomes using tetrachoric correlations. In children aged 1 year, Bunyavanich et al. [12] highlighted a predominance of shared environmental contributions for asthma diagnosis and medication use (84 to 88% through different models) and a predominance of heritability for asthma hospitalizations (55 to 95%). In children aged 3 years, they highlighted a substantial genetic contribution for all three outcomes (34 to 65%). In children aged 3–9 years, Kahr et al. [14] highlighted an overall contribution of heritability equal to 94% for atopic dermatitis, 54% for asthma, and 43% for hay fever. Moreover, they did not find significant differences in the genetic contribution (gene–environment interactions) after stratification by age, sex, gestational age, delivery mode, and maternal smoking. In this regard, it appears that the authors applied a statistical test (e.g., a Wald test) to compare independent estimates obtained from the different sub-groups.
The moderated ACE model [45] offers a formal way of assessing gene–environment interactions by allowing parameters (including the intercept) to be linear functions of one or more moderators. Using this model in children aged 3 and 12 years, Robbers et al. [20] highlighted a lower heritability of internalizing/externalizing problems in children from divorced families (3 to 51%) than in children from non-divorced families (34 to 60%). In children aged 7, 10, and 12 years, Lamb et al. [16] found no shared environmental influences on internalizing/externalizing problems (quantitative scales). Moreover, they found lower heritability of internalizing/externalizing problems in twins exposed to different teachers (26 to 56%) than in twins exposed to the same teacher (67 to 82%). They also found higher heritability of externalizing problems in boys (47 to 82%) than in girls (55 to 75%). In children aged 4 years, Schrempft et al. [21] found that heritability of Body Mass Index (BMI) was significantly higher in higher-risk home environments (86%) than in lower-risk ones (39%). Moreover, they found a significantly lower effect of shared environmental factors on BMI in high obesogenic home environments (0%) than in low obesogenic home environments (34%). They also fit a scaled model allowing different sizes of variance components between moderator subgroups but fixing their percentage contributions to be equal. However, according to the likelihood-ratio tests performed, the scaled model generally led to a deterioration in the model fit.
When sex is the moderator, the gene–environment interaction is referred to as quantitative sex-limitation [46]. In this case, genetic and environmental effects on the outcome are assumed greater in one sex than in the other one. If, however, different subsets of genes (or shared environmental factors) are assumed to influence the outcome in boys and in girls, it is referred to as qualitative sex-limitation [46]. Clues of qualitative sex-limitation can be provided by a lower opposite-sex dizygotic (OSDZ) correlation. In this case, the correlation between A 1 and A 2 (or between C 1 and C 2 ) in the OSDZ group is expected to be <0.5 (or <1), and is estimated as a free parameter. Quantitative and qualitative sex-limitation models were applied by Silventoinen et al. [22], who found the lowest heritability of body mass index (BMI) at 4 years of age (40%) and increased heritability at 19 years of age (75%). Evidence of quantitative and qualitative (genetic) sex-limitation was found through likelihood-ratio tests carried out for each age (from 1 to 19 years): age-specific patterns of genetic/environmental contributions were similar in boys and girls, and genetic correlations within OSDZ pairs were generally <0.5.
An important extension of model (1) is the multivariate ACE model. Here we present the Cholesky decomposition representation of the bivariate case [47]:
{ Y 1 j = μ y + a y A 1 j y + c y C 1 j y + e y E 1 j y + β y x 1 j + γ y z 1 j Y 2 j = μ y + a y A 2 j y + c y C 2 j y + e y E 2 j y + β y x 2 j + γ y z 2 j W 1 j = μ w + a y w A 1 j y + c y w C 1 j y + e y w E 1 j y + a w A 1 j w + c w C 1 j w + e w E 1 j w + β w x 1 j + γ w z 1 j W 2 j = μ w + a y w A 2 j y + c y w C 2 j y + e y w E 2 j y + a w A 2 j w + c w C 2 j w + e w E 2 j w + β w x 2 j + γ w z 2 j
where Y and W are different quantitative outcomes, with specific intercepts and covariate effects. There are twice as many latent random variables as in model (1), and the assumptions about their variances and covariances remain similar. For W , the variance explained by heritable factors is now a y w 2 + a w 2 , and environmental contributions can be derived similarly. Two meaningful pieces of information can be derived from model (2). The first one is the fraction of the correlation ( r ) between Y and W that can be attributed to common genetic factors: F R a = a y a y w / ( a y a y w + c y c y w + e y e y w ) . The second one is the correlation between genetic components of Y and W (i.e., the “genetic correlation”): r a = a y a y w / a y 2 ( a y w 2 + a w 2 ) . Similar indicators can be derived for environmental factors.
In children aged 4–7 years, Faith et al. [13] estimated heritability of BMI, waist circumference (WC), and per cent body fat (BF) to be 89%, 73%, and 90%, respectively. Then, a bivariate ACE model highlighted that energy compensation ability (COMPX) was correlated with BF ( r = 0.24 ) due to a unique environmental correlation ( r e = 0.27 ). In twins aged 8–17 years, Ning et al. [19] found that heritability ranged from 56 to 71% for BMI and from 24 to 56% for WC, with substantial contributions of common genetic effects to correlations ( F R a ) between BMI and WC (37 to 91%). They also found higher genetic effects in older children and in girls, but this appears to have been assessed through univariate models. Indeed, multivariate analysis of gene–environment interaction and sex-limitation is not straightforward [46]. Interestingly, Llewellyn et al. [17] applied a bivariate ACE model to repeated measurements of BMI in children at 4 and 10 years. They found that heritability increased significantly from 43 (age 4) to 82% (age 10), and that genetic correlation between BMI at the two ages was r a = 0.58 .
The extension of model (2) to more than two outcomes requires including additional equation pairs with additional random variables (in a cumulative fashion), so that parameters like a y w , c y w and e y w are obtained for each outcome pair. Using a multivariate ACE model in children aged 3 months, Llewellyn et al. [18] found weak correlations between weight and slowness in eating ( r = 0.22 ), satiety responsiveness ( r = 0.23 ), and appetite size ( r = 0.30 ), and that common genetic effects explained about 40% of these correlations. Finally, in children aged 5 years, Kan et al. [15] found high heritability of BMI (79%) and weak correlations between BMI and food responsiveness ( r = 0.20 ) and external eating ( r = 0.10 ). Since the different contributions to correlation ( F R a , F R c , and F R e ) were in opposite directions, these were not reported by the authors.
In all the aforementioned studies, the whole genome (accounted for by random variables A ) and the exposome (accounted for by random variables C and E ) were themselves the exposures of interest (no study aimed to estimate parameter β for a specific exposure). The reported effects were in the form of a variance (or correlation) component and can be considered indicative of a causal pathway to the outcomes. This would be especially true as all the studies accounted for several confounders/effect modifiers by regressing them out prior to analyses, inclusion in the model, or stratification (Table S1).

3.2. GLMMs

A GLMM can be expressed as:
g [ E ( Y i j | b j ) ] = μ + b j + β x i j + γ z i j ,
where g is a known link function that relates the expected value of the outcome (belonging to the exponential class of distributions) to the linear predictor, Y i j is the outcome value in twin i ( i = 1 ,   2 ) of pair j , and b j is a pair-level random effect, generally assumed to be normally distributed with 0 mean and variance equal to σ b 2 . Interpretation of β and γ changes according to whether g is identity (mean difference), logarithmic (log-relative risk, RR), or logistic (log-odds ratio, OR). Using model (3), Pimpin et al. [27] found that daily % of total energy from proteins (%Epro) at 21 months was associated with overweight/obesity status at 36 months (OR = 1.10). Similarly, Pimpin et al. [25] found that % dairy protein intake at 21 months was associated with overweight/obesity status at 36 months (OR = 1.12). Ha et al. [23] found that postnatal exposures to O3 were significantly associated with developmental delays at 8, 12, 18, 24, 30, and 36 months (RR = 1.014 to 1.173). Moreover, they found that postnatal exposures to PM2.5 were significantly associated with personal-social developmental delays up to 18 months (RR = 1.052 to 1.120). Finally, model (3) was used by Jackson et al. [24] to confirm the results obtained through a different method.
An extension of model (3) suitable for repeated measurements (e.g., for growth models) can be obtained by adding an individual-level random intercept and, possibly, a random slope for the time indicator, with a structure of variance/covariance between them. Using this extension, Pimpin et al. [27] found that %Epro was associated with BMI ( β = 0.043 ) and weight ( β = 0.052 ) in children aged between 21 and 60 months. Similarly, Pimpin et al. [25] found that % dairy protein intake was associated with BMI ( β = 0.040 ) and weight ( β = 0.046 ). Yeung et al. [26] found that infertility treatments were not associated with developmental delays in children aged between 4 and 36 months. Finally, Ha et al. [23] found that prenatal exposure to O3 was significantly associated with communication delays (RR = 1.025) between 8 and 36 months.
Since GLMMs include pair effects, shared confounders are adjusted for by design. Therefore, the effects sizes reported in the aforementioned studies may be considered indicative of a causal pathway from the exposures of interest to the outcomes. This would be especially true as all the studies explicitly accounted for several confounders (both shared and individual-level ones) by inclusion in the model (Table S1).

3.3. GLMs with Fixed Pair Effects

A GLM with fixed pair effects ( b j ) can be defined as:
g [ E ( Y i j ) ] = b j + β x i j + γ z i j .
In this case, parameters b j are estimated through dummy variables. Using this model in twins aged 9–16 years, Örtqvist et al. [29] found no association between fetal growth and lung function. In children aged 4 years, Leong et al. [31] found no association between the number of antibiotic courses during early childhood (0–24 months) and BMI/obesity status. For binary outcomes, conditional logistic regression allows eliminating the need to estimate the strata parameters b j . Through this model, Örtqvist et al. [29] found a strong association between fetal growth and abnormal FEV1 (OR = 5.57) in twins aged 9–16 years. In children up to 18 years, Li et al. [28] found an increased risk of ever being obese associated with untreated infection in the first year of life compared with no infection (OR = 1.55). Finally, in children aged 3–10 years, Slob et al. [30] found that antibiotic use in the first 2 years of life was consistently associated with an increased risk of asthma in two cohorts (OR = 1.54 and OR = 2.00), while inconsistent results were found in the two cohorts for eczema (OR = 0.99 and OR = 1.67).
Since this class of models includes pair effects, the effects sizes reported in the aforementioned studies may be considered indicative of a causal pathway from the exposure to the outcome. This would be especially true in two of the four aforementioned studies where several individual-level confounders were included in the model [30,31]. However, it should be pointed out that in one study [31], model (4) was applied to a binary outcome using a standard logistic regression with dummy variables for the pair effects. In this case, the estimator of β is severely biased, and using conditional logistic regression is much more recommended (see Discussion).

3.4. Within-Pair Difference Analyses

When g is the identity function, a within-pair difference model can be derived from (and is formally equivalent to) model (4) by taking differences between twins 1 and 2:
E ( Y 1 j Y 2 j ) = β ( x 1 j x 2 j ) + γ ( z 1 j z 2 j ) .
This model was applied by Bogl et al. [32] in twins aged 11 years or older, who found that increasing energy intake was associated with increasing BMI ( β = 0.13 for a 100-kcal/d increase). Of note, other authors simply analyzed correlations between outcome and exposure differences. In children aged 3 months, Tripicchio et al. [34] used Pearson’s correlation and found that parental restriction to eat was positively associated with BMI ( r = 0.31 ), while it was negatively associated with an energy compensation ability scale (COMPX,   r = 0.27 ). Moreover, parental pressure to eat was negatively associated with BMI ( r = 0.40 ), BF ( r = 0.38 ), and WC ( r = 0.40 ). Dubois et al. [33] used Spearman’s correlation and found negative (positive) associations between intake of carbohydrates (fats) at 9 years and BMI at 12, 13, and 14 years.
As for model (4), parameter β in model (5) represents a within-pair effect of the exposure, and may be considered indicative of a causal pathway to the outcome. However, only one study [32] included an individual-level confounder (height) in the model.

3.5. Paired-Sample Tests

If all the twin pairs are discordant on a binary exposure ( x 1 j = 1 and x 2 j = 0 ), a paired-sample test can be applied. Particularly for a quantitative outcome, and in the absence of individual-level confounders ( γ = 0 ), performing a paired t-test is equivalent to testing β in model (5), which becomes:
E ( Y 1 j Y 2 j ) = β .
Similarly, a paired-sample test can be performed if all the twin pairs are discordant on a binary outcome. Through Wilcoxon signed-rank tests, Dubois et al. [33] found that heavier female twins consumed more grain products than their leaner female co-twin.
Moreover, in twins aged 4–18 years, Bilenberg et al. [35] found that twins scoring high on a scale of attention-deficit/hyperactivity disorders (ADHD) had significantly higher PnPs14 RFU-score (a maternal transplacentally-acquired antibody) than lower-scoring co-twins (mean difference was 0.08).
Although paired-sample tests account by design for shared confounders, individual-level confounders cannot be accounted for. Therefore, the associations reported in the aforementioned studies may not be indicative of a causal pathway from the exposure to the outcome if parameter γ is not 0 in model (5).

3.6. GEE Models

A GEE model can be expressed as:
g [ E ( Y i j ) ] = μ + β x i j + γ z i j ,
where Y 1 j and Y 2 j are assumed to be correlated for each j , and the unknown correlation is to be estimated. Since there are no pair effects, Equation (7) is a marginal rather than a conditional model, as all the previous ones were [48].
Using GEE models in children aged 9 or 12 years, Gong et al. [37] found that ADHD and autism spectrum disorders were not significantly associated with exposure to PM10 and NOx during pregnancy, and during the first/ninth year of life. In children aged 18–22 months, Boghossian et al. [36] found that antenatal corticosteroids (ANS) were associated with a lower risk of neurodevelopmental impairment (NDI) or death among non-small for gestational age infants (RR = 0.89), and a higher risk among small for gestational age infants (RR = 1.62). Moreover, ANS were associated with a higher risk of NDI/death among infants of mothers with diabetes (RR = 1.55). In children aged 3–7 years, Johnson et al. [38] found positive associations between BMI and the frequency of maternal non-verbal encouragements/discouragements and temporary discouragements during a laboratory meal. Using GEE models, Yeung et al. [26] and Slob et al. [30] found similar results than in their conditional analyses (GLMMs and conditional logistic regression, respectively).
Since Equation (7) is a marginal model, parameter β represents a population-averaged effect. However, GEE models yield essentially the same estimates of β as in GLMMs with identity link function and in log-linear models. Conversely, studies using logistic regression [26,30,37] may have provided attenuated estimates of the within-pair effects (see Discussion). When several confounders are included in the model (as in all the aforementioned studies), the degree of attenuation decreases, and the reported associations may be considered indicative of a causal pathway.

3.7. GLMs with Cluster-Robust Standard Errors

A GLM with cluster-robust standard errors can be expressed as model (7), but no correlation parameter is estimated. Since ignoring pairwise dependence induces a bias in the standard error estimates, these are corrected through sandwich estimators [49]. Using this approach in children aged 4 years, Petkovsek et al. [41] found that, in females, the effect of prenatal smoking on externalizing behavioral problems was stronger in children with higher genetic risk ( β for interaction = 0.36 ). In twins aged 12–23 years, Palmer et al. [40] found that prenatal smoking significantly increased the frequency of conduct disorders ( β = 0.43 ) and inattention ( β = 0.85 ). In children aged 4–7 years, Jackson et al. [24] found that the effect of a short duration of breastfeeding on conduct problems was stronger in children with higher genetic risk ( β for interaction = 0.095 ). Örtqvist et al. [29] found that fetal growth was associated with post-bronchodilator FEV1 ( β = 0.16 ) and abnormal FEV1 (OR = 1.27). In children aged 9–12 years, Castellheim et al. [39] found that lifetime general anesthesia was significantly associated with ADHD scores ( β = 1.02 ). Finally, Leong et al. [31] found that antibiotic use was associated with increasing BMI ( β = 0.018 ) and obesity (OR = 1.09) in children aged 4 years.
Since pair effects were not included in these models, shared confounders were not adjusted for by design. Therefore, despite the inclusion of several covariates, the effects sizes reported in the aforementioned studies may not be considered indicative of a causal pathway from the exposures of interest to the outcomes. Of note, Leong et al. [31] found that the reported associations disappeared after including pair effects in the model.

3.8. GLMs

A plain GLM can be expressed as model (7), but neither correlation parameters nor cluster-robust standard errors are provided. Using this approach in children aged 6 years, González-Valenzuela et al. [42] found that cesarean birth was associated with verbal (OR = 2.83), non-verbal (OR = 2.55), and global (OR = 3.69) delays, and with general intelligence difficulties (OR = 2.62). Although several covariates were included in the models, the reported associations may not be considered indicative of a causal pathway from the exposure to the outcome.

3.9. Independent-Sample Tests

Using independent-sample tests (Chi-square tests) in children aged 2–10 years, Zafman et al. [43] found that delivery order (length of exposure to the maternal vaginal–fecal microbiome) was not associated with asthma and obesity indicators. Since confounding effects were totally disregarded in this case, the reported associations should not be considered indicative of a causal pathway from the exposure to the outcome.

4. Discussion

We reviewed the statistical approaches used in twin studies relating exposure to childhood obesity, allergic, and neurodevelopmental disorders. We identified 32 studies published between 2011 and 2021, and nine classes of statistical approaches. Five were conditional approaches (within-pair analyses), while four were marginal approaches (unpaired analyses). We provided a description of the identified statistical methodologies, highlighting the differences and connections among them.
Twinning was disregarded in two studies [42,43], where independent-sample tests and a plain GLM were applied. In this regard, ignoring the correlation between observations leads to sub-optimal effect-size estimators (they are not the best linear unbiased estimators–BLUE–in linear regression) and biased standard error estimators, invalidating inference [50]. In particular, this leads to overestimating parameter uncertainty for individual-level covariates and to underestimating parameter uncertainty for shared covariates [51]. Moreover, unless many confounders are included, these models may not be appropriate to unveil causal relationships between the exposures and outcomes.
Six studies [24,29,31,39,40,41] adopted a “corrective” approach by adjusting the standard errors through sandwich estimators. In this case, however, the effect-size estimators remain sub-optimal. Conversely, five studies [26,30,36,37,38] adopted a “preventive” approach, i.e., the GEE model. In the absence of unmeasured or disregarded confounding, GEE models provide optimal effect-size estimators (they are BLUE in linear regression), and correct standard error estimators [9,48].
Whereas all the aforementioned approaches are “marginal”, all the other identified approaches are “conditional”, since their parameters represent outcome differences associated with an exposure difference within a twin pair [4,5]. Consequently, conditional models adjust by design for shared confounders, i.e., the genome and the twin-identical exposome, and may be more appropriate to unveil causal relationships.
Two studies [33,35] applied the simplest form of conditional analysis, i.e., paired-sample tests; however, this approach provides no way to adjust for individual-level confounders. Conversely, three studies [32,33,34] carried out within-pair difference analyses, which have the potential to adjust for individual-level confounders by the inclusion of additional difference terms in the model. However, only Bogl et al. [32] included the difference in an individual-level confounder (height) in their model, while simple correlation coefficients were used in the other two studies [33,34]. Within-pair difference models cannot be applied if the link function is different from the identity.
GLMs with fixed pair effects can be applied in more general settings. This model was applied to a quantitative outcome in two studies [29,31] as “a way to control for a greater amount of unmeasured confounding” [31] compared to linear regression with robust standard errors. Only in one study [31], model (4) was applied to a binary outcome (logistic regression) using dummy variables for the pair effect. However, in this case, the estimator of β is severely biased, and using conditional logistic regression is much more recommended [52]. Conditional logistic regression was, indeed, used in three studies [28,29,30].
Although GLMs with fixed pair effects adjust by design for shared confounders, the associated parameters are not identifiable if included in the model. Conversely, GLMMs can incorporate shared covariates, simply leading to a lower heterogeneity captured by the random intercept [48]. As a result, estimates of β from GLMMs are quite robust with respect to the omission of shared covariates [48]. Using a GEE rather than a GLMM approach leads essentially to the same estimate of β in models with identity link function and in log-linear models [9,48]. Conversely, logistic regression estimated through the GEE approach generally yields attenuated estimates of β , and the degree of attenuation increases when shared covariates are omitted [48]. Five studies applied a GLMM [23,24,25,26,27]; of note, all but one [17] of the studies including repeated measurements of the outcome applied a GLMM [23,25,26,27].
Finally, ACE models were applied by eleven studies [12,13,14,15,16,17,18,19,20,21,22]. These models keep the advantages of all the previous models, but in addition, they can disentangle the contribution of the genome and the exposome. However, such models may appear difficult to implement and to interpret, and their extension to longitudinal data through mixed-effect modelling is not straightforward [53]. In this review, only one study [17] applied an ACE model to repeated measurements, using its bivariate extension. In ACE models, genetics is often the exposure of interest, while shared and individual covariates are incorporated into C and E , respectively. In this review, covariates were included in the ACE model only in one study [12], while they were regressed out prior to analyses (or considered as potential moderators) in almost all the other studies. Table 1 summarizes the strengths and limitations of the aforementioned methods and dedicated libraries for implementation in the R software.
In practical applications, the best (or sufficient) approach to use may depend on the research needs, data structure, and desired parameter meaning. ACE models can accommodate for most of the research needs, including the assessment of heritability, but they require adaptations for binary outcomes and repeated measurements. GLMMs are a valid choice if assessing heritability is not a research need, and they would be the best choice to accommodate for binary outcomes and repeated measurements. GLMs with fixed pair effects and within-pair difference analyses are good (and substantially equivalent) approaches when the effects of shared confounders do not need to be explicitly quantified, but only the former can accommodate for binary outcomes (conditional logistic regression is recommended). If either the outcome or the exposure is binary, and confounding effects are assumed to be marginal (e.g., if the twins are substantially identical except that for the exposure of interest), paired-sample tests can be a valid alternative to more complex models. The use of GEE models (marginal models) for logistic regression should only be limited to situations in which the researchers are interested in estimating population-averaged effects rather than within-pair effects. In these cases, however, the twin design is not strictly needed, and is not more useful than a singleton study (except that for increasing the sample size). Finally, GLMs should never be used in a twin study, since they may lead to sub-optimal inference. Figure 2 provides a summary of recommendations regarding which types of models researchers should favor under different conditions (research needs, data structure, and desired parameter meaning).
Among other methods for twin data, DeFries–Fulker regression was only mentioned by Petkovsek et al. [41]. ADE models were only mentioned by Faith et al. [13]. ACE models accounting for correlated errors and “twin confusion” were only mentioned by Lamb et al. [16]. ACDE models [44] and ACE models for categorical outcomes [54] were never mentioned. Incomplete records were generally excluded, and the issue of missing value imputation appears to have been only addressed in one study [26]. Indeed, imputation methods for correlated data might be less popular than standard approaches; in general, likelihood approaches (such as GLMM) have shown to be robust to the “missing completely at random” assumption, while non-likelihood marginal models (such as GEE models) have not [55].
Twin studies also have disadvantages. First, generalizability to singleton populations may not always be possible [56]. In this regard, twins may differ from singletons in several aspects: twins tend to have older parents, to be born preterm, and to have lower birthweight and specific disorders. Moreover, as pointed out by Bilenberg et al. [35], the genetic control may reduce the exposure contrast, and larger sample sizes may be required. However, due to the relative rarity of twins, gathering a large sample size may not be easy. Finally, experimental trials involving twins are statistically inefficient if all the twins in the same pair are allocated to the same treatment arm (the treatment becomes a shared covariate, leading to large standard errors) [8].
This review has some limitations that should be acknowledged. Study outcomes in the search strategy were limited to obesity, asthma, eczema, rhinitis, and neurodevelopmental disorders. Therefore, the distribution of statistical methods captured in this review may not generalize to the broader twin literature. Moreover, although we have used the general keyword “expos*” in the search strategy, we limited to search this term in titles and abstracts; therefore, as with all reviews, we might not have identified all the relevant articles. Finally, the characteristics of the included studies were sometimes difficult to identify and to extract (e.g., the number of twin pairs by zygosity) and, in some cases, they were inferred by the reviewers (e.g., based on tables).

5. Conclusions

In conclusion, twin data provide a unique opportunity to control for confounding and disentangling the role of the human genome and exposome when investigating disease etiology. Although rare, sub-optimal (GLMs with fixed pair effects for binary outcomes) or simplistic (plain GLMs and independent-sample tests) approaches were still present in the twin-study literature, possibly undermining the validity of the evidence base. By highlighting the strengths and limitations of commonly applied statistical methods, this review may be helpful for researchers using twin designs.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/ijerph182312696/s1. Supplementary Material S1: Characteristics of the included studies. Search strategy: “A critical review of statistical methods for twin studies relating exposure to early life health conditions: Search Strategy”, Table S1: Characteristics of the included studies.

Author Contributions

Conceptualization, I.A.-M. and S.L.G.; methodology, S.F., L.M., G.C. and B.C.; software, S.F. and L.M.; validation, S.F., G.C. and B.C.; formal analysis, S.F.; investigation, S.F.; resources, I.A.-M. and S.L.G.; data curation, S.F. and L.M.; writing—original draft preparation, S.F.; writing—review and editing, S.F., L.M., G.C., B.C., V.M., G.F., I.A.-M. and S.L.G.; visualization, S.F. and L.M.; supervision, I.A.-M. and S.L.G.; project administration, I.A.-M. and S.L.G.; funding acquisition, I.A.-M. and S.L.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the European Commission and the Ministry of Education, Universities and Research (Project Id. INTIMIC-091: earlyFOOD).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

We acknowledge the HEALS project (FP7-ENV-603946) that has supported the development of the methodology framework in this study by recruiting populations of twins.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Jager, K.; Zoccali, C.; MacLeod, A.; Dekker, F. Confounding: What it is and how to deal with it. Kidney Int. 2008, 73, 256–260. [Google Scholar] [CrossRef] [Green Version]
  2. Bell, J.; Saffery, R. The value of twins in epigenetic epidemiology. Int. J. Epidemiol. 2012, 41, 140–150. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Debord, D.G.; Carreón, T.; Lentz, T.J.; Middendorf, P.J.; Hoover, M.D.; Schulte, P.A. Use of the "Exposome" in the Practice of Epidemiology: A Primer on -Omic Technologies. Am. J. Epidemiol. 2016, 184, 302–314. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Dwyer, T.; Blizzard, L. A discussion of some statistical methods for separating within-pair associations from associations among all twins in research on fetal origins of disease. Paediatr. Périnat. Epidemiol. 2005, 19, 48–53. [Google Scholar] [CrossRef]
  5. Morley, R.; Dwyer, T. Studies of twins: What can they tell us about the fetal origins of adult disease? Paediatr. Périnat. Epidemiol. 2005, 19, 2–7. [Google Scholar] [CrossRef] [PubMed]
  6. Craig, J.M.; Calais-Ferreira, L.; Umstad, M.P.; Buchwald, D. The Value of Twins for Health and Medical Research: A Third of a Century of Progress. Twin Res. Hum. Genet. 2020, 23, 8–15. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Snieder, H.; Wang, X.; MacGregor, A.J. Twin Methodology. In Encyclopedia of Life Sciences (ELS); John Wiley & Sons, Ltd.: Chichester, UK, 2010. [Google Scholar]
  8. Hanley, J.A. The statistical joys-and added complications-of twin studies. Paediatr. Périnat. Epidemiol. 2018, 32, 388–389. [Google Scholar] [CrossRef]
  9. Carlin, J.B.; Gurrin, L.; Sterne, J.; Morley, R.; Dwyer, T. Regression models for twin studies: A critical review. Int. J. Epidemiol. 2005, 34, 1089–1099. [Google Scholar] [CrossRef] [Green Version]
  10. Tan, Q.; Christiansen, L.; Hjelmborg, J.; Christensen, K. Twin methodology in epigenetic studies. J. Exp. Biol. 2015, 218, 134–139. [Google Scholar] [CrossRef] [Green Version]
  11. Arrieta, M.-C.; Stiemsma, L.T.; Amenyogbe, N.; Brown, E.M.; Finlay, B. The Intestinal Microbiome in Early Life: Health and Disease. Front. Immunol. 2014, 5, 427. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Bunyavanich, S.; Silberg, J.L.; Lasky-Su, J.; Gillespie, N.A.; Lange, N.E.; Canino, G.; Celedόn, J.C. A Twin Study of Early-Childhood Asthma in Puerto Ricans. PLoS ONE 2013, 8, e68473. [Google Scholar] [CrossRef] [Green Version]
  13. Faith, M.S.; Pietrobelli, A.; Heo, M.; Johnson, S.L.; Keller, K.L.; Heymsfield, S.B.; Allison, D. A twin study of self-regulatory eating in early childhood: Estimates of genetic and environmental influence, and measurement considerations. Int. J. Obes. 2012, 36, 931–937. [Google Scholar] [CrossRef] [Green Version]
  14. Kahr, N.; Naeser, V.; Stensballe, L.G.; Kyvik, K.O.; Skytthe, A.; Backer, V.; Bønnelykke, K.; Thomsen, S.F. Gene-environment interaction in atopic diseases: A population-based twin study of early-life exposures. Clin. Respir. J. 2015, 9, 79–86. [Google Scholar] [CrossRef]
  15. Kan, C.; Herle, M.; Treasure, J.; Jones, A.; Rijsdijk, F.; Llewellyn, C. Common etiological architecture underlying reward responsiveness, externally driven eating behaviors, and BMI in childhood: Findings from the Gemini twin cohort. Int. J. Obes. 2020, 44, 2064–2074. [Google Scholar] [CrossRef] [PubMed]
  16. Lamb, D.J.; Middeldorp, C.M.; van Beijsterveldt, T.; Boomsma, R.I. Gene-environment interaction in teacher-rated internalizing and externalizing problem behavior in 7- to 12-year-old twins. J. Child Psychol. Psychiatry 2012, 53, 818–825. [Google Scholar] [CrossRef]
  17. Llewellyn, C.; Trzaskowski, M.; Plomin, R.; Wardle, J. From modeling to measurement: Developmental trends in genetic influence on adiposity in childhood. Obesity 2014, 22, 1756–1761. [Google Scholar] [CrossRef] [Green Version]
  18. Llewellyn, C.; Van Jaarsveld, C.H.; Plomin, R.; Fisher, A.; Wardle, J. Inherited behavioral susceptibility to adiposity in infancy: A multivariate genetic analysis of appetite and weight in the Gemini birth cohort. Am. J. Clin. Nutr. 2012, 95, 633–639. [Google Scholar] [CrossRef]
  19. Ning, F.; Silventoinen, K.; Pang, Z.C.; Kaprio, J.; Wang, S.J.; Zhang, D.; Duan, H.P.; Wu, W.F.; Qiao, Q. Genetic and Environmental Correlations Between Body Mass Index and Waist Circumference in China: The Qingdao Adolescent Twin Study. Behav. Genet. 2013, 43, 340–347. [Google Scholar] [CrossRef]
  20. Robbers, S.; van Oort, F.; Huizink, A.; Verhulst, F.; van Beijsterveldt, C.; Boomsma, D.; Bartels, M. Childhood problem behavior and parental divorce: Evidence for gene–environment interaction. Soc. Psychiatry Psychiatr. Epidemiol. 2012, 47, 1539–1548. [Google Scholar] [CrossRef] [Green Version]
  21. Schrempft, S.; Van Jaarsveld, C.H.M.; Fisher, A.; Herle, M.; Smith, A.; Fildes, A.; Llewellyn, C.H. Variation in the Heritability of Child Body Mass Index by Obesogenic Home Environment. JAMA Pediatr. 2018, 172, 1153–1160. [Google Scholar] [CrossRef] [Green Version]
  22. Silventoinen, K.; Jelenkovic, A.; Sund, R.; Hur, Y.-M.; Yokoyama, Y.; Honda, C.; Hjelmborg, J.; Möller, S.; Ooki, S.; Aaltonen, S.; et al. Genetic and environmental effects on body mass index from infancy to the onset of adulthood: An individual-based pooled analysis of 45 twin cohorts participating in the COllaborative project of Development of Anthropometrical measures in Twins (CODATwins) study. Am. J. Clin. Nutr. 2016, 104, 371–379. [Google Scholar] [CrossRef]
  23. Ha, S.; Yeung, E.; Bell, E.; Insaf, T.; Ghassabian, A.; Bell, G.; Muscatiello, N.; Mendola, P. Prenatal and early life exposures to ambient air pollution and development. Environ. Res. 2019, 174, 170–175. [Google Scholar] [CrossRef]
  24. Jackson, D.B. Breastfeeding duration and offspring conduct problems: The moderating role of genetic risk. Soc. Sci. Med. 2016, 166, 128–136. [Google Scholar] [CrossRef]
  25. Pimpin, L.; Jebb, S.A.; Johnson, L.; Llewellyn, C.; Ambrosini, G.L. Sources and pattern of protein intake and risk of overweight or obesity in young UK twins. Br. J. Nutr. 2018, 120, 820–829. [Google Scholar] [CrossRef]
  26. Yeung, E.H.; Sundaram, R.; Bell, E.M.; Druschel, C.; Kus, C.; Ghassabian, A.; Bello, S.; Xie, Y.; Louis, G.B. Examining Infertility Treatment and Early Childhood Development in the Upstate KIDS Study. JAMA Pediatr. 2016, 170, 251–258. [Google Scholar] [CrossRef] [Green Version]
  27. Pimpin, L.; Jebb, S.; Johnson, L.; Wardle, J.; Ambrosini, G.L. Dietary protein intake is associated with body mass index and weight up to 5 y of age in a prospective cohort of twins1. Am. J. Clin. Nutr. 2015, 103, 389–397. [Google Scholar] [CrossRef]
  28. Li, D.-K.; Chen, H.; Ferber, J.; Odouli, R. Infection and antibiotic use in infancy and risk of childhood obesity: A longitudinal birth cohort study. Lancet Diabetes Endocrinol. 2017, 5, 18–25. [Google Scholar] [CrossRef]
  29. Örtqvist, A.K.; Ullemar, V.; Lundholm, C.; Kuja-Halkola, R.; Magnusson, P.K.; Lichtenstein, P.; Hallberg, J.; Almqvist, C. Fetal Growth and Childhood Lung Function in the STOPPA Twin Study. Ann. Am. Thorac. Soc. 2017, 14, 1147–1153. [Google Scholar] [CrossRef]
  30. Slob, E.M.; Brew, B.; Vijverberg, S.J.; Kats, C.J.; Longo, C.; Pijnenburg, M.W.; Van Beijsterveldt, T.C.; Dolan, C.V.; Bartels, M.; Magnusson, P.; et al. Early-life antibiotic use and risk of asthma and eczema: Results of a discordant twin study. Eur. Respir. J. 2020, 55, 1902021. [Google Scholar] [CrossRef]
  31. Leong, K.S.W.; McLay, J.; Derraik, J.G.B.; Gibb, S.; Shackleton, N.; Taylor, R.W.; Glover, M.; Audas, R.; Taylor, B.; Milne, B.J.; et al. Associations of Prenatal and Childhood Antibiotic Exposure with Obesity at Age 4 Years. JAMA Netw. Open 2020, 3, e1919681. [Google Scholar] [CrossRef]
  32. Bogl, L.; Mehlig, K.; Intemann, T.; Masip, G.; Keski-Rahkonen, A.; Russo, P.; Michels, N.; Reisch, L.; Pala, V.; Johnson, L.; et al. A within-sibling pair analysis of lifestyle behaviours and BMI z-score in the multi-centre I.Family study. Nutr. Metab. Cardiovasc. Dis. 2019, 29, 580–589. [Google Scholar] [CrossRef] [Green Version]
  33. Dubois, L.; Diasparra, M.; Bogl, L.-H.; Fontaine-Bisson, B.; Bédard, B.; Tremblay, R.E.; Kaprio, J.; Boivin, M. Dietary Intake at 9 Years and Subsequent Body Mass Index in Adolescent Boys and Girls: A Study of Monozygotic Twin Pairs. Twin Res. Hum. Genet. 2016, 19, 47–59. [Google Scholar] [CrossRef] [Green Version]
  34. Tripicchio, G.L.; Keller, K.L.; Johnson, C.; Pietrobelli, A.; Heo, M.; Faith, M.S. Differential Maternal Feeding Practices, Eating Self-Regulation, and Adiposity in Young Twins. Pediatrics 2014, 134, e1399–e1404. [Google Scholar] [CrossRef] [Green Version]
  35. Bilenberg, N.; Hougaard, D.; Norgaard-Pedersen, B.; Nordenbæk, C.M.; Olsen, J. Twin study on transplacental-acquired antibodies and attention deficit/hyperactivity disorder—A pilot study. J. Neuroimmunol. 2011, 236, 72–75. [Google Scholar] [CrossRef] [PubMed]
  36. Boghossian, N.S.; McDonald, S.; Bell, E.F.; Carlo, W.A.; Brumbaugh, J.E.; Stoll, B.J.; Laptook, A.R.; Shankaran, S.; Walsh, M.C.; Das, A.; et al. Association of Antenatal Corticosteroids With Mortality, Morbidity, and Neurodevelopmental Outcomes in Extremely Preterm Multiple Gestation Infants. JAMA Pediatr. 2016, 170, 593–601. [Google Scholar] [CrossRef] [Green Version]
  37. Gong, T.; Almqvist, C.; Bölte, S.; Lichtenstein, P.; Anckarsäter, H.; Lind, T.; Lundholm, C.; Pershagen, G. Exposure to Air Pollution From Traffic and Neurodevelopmental Disorders in Swedish Twins. Twin Res. Hum. Genet. 2014, 17, 553–562. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Johnson, C.M.; Henderson, M.S.; Tripicchio, G.; Rozin, P.; Heo, M.; Pietrobelli, A.; Berkowitz, R.I.; Keller, K.L.; Faith, M.S. Observed parent-child feeding dynamics in relation to child body mass index and adiposity. Pediatr. Obes. 2017, 13, 222–231. [Google Scholar] [CrossRef]
  39. Castellheim, A.; Lundström, S.; Molin, M.; Kuja-Halkola, R.; Gillberg, C.; Gillberg, C. The role of general anesthesia on traits of neurodevelopmental disorders in a Swedish cohort of twins. J. Child. Psychol. Psychiatry 2018, 59, 966–972. [Google Scholar] [CrossRef]
  40. Palmer, R.H.C.; Bidwell, L.C.; Heath, A.C.; Brick, L.A.; Madden, P.A.F.; Knopik, V. Effects of Maternal Smoking during Pregnancy on Offspring Externalizing Problems: Contextual Effects in a Sample of Female Twins. Behav. Genet. 2016, 46, 403–415. [Google Scholar] [CrossRef] [Green Version]
  41. Petkovsek, M.A.; Boutwell, B.B.; Beaver, K.M.; Barnes, J. Prenatal smoking and genetic risk: Examining the childhood origins of externalizing behavioral problems. Soc. Sci. Med. 2014, 111, 17–24. [Google Scholar] [CrossRef]
  42. González-Valenzuela, M.-J.; González-Mesa, E.; Cazorla-Granados, O.; López-Montiel, D. Type of Delivery, Neuropsychological Development and Intelligence in Twin Births. Front. Psychol. 2019, 10, 972. [Google Scholar] [CrossRef]
  43. Zafman, K.B.; Bergh, E.P.; Cohen, N.; Odom, E.; Fox, N.S. The Effect of Microbiome Exposure at Birth on Pediatric Outcomes Using a Twin Cohort Discordant for Microbiome Exposure at Birth [Published Online Ahead of Print November 18, 2019. J. Matern.-Fetal Neonatal Med. 2019, 34, 3355–3361. [Google Scholar] [CrossRef] [PubMed]
  44. Jöreskog, K.G. Classical Models for Twin Data. Struct. Equ. Model. A Multidiscip. J. 2021, 28, 121–126. [Google Scholar] [CrossRef]
  45. Purcell, S. Variance Components Models for Gene–Environment Interaction in Twin Analysis. Twin Res. Hum. Genet. 2002, 5, 554–571. [Google Scholar] [CrossRef]
  46. Neale, M.C.; Røysamb, E.; Jacobson, K. Multivariate Genetic Analysis of Sex Limitation and GxE Interaction. Twin Res. Hum. Genet. 2006, 9, 481–489. [Google Scholar] [CrossRef]
  47. Kohler, H.-P.; Behrman, J.R.; Schnittker, J. Social Science Methods for Twins Data: Integrating Causality, Endowments, and Heritability. Biodemogr. Soc. Biol. 2011, 57, 88–141. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Muff, S.; Held, L.; Keller, L.F. Marginal or conditional regression models for correlated non-normal data? Methods Ecol. Evol. 2016, 7, 1514–1524. [Google Scholar] [CrossRef] [Green Version]
  49. Zeileis, A. Object-Oriented Computation of Sandwich Estimators. J. Stat. Softw. 2006, 16, 1–16. [Google Scholar] [CrossRef] [Green Version]
  50. Cannon, M.J.; Warner, L.; Taddei, J.A.; Kleinbaum, D.G. What can go wrong when you assume that correlated data are independent: An illustration from the evaluation of a childhood health intervention in Brazil. Stat. Med. 2001, 20, 1461–1467. [Google Scholar] [CrossRef] [PubMed]
  51. Sainani, K. The Importance of Accounting for Correlated Observations. PM&R 2010, 2, 858–861. [Google Scholar] [CrossRef]
  52. Coupé, T. Bias in Conditional and Unconditional Fixed Effects Logit Estimation: A Correction. Politi- Anal. 2005, 13, 292–295. [Google Scholar] [CrossRef] [Green Version]
  53. McArdle, J.J. Latent Curve Analyses of Longitudinal Twin Data Using a Mixed-Effects Biometric Approach. Twin Res. Hum. Genet. 2006, 9, 343–359. [Google Scholar] [CrossRef] [PubMed]
  54. Prescott, C.A. Using the Mplus Computer Program to Estimate Models for Continuous and Categorical Data from Twins. Behav. Genet. 2004, 34, 17–40. [Google Scholar] [CrossRef] [PubMed]
  55. Satty, A.; Mwambi, H.; Molenberghs, G. Different methods for handling incomplete longitudinal binary outcome due to missing at random dropout. Stat. Methodol. 2015, 24, 12–27. [Google Scholar] [CrossRef] [Green Version]
  56. Morley, R. Can we generalise from findings in twins? Paediatr. Périnat. Epidemiol. 2005, 19, 54–59. [Google Scholar] [CrossRef]
Figure 1. Flow diagram showing the study selection process. * Reasons for exclusion: reviews (n = 2), abstracts (n = 11), statistical analyses included singletons and twins together (n = 1), different outcomes (n = 3), twin mothers (n = 1), adult twins (n = 1), and no statistical analyses (n = 1).
Figure 1. Flow diagram showing the study selection process. * Reasons for exclusion: reviews (n = 2), abstracts (n = 11), statistical analyses included singletons and twins together (n = 1), different outcomes (n = 3), twin mothers (n = 1), adult twins (n = 1), and no statistical analyses (n = 1).
Ijerph 18 12696 g001
Figure 2. Summary of recommendations regarding which types of models researchers should favor under different conditions (research needs, data structure, and desired parameter meaning). 1 Conditional logistic regression is recommended. 2 Accommodated for through the use of dummy variables. 3 Either the outcome or the exposure mustable t be binary. 4 Only in models with identity link function and in log-linear models.
Figure 2. Summary of recommendations regarding which types of models researchers should favor under different conditions (research needs, data structure, and desired parameter meaning). 1 Conditional logistic regression is recommended. 2 Accommodated for through the use of dummy variables. 3 Either the outcome or the exposure mustable t be binary. 4 Only in models with identity link function and in log-linear models.
Ijerph 18 12696 g002
Table 1. Classes of statistical methods used in the reviewed studies.
Table 1. Classes of statistical methods used in the reviewed studies.
Class of MethodsStrengthsLimitationsR Libraries
ACE models
-
Confounders can be included
-
Optimal inference
-
Shared confounders are adjusted for by design
-
Genetic contribution can be estimated
-
Require adaptations for binary outcomes and repeated measurements
umx
lavaan
OpenMx
Generalized linear mixed models (GLMMs)
-
Suitable for binary outcomes
-
Confounders can be included
-
Optimal inference
-
Shared confounders are adjusted for by design
-
Suitable for repeated measurements
-
Genetic contribution cannot be estimated
lme4
nlme
MASS
GLMs with fixed pair effects
-
Suitable for binary outcomes
-
Individual-level confounders can be included
-
Shared confounders are adjusted for by design
-
Shared confounders cannot be included
-
Estimators may be sub-optimal
-
Unsuitable for repeated measurements
-
Genetic contribution cannot be estimated
stats
Within-pair difference analyses
-
Individual-level confounders can be included
-
Optimal inference
-
Shared confounders are adjusted for by design
-
Unsuitable for binary outcomes
-
Shared confounders cannot be included
-
Unsuitable for repeated measurements
-
Genetic contribution cannot be estimated
stats
Paired-sample tests
-
Optimal inference
-
Shared confounders are adjusted for by design
-
Require a binary exposure
-
Require adaptations for binary outcomes
-
Confounders cannot be included
-
Unsuitable for repeated measurements
-
Genetic contribution cannot be estimated
stats
Generalized estimating equations (GEE) models
-
Suitable for binary outcomes
-
Confounders can be included
-
Optimal inference
-
Shared confounders are not adjusted for by design
-
Require adaptations for repeated measurements
-
Genetic contribution cannot be estimated
gee
geepack
Generalized linear models (GLMs) with cluster-robust standard errors
-
Suitable for binary outcomes
-
Confounders can be included
-
Optimal standard error estimators
-
Sub-optimal effect-size estimators
-
Shared confounders are not adjusted for by design
-
Unsuitable for repeated measurements
-
Genetic contribution cannot be estimated
sandwich
Generalized linear models (GLMs)
-
Suitable for binary outcomes
-
Confounders can be included
-
Sub-optimal inference
-
Shared confounders are not adjusted for by design
-
Unsuitable for repeated measurements
-
Genetic contribution cannot be estimated
stats
Independent-sample tests
-
Suitable for binary outcomes
-
Require a binary exposure
-
Confounders cannot be included
-
Sub-optimal inference
-
Shared confounders are not adjusted for by design
-
Unsuitable for repeated measurements
-
Genetic contribution cannot be estimated
stats
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fasola, S.; Montalbano, L.; Cilluffo, G.; Cuer, B.; Malizia, V.; Ferrante, G.; Annesi-Maesano, I.; La Grutta, S. A Critical Review of Statistical Methods for Twin Studies Relating Exposure to Early Life Health Conditions. Int. J. Environ. Res. Public Health 2021, 18, 12696. https://doi.org/10.3390/ijerph182312696

AMA Style

Fasola S, Montalbano L, Cilluffo G, Cuer B, Malizia V, Ferrante G, Annesi-Maesano I, La Grutta S. A Critical Review of Statistical Methods for Twin Studies Relating Exposure to Early Life Health Conditions. International Journal of Environmental Research and Public Health. 2021; 18(23):12696. https://doi.org/10.3390/ijerph182312696

Chicago/Turabian Style

Fasola, Salvatore, Laura Montalbano, Giovanna Cilluffo, Benjamin Cuer, Velia Malizia, Giuliana Ferrante, Isabella Annesi-Maesano, and Stefania La Grutta. 2021. "A Critical Review of Statistical Methods for Twin Studies Relating Exposure to Early Life Health Conditions" International Journal of Environmental Research and Public Health 18, no. 23: 12696. https://doi.org/10.3390/ijerph182312696

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop