New Equivalence Tests for Hardy–Weinberg Equilibrium and Multiple Alleles

Ostrovski, Vladimir

doi:10.3390/stats3010004

Open AccessArticle

New Equivalence Tests for Hardy–Weinberg Equilibrium and Multiple Alleles

by

Vladimir Ostrovski

ERGO Group AG, ERGO-Platz 1, 40198 Düsseldorf, Germany

Stats 2020, 3(1), 34-39; https://doi.org/10.3390/stats3010004

Submission received: 18 January 2020 / Revised: 29 January 2020 / Accepted: 31 January 2020 / Published: 5 February 2020

Download Versions Notes

Abstract

:

We consider testing equivalence to Hardy–Weinberg Equilibrium in case of multiple alleles. Two different test statistics are proposed for this test problem. The asymptotic distribution of the test statistics is derived. The corresponding tests can be carried out using asymptotic approximation. Alternatively, the variance of the test statistics can be estimated by the bootstrap method. The proposed tests are applied to three real data sets. The finite sample performance of the tests is studied by simulations, which are inspired by the real data sets.

Keywords:

test; testing; equivalence; Hardy; Weinberg; Equilibrium; asymptotic; bootstrap; simulation study

1. Introduction

Hardy–Weinberg Equilibrium (HWE) plays an important role in the field of the population genetics and related scientific domains. HWE is a common assumption in many areas of research so that assessing the compatibility of observed genotype frequencies with HWE is a basic step of a complete statistical analysis. There are two main approaches to this undertaking: goodness of fit tests and equivalence tests.

A vast amount of literature exists on the goodness of fit tests for HWE, which includes application of the asymptotic

χ^{2}

and likelihood ratio tests. The specific exact goodness of fit tests for HWE are developed in [1,2,3,4,5] among others. The null hypothesis of all these tests is that the underlying population is exactly in HWE. Hence, the goodness of fit tests are tailored to establish lack of compatibility with HWE.

The equivalence tests are appropriate to establish sufficiently good agreement of the observed genotype frequencies with HWE. The exact and approximate equivalence tests for the biallelic case are developed recently in [6,7,8]. To our best knowledge, there are not any equivalence tests for HWE and multiple alleles. Two different equivalence tests are developed in this paper for the case of multiple alleles. The tests can be carried out using the asymptotic approximation or bootstrap method.

A distribution of diploid genotypes at a k-allele locus can be represented as a lower triangular matrix p, where

p (i, j)

is the probability of the genotypes with alleles i and j. Let

a (p)

denote the allele distribution under p. The probability of the allele i under p can be calculated as

a (p, i) = \frac{1}{2} \sum_{j = 1}^{k} (p (i, j) + p (j, i))

. If the population is in HWE, then the genotype distribution fulfills the conditions

p (i, j) = 2 a (p, i) a (p, j)

for

i < j

and

p (i, i) = a {(p, i)}^{2}

. Let

e (p)

denote the genotype distribution under the assumption of HWE, which is implied by the allele distribution

a (p)

.

Euclidean distance

l_{2} (p, e (p))

can be considered a conditional distance between the genotype distributions p and

e (p)

under the joint allele distribution

a (p)

. The equivalence test problem is then defined by

\begin{matrix} H_{0} = \{l_{2} (p, e (p)) \geq ε\} & and & H_{1} = \{l_{2} (p, e (p)) < ε\} \end{matrix}

(1)

where

ε

is a tolerance parameter.

Let

M

denote the family of all possible genotype distributions at HWE. The minimum distance between p and

M

is defined by

d (p, M) = {min}_{q \in M} l_{2} (p, q)

. The corresponding equivalence test problem is given by

\begin{matrix} H_{0} = \{d (p, M) \geq ε\} & and & H_{1} = \{d (p, M) < ε\} \end{matrix}

(2)

We observe the genotype frequencies

p_{n}

of the sample size n. The natural test statistic for (1) is

T_{c} (p_{n}) = \sqrt{n} (l_{2}^{2} (p_{n}, e (p_{n})) - ε^{2}),

(3)

which can be easily computed. The appropriate test statistic for (2) is

T_{m} (p_{n}) = \sqrt{n} (d^{2} (p_{n}, M) - ε^{2}),

(4)

which requires optimization for the calculation of

d (p_{n}, M)

. The test statistic

T_{c} (p_{n})

can be considered a numerically efficient approximation to

T_{m} (p_{n})

because of

l_{2} (p_{n}, e (p_{n})) \geq d (p_{n}, M)

. The subscript * will be used instead of c and m in the reminder of the paper, if statements are appropriate for both cases.

If Hypothesis (1) or (2) of the non-equivalence can be rejected for some appropriate value of

ε

then the true underlying genotype distribution is close to HWE with the probability greater than

1 - α

, where

α

is the nominal level of the test. The appropriate value of

ε

depends on the application and the available sample size. The value of the parameter

ε

can be found by simulation as shown in Section 3.2. Alternatively, the minimum tolerance parameter

ε

, for which

H_{0}

can be rejected, can be computed and reported, see Section 2 for details.

2. Equivalence Tests

In this section, we derive the asymptotic distributions of the test statistics

T_{c} (p_{n})

and

T_{m} (p_{n})

. We provide also an algorithm for the asymptotic and bootstrap-based tests.

Let v be the usual bijective mapping of the matrix p to the vector

(p (1, 1), p (1, 2), \dots, p (k, k))

. Let

{\overset{˚}{d}}_{c}

denote the derivative of the function

q \mapsto l_{2}^{2} (v^{- 1} (q), e (v^{- 1} (q)))

, where q is a vector of length

k^{2}

. The derivative

{\overset{˚}{d}}_{c}

can be derived using the chain rule. Let

p_{0} \in H_{0}

fulfill the boundary condition

l_{2} (p_{0}, e (p_{0})) = ε

and let

q_{0} = v (p_{0})

. Then the asymptotic distribution of

T_{c} (p_{n})

under

p_{0}

is Gaussian with mean zero and variance

σ_{c}^{2} (p_{0}) = {\overset{˚}{d}}_{c} (q_{0}) Σ (q_{0}) {\overset{˚}{d}}_{c} {(q_{0})}^{t}

, where

Σ (q_{0}) = D_{q} - q q^{t}

is a covariance matrix and

D_{q}

is a square diagonal matrix, whose diagonal entries are elements of q. The proof of the statement can be found in [9].

The test statistic

T_{m} (p_{n})

converges weakly under the assumption that there exists a continuous function h on an open neighborhood of

p_{0}

such that

h (p) \in M

and

d (p, M) = l_{2} (p, h (p))

. The existence of a continuous minimizer h is also an important requirement for the numerical computation of

d (p, M)

. We assume the existence of a continuous minimizer h on an open neighborhood of

p_{0}

for the reminder of the paper. Let

{\overset{˚}{d}}_{c}

denote the derivative of the function

q \mapsto l_{2}^{2} (v^{- 1} (q), h (p_{0}))

. Then the asymptotic distribution of

T_{m} (p_{n})

under

p_{0}

is Gaussian with mean zero and variance

σ_{m}^{2} (p_{0}) = {\overset{˚}{d}}_{m} (q_{0}) Σ (q_{0}) {\overset{˚}{d}}_{m} {(q_{0})}^{t}

, see [10] for details.

The asymptotic variance

σ_{*}^{2} (p_{0})

is unknown and can be estimated by

σ_{*}^{2} (p_{n})

. The asymptotic test can be carried out as follows:

(1): Given are the genotype frequencies $p_{n}$ , the tolerance parameter $ε$ and the significance level $α$ .
(2): Compute the tests statistic $T_{*} (p_{n})$ .
(3): Estimate the asymptotic variance by $σ_{*}^{2} (p_{n})$ .
(4): Reject $H_{0}$ if $T_{*} (p_{n}) \leq c_{α} σ_{*} (p_{n})$ , where $c_{α}$ is the lower $α$ -quantile of the normal distribution.

The minimum tolerance parameter

ε

, for which the asymptotic test can reject

H_{0}

, can be computed as

\sqrt{l_{2}^{2} (p_{n}, e (p_{n})) - n^{- \frac{1}{2}} c_{α} σ_{c} (p_{n})}

or

\sqrt{d (p_{n}, M) - n^{- \frac{1}{2}} c_{α} σ_{m} (p_{n})}

correspondingly.

To improve the finite sample performance of the proposed tests, the bootstrap method is applied to estimate the variance of

T_{*} (p_{n})

, see [11], Section 6 for details. The estimator

σ_{*} (p_{n})

is then replaced by the bootstrap estimator of the variance. Otherwise, everything stays the same.

3. Simulation Study

The proposed tests are implemented in R and are freely available on GitHub under https://github.com/TestingEquivalence/HardyWeinbergEquilibriumR. All simulations are performed in R-Studio on a usual scientific workstation.

3.1. Real Data Sets

The equivalence tests are applied to the following data sets, which are already analyzed in the literature on the goodness of fit tests: 1. from rheumatoid arthritis study [12]; 2. from the documentation included with the GENEPOP software package [13]; 3. genotype frequency data at Rhesus locus [14]. The genotype distributions of the data sets are given in Table 1.

The minimum tolerance parameters

ε

, for which the tests can reject the corresponding

H_{0}

, are displayed in Table 2. The distances

d (p_{n}, M)

and

l_{2} (p_{n}, e (p_{n}))

are close to each other in all cases so that

l_{2} (p_{n}, e (p_{n}))

provides a good approximation to

d (p_{n}, M)

. The test results are also similar for

T_{c}

and

T_{m}

. The bootstrap tests are slightly more conservative than the asymptotic tests in all cases.

It could not be shown that data sets 1 and 2 are close to HWE. All goodness of fit tests in [15] reject also the null hypothesis of HWE for data set 2 at the nominal level 0.05. Data set 3 is very close to HWE. This observation corresponds to the results of the goodness of fit tests in [5,15].

3.2. Test Power

In this subsection we study the test power at HWE. We restrict yourself to the genotype distributions at HWE, which are implied by the real data sets from Section 3.1, because the family

M

is very large. To shed some light on the appropriate values of the tolerance parameter

ε

, the test power is computed for different values of

ε

, see Table 3. The value of

ε

may be considered appropriate if the test power is approximately 0.9. Hence, the appropriate value of

ε

is 0.1 for data set 1, 0.1 for data set 2 and 0.018 for data set 3.

The observed genotype frequencies

p_{n}

are subjected to the sampling error. It is important for the test efficiency that the sampling error has a small influence on the test power at HWE. The test power is computed at 100 random genotype frequencies, where the corresponding random samples of size n are drawn from the implied genotype distribution

e (p_{n})

. The simulation results are summarized in Table 4. The power of all considered tests varies little from point to point. Hence, the impact of the sampling error on the test power at HWE is very small.

3.3. Type I error

We study the type I error rates of the proposed tests in this subsection. The boundary of

H_{0}

is so complex that it is very difficult to find boundary points, which have the largest rejection probability. We consider therefore randomly selected boundary points of

H_{0}

, which are based on the three real data sets from Section 3.1. The boundary points are generated using the following algorithm:

Given are $p_{n}$ and $ε$ .
Draw a sample of size n from $p_{n}$ and compute the sample genotype frequency ${\tilde{p}}_{n}$ .
If $T_{*} ({\tilde{p}}_{n}) < 0$ then reject ${\tilde{p}}_{n}$ and repeat step 2. Otherwise accept ${\tilde{p}}_{n}$ .
Consider the linear combination $a {\tilde{p}}_{n} + (1 - a) e (p_{n})$ for $a \in [0, 1]$ . Find $a_{n} \in [0, 1]$ such that $T_{*} (a_{n} {\tilde{p}}_{n} + (1 - a_{n}) e (p_{n})) = 0$ . The value of $a_{n}$ can be found using any line search method.
Return $a_{n} {\tilde{p}}_{n} + (1 - a_{n}) e (p_{n})$ , which is a random boundary point of $H_{0}$ .

The tolerance parameter

ε

is close to

l_{2} (p_{n}, e (p_{n}))

for each data set under consideration so that

a_{n}

is usually not far from 1. The corresponding random boundary point is then close to

p_{n}

. Hence, we explore the boundary of

H_{0}

in the neighborhood of the given data set. The test power at 100 random boundary points is summarized in Table 5. The test power varies considerable from point to point. The asymptotic test based on

T_{c}

is not conservative for all three data sets. The asymptotic test based on

T_{m}

shows some anti-conservative tendencies for data sets 2 and 3. The bootstrap test based on

T_{c}

is conservative for all three data sets. The bootstrap test based on

T_{m}

shows slight non conservative tendencies.

The power at the boundary points is larger than the nominal level due to the following reasons. If the number of observations

n p_{n} (i, j)

is too small for some i and j then the distribution of

T_{*} (p_{n})

may be far away from the normal approximation and also may have considerable jumps. The critical values of the asymptotic and bootstrap tests are then incorrect. If the vector

{\overset{˚}{d}}_{*} (v (p_{n}))

contains zero elements then the power of the asymptotic tests tends to be above the nominal level

α

. The power of the bootstrap tests is closer to

α

in this case because the vector

{\overset{˚}{d}}_{*} (v (p_{n}))

is not used for the variance estimation by the bootstrap method.

4. Summary

Two different test statistics are proposed to establish equivalence of the genotype distributions to HWE. The critical values of the tests are calculated using the asymptotic approximation by the normal distribution. The variance of the test statistic is estimated asymptotically or by the bootstrap method. The minimum tolerance parameter

ε

, for which

H_{0}

can be rejected, is derived. The tests are successfully applied to three real data sets, which are frequently considered in the literature. The test power at HWE and the type I error rates are studied at a large number of points, which are inspired by the real data sets. The asymptotic tests have anti-conservative tendencies and should be used with caution. The bootstrap-based tests are sufficiently conservative for the most practical situations. If more conservative tests are required then the nominal level may be halved or the tolerance parameter

ε

may be reduced. We recommend to perform all proposed tests in any case and compare the results. The appropriate value of

ε

depends on the application and the available sample size. The reasonable values of the parameter

ε

can be found by simulation as shown in Section 3.2. Additionally, the rejection probabilities at the close random boundary points may be studied as shown in Section 3.3.

Funding

This research received no external funding.

Acknowledgments

The author would like to thank the editors and reviewers for the excellent work.

Conflicts of Interest

The author declares no conflict of interest.

References

Levene, H. On a matching problem arising in genetic. Ann. Math. Stat. 1949, 20, 91–94. [Google Scholar] [CrossRef]
Haldane, J.B.S. An exact test for randomness of mating. J. Genet. 1954, 52, 631–635. [Google Scholar] [CrossRef]
Chapco, W. An exact test of the Hardy-Weinberg law. Biometrics 1976, 32, 183–189. [Google Scholar] [CrossRef]
Louis, E.J.; Dempster, E.R. An exact test for Hardy-Weinberg and multiple alleles. Biometrics 1987, 43, 805–811. [Google Scholar] [CrossRef] [PubMed]
Guo, S.; Thompson, E. Performing the exact test of Hardy-Weinberg proportion for multiple alleles. Biometrics 1992, 48, 361–372. [Google Scholar] [CrossRef]
Wellek, S. Tests for establishing compatibility of an observed genotype distribution with Hardy Weinberg equilibrium in the case of biallelic locus. Biometrics 2004, 60, 694–703. [Google Scholar] [CrossRef]
Wellek, S. Testing Statistical Hypotheses of Equivalence and Noninferiority, 2nd ed.; Chapman and Hall/CRC: Boca Raton, FL, USA, 2010. [Google Scholar]
Wellek, S.; Goddard, K.; Ziegler, A. A confidence-limit-based approach to the assessment of Hardy Weinberg equilibrium. Biom. J. 2010, 52, 253–270. [Google Scholar] [CrossRef]
Ostrovski, V. New equivalence tests for approximate independence in contingency tables. Stats 2019, 2, 239–246. [Google Scholar] [CrossRef] [Green Version]
Ostrovski, V. Testing equivalence to families of multinomial distributions with application to the independence model. Stat. Probab. Lett. 2018, 139, 61–66. [Google Scholar] [CrossRef]
Efron, B.; Tibshirani, R.J. An Introduction to the Bootstrap, 1st ed.; Chapman & Hall: New York, NY, USA, 1993. [Google Scholar]
Wordsworth, P.; Pile, K.; Buckley, J.; Lanchbury, J.; Ollier, B.; Lathrop, M.; Bell, J. HLA heterozygosity contributes to susceptibility to rheumatoid arthritis. Am. J. Hum. Genet. 1992, 51, 585–591. [Google Scholar] [PubMed]
Rousset, F. A complete re-implementation of the genepop software for Windows and Linux. Mol. Ecol. Res. 2008, 8, 103–106. [Google Scholar] [CrossRef] [PubMed]
Cavalli-Sforza, L.; Bodmer, W. The Genetics of Human Populations; W. H. Freeman: San Francisco, CA, USA, 1971. [Google Scholar]
Engels, W.R. Exact tests for Hardy-Weinberg proportions. Genetics 2009, 183, 1431–1441. [Google Scholar] [CrossRef] [PubMed] [Green Version]

Table 1. The data sets: (1) from rheumatoid arthritis study, [12]; (2) from the documentation included with the GENEPOP software package, [13]; (3) genotype frequency data at Rhesus locus, [14].

(1)

5
40	12
6	32	2
30	55	15	33

(2)

2
12	24
30	34	54
22	21	20	10

(3)

1236
120	3
18	0	0
982	55	7	249
32	1	0	12	0
2582	132	20	1162	29	1312
6	0	0	4	0	4	0
2	0	0	0	0	0	0	0
115	5	2	53	1	149	0	0	4

Table 2. Minimum tolerance parameter

ε

, for which

H_{0}

can be rejected at the nominal level

0.05

. A stands for the asymptotic test and B stands for the bootstrap test.

Table 2. Minimum tolerance parameter

ε

, for which

H_{0}

can be rejected at the nominal level

0.05

. A stands for the asymptotic test and B stands for the bootstrap test.

Data Set	n	$l_{2} (p_{n}, e (p_{n}))$	$d (p_{n}, M)$	$T_{c}$ A	$T_{c}$ B	$T_{m}$ A	$T_{m}$ B
1	230	0.102	0.101	0.130	0.134	0.130	0.132
2	229	0.126	0.118	0.159	0.164	0.149	0.153
3	8295	0.013	0.013	0.017	0.019	0.018	0.018

Table 3. Simulated rejection probability of the equivalence tests at the nominal level

0.05

. The rejection probability is simulated for different values of the tolerance parameter

ε

at the HWE distributions

e (p_{n})

, which are implied by data sets 1, 2 and 3. The sample size equals the size of the corresponding data set. The number of replications is 1000 for each experiment. A stands for the asymptotic test and B stands for the bootstrap test.

Table 3. Simulated rejection probability of the equivalence tests at the nominal level

0.05

. The rejection probability is simulated for different values of the tolerance parameter

ε

at the HWE distributions

e (p_{n})

, which are implied by data sets 1, 2 and 3. The sample size equals the size of the corresponding data set. The number of replications is 1000 for each experiment. A stands for the asymptotic test and B stands for the bootstrap test.

	Data set 1				Data set 2				Data set 3
$ε$	0.07	0.08	0.09	0.10	0.07	0.08	0.09	0.10	0.012	0.014	0.016	0.018
$T_{c}$ , A	0.56	0.75	0.87	0.95	0.53	0.74	0.87	0.94	0.67	0.82	0.91	0.96
$T_{c}$ , B	0.40	0.63	0.79	0.90	0.40	0.62	0.80	0.90	0.50	0.71	0.83	0.91
$T_{m}$ , A	0.54	0.74	0.87	0.94	0.58	0.78	0.89	0.96	0.61	0.77	0.88	0.95
$T_{m}$ , B	0.49	0.70	0.85	0.93	0.53	0.74	0.86	0.94	0.58	0.75	0.86	0.94

Table 4. Summary of the simulated rejection probabilities at the nominal level

0.05

. The rejection probabilities are simulated at the 100 random samples from the HWE distributions

e (p_{n})

, which are implied by data sets 1, 2 and 3. The sample size equals the size of the corresponding data set. The number of replications is 1000 for each experiment. A stands for the asymptotic test and B stands for the bootstrap test.

Table 4. Summary of the simulated rejection probabilities at the nominal level

0.05

. The rejection probabilities are simulated at the 100 random samples from the HWE distributions

e (p_{n})

, which are implied by data sets 1, 2 and 3. The sample size equals the size of the corresponding data set. The number of replications is 1000 for each experiment. A stands for the asymptotic test and B stands for the bootstrap test.

	Data set 1, $ε = 0.1$				Data set 2, $ε = 0.1$				Data set 3, $ε = 0.018$
	min	max	mean	dev	min	max	mean	dev	min	max	mean	dev
$T_{c}$ , A	0.93	0.97	0.95	0.008	0.93	0.97	0.94	0.008	0.94	0.98	0.97	0.007
$T_{c}$ , B	0.87	0.92	0.90	0.010	0.88	0.94	0.90	0.012	0.89	0.94	0.91	0.010
$T_{m}$ , A	0.92	0.96	0.95	0.009	0.93	0.99	0.96	0.012	0.92	0.97	0.95	0.007
$T_{m}$ , B	0.90	0.95	0.93	0.010	0.91	0.99	0.95	0.015	0.91	0.96	0.94	0.009

Table 5. Summary of the simulated rejection probabilities at the nominal level

0.05

. The rejection probabilities are simulated at the 100 randomly selected boundary points of

H_{0}

. The sample size equals the size of the corresponding data set. The number of replications is 1000 for each experiment. A stands for the asymptotic test and B stands for the bootstrap test.

Table 5. Summary of the simulated rejection probabilities at the nominal level

0.05

. The rejection probabilities are simulated at the 100 randomly selected boundary points of

H_{0}

. The sample size equals the size of the corresponding data set. The number of replications is 1000 for each experiment. A stands for the asymptotic test and B stands for the bootstrap test.

	Data set 1, $ε = 0.1$				Data set 2, $ε = 0.1$				Data set 3, $ε = 0.018$
	min	max	mean	dev	min	max	mean	dev	min	max	mean	dev
$T_{c}$ , A	0.017	0.060	0.035	0.009	0.023	0.065	0.044	0.009	0.051	0.114	0.090	0.011
$T_{c}$ , B	0.005	0.036	0.018	0.006	0.013	0.048	0.029	0.006	0.012	0.052	0.033	0.008
$T_{m}$ , A	0.019	0.049	0.031	0.006	0.025	0.077	0.046	0.009	0.034	0.075	0.051	0.008
$T_{m}$ , B	0.011	0.042	0.022	0.006	0.016	0.064	0.034	0.008	0.025	0.064	0.038	0.008

© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ostrovski, V. New Equivalence Tests for Hardy–Weinberg Equilibrium and Multiple Alleles. Stats 2020, 3, 34-39. https://doi.org/10.3390/stats3010004

AMA Style

Ostrovski V. New Equivalence Tests for Hardy–Weinberg Equilibrium and Multiple Alleles. Stats. 2020; 3(1):34-39. https://doi.org/10.3390/stats3010004

Chicago/Turabian Style

Ostrovski, Vladimir. 2020. "New Equivalence Tests for Hardy–Weinberg Equilibrium and Multiple Alleles" Stats 3, no. 1: 34-39. https://doi.org/10.3390/stats3010004

Article Menu

New Equivalence Tests for Hardy–Weinberg Equilibrium and Multiple Alleles

Abstract

1. Introduction

2. Equivalence Tests

3. Simulation Study

3.1. Real Data Sets

3.2. Test Power

3.3. Type I error

4. Summary

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI