Next Article in Journal
Altered Retrograde Signaling Patterns in Breast Cancer Cells Cybrids with H and J Mitochondrial DNA Haplogroups
Next Article in Special Issue
The Clinical Spectrum and Disease Course of DRAM2 Retinopathy
Previous Article in Journal
Galunisertib Exerts Antifibrotic Effects on TGF-β-Induced Fibroproliferative Dermal Fibroblasts
Previous Article in Special Issue
Next-Generation Sequencing Screening of 43 Families with Non-Syndromic Early-Onset High Myopia: A Clinical and Genetic Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Relative Frequencies of PAX6 Mutational Events in a Russian Cohort of Aniridia Patients in Comparison with the World’s Population and the Human Genome

by
Tatyana A. Vasilyeva
1,
Andrey V. Marakhonov
1,*,
Sergey I. Kutsev
1 and
Rena A. Zinchenko
1,2
1
Research Centre for Medical Genetics, 115522 Moscow, Russia
2
N.A. Semashko National Research Institute of Public Health, 105064 Moscow, Russia
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(12), 6690; https://doi.org/10.3390/ijms23126690
Submission received: 14 May 2022 / Revised: 6 June 2022 / Accepted: 13 June 2022 / Published: 15 June 2022
(This article belongs to the Special Issue Genetics of Eye Disease 2.0)

Abstract

:
Genome-wide sequencing metadata allows researchers to infer bias in the relative frequencies of mutational events and to predict putative mutagenic models. In addition, much less data could be useful in the evaluation of the mutational frequency spectrum and the prevalent local mutagenic process. Here we analyzed the PAX6 gene locus for mutational spectra obtained in our own and previous studies and compared them with data on other genes as well as the whole human genome. MLPA and Sanger sequencing were used for mutation searching in a cohort of 199 index patients from Russia with aniridia and aniridia-related phenotypes. The relative frequencies of different categories of PAX6 mutations were consistent with those previously reported by other researchers. The ratio between substitutions, small indels, and chromosome deletions in the 11p13 locus was within the interval previously published for 20 disease associated genomic loci, but corresponded to a higher end due to very high frequencies of small indels and chromosome deletions. The ratio between substitutions, small indels, and chromosome deletions for disease associated genes, including the PAX6 gene as well as the share of PAX6 missense mutations, differed considerably from those typical for the whole genome.

1. Introduction

Different genomic loci may be subject to specific mutagenic mechanisms. Replication errors and/or DNA damage leave their own footprints in the sequence leading to different spectra of mutations occurring at different genomic loci [1]. Heterozygous PAX6 pathogenic variants or chromosome 11p13 rearrangements cause congenital aniridia, related phenotypes, and, on rare occasions, other phenotypes such as isolated optic nerve malformations, foveal hypoplasia, and achromatopsia. The main underlying mechanism consists in the haploinsufficiency of PAX6 function. On the one hand, the very first PAX6 researchers pointed to specificities of 11p13 genome locus mutational events frequencies [2]. On the other hand, the spectra of PAX6 gene mutational events in different cohorts have recently become more explored, though the degree of their similarity and the relative frequencies of mutations remain unclear. In both of these respects, a comparative study of PAX6 mutations spectrum and its consideration in the world population context, as well as in the context of the whole genome, could be useful, as it could help to elucidate spectrum peculiarities, which could point to putative mechanisms of local 11p13 mutations.

2. Results

2.1. PAX6 Mutation Type Proportions and Their Consistency with Mutational Spectra from Three Large Studies of Aniridia Patients

The current study revealed 138 disease-causing variants in the PAX6 gene and 61 chromosome 11p13 rearrangements. Defined chromosome deletions varied from 0.35 to 7.5 Mb in size. The 138 small intragenic PAX6 variants included 53 nonsense pathogenic variants, 30 splicing changes, 34 frame shifts, 8 CTE (C-terminal extension), 2 start codon losses, 7 missense pathogenic variants, and 4 pathogenic variants in 5′-UTR sequence (Supplementary Tables S1 and S2) [3,4,5,6].
The PAX6 mutations ratio obtained in the current study was characterized by high proportions of de novo mutations in the sample (147 sporadic cases out of 199, or 73.9%) as well as of chromosome deletions (61 out of 199, or 30.7%), frequent 3′-cis-regulatory region deletions (15 out of 199, or 7.5%), and a rarity of nonsynonymous substitutions (7 out of 199, or 3.5%), while the great majority of sequence PAX6 variants were premature stop codon formation mutations (117 out of 199, or 58.8%).
The comparison of PAX6 mutation types was carried out for (i) four studies with the number of examined families more than 50 (from Great Britain [7], the USA [8], China [9], and Russia (current study)) as well as for (ii) five other studies with smaller numbers of patients (13 < N < 30) (Table 1). The analysis demonstrated a common pattern of PAX6 mutation types in different populations around the world, even considering small cohorts (Kruskal–Wallis test, H = 15.05, approximate p-value = 0.0582).
Highlighting the similarities and differences between the cohorts, the proportion of 11p13 chromosome deletions presumably was lower in other studies, but, in some cases, it could be higher (30/77, or 39% [10]); the same was true for the fraction of 3′-cis-regulatory region deletion which could account for 11/141, or 7.8% [11]. An extremely high frequency of indels in the PAX6 gene was observed in every study and seemed to be typical. That last observation was also noted by Bobilev et al. in their PAX6 mutation study [8]. Very few missense pathogenic variants were identified in seven Russian families with congenital aniridia or related phenotypes (7/199, 3.5%),but a similar fraction was also typical and previously observed (6/84, 7.1%) [2].

2.2. Relative Frequencies of PAX6 Mutations vs. Data on Other AD Diseases Associated Genes

To compare data from the current study with the pattern of spontaneous mutations in other genes and the whole human genome, we counted all our variants as de novo events, which was permissible for congenital aniridia since even familial mutations arose as de novo. For the Russian cohort, we have 91 single nucleotide substitutions, 47 indels of <50 bp in length, and 61 de novo chromosome events.
Firstly, we compared the PAX6 ratio obtained in the current study with data for 20 genes associated with dominantly inherited fully penetrant diseases reviewed by Kondrashov [17]. Mutations in twenty diseases associated genes were defined by targeted sequencing and copy number variation analysis as they were in our study.
We received a higher proportion between substitutions and small intragenic indels (less than 50 bp), at a ratio of 91:47 (~1.9:1). The ratio could vary across different gene loci, but the proportion 1.9:1 was close to the highest edge of the interval published by Kondrashov. The ratio among 20 loci varied from 0.6:1 for the APC gene to 15.6:1 for the F9 gene [17]. While comparing the incidence of small indels in two well characterized genes, PAX6 and F9, Prosser and van Heyningen also noted that in the PAX6 locus they occurred several times more frequently than in the F9 [2]. This observation appeared not to be a result of sampling bias, as was suggested by Proser and van Heyningen [2], since the PAX6 gene locus usually showed the same high proportion of small indels (Table 1) and the PAX6 and F9 genes were located at the opposite sides of distribution of the proportions (Table 2).
The occurrence of 11p13 chromosome large rearrangements seemed to be too frequent in our study as well as in other PAX6 studies, and evidently was typical for the locus (Table 1). In our cohort, 11p13 chromosome deletions were responsible for 30.7% of the aniridia cases. The ratio between substitutions and chromosome events counted for 91:61 (1.5:1) in the current study. Compared with 20 disease-associated genes in Kondrashov’s review, the share of 11p13 deletions was higher than for almost all reviewed loci; and it was as large as for the VHL and APC genes (1.6:1 and 2.1:1, respectively), and less only than F8 and DMD genes (0.7:1 and 0.5:1, respectively) (Table 2) [17]. Furthermore, according to our data, 11p13 CNVs occurred more frequently (the ratio between substitutions and CNVs was 1.5:1) than it was estimated in the Kondrashov’s review (7.4:1). The discrepancy could be related to the improvement of the molecular diagnosis methods and routine usage of CNVs analysis in the modern aniridia DNA diagnosis protocols.

2.3. Relative Frequencies of PAX6 Mutations vs. the Whole Genome De Novo Mutations

To assess relative frequencies of PAX6 mutations in the background of the whole genome, we compared the ratio between substitutions, micro indels, and large chromosome events revealed in this study with that obtained a sequencing project for the whole genome in 250 Dutch families (Table 3) [18,19]. The comparison demonstrated a distinguished PAX6 mutation pattern with elevated proportions of small indels and chromosome deletions. The ratio between de novo substitutions, indels, and large deletions in the current study was 91:47:61 (1.5:0.8:1), and differed from the whole genome proportion of 11020:291:41 (268.8:7.1:1) (Freeman–Halton extension of Fisher’s exact test p-value = 0.0036).
Indeed, defined in healthy samples, de novo large CNVs seemed to be relatively rare compared to indels and SNVs (one de novo CNV per 7 births occurs, while on average 9 de novo indels and 73 de novo SNVs per birth occur) [20].
As mentioned above in our study, PAX6 missense variants were identified in few probands (7/199, or 3.5%), and such a proportion appeared to be typical for the PAX6 gene. We tried to infer whether the fraction of missense variants in the PAX6 gene indeed distinguished the locus from the whole human genome.
Previously, Hanson et al. pointed to the highly biased PAX6 mutations spectrum in aniridia patients with only 2% of missense substitutions, as the missense rarity was proposed to be one more feature of the locus [21]. In a retrospective review of PAX6 molecular diagnosis in a large cohort of patients with several different clinical diagnoses, the share of missense pathogenic variants could reach 12.2% (31/254 [11]). Thus, PAX6 missense pathogenic variants could be observed in extended samples of the patients, although in a much lower proportion than expected based on the whole genome data. In the whole genome databases of 10 930 small-scale de novo mutations for monogenic disorders, 37% of mutations were missense [22]. The ratio between de novo nonsynonymous and nonsense pathogenic variants in the coding region established in the 78 Icelandic Trios sequencing project was ~30:1 [23]. While even considering all known PAX6 missense variants, including benign variants met in healthy samples (and registered in gnomAD datasets) and pathogenic missense variants identified in affected people (gathered in the HGMD dataset), the proportion between PAX6 nonsynonymous and nonsense substitutions would be only ~2:1 (367:179) (https://gnomad.broadinstitute.org/ (accessed on 14 May 2021) and http://www.hgmd.cf.ac.uk/ (accessed on 14 May 2021)). The number 367 is formed by 133 known PAX6 missense pathogenic variants from HGMD and 234 ones known from gnomAD; the number 179 is formed by 79 nonsense pathogenic variants from HGMD database plus at least 100 different cases of PAX6 hot spot nonsense mutations (20% of all known PAX6 mutations according to Tzoulaki’s statistics), counting every case of recurrent nonsense mutation in the hotspots as a mutation [24]. The proportion ~2:1 between PAX6 nonsynonymous and nonsense substitutions demonstrated an extremely lower share of PAX6 missense mutations.
The rarity of missense mutations in the PAX6 gene is possibly explained by strong negative selection pressure towards them, as some PAX6 nonsynonymous substitutions could cause severe consequences. It has been proposed that they have a dominant negative effect and lead to severe or nonviable phenotypes [25].

2.4. Comparison of the Sequence Context of PAX6 Mutations with Data of the PAX6 Mutational Review by Prosser and Van Heyningen

Next, we tried to assess the sequence context of PAX6 substitutions. To explore the contexts of pathogenic PAX6 variants and to characterize the mutation process in the cohort, we compared our study with previous ones. In 1998, Prosser and van Heyningen [2] reviewed the spectrum and sequence context of known PAX6 pathogenic variants (N = 84) and discussed their putative mechanisms. In our cohort, we have a comparable number of intragenic PAX6 pathogenic variants (N = 138) (Table 4). A comparison of mutational spectrum in our cohort of patients with those analyzed by Prosser and van Heyningen revealed no significant difference (Mann–Whitney U = 10, exact p-value = 0.2403). Thus, benchmarking could be relevant.
Proser published a 24:16 ratio of transitions at CpG to non-CpG dinucleotides, while we reported 31:20. Therefore, in both studies it was about 3:2 [2].
In our study, C-to-T substitutions occurred in 6 CpG PAX6 sites, in codons 44 (once), 103 (3 times), 203 (8 times), 240 (10 times), 261 (2 times), and 317 (3 times). In addition, the difference with Prosser’s analysis accounted for two points. We did not meet changes in CpG in 128 and 208 positions, but met them in two other ones: in 44 (once) and in known hotspot 261 (twice).
These eight CpG sites are highly mutable and present de novo PAX6 mutation hot spots in every study in different populations. All in all, there are 44 CpG dinucleotides in the coding sequence of the PAX6 gene [2]. Prosser proposed that mutation hotspots should be sites of methylation. They could not, at least, be the positions of these eight hot spots not constantly methylated (UCSC methylation map, https://genome.ucsc.edu/ (accessed on 14 May 2021)). It is currently generally accepted that methylation on its own does not entirely explain the mutability of CpG sites [26].
Other hot spot mutations identified in our cohort were localized at positions c.109, c.265, c.300, c.357+1, c.467, c.916+1, c.1183, c.1183+2, and c.1268. Probands from 18 unrelated families were found to have known changes at those points.
Hot spot mutations defined in the Russian cohort counted for 24% of all mutations (48/199). That corresponded with Kondrashov’s conclusion that mutations in hot spots (those sites where independent mutations were observed three or more times were regarded as hot spots) might make only a minor contribution to spontaneous substitutions [17], though the share of 24% did not seem literally “minor.” Recurrent mutations could explain why all PAX6 mutations arose in the same proportions in different populations, according to same rules possibly dictated by local 11p13 chromosome context and architecture.

2.5. Comparison of the Sequence Context of PAX6 Mutations with the Whole Genome Data

We assessed the differences in frequencies of context-specific mutational events in the PAX6 locus and in the whole genome. The ratio between PAX6 transitions and transversions obtained here, 61:31 (2:1), was in consistent with a reported ratio for the whole genome [27]. Transitions to a transversions ratio appeared to be 2:1 across the whole genome, 3:1 in the coding part of the genes, and 2:1 for nonsynonymous substitutions in the coding sequence [28]. The ratio between de novo transitions and transversions was defined as 2:1, both in the UK10K Project and in the 78 Islandic Trio analysis [23,29].
The ratio of transitions at CpG to non-CpG dinucleotides as defined for the PAX6 coding sequence in the Russian cohort was 31:20. Similar transition distribution between CpG and non-CpG dinucleotides could be observed among nonsense pathogenic variants in 20 disease-associated loci [17]. The ratio between the number of C-to-T substitutions (as well as complementary G-to-A substitutions) to the number of changes in the opposite direction (T-to-C or A-to-G) obtained here was 25:12 (2.1:1), which was consistent with a previously published ratio (2.15:1) The ratio was established based on direct approaches to genome wide de novo substitution estimations in families [23].
It was interesting to compare the ratio of novel and recurrent mutations for the PAX6 locus and for the whole genome. In our study, recurrent PAX6 sequence variants were defined in 75 probands, while novel were defined in 63 probands (the ratio was approximately 1.2:1). That ratio was also typical for other PAX6 studies (61:70, ≈1:1.1 [11]), as well as for the novelty rate of the whole genome mutations 1:1 [30].

3. Discussion

A large cohort of 199 patients from Russia with aniridia or aniridia-related phenotypes was studied for the PAX6 mutation spectrum, considering the background of the PAX6 mutation spectrum in the world’s population and the ratio of de novo mutations in the whole genome.
We compared our results with studies of three large cohorts of aniridia patients (each with more than 50 patients): Robinson et al. [7], Bobilev et al. [8], and Bing You et al. [9], who screened their patients by searching for the PAX6 gene point pathogenic variants and 11p13 chromosome deletions. The similarity of methodology considering the high frequency of 11p13 chromosome events allowed for a comparison of results obtained in all three studies. Robinson et al. [7] analyzed a number of patients with both syndromic aniridia and WAGR syndrome. Based on our experience, WAGR patients are rarer in the cohorts of patients with isolated aniridia, as they receive karyotyping diagnostics very early. That possibly explains a higher share of chromosome events in Robinson’s study, although the discrepancy was not relevant.
Then we compared our data with that obtained earlier for other disease-associated loci. A comparison with Kondrashov’s study [17] of rates of different types of mutations in 20 disease-associated loci was acceptable, as in both analyses data were obtained (i) for loci where loss-of-function mutations produce clear-cut phenotypes, (ii) about 100 independent patients with mutations in the locus were analyzed, and (iii) a fraction of patients with no pathogenic mutations detected at the locus was well below ½. Kondrashov also considered only loci associated with diseases with dominant inheritance, full penetrance, and haploinsufficiency as the pathogenic mechanism, for which all coding sequence and chromosome events were routinely analyzed [17].
In addition, for the comparison with the whole genome data, we gathered relative frequencies of different types of mutations estimated in two analyses. Both analyses were based on the same project of the whole genome NGS sequencing of 250 Dutch parent offspring families [18,19]. One analyzed de novo SNVs and indels, while the other analyzed structure variants.
In our study, we defined 138 PAX6 intragenic variants, including 53 nonsense mutations, 30 changes of splicing, 34 frame shifts, 8 CTE, 2 start codon losses, 7 missense mutations, and 4 pathogenic variants in 5′-UTR sequence and 61 chromosome 11p13 rearrangements.
Despite the high share of de novo mutations, which were four times more frequent than inherited ones, the PAX6 mutational events relative frequencies in Russian aniridia patients’ cohort remained the same as those obtained in earlier studies of different populations. The similarity between different populations might provide an insight into the common mechanism of PAX6 locus mutations constantly arising at a specific rate for each type of mutation. The resolving condition and perhaps direct reason for this might be specific base composition, repeat content, chromatin state and structure, nucleosome position, methylation, and other epigenetics marks, as well as repair efficiency and other specific related properties of this genome locus [1,31]. The contribution of the hot spot events in the PAX6 mutation spectrum (which was defined as 24%) seemed not to be so modest as was previously established by Kondrashov for other 20 disease-associated genes [17]. Peculiar for the PAX6 locus mutations spectrum is a higher proportion between substitutions and chromosome deletions (91:61, approximately 1.5:1). The frequency could be typical for the locus, as 11p13 deletions appeared to be characterized by some common features. Previously, we showed that the 11p13 locus had a propensity to be de novo broken, presumably on the paternal 11p13 chromosome [32]. Another peculiarity among the PAX6 gene features is a higher proportion between substitutions and small (<50 bp) indels, which was defined as 91:47 (approx. 2:1). Ratios obtained here were within the interval reported by Kondrashov for 20 disease-associated loci, but corresponded to its higher end. The rarity of the PAX6 gene missense mutations accounts for one more spectrum peculiarity. We could observe “levelling out the differences” between PAX6 mutations types. Several possible factors could play role, such as high conservativeness of the PAX6, the deleterious effect of most PAX6 substitutions, and higher frequencies of the indels and CNVs typical for the region.
On the putative mechanism of the PAX6 gene mutations formation, Prosser has suggested DNA damage is influenced by environment factors as well as endogenous processes. But a collation of the data of different population seems to demonstrate no differences in the PAX6 mutational spectra, which implies that PAX6 mutations arise through the same mechanisms. Indeed, endogenous processes as a principal factor lead to the same rates of different kinds of mutations in the same locus. The rates strongly correlate inside numerous genome loci depending on, e.g., sequence context and recombination rate [33]. The relative frequency of high PAX6 indels has been previously reported [8]. Along the human genome, small indels frequency varies widely. About 40% of small indels occur in 4% of the genome, and they arise 16 times rarer than SNPs in the remaining 96% of the genome sequence [34]. The frequency of indels observed here in the PAX6 locus could be explained, e.g., by DNA strand slippage and/or breaks in sites of high chromatin rigidity due to low local cleavage intensity, as indels distribution was shown to be strongly associated with low DNA cleavage intensity [35]. Moreover, the same chromatin features could correspond to the breaks’ formation and a high frequency of large chromosome rearrangements in the 11p13 locus [35].
Interestingly, the proportions between SNVs, indels, and CNVs for 20 disease-associated loci, including the PAX6 gene, differed considerably from de novo genome-wide mutations shares. Evidently such a shift of proportions between de novo SNVs, indels, and CNVs from 44.3:11.4:1 [1] for the whole genome to 5.1:1.3:1 (total sums) for 20 disease-associated loci [17], and to 1.5:0.8:1 for 11p13 locus obtained here, could be explained by a conservativeness of the disease-associated coding sequence and selection impact [36]. Functional consequences of frame shifting indels are profoundly serious, and they obviously are strongly influenced by purifying selection [34]. Selection pressure towards substitutions along the genome could vary and be weaker, possibly explaining a lower portion of indels among the genome (11.4 ÷ 16:1) [1,34]. The lower ratio between indels and substitutions indicates less selection pressure on substitutions. That proportion is changing for conservative protein coding sequences. Considering PAX6 along with the conservativeness of other disease-associated genes and the deleterious effect of substitutions, a shift in the proportion between SNVs and indels for disease associated loci seems to be expected. However, the overrepresentation of indels among disease-causative variants was previously shown [37]. In addition, the proportion could vary depending on the mutation rate variance in functionally different genome loci. Therefore, the de novo mutation rate is proposed to be higher in functionally active regions, in coding and regulatory ones, partially due to abundant CpG content [1,38].
Although we obtained a similar characteristic proportion between transitions and transversions in the PAX6 locus and in a healthy genome, we need to keep in mind that we considered a sample of affected patients, and that might and should influence the proportions between sequence lesion types. Accordingly, the comparison of proportions of different lesion types with other disease-associated genes was undertaken.

4. Materials and Methods

The sample consisted of patients with aniridia clinical features and identified molecular genetic cause of the phenotype, which was either 11p13 deletion or intragenic pathogenic/likely pathogenic PAX6 variant. All such patients were referred to the Research Centre for Molecular Genetic for genetic testing.
A total of 248 patients from 199 unrelated families were included in the study. Of these, 118 (47.6%) are male and 130 (52.4%) are female. The mean age at examination was 16.8, median 9.0 years (25%–75%, range 3–28, minimum 0.2, maximum 65.0). Of the 199 probands, 147 (73.9%) were sporadic and 52 (26.1%) had a family history. Of the 13 patients were diagnosed with the WAGR syndrome, 1 was diagnosed with WAGRO, 1 family was diagnosed with achromatopsia, 1 more was diagnosed with optic nerve hypoplasia, and the remaining 183 families received a diagnosis of congenital aniridia. The patients were examined by ophthalmologists from four clinics, the Research Centre for Medical Genetics, Moscow; the Cheboksary Branch of the S. Fyodorov Eye Microsurgery Federal State Institution; the Moscow Helmholtz Research Institute of Eye Diseases; and the National Medical Research Centre for Children’s Health, Moscow.
The majority of the patients (181/248, or 73%) considered themselves to be Russian, while the rest of the patients considered themselves as belonging to several nationalities: Tatar (17/248, 6.9%), Ukrainian (6/248, 2.4%), Chuvash (7/248, 2.8%), Kazakh (6/248, 2.4%), Kirgiz (6/248, 2.4%), Belarusian (N = 3), Moldovan (N = 3), Ashkenazi (N = 3), Chechen (N = 4), Udmurts (N = 3), Armenian (N = 2), Avar (N = 2), Dargin (N = 2), Karachai (N = 2), and Bulgarian (N = 1).
Screening for the 11p13 rearrangements was carried out by Multiplex Ligation-dependent Probe Amplification (MLPA analysis) by using SALSA MLPA Probemix P219 according to manufacture protocol (www.mlpa.com) Screening for pathogenic variants in the PAX6 gene was carried out by a Sanger sequencing analysis of 12 PAX6 exons (3–13) and their exon-intron boundaries.
Chromosome 11p13 deletions were designated according to the MLPA probes with reduced signal coordinates according to the NCBI36/hg18 assembly of the human genome. Intragenic pathogenic variants were named based on PAX6 transcript variant 1 (NM_000280.4) according to the HGVS nomenclature. The pathogenic status of single nucleotide variation for interpretation of sequence variants was established using the consensus recommendations of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology [39]. Only pathogenic and likely pathogenic genetic changes were reviewed.
Mutations identified in 139 index patients were described in our previous publications [3,4,5,6], while for 60 families. 46 mutations were newly reported. They were 31 PAX6 intragenic mutations identified in 36 families and 15 different deletions defined in 24 families. Mutations of new index patients are listed in Supplementary Tables S1 and S2. In total, the proportion of familial to sporadic probands counted for 52/147. DNA analysis was carried out by using a combination of multiplex ligation-dependent probe amplification (MLPA) analysis for searching for copy number variations (CNVs) in 11p13 chromosome region, followed by Sanger sequencing for PAX6 intragenic variants searching [3].
We compared our proportions with those observed in three large studies of aniridia patients from the USA, Great Britain, and China [7,8,9]. We also included five smaller cohorts in the comparative analysis (from France [13], Australia [14], Korea [15], India [16], and another smaller cohort from Great Britain [12]). To compare the 11p13 locus with other genome loci we used data for 20 disease-associated genes reviewed by Kondrashov [17]. Kondrashov analyzed only data for diseases with dominant inheritance, nearly full penetrance, and haploinsufficiency as molecular mechanisms of pathogenesis, so it was of relevance to compare PAX6 spectrum with those 20 reviewed loci. The spectrum of genome-wide de novo mutations was taken from a sequencing project of 250 Dutch families for the whole genome [18,19]. Kruskal–Wallis H test, Freeman–Halton’s extension of Fisher’s exact test, and the Mann–Whitney U test were used in GraphPad Prism 8.0.1 (GraphPad Software, CA, USA) to compare proportions of different types of PAX6 mutations among several cohort studies, as well as with estimation of relative frequencies of de novo mutational events in human genome based on the data from the sequencing projects listed above.

5. Conclusions

Compared to worldwide aniridia patients’ samples, the relative frequencies of PAX6 mutations obtained in our study retained the same rate, and could characterize PAX6 spectrum. Compared to the whole genome, the relative frequencies or de novo mutations typical for the PAX6 mutational spectrum stands out from the typical for the healthy genome, but is part of the interval for other disease-associated genes spectra. This is the case, firstly, due to shifting ratios between the mutations of different types in disease-associated genes, and, secondly, due to intrinsic peculiarities of the PAX6 locus with a higher frequency of small indels and chromosome deletions and a lower frequency of missense mutations.

Supplementary Materials

The following supporting information can be downloaded at: www.mdpi.com/article/10.3390/ijms23126690/s1.

Author Contributions

Conceptualization, T.A.V. and A.V.M.; methodology, A.V.M.; software, A.V.M.; validation, T.A.V.; formal analysis A.V.M.; investigation, T.A.V.; resources, R.A.Z.; data curation, T.A.V.; writing—original draft preparation, T.A.V.; writing—review and editing, A.V.M.; visualization, T.A.V.; supervision, R.A.Z.; project administration, R.A.Z.; funding acquisition, A.V.M., R.A.Z. and S.I.K. All authors have read and agreed to the published version of the manuscript.

Funding

This study was partly supported by the Russian Science Foundation, grant no. 22-25-00824.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of the Research Centre for Medical Genetics, Moscow, Russia (protocol no. 2017-4/1 dated 4 May 2017).

Informed Consent Statement

Informed consent was obtained from all subjects and/or their legal representative, as appropriate, involved in the study to publish this paper.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Acknowledgments

We are grateful to the doctors of our and other ophthalmological centers who took part in the cohort examination, they were Olga Khlebnikova, Vitaly Kadyshev, Natella Sukhanova from Research Centre for Medical Genetics (Moscow, Russian Federation), Anna Voskresenskaya and Nadezhda Pozdeeva from S. Fyodorov Eye Microsurgery Federal State Institution (Cheboksary, Russian Federation), and Barbara Käsmann-Kellner from Universitätsklinikum des Saarlandes und Medizinische Fakultät der Universität des Saarlandes (Homburg, Saarland, Germany). Here we also acknowledge a great support of the patients’ families, Aniridia patients supporting organization “Iris”, and its president Galina Gening.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Francioli, L.C.; Polak, P.P.; Koren, A.; Menelaou, A.; Chun, S.; Renkens, I.; Genome of the Netherlands Consortium; Van Duijn, C.M.; Swertz, M.; Wijmenga, C.; et al. Genome-wide patterns and properties of de novo mutations in humans. Nat. Genet. 2015, 47, 822–826. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Prosser, J.; van Heyningen, V. Pax6 mutations reviewed. Hum. Mutat. 1998, 11, 93–108. [Google Scholar] [CrossRef]
  3. Vasilyeva, T.A.; Voskresenskaya, A.A.; Käsmann-Kellner, B.; Khlebnikova, O.V.; Pozdeyeva, N.A.; Bayazutdinova, G.M.; Kutsev, S.I.; Ginter, E.K.; Semina, E.V.; Marakhonov, A.V.; et al. Molecular analysis of patients with aniridia in Russian Federation broadens the spectrum of PAX6 mutations. Clin. Genet. 2017, 92, 639–644. [Google Scholar] [CrossRef] [PubMed]
  4. Vasilyeva, T.A.; Marakhonov, A.V.; Voskresenskaya, A.A.; Kadyshev, V.V.; Kasmann-Kellner, B.; Sukhanova, N.V.; Katargina, L.A.; Kutsev, S.I.; Zinchenko, R.A. Analysis of genotype-phenotype correlations in PAX6-associated aniridia. J. Med. Genet. 2021, 58, 270–274. [Google Scholar] [CrossRef] [PubMed]
  5. Filatova, A.Y.; Vasilyeva, T.A.; Marakhonov, A.V.; Voskresenskaya, A.A.; Zinchenko, R.A.; Skoblov, M.Y. Functional reassessment of PAX6 single nucleotide variants by in vitro splicing assay. Eur. J. Hum. Genet. 2019, 27, 488–493. [Google Scholar] [CrossRef] [Green Version]
  6. Filatova, A.Y.; Vasilyeva, T.A.; Marakhonov, A.V.; Sukhanova, N.V.; Voskresenskaya, A.A.; Zinchenko, R.A.; Skoblov, M.Y. Upstream ORF frameshift variants in the PAX6 5′UTR cause congenital aniridia. Hum. Mutat. 2021, 42, 1053–1065. [Google Scholar] [CrossRef]
  7. Robinson, D.O.; Howarth, R.J.; Williamson, K.A.; van Heyningen, V.; Beal, S.J.; Crolla, J.A. Genetic analysis of chromosome 11p13 and the PAX6 gene in a series of 125 cases referred with aniridia. Am. J. Med. Genet. Part A 2008, 146A, 558–569. [Google Scholar] [CrossRef]
  8. Bobilev, A.M.; McDougal, M.E.; Taylor, W.L.; Geisert, E.E.; Netland, P.A.; Lauderdale, J.D. Assessment of PAX6 alleles in 66 families with aniridia. Clin. Genet. 2016, 89, 669–677. [Google Scholar] [CrossRef] [Green Version]
  9. You, B.; Zhang, X.; Xu, K.; Xie, Y.; Ye, H.; Li, Y. Mutation spectrum of PAX6 and clinical findings in 95 Chinese patients with aniridia. Mol. Vis. 2020, 26, 226–234. [Google Scholar]
  10. Crolla, J.A.; van Heyningen, V. Frequent chromosome aberrations revealed by molecular cytogenetic studies in patients with aniridia. Am. J. Hum. Genet. 2002, 71, 1138–1149. [Google Scholar] [CrossRef] [Green Version]
  11. Cross, E.; Duncan-Flavell, P.J.; Howarth, R.J.; Crooks, R.O.; Thomas, N.S.; Bunyan, D.J. Screening of a large PAX6 cohort identified many novel variants and emphasises the importance of the paired and homeobox domains. Eur. J. Med. Genet. 2020, 63, 103940. [Google Scholar] [CrossRef] [PubMed]
  12. Hingorani, M.; Williamson, K.A.; Moore, A.T.; van Heyningen, V. Detailed ophthalmologic evaluation of 43 individuals with PAX6 mutations. Investig. Ophthalmol. Vis. Sci. 2009, 50, 2581–2590. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Vincent, M.C.; Pujo, A.L.; Olivier, D.; Calvas, P. Screening for PAX6 gene mutations is consistent with haploinsufficiency as the main mechanism leading to various ocular defects. Eur. J. Hum. Genet. 2003, 11, 163–169. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Souzeau, E.; Rudkin, A.K.; Dubowsky, A.; Casson, R.J.; Muecke, J.S.; Mancel, E.; Whiting, M.; Mills, R.A.D.; Burdon, K.P.; Craig, J.E. PAX6 molecular analysis and genotype-phenotype correlations in families with aniridia from Australasia and Southeast Asia. Mol. Vis. 2018, 24, 261–273. [Google Scholar]
  15. Park, S.H.; Kim, M.S.; Chae, H.; Kim, Y.; Kim, M. Molecular analysis of the PAX6 gene for congenital aniridia in the Korean population: Identification of four novel mutations. Mol. Vis. 2012, 18, 488–494. [Google Scholar]
  16. Dubey, S.K.; Mahalaxmi, N.; Vijayalakshmi, P.; Sundaresan, P. Mutational analysis and genotype-phenotype correlations in Southern Indian patients with sporadic and familial aniridia. Mol. Vis. 2015, 21, 88–97. [Google Scholar]
  17. Kondrashov, A.S. Direct estimates of human per nucleotide mutation rates at 20 loci causing Mendelian diseases. Hum. Mutat. 2003, 21, 12–27. [Google Scholar] [CrossRef]
  18. Genome of the Netherlands Consortium. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat. Genet. 2014, 46, 818–825. [Google Scholar] [CrossRef]
  19. Kloosterman, W.P.; Francioli, L.C.; Hormozdiari, F.; Marschall, T.; Hehir-Kwa, J.Y.; Abdellaoui, A.; Lameijer, E.W.; Moed, M.H.; Koval, V.; Renkens, I.; et al. Characteristics of de novo structural changes in the human genome. Genome Res. 2015, 25, 792–801. [Google Scholar] [CrossRef] [Green Version]
  20. Besenbacher, S.; Liu, S.; Izarzugaza, J.M.; Grove, J.; Belling, K.; Bork-Jensen, J.; Huang, S.; Als, T.D.; Li, S.; Yadav, R.; et al. Novel variation and de novo mutation rates in population-wide de novo assembled Danish trios. Nat. Commun. 2015, 6, 5969. [Google Scholar] [CrossRef] [Green Version]
  21. Besenbacher, S.; Sulem, P.; Helgason, A.; Helgason, H.; Kristjansson, H.; Jonasdottir, A.; Jonasdottir, A.; Magnusson, O.T.; Thorsteinsdottir, U.; Masson, G.; et al. Multi-nucleotide de novo mutations in humans. PLoS Genet. 2016, 12, e1006315. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Lynch, M. Rate, molecular spectrum, and consequences of human mutation. Proc. Natl. Acad. Sci. USA 2010, 107, 961–968. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Kong, A.; Frigge, M.L.; Masson, G.; Besenbacher, S.; Sulem, P.; Magnusson, G.; Gudjonsson, S.A.; Sigurdsson, A.; Jonasdottir, A.; Jonasdottir, A.; et al. Rate of de novo mutations and the importance of father’s age to disease risk. Nature 2012, 488, 471–475. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Tzoulaki, I.; White, I.M.; Hanson, I.M. PAX6 mutations: Genotype-phenotype correlations. BMC Genet. 2005, 6, 27. [Google Scholar] [CrossRef] [Green Version]
  25. Williamson, K.A.; Hall, H.N.; Owen, L.J.; Livesey, B.J.; Hanson, I.M.; Adams, G.G.W.; Bodek, S.; Calvas, P.; Castle, B.; Clarke, M.; et al. Recurrent heterozygous PAX6 missense variants cause severe bilateral microphthalmia via predictable effects on DNA-protein interaction. Genet. Med. Off. J. Am. Coll. Med. Genet. 2020, 22, 598–609. [Google Scholar] [CrossRef] [Green Version]
  26. Aggarwala, V.; Voight, B.F. An expanded sequence context model broadly explains variability in polymorphism levels across the human genome. Nat. Genet. 2016, 48, 349–355. [Google Scholar] [CrossRef] [Green Version]
  27. Wang, J.; Raskin, L.; Samuels, D.C.; Shyr, Y.; Guo, Y. Genome measures used for quality control are dependent on gene function and ancestry. Bioinformatics 2015, 31, 318–323. [Google Scholar] [CrossRef] [Green Version]
  28. Zhang, Y.; Li, B.; Li, C.; Cai, Q.; Zheng, W.; Long, J. Improved variant calling accuracy by merging replicates in whole-exome sequencing studies. BioMed Res. Int. 2014, 2014, 319534. [Google Scholar] [CrossRef] [Green Version]
  29. Consortium, U.K.; Walter, K.; Min, J.L.; Huang, J.; Crooks, L.; Memari, Y.; McCarthy, S.; Perry, J.R.; Xu, C.; Futema, M.; et al. The UK10K project identifies rare variants in health and disease. Nature 2015, 526, 82–90. [Google Scholar] [CrossRef] [Green Version]
  30. Genomes Project Consortium; Abecasis, G.R.; Auton, A.; Brooks, L.D.; DePristo, M.A.; Durbin, R.M.; Handsaker, R.E.; Kang, H.M.; Marth, G.T.; McVean, G.A.; et al. An integrated map of genetic variation from 1092 human genomes. Nature 2012, 491, 56–65. [Google Scholar]
  31. Nishant, K.T.; Singh, N.D.; Alani, E. Genomic mutation rates: What high-throughput methods can tell us. Bioessays 2009, 31, 912–920. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Vasilyeva, T.A.; Marakhonov, A.V.; Sukhanova, N.V.; Kutsev, S.I.; Zinchenko, R.A. Preferentially paternal origin of de novo 11p13 chromosome deletions revealed in patients with congenital aniridia and WAGR syndrome. Genes 2020, 11, 812. [Google Scholar] [CrossRef] [PubMed]
  33. Simon, H.; Huttley, G. Quantifying influences on intragenomic mutation rate. G3 (Bethesda) 2020, 10, 2641–2652. [Google Scholar] [CrossRef] [PubMed]
  34. Montgomery, S.B.; Goode, D.L.; Kvikstad, E.; Albers, C.A.; Zhang, Z.D.; Mu, X.J.; Ananda, G.; Howie, B.; Karczewski, K.J.; Smith, K.S.; et al. The origin, evolution, and functional impact of short insertion-deletion variants identified in 179 human genomes. Genome Res. 2013, 23, 749–761. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Chen, W.; Zhang, L. The pattern of DNA cleavage intensity around indels. Sci. Rep. 2015, 5, 8333. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Schmidt, S.; Gerasimova, A.; Kondrashov, F.A.; Adzhubei, I.A.; Adzuhbei, I.A.; Kondrashov, A.S.; Sunyaev, S. Hypermutable non-synonymous sites are under stronger negative selection. PLoS Genet. 2008, 4, e1000281. [Google Scholar] [CrossRef]
  37. Stenson, P.D.; Ball, E.V.; Howells, K.; Phillips, A.D.; Mort, M.; Cooper, D.N. The Human Gene Mutation Database: Providing a comprehensive central mutation database for molecular diagnostics and personalized genomics. Hum. Genom. 2009, 4, 69–72. [Google Scholar] [CrossRef]
  38. Subramanian, S.; Kumar, S. Neutral substitutions occur at a faster rate in exons than in noncoding DNA in primate genomes. Genome Res. 2003, 13, 838–844. [Google Scholar] [CrossRef] [Green Version]
  39. Richards, S.; Aziz, N.; Bale, S.; Bick, D.; Das, S.; Gastier-Foster, J.; Grody, W.W.; Hegde, M.; Lyon, E.; Spector, E.; et al. Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 2015, 17, 405–424. [Google Scholar] [CrossRef] [Green Version]
Table 1. Comparison of PAX6 mutation types in world-wide cohorts of aniridia patients.
Table 1. Comparison of PAX6 mutation types in world-wide cohorts of aniridia patients.
Title 1Russia 2022 (This Study)UK 2008 [7]USA 2016 [8]China 2020 [9]UK 2009 [12]France 2003 [13]Australia 2018 [14]South Korea 2012 [15]India 2015 [16]
Large chromosome deletions61341011126
Nonsense53131812414591
Frame shift348161649237
Missense7241111
Splicing308141354133
Start codon loss21
CTE8434411
5′UTR43
In frame del2
Total1996761632230171613
Table 2. The ratio of mutation events from Kondrashov’s review and the current study (highlighted in bold).
Table 2. The ratio of mutation events from Kondrashov’s review and the current study (highlighted in bold).
LocusNumber of All MutationsSubstitutions/
Small Indels Ratio
Substitutions/
Major Deletions Ratio
F9201814.718
F810102.30.7
APC8580.62.1
VHL6173.61.6
CYBB50836.4
NF13681.67.2
TSC23502.23.5
ABCD13363.613.8
RB12801.94.3
AR2697.516.1
IDS2663.43.5
AVPR22502.719.6
JAG124917.1
IL2RG2243.542.8
PAX6
(current study)
1991.91.5
BTK1953.112.6
EMD1911.25
PAX61762.97.4
OTC16010.823.5
DMD1233.40.5
PKD1822.418.7
Table 3. Comparison of proportions between SNVs, indels, and large chromosome deletions observed in the PAX6 gene in Russian cohort of aniridia patients with the whole genome ratio.
Table 3. Comparison of proportions between SNVs, indels, and large chromosome deletions observed in the PAX6 gene in Russian cohort of aniridia patients with the whole genome ratio.
Mutation TypeNovel Mutations in This AnalysisNovel Mutations in 250
Dutch Families [18,19]
All substitutions9111020
Small indels 47291
CNVs6141
Table 4. Comparison of numbers of PAX6 substitutions in different contexts identified in two studies.
Table 4. Comparison of numbers of PAX6 substitutions in different contexts identified in two studies.
Mutation TypeProsser J. and Van Heyningen V. [2]%This Analysis%
Transitions at CpG2428.63115.6
Transitions at non-CpG1619.02915.1
Transversions at CpG33.652.5
Transversions at non-CpG1315.52613.1
All substitutions5667.09146.2
Small indels (<50 bp)2732.14723.1
Total84100199100
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Vasilyeva, T.A.; Marakhonov, A.V.; Kutsev, S.I.; Zinchenko, R.A. Relative Frequencies of PAX6 Mutational Events in a Russian Cohort of Aniridia Patients in Comparison with the World’s Population and the Human Genome. Int. J. Mol. Sci. 2022, 23, 6690. https://doi.org/10.3390/ijms23126690

AMA Style

Vasilyeva TA, Marakhonov AV, Kutsev SI, Zinchenko RA. Relative Frequencies of PAX6 Mutational Events in a Russian Cohort of Aniridia Patients in Comparison with the World’s Population and the Human Genome. International Journal of Molecular Sciences. 2022; 23(12):6690. https://doi.org/10.3390/ijms23126690

Chicago/Turabian Style

Vasilyeva, Tatyana A., Andrey V. Marakhonov, Sergey I. Kutsev, and Rena A. Zinchenko. 2022. "Relative Frequencies of PAX6 Mutational Events in a Russian Cohort of Aniridia Patients in Comparison with the World’s Population and the Human Genome" International Journal of Molecular Sciences 23, no. 12: 6690. https://doi.org/10.3390/ijms23126690

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop