Next Article in Journal
Novel Solution for Using Neural Networks for Kidney Boundary Extraction in 2D Ultrasound Data
Next Article in Special Issue
Optical Genome Mapping for the Molecular Diagnosis of Facioscapulohumeral Muscular Dystrophy: Advancement and Challenges
Previous Article in Journal
NanoFIRE: A NanoLuciferase and Fluorescent Integrated Reporter Element for Robust and Sensitive Investigation of HIF and Other Signalling Pathways
Previous Article in Special Issue
Deep Intronic LINE-1 Insertions in NF1: Expanding the Spectrum of Neurofibromatosis Type 1-Associated Rearrangements
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optical Genome Mapping Enables Detection and Accurate Sizing of RFC1 Repeat Expansions

by
Stefano Facchini
1,2,*,†,
Natalia Dominik
2,†,
Arianna Manini
2,3,4,
Stephanie Efthymiou
2,
Riccardo Currò
2,5,
Bianca Rugginini
2,5,
Elisa Vegezzi
1,
Ilaria Quartesan
1,5,
Benedetta Perrone
2,
Shahedah Koya Kutty
6,
Valentina Galassi Deforie
2,
Ricardo P. Schnekenberg
2,5,
Elena Abati
2,3,
Anna Pichiecchio
1,5,
Enza Maria Valente
1,7,
Cristina Tassorelli
1,5,
Mary M. Reilly
2,
Henry Houlden
2,
Enrico Bugiardini
2 and
Andrea Cortese
2,5,*
1
IRCCS Mondino Foundation, 27100 Pavia, Italy
2
Department of Neuromuscular Diseases, University College London, London WC1N 3BG, UK
3
Department of Pathophysiology and Transplantation, University of Milan, 20122 Milan, Italy
4
Department of Neurology and Laboratory of Neuroscience, IRCCS Istituto Auxologico Italiano, 20149 Milan, Italy
5
Department of Brain and Behavioral Sciences, University of Pavia, 27100 Pavia, Italy
6
Department of Internal Medicine, Kulliyah of Medicine, International Islamic University Malaysia (IIUM), Pahang 53100, Malaysia
7
Department of Molecular Medicine, University of Pavia, 27100 Pavia, Italy
*
Authors to whom correspondence should be addressed.
These authors equally contributed to this work.
Biomolecules 2023, 13(10), 1546; https://doi.org/10.3390/biom13101546
Submission received: 14 August 2023 / Revised: 13 September 2023 / Accepted: 9 October 2023 / Published: 19 October 2023

Abstract

:
A recessive Short Tandem Repeat expansion in RFC1 has been found to be associated with cerebellar ataxia, neuropathy and vestibular areflexia syndrome (CANVAS), and to be a frequent cause of late onset ataxia and sensory neuropathy. The usual procedure for sizing these expansions is based on Southern Blotting (SB), a time-consuming and a relatively imprecise technique. In this paper, we compare SB with Optical Genome Mapping (OGM), a method for detecting Structural Variants (SVs) based on the measurement of distances between fluorescently labelled probes, for the diagnosis of RFC1 CANVAS and disease spectrum. The two methods are applied to 17 CANVAS patients’ blood samples and resulting sizes compared, showing a good agreement. Further, long-read sequencing is used for two patients to investigate the agreement of sizes with either SB or OGM. Our study concludes that OGM represents a viable alternative to SB, allowing for a simpler technique, a more precise sizing of the expansion and ability to expand analysis of SV in the entire genome as opposed to SB which is a locus specific method.

1. Introduction

Cerebellar ataxia, neuropathy and vestibular areflexia syndrome (CANVAS) is caused by recessive short tandem repeat expansions in the second intronic region of RFC1 [1,2,3]. Subsequent studies have confirmed that biallelic RFC1 repeat expansions were shown to represent a common cause of late onset ataxia and sensory neuropathy, explaining 22% and 34% of genetically unconfirmed cases, respectively [1,4].
The underlying genetic cause of CANVAS is quite complex. Although most patients carry biallelic AAGGG expansions, additional pathogenic repeats were identified including ACAGG in Asians and AGGGC in Europeans [3,5,6,7]. Notably, all pathogenic repeats, independently from exact repeat motif, are large, ranging from around 250 to over 4000 repeats, while non-pathogenic expansions are typically < 100 repeats. Furthermore, although this has not been demonstrated yet in RFC1 CANVAS, in other repeat expansions disorders, the length of the repeat expansions has an important prognostic role as larger expansions typically lead to an earlier onset and more severe phenotype. Biallelic RFC1 repeat expansions were shown to represent a common cause of late onset ataxia and sensory neuropathy, explaining 22% and 34% of genetically unconfirmed cases, respectively [1]. Therefore, detection of the presence and measurement of the size of RFC1 repeat expansion is of paramount diagnostic importance. Moreover, a small number of patients with typical CANVAS has been found to carry one AAGGG expanded allele and a second, truncating variant, in trans, warranting additional investigations in heterozygous carriers with suggestive clinical features [8,9,10].
Current diagnostic strategy for RFC1 testing relies on polymerase chain reaction (PCR), including flanking PCR and a repeat-primed PCR (RP-PCR) [1]. However, given the large size and high GC content of the pathogenic AAGGG motif, PCR-based techniques fail to amplify the full expanded repeat. Therefore, demonstration of the presence of two expanded alleles and measurement of their size was only possible with traditional Southern Blotting (SB). SB utilises a pre-designed probe that only binds to a specific locus flanking the RFC1 repeat, and the expansion sizing is based on the visual comparison between the sample track and a reference ladder track [1]. Despite being clinically very useful, SB is a time-consuming technique which requires considerable amount of work and a dedicated laboratory setup.
Southern Blotting has been a gold standard technique for measuring allele sizes in various conditions such as C9orf72 repeat expansion disorders [11], myotonic dystrophy type 1 (DM1) [12], and fragile X syndrome [13]. SB is a cumbersome method and studies have been carried out whether more convenient and high-throughput methods, so far mainly limited to PCR, can replace or minimise the need for SB [14,15]. However, PCR cannot amplify large repetitive sequences; therefore, it is not possible to use it for sizing of repeat expansions.
Optical Genome Mapping (OGM) is a new technology which enables accurate detection of large (>500 nucleotides) Structural Variants, based on the measurement of the distance between fluorophore-labelled probes which tag ultra-high molecular weight DNA molecules. The advantages of this technique include the following: (1) a more streamlined laboratory protocol; (2) the possibility of mapping the entire genome for each sample, instead of a single locus; (3) the possibility of automatizing the data analysis. The main commercial implementation of OGM is currently provided by Bionano Genomics, which is the technology used for this paper. Bionano OGM was able to reliably detect the presence of repeat expansion in DM1 and SCA10 [16,17]. In addition, OGM was successfully used to confirm the presence of biallelic RFC1 expansions in seven Dutch patients carrying RFC1 expansions [18]. However, a systematic comparison between OGM and SB was never performed.
In the present study, we compared OGM with SB for the detection and sizing of RFC1 repeat expansion and showed that OGM can be a viable high-throughput alternative to SB for RFC1 expansion testing, as part of its ability to assess the presence of structural variants and large repeat expansion at genome-wide level.

2. Materials and Methods

2.1. Samples

A total of 17 CANVAS patients carrying biallelic AAGGG (n = 15) or ACAGG (n = 1) or compound heterozygous AAGGG and AGGGC (n = 1) repeat expansions were enrolled at the Institute of Neurology (IoN), University College London, Mondino IRCCS Foundation and the International Islamic University of Malaysia. DNA was extracted from fresh blood collected in EDTA blood tubes. All samples were subjected to SB and OGM and 2 samples (patients 7 and 10) were sequenced using long-read sequencing. Two control samples without CANVAS disease were also included in the OGM analysis.

2.2. Southern Blotting

Five ug of genomic DNA were enzymatically digested with EcoRI for 3 h and subsequently electrophoresed on 1.2% agarose for 15 h. The gel was washed for 45 min in depurination, denaturation and neutralising solutions. Subsequently, the fragmented DNA was transferred to a positively charged membrane using upward transfer method for 15 h. The DNA was then UV cross-linked on the membrane, and it was hybridised with a mixture of salmon sperm and RFC1-specific probe in digoxigenin granules solution (DIG Easy Hyb™ Granules, Roche, Basel, Switzerland) overnight at 49 °C. The membrane was washed and blocked. Anti-DIG antibody was added and incubated for 30 min at room temperature. Band visualisation followed incubation with detection buffer and chemiluminescent CDP-STAR substrate. The repeat sizing was estimated against DIG-labelled DNA molecular weight marker II after subtracting the wild-type allele size. For each individual, the sizes of detected alleles were recorded as number of pentanucleotide repeats.
Densitometric profiles for SB lanes were obtained with ImageJ v1.54f (https://imagej.nih.gov/ij/ accessed on 13 August 2023).

2.3. Optical Genome Mapping

Ultra-high molecular weight DNA was extracted using “Blood and cell culture DNA isolation kit” according to the manufacturer’s protocol. Following Qubit Fluorometer (Invitrogen, Waltham, MA, USA) quantification, 750 ng DNA per sample was labelled using Bionano Prep DLS Labelling Kit. All samples were loaded onto Saphyr chips for linearisation and imagining and processed on a Bionano Saphyr machine (Bionano Genomics, San Diego, CA, USA). Molecules were aligned to the hg38 reference using the align_mol_to_ref.py script available in Bionano Solve 3.6 software package (https://bionano.com/software-downloads/ accessed on 13 August 2023).
RFC1 repeat expansion is located between markers 7723 and 7724 of chromosome 4 (hg38; chr4:39343732-39350590; reference intermarker distance = 6858 bp).
Instead of relying on the single size estimate provided by the Bionano Access platform, we decided to use all the mapping information available by collecting the intermarker distances over all molecules overlapping both markers. A histogram of the distances reveals the presence of one or two alleles, visible as peaks in the plot. The distribution of distances is decomposed in Gaussian components, using the Gaussian Mixture model from Scikit-learn python package (https://scikit-learn.org/stable/ accessed on 13 August 2023). The mean of each component provides an estimate for the allele size. The standard deviation accounts for technical errors in the optical reading and possibly for somatic instability. For each individual, the sizes of detected alleles were recorded as number of pentanucleotide repeats.
The method comprises two steps:
  • For each sample, we apply the clustering DBSCAN algorithm from Scikit-learn, with epsilon parameter obtained using the KneeLocator method (https://github.com/arvkevi/kneed accessed on 13 August 2023): all observations not belonging to a cluster are removed as outliers;
  • We fit two Gaussian Mixture Models to the remaining observations, one with a single component and one with two components. The two-component model is preferred if both these conditions are satisfied: (a) the Bayesian Information Content (BIC) improves; (b) both components have weights > 25%
We refer to Supplementary Materials for an implementation of the algorithm.

2.4. Targeted RFC1 Long-Read Sequencing

Targeted long-read sequencing (LR) was performed on patient 10 with Oxford Nanopore technology, and on patient 7 with PacBio technology. Target enrichment was performed with clustered regularly interspaced short palindromic repeats CRISPR/CRISPR-associated protein-9 nuclease (Cas9) system.
High molecular weight DNA was extracted from blood using the Qiagen (Venlo, The Netherlands) MagAttract HMW DNA Kit, and quality control was performed with Thermo Scientific NanoDrop. Subsequently, Agilent Femto Pulse Genomic DNA 165 kb kit was used to distinguish samples with majority of fragments over 25 kb that were used for CRISPR/Cas9 targeted sequencing as previously described [3]. Briefly, libraries were prepared from 5 µg of input DNA and sequenced using Nakamura et al. [19] CRISPR-Cas9 guides RFC1-F1: 5′-GACAGTAACTGTACCACAATGGG-3′, RFC1-R1: 5′-CTATATTCGTGGAACTATCTTGG-3′, RFC1-F2: 5′-ACACTCTTTGAAGGAATAACAGG-3′ and RFC1-R2: 5′-TGAGGTATGAAT CATCCTGAGGG-3′ for patient 10 and guides RFC1-F3: 5′-GAAACTAAATAGAACCAGCC-3′ RFC1-R3: 5′-GACTATGGCTTACCTGAGTG-3′ for patient 7. Minimap2 was used for alignment of sequences to the hg38 reference.

2.5. Correlation

The statistical analysis of correlation between SB and OGM results was conducted with the standard Python package “statsmodel”. A linear regression model with constant was fitted using the Ordinary Least Square method (OLS) from the same Python package.

3. Results

3.1. Clinical and Demographic Features

Demographic and clinical features of the patients included in our analysis (n = 17) are presented in Table 1. Except for one patient coming from Malaysia, the remaining cohort were of Caucasian origin, mainly from the United Kingdom and Italy. Median age at onset was 56 years (IQR, 39–61), and median disease duration at examination was 9 years (IQR, 4–15.5). Six patients showed all the three core clinical features of CANVAS, namely sensory neuropathy, cerebellar syndrome and vestibular dysfunction. Six patients had a complex neuropathy, including one case with sensory neuropathy and vestibular areflexia and five with sensory neuropathy and cerebellar ataxia, while five patients had an isolated sensory neuropathy. In three patients, the vestibular system was not examined. Thirteen patients exhibited chronic cough. Six patients showed dysautonomia, which was represented by erectile dysfunction in three of them (Pt 3, 5 and 6). One patient exhibited generalised, intermittent fasciculations (Pt 7).

3.2. Technical Considerations of Southern Blotting and Optical Genome Mapping

SB relies on large quantities (5 µg) of high-quality and purity DNA. SB is compatible with most DNA extraction methods, thus facilitating sample processing and shipping of extracted DNA from collaborators across the globe. In comparison, OGM can only be performed on very high molecular weight DNA fragments (>150 Kbp), which requires a bespoke extraction method using the Bionano extraction kit from fresh or snap frozen blood or cell pellets. Hands-on processing time at the bench is 4 working days for SB and 2 working days for OGM, followed by Saphyr imaging and automatic data collection.
SB size estimation relies on comparison to a ladder tract. OGM relies on fluorescent labels which bind to specific 6 bp DNA motifs (CTTAAG) present in the genome at an average of 20 times per 100 Kbp.
In addition to good technical skills, necessary for both methods, OGM requires computer literacy for size estimation in the online Bionano Access analysis platform, or to perform custom analysis.
Representative examples of OGM and SB are shown in Figure 1.

3.3. Southern Blotting and Optical Genome Mapping Show Good Sizing Agreement

All CANVAS samples were confirmed to carry biallelic RFC1 repeat expansions with both methods (see Table 2). Control samples were subjected to OGM analysis, confirming the absence of biallelic expansions (Control 1 has one expanded allele; Control 2 has two unexpanded alleles).
We observed an excellent linear correlation between the two methods (Figure 2), with r2 = 0.97. However, the linear coefficient is 0.62 [0.58–0.66] at 95% C.I., and the intercept is 232 [181–226] at 95% C.I.
We show in Figure 3A the OGM molecules size distribution for all samples, with the estimated Gaussian components. We did not detect evidence of significant somatic instability, as suggested by a standard deviation of ~5% of the repeat size in all tested samples, independently from the repeat length. In Figure 3B, we show the SB images for all patients.

3.4. Targeted Long-Read Sequencing

We also performed a targeted long-read sequencing of the RFC1 locus, as described in Methods, for two patients (Pt 7 and Pt 10).
For Pt 7, we obtained 2 PacBio CCS reads with expansion of 1160 and 1224 repeats; median = 1192.
For Pt 10, we obtained 20 Nanopore reads for the allele AGGGC, with expansions ranging from 233 to 5049 repeats; median = 3363 (IQR 3261–3532). No reads were obtained for the AAGGG allele.
Notably, the size of the expanded alleles as measured using LRS matches more closely the OGM data, while Southern Blotting tended to overestimate the size of large expanded alleles by 14–29% (Table 3).

4. Discussion

Biallelic RFC1 expansions represent a common cause of late-onset ataxia and sensory neuropathy [1,2,3,4,5,6]. Unfortunately, it is still difficult to implement a diagnostic test. Indeed, the molecular diagnosis of RFC1 CANVAS is challenging and currently relies on the combination of flanking PCR, repeat-primed PCR for different pathogenic (AAGGG, ACAGG, AGGGC, AAGGC and AGAGG) and non-pathogenic (AAAAG) repeat motifs, and SB as confirmation test, which is often not available in diagnostic labs. Therefore, the diagnostic implementation of RFC1 genetic testing remains challenging.
In this paper, we validated the OGM technology on 17 blood samples from patients carrying biallelic RFC1 repeat expansions. We compared the repeat sizing between SB and OGM and showed a very good linear correlation of the two techniques. We noticed a deviation from the expected identity function in the regression, which is accounted by a systematic error either in the SB or in the OGM method, particularly for the expanded alleles over ~1000 repeats. This could either be due to overestimation of repeat size with SB or underestimation with OGM. SB relies on gel electrophoresis to resolve large fragments of genomic DNA. Possible formation of secondary structures by the repeats, slowing down the migration during electrophoresis, could lead to an overestimation of the repeat lengths. Moreover, due to the necessity of a visual comparison with a logarithmic scale, estimation of the allele size is increasingly imprecise for larger fragments, and it often cannot resolve similarly sized alleles resulting in a single band.
On the other hand, OGM may underestimate expansion size by taking into account kinked DNA molecules during imaging, leading to the underestimation of expansion size.
To better assess the relative accuracy of the two methods, we compared the sizing estimates of SB and OGM with the size measured using targeted long-read sequencing of the expansion for Pt 7 and Pt 10. In both cases, we found a much better agreement with the OGM estimate (see Table 3). These results suggest that the discrepancy between OGM and SB is probably accounted for by a systematic error in the SB, possibly due to electrophoretic or analytic procedure as discussed above.
Moreover, OGM, unlike SB, was able to distinguish two alleles of similar size in 3 out of the 17 patients (Pt 6, Pt 9, Pt 17; see Table 2 and Figure 1B,C), while in one case (Pt 15), the presence of two distinct alleles was suggested via SB, but only one component was detected with OGM. Overall, OGM improved the allele sizing resolution in 4/17 (24%) samples.
In addition, we showed that OGM is able to detect heterozygous carriers of RFC1 repeat expansions. This is of clinical relevance since detection of heterozygous expansion in patients with typical CANVAS symptoms should prompt additional testing including full sequencing of RFC1 gene looking for a second nonsense variant [3,4].
An additional advantage of OGM is the possibility to screen for SV as well as large expansions (>500 nt) in the entire patient’s genome in parallel to RFC1 testing.
Both techniques require good technical skills, specific laboratory setups and special sample storage and transport considerations. However, advantages of OGM include a short response time (in ideal conditions, approximately 10 h hands-on time for DNA isolation and DNA labelling, overnight homogenisation of ultra-high molecular DNA, 8 h of run time at 100X coverage and 24 h for automated data collection), higher accuracy and high-throughput output.
As part of this study, we have developed a bespoke algorithm to accurately capture all the available information on RFC1 repeat size data and their dispersion compared to the standard DeNovo variant calling available on the Bionano Access software (currently at version 3.7), which only provide one value for each expanded allele. In particular, the in-house algorithm, based on the analysis of the Gaussian distribution of all available molecules spanning the repeat expansion, enabled accurate sizing of the RFC1 repeat, as well as the assessment of its somatic instability, as indicated by the mean and the standard deviation of the Gaussian, respectively. Notably, based on the data generated in this study, OGM analysis did not support the presence of significant somatic instability of RFC1 repeat in blood.
A known limitation of both OGM and SB is that they do not provide any information on the repeat sequence and need to be complemented with PCR, short or long-read sequencing. This is particularly true in cases with typical CANVAS symptoms but only heterozygous expansion where a truncating variant could be present in trans with the expansion, or in cases with suspected configuration motifs different to canonical pathogenic AAGGG.
In conclusion, OGM appears as a valid alternative to SB for the detection and sizing of RFC1 expansions, along with genome-wide assessment of structural variants and other large repeat expansions, which could support its use in a diagnostic setting.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biom13101546/s1, Figure S1: original image of Figure 1 and Figure 3. An implementation of the sizing algorithm is available at https://github.com/stfacc/extract_gaussian_alleles.

Author Contributions

Conceptualization, S.F. and A.C.; methodology, S.F. and N.D.; software, S.F.; investigation, N.D., R.C., S.E., B.P., S.K.K., E.V., I.Q., B.P., R.P.S., E.A., B.R., A.P., E.M.V., C.T. and V.G.D.; writing—original draft preparation, S.F., N.D. and A.M.; writing—review and editing, A.C.; supervision, A.C.; project administration, E.B., M.M.R., A.C. and H.H.; funding acquisition, A.C. All authors have read and agreed to the published version of the manuscript.

Funding

The study was supported by Medical Research Council (MR/T001712/1), Fondazione Cariplo (grant n. 2019-1836), the Inherited Neuropathy Consortium, National Ministry of Health (Ricerca Corrente 2021–2022), Italian Ministry for Universities and Research (MUR, 20229MMHXP), National Brain Appeal’s Innovation Fund and Rosetrees Trust grant.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Human Research Ethics Committee of the Royal Children Hospital (HREC 28097).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cortese, A.; Simone, R.; Sullivan, R.; Vandrovcova, J.; Tariq, H.; Yau, W.Y.; Humphrey, J.; Jaunmuktane, Z.; Sivakuma, P.; Polke, J.; et al. Biallelic expansion of an intronic repeat in RFC1 is a common cause of late-onset ataxia. Nat. Genet. 2019, 51, 649–658. [Google Scholar] [CrossRef]
  2. Rafehi, H.; Szmulewicz, D.J.; Bennett, M.F.; Sobreira, N.L.; Pope, K.; Smith, K.R.; Gillies, G.; Diakumis, P.; Dolzhenko, E.; Eberle, M.A.; et al. Bioinformatics-Based Identification of Expanded Repeats: A Non-reference Intronic Pentamer Expansion in RFC1 Causes CANVAS. Am. J. Hum. Genet. 2019, 105, 151–165. [Google Scholar] [CrossRef]
  3. Dominik, N.; Magri, S.; Currò, R.; Abati, E.; Facchini, S.; Corbetta, M.; Hannah, M.; Daniela, D.B.; Elisa, S.; Igor, S.; et al. Normal and pathogenic variation of RFC1 repeat expansions: Implications for clinical diagnosis. Brain 2023, in press. [Google Scholar] [CrossRef]
  4. Traschütz, A.; Heindl, F.; Bilal, M.; Hartmann, A.M.; Dufke, C.; Riess, O.; Zwergal, A.; Rujescu, D.; Haack, T.; Synofzik, M.; et al. Frequency and Phenotype of RFC1 Repeat Expansions in Bilateral Vestibulopathy. Neurology 2023, 101, e1001–e1013. [Google Scholar] [CrossRef]
  5. Scriba, C.K.; Beecroft, S.J.; Clayton, J.S.; Cortese, A.; Sullivan, R.; Yau, W.Y.; Dominik, N.; Rodrigues, M.; Walker, E.; Dyer, Z.; et al. A novel RFC1 repeat motif (ACAGG) in two Asia-Pacific CANVAS families. Brain 2020, 143, 2904–2910. [Google Scholar] [CrossRef]
  6. Watanabe, K.; Nakashima, M.; Wakatsuki, R.; Bunai, T.; Ouchi, Y.; Nakamura, T.; Miyajima, H.; Saitsu, H. Cognitive Impairment in a Complex Family with AAGGG and ACAGG Repeat Expansions in RFC1 Detected by ExpansionHunter Denovo. Neurol. Genet. 2022, 8, e682. [Google Scholar] [CrossRef]
  7. Miyatake, S.; Yoshida, K.; Koshimizu, E.; Doi, H.; Yamada, M.; Miyaji, Y.; Ueda, N.; Tsuyuzaki, J.; Kodaira, M.; Onoue, H.; et al. Repeat conformation heterogeneity in cerebellar ataxia, neuropathy, vestibular areflexia syndrome. Brain 2022, 145, 1139–1150. [Google Scholar] [CrossRef] [PubMed]
  8. Benkirane, M.; Da Cunha, D.; Marelli, C.; Larrieu, L.; Renaud, M.; Varilh, J.; Pointaux, M.; Baux, D.; Ardouin, O.; Vangoethem, C.; et al. RFC1 nonsense and frameshift variants cause CANVAS: Clues for an unsolved pathophysiology. Brain 2022, 145, 3770–3775. [Google Scholar] [CrossRef] [PubMed]
  9. Weber, S.; Coarelli, G.; Heinzmann, A.; Monin, M.L.; Richard, N.; Gerard, M.; Durr, A.; Huin, V. Two RFC1 splicing variants in CANVAS. Brain 2023, 146, e14–e16. [Google Scholar] [CrossRef] [PubMed]
  10. Arteche-López, A.; Avila-Fernandez, A.; Damian, A.; Soengas-Gonda, E.; de la Fuente, R.P.; Gómez, P.R.; Merlo, J.G.; Burgos, L.H.; Fernández, C.C.; Rosales, J.M.L.; et al. New Cerebellar Ataxia, Neuropathy, Vestibular Areflexia Syndrome cases are caused by the presence of a nonsense variant in compound heterozygosity with the pathogenic repeat expansion in the RFC1 gene. Clin. Genet. 2023, 103, 236–241. [Google Scholar] [CrossRef] [PubMed]
  11. DeJesus-Hernandez, M.; Mackenzie, I.R.; Boeve, B.F.; Boxer, A.L.; Baker, M.; Rutherford, N.J.; Nicholson, A.M.; Finch, N.A.; Flynn, H.; Adamson, J.; et al. Expanded GGGGCC hexanucleotide repeat in noncoding region of C9ORF72 causes chromosome 9p-linked FTD and ALS. Neuron 2011, 72, 245–256. [Google Scholar] [CrossRef] [PubMed]
  12. Joosten, I.B.; Hellebrekers, D.M.; de Greef, B.T.; Smeets, H.J.; de Die-Smulders, C.E.; Faber, C.G.; Gerrits, M.M. Parental repeat length instability in myotonic dystrophy type 1 pre- and protomutations. Eur. J. Hum. Genet. 2020, 28, 956–962. [Google Scholar] [CrossRef]
  13. Curtis-Cioffi, K.M.C.; Rodrigueiro, D.A.; Rodrigues, V.C.; Cicarelli, R.M.B.; Scarel-Caminaga, R.M. Comparison between the polymerase chain reaction-based screening and the Southern blot methods for identification of fragile X syndrome. Genet. Test. Mol. Biomark. 2012, 16, 1303–1308. [Google Scholar] [CrossRef]
  14. Grasso, M.; Boon, E.M.; Filipovic-Sadic, S.; Van Bunderen, P.A.; Gennaro, E.; Cao, R.; Latham, G.J.; Hadd, A.G.; Coviello, D.A. A novel methylation PCR that offers standardized determination of FMR1 methylation and CGG repeat length without Southern blot analysis. J. Mol. Diagn. 2014, 16, 23–31. [Google Scholar] [CrossRef] [PubMed]
  15. Chen, L.; Hadd, A.; Sah, S.; Filipovic-Sadic, S.; Krosting, J.; Sekinger, E.; Pan, R.; Hagerman, P.J.; Stenzel, T.T.; Tassone, F.; et al. An information-rich CGG repeat primed PCR that detects the full range of fragile X expanded alleles and minimizes the need for Southern blot analysis. J. Mol. Diagn. 2010, 12, 589–600. [Google Scholar] [CrossRef] [PubMed]
  16. Otero, B.A.; Poukalov, K.; Hildebrandt, R.P.; Thornton, C.A.; Jinnai, K.; Fujimura, H.; Kimura, T.; Hagerman, K.A.; Sampson, J.B.; Day, J.W.; et al. Transcriptome alterations in myotonic dystrophy frontal cortex. Cell Rep. 2021, 34, 108634. [Google Scholar] [CrossRef] [PubMed]
  17. Torres, C.A.M.; Zafar, F.; Tsai, Y.C.; Vazquez, J.P.; Gallagher, M.D.; McLaughlin, I.; Hong, K.; Lai, J.; Lee, J.; Chirino-Perez, A.; et al. ATTCT and ATTCC repeat expansions in the ATXN10 gene affect disease penetrance of spinocerebellar ataxia type 10. HGG Adv. 2022, 3, 100137. [Google Scholar]
  18. Ghorbani, F.; de Boer-Bergsma, J.; Verschuuren-Bemelmans, C.C.; Pennings, M.; de Boer, E.N.; Kremer, B.; Vanhoutte, E.K.; de Vries, J.J.; van de Berg, R.; Kamsteeg, E.-J.; et al. Prevalence of intronic repeat expansions in RFC1 in Dutch patients with CANVAS and adult-onset ataxia. J. Neurol. 2022, 269, 6086–6093. [Google Scholar] [CrossRef] [PubMed]
  19. Nakamura, H.; Doi, H.; Mitsuhashi, S.; Miyatake, S.; Katoh, K.; Frith, M.C.; Asano, T.; Kud, Y.; Ikeda, T.; Kubota, S.; et al. Long-read sequencing identifies the pathogenic nucleotide repeat expansion in RFC1 in a Japanese case of CANVAS. J. Hum. Genet. 2020, 65, 475–480. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (A) Bionano OGM markers used for the analysis. The red triangle area indicates the position of the repeat expansion inside the second intron of RFC1 (blue arrows point in the coding direction of the gene). The blue triangles indicate the position of the markers flanking the repeat (markers 7723 and 7724) (B) Optical genome mapping for Pt 6. Two alleles are observed as Gaussian components of size 664 and 730 repeats (3322bp and 3648bp, respectively) (C) Representative example of Southern Blotting plot. For Pt 6 (indicated by the red triangle), only one band is visible, corresponding to an expansion of 917 repeats (4585 bp) (original images can be found in Figure S1). (D) Densitometric profile for the SB band. Only one peak is visible.
Figure 1. (A) Bionano OGM markers used for the analysis. The red triangle area indicates the position of the repeat expansion inside the second intron of RFC1 (blue arrows point in the coding direction of the gene). The blue triangles indicate the position of the markers flanking the repeat (markers 7723 and 7724) (B) Optical genome mapping for Pt 6. Two alleles are observed as Gaussian components of size 664 and 730 repeats (3322bp and 3648bp, respectively) (C) Representative example of Southern Blotting plot. For Pt 6 (indicated by the red triangle), only one band is visible, corresponding to an expansion of 917 repeats (4585 bp) (original images can be found in Figure S1). (D) Densitometric profile for the SB band. Only one peak is visible.
Biomolecules 13 01546 g001
Figure 2. Linear regression showing the correlation between Optical Genome Mapping (OGM) and Southern Blotting (SB) size estimates. The grey dashed line represents the expected identity function in case of perfect correlation.
Figure 2. Linear regression showing the correlation between Optical Genome Mapping (OGM) and Southern Blotting (SB) size estimates. The grey dashed line represents the expected identity function in case of perfect correlation.
Biomolecules 13 01546 g002
Figure 3. (A) OGM molecule size distribution for all samples, with estimated Gaussian components. On the vertical axis the molecule count is reported. The vertical dotted red line corresponds to a non-expanded allele. For each sample, we report the total number of observed molecules in parenthesis. (B) SB images for all patients. Arrows point to the alleles visible on Southern Blot; yellow and green when two alleles of distinct sizes are seen and red when two alleles of the same size are seen (original images can be found in Figure S1).
Figure 3. (A) OGM molecule size distribution for all samples, with estimated Gaussian components. On the vertical axis the molecule count is reported. The vertical dotted red line corresponds to a non-expanded allele. For each sample, we report the total number of observed molecules in parenthesis. (B) SB images for all patients. Arrows point to the alleles visible on Southern Blot; yellow and green when two alleles of distinct sizes are seen and red when two alleles of the same size are seen (original images can be found in Figure S1).
Biomolecules 13 01546 g003
Table 1. Demographic and clinical features of the cohort. Pt: patient; F: female; M: male; AAO: age at onset; Y: yes; N: no; NA: not available.
Table 1. Demographic and clinical features of the cohort. Pt: patient; F: female; M: male; AAO: age at onset; Y: yes; N: no; NA: not available.
PatientSexEthnicityAAODisease Duration at Examination (Years)PhenotypeChronic CoughCerebellar SyndromeSensory NeuropathyBilateral Vestibular AreflexiaDysautonomiaUse of Walking Aids (Age)Additional Features
Pt 1FCaucasian (British)394Isolated sensory neuropathyYNYNANNo/
Pt 2FCaucasian (British)762Complex neuropathyYYYNANStick (78)/
Pt 3MCaucasian (British)3543CANVASYYYYYStick (75)Erectile dysfunction
Pt 4FCaucasian (British)5720Complex neuropathyYYYNANStick (74); Wheelchair (77)/
Pt 5MCaucasian (British)3517Isolated sensory neuropathyYNYNYNoErectile dysfunction
Pt 6MCaucasian (Italian)596CANVASYYYYYStickErectile dysfunction
Pt 7MAsian (Malaysian)3829CANVASYYYYNStick (63); Wheelchair (64)Generalised, intermittent fasciculations
Pt 8FCaucasian (British)5914CANVASNYYYYStick (72)/
Pt 9FCaucasian (British)2540CANVASYYYYYNoThoracic syrinx
Pt 10MCaucasian (British)7111Complex neuropathyYYYNYWheelchair (81)
Pt 11FCaucasian (Italian)743Isolated sensory neuropathyYNYNNNo/
Pt 12FCaucasian (Italian)752CANVASYYYYNStick (76)/
Pt 13MCaucasian (Italian)619Isolated sensory neuropathyNNYNNNoDiabetes
Pt 14MCaucasian (Italian)519Complex neuropathyNYYNNNo/
Pt 15MCaucasian (Italian)564Isolated sensory neuropathyNNYNNStick (58)/
Pt 16MCaucasian (Italian)414Complex neuropathyYNYYNNo/
Pt 17FCaucasian (Italian)501Complex neuropathyYYYNNNo/
Control 1FCaucasian (British)//////////
Control 2FCaucasian (Italian)//////////
Table 2. Estimated sizes of the repeat expansions (number of pentanucleotide repeats). In OGM, repeat size is indicated as mean ± standard deviation of the Gaussian. Highlighted in grey are the patients where OGM, unlike SB, could better discriminate the size of the two expanded alleles.
Table 2. Estimated sizes of the repeat expansions (number of pentanucleotide repeats). In OGM, repeat size is indicated as mean ± standard deviation of the Gaussian. Highlighted in grey are the patients where OGM, unlike SB, could better discriminate the size of the two expanded alleles.
PatientSB Allele 1SB Allele 2OGM Allele 1OGM Allele 2
Pt 17651242677 ± 41955 ± 45
Pt 25981035622 ± 34841 ± 36
Pt 3989 (Homozygous)894 ± 29 (Homozygous)
Pt 411271593866 ± 481182 ± 70
Pt 5144718381017 ± 571180 ±40
Pt 6917 (Homozygous)664 ± 24730 ± 22
Pt 71400 (Homozygous)1223 ± 36 (Homozygous)
Pt 8991 (Homozygous)829 ± 53 (Homozygous)
Pt 91185 (Homozygous)880 ± 46943 ± 29
Pt 10125647461055 ± 793226 ± 163
Pt 11249810333 ± 20831 ± 35
Pt 12724 (Homozygous)792 ± 63 (Homozygous)
Pt 13294 (Homozygous)406 ± 32 (Homozygous)
Pt 14640794652 ± 40759 ± 24
Pt 15605714640 ± 51 (Homozygous)
Pt 167942386745 ± 511646 ± 97
Pt 17810 (Homozygous)582 ± 35654 ± 24
Control 1/−4 ± 26450 ± 22
Control 2/−6 ± 30 (Homozygous)
Table 3. Comparison of SB and OGM size estimates with long-read sequencing (number of pentanucleotides repeats). Pt 7 is homozygous, while for Pt 10, we report the estimates for the largest allele (with motif AGGGC). In both cases, we notice a much better agreement of LR with OGM.
Table 3. Comparison of SB and OGM size estimates with long-read sequencing (number of pentanucleotides repeats). Pt 7 is homozygous, while for Pt 10, we report the estimates for the largest allele (with motif AGGGC). In both cases, we notice a much better agreement of LR with OGM.
PatientSBOGMLR|LR-SB|/SB|LR-OGM|/OGM
Pt 714001223119214%3%
Pt 1047463226336329%4%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Facchini, S.; Dominik, N.; Manini, A.; Efthymiou, S.; Currò, R.; Rugginini, B.; Vegezzi, E.; Quartesan, I.; Perrone, B.; Kutty, S.K.; et al. Optical Genome Mapping Enables Detection and Accurate Sizing of RFC1 Repeat Expansions. Biomolecules 2023, 13, 1546. https://doi.org/10.3390/biom13101546

AMA Style

Facchini S, Dominik N, Manini A, Efthymiou S, Currò R, Rugginini B, Vegezzi E, Quartesan I, Perrone B, Kutty SK, et al. Optical Genome Mapping Enables Detection and Accurate Sizing of RFC1 Repeat Expansions. Biomolecules. 2023; 13(10):1546. https://doi.org/10.3390/biom13101546

Chicago/Turabian Style

Facchini, Stefano, Natalia Dominik, Arianna Manini, Stephanie Efthymiou, Riccardo Currò, Bianca Rugginini, Elisa Vegezzi, Ilaria Quartesan, Benedetta Perrone, Shahedah Koya Kutty, and et al. 2023. "Optical Genome Mapping Enables Detection and Accurate Sizing of RFC1 Repeat Expansions" Biomolecules 13, no. 10: 1546. https://doi.org/10.3390/biom13101546

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop