Next Article in Journal
A Novel Aeromonas popoffii Phage AerP_220 Proposed to Be a Member of a New Tolavirus Genus in the Autographiviridae Family
Previous Article in Journal
Taxifolin Inhibits WSSV Infection and Transmission by Increasing the Innate Immune Response in Litopenaeus vannamei
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multiple Novel Human Norovirus Recombinants Identified in Wastewater in Pretoria, South Africa by Next-Generation Sequencing

1
Department of Medical Virology, Faculty of Health Sciences, University of Pretoria, Private Bag X323, Gezina, Pretoria 0031, South Africa
2
National Health Laboratory Service, Tshwane Academic Division, Pretoria 0002, South Africa
3
Sequencing Core Facility, National Institute for Communicable Diseases, National Health Laboratory Service, Johannesburg 2192, South Africa
4
Department of Biochemistry and Microbiology, Faculty of Science, Engineering and Agriculture, University of Venda, Thohoyandou 0950, South Africa
5
Department of Genetics and Genomics, College of Medicine and Health Sciences, United Arab Emirates University, Al Ain 15551, United Arab Emirates
*
Author to whom correspondence should be addressed.
Viruses 2022, 14(12), 2732; https://doi.org/10.3390/v14122732
Submission received: 1 November 2022 / Revised: 25 November 2022 / Accepted: 2 December 2022 / Published: 7 December 2022
(This article belongs to the Topic Wastewater-Based Epidemiology)

Abstract

:
The genogroup II genotype 4 (GII.4) noroviruses are a major cause of viral gastroenteritis. Since the emergence of the Sydney_2012 variant, no novel norovirus GII.4 variants have been reported. The high diversity of noroviruses and periodic emergence of novel strains necessitates continuous global surveillance. The aim of this study was to assess the diversity of noroviruses in selected wastewater samples from Pretoria, South Africa (SA) using amplicon-based next-generation sequencing (NGS). Between June 2018 and August 2020, 200 raw sewage and final effluent samples were collected fortnightly from two wastewater treatment plants in Pretoria. Viruses were recovered using skimmed milk flocculation and glass wool adsorption-elution virus recovery methods and screened for noroviruses using a one-step real-time reverse-transcription PCR (RT-PCR). The norovirus BC genotyping region (570–579 bp) was amplified from detected norovirus strains and subjected to Illumina MiSeq NGS. Noroviruses were detected in 81% (162/200) of samples. The majority (89%, 89/100) of raw sewage samples were positive for at least one norovirus, compared with 73% (73/100) of final effluent samples. Overall, a total of 89 different GI and GII RdRp-capsid combinations were identified, including 51 putative novel recombinants, 34 previously reported RdRp-capsid combinations, one emerging novel recombinant and three Sanger-sequencing confirmed novel recombinants.

1. Introduction

Human noroviruses are the most common cause of epidemic and sporadic acute gastroenteritis in all age groups globally [1]. An estimated 212,489 deaths occur every year worldwide due to norovirus-associated gastroenteritis [2] and many of these deaths are in lower- to middle-income countries (LMICs) [3]. Although individuals of all ages are affected, children under the age of five [4], the elderly [5], and the immunocompromised are prone to severe illness [6]. These highly infectious viruses are transmitted faecal-orally through person-to-person contact and the consumption of contaminated food or water [7,8]. Hence, most norovirus outbreaks are found in closed settings such as nursing homes or linked to shared food and water sources at sporting and entertainment events, cruise ships, and airlines [9]. There is no licensed vaccine nor specific antiviral drug to prevent and treat disease caused by human noroviruses [10,11].
Noroviruses are non-enveloped and have a linear, non-segmented, single-stranded, positive-sense RNA genome of ~7.7 kb [12]. They are classified under the genus Norovirus in the Caliciviridae family together with ten other genera that infect a wide range of hosts [13]. The Norovirus genus is divided into ten genogroups (GI-GX), which are subdivided into 60 P-types and 49 genotypes based on the nucleotide diversity of the RNA-dependent RNA polymerase (RdRp) and the amino acid diversity of the complete major capsid (VP1) gene, respectively [14]. Genogroups I, II, IV, VIII, and IX comprise at least 49 P-types and 35 genotypes that are known to cause disease in humans [14]. Despite the vast diversity within these genogroups, genotype GII.4 viruses have been the most prevalent cause of norovirus-associated gastroenteritis [15]. Their success is largely driven by the periodic emergence (every 2 to 5 years) of novel strains and intergenotype recombination that often results in a surge of infections globally [16]. However, there has not been a pandemic caused by a novel GII.4 variant since the emergence of Sydney_2012 [17]. The GII.4 strains that are currently circulating have a Sydney_2012 capsid with different P-types due to intergenotype recombination [18]. The emergence and dominance of novel GII.17 strains between 2014 and 2016 in different parts of the world has placed an emphasis on the importance of continuous surveillance for human noroviruses at population level [19]. Norovirus surveillance from raw sewage has proved to be feasible in determining the diversity of viruses circulating in a given population [19,20,21]. Unlike clinical surveillance, environmental surveillance provides a broad insight into the epidemiology and distribution of enteric viruses as it includes strains from asymptomatic individuals and those with mild disease that shed the virus but do not seek medical help [22]. Next-generation sequencing (NGS) is a versatile tool utilised in virus genomic diversity studies as large amounts of genetic material can be sequenced simultaneously, at a low cost [23]. In this study, Illumina MiSeq was used to determine the diversity of human noroviruses from raw sewage and final effluent water samples in selected wastewater treatment works (WWTWs) in Pretoria, South Africa.

2. Materials and Methods

2.1. Sample Collection and Virus Recovery

Between June 2018 and August 2020, raw sewage (1 L) and final effluent (10 L) water samples were collected bi-weekly from two WWTWs in Pretoria, in the City of Tshwane Metropolitan Municipality, South Africa. All samples were collected between 7 and 9 am during the peak morning flow. The WWTWs receive sewage from communities of approximately 1.42 million people and use activated sludge and chlorine treatment [24]. A modified pre-flocculated skimmed milk (PSM) methodology was used to recover viruses from the raw sewage samples [25]. Briefly, the pH of raw sewage samples was adjusted to 3.5 when 1 N HCl (Merck KGaA, Darmstadt, Germany) or 1 M NaOH (Merck) was added. The conductivity of water was increased by adding 33.3 g of sea salt per sample. This was performed to ensure the formation of flocculated material. A 1% (w/v) PSM solution was prepared by dissolving 1 g skim milk powder (Oxoid Ltd., Basingstoke, United Kingdom) in 100 mL autoclaved distilled water and the pH was adjusted to 3.5 with 1 N HCl (Merck). The prepared PSM was immediately added to the conditioned raw sewage sample and left to stir at room temperature for 12 hours to allow viruses to adsorb to the flocculated material. The mixture was centrifuged (Sorvall® Super T20, du Pont, Wilmington, DE, USA) at 10,000× g for 30 min at 4 °C. The supernatant was discarded, and the pellet was resuspended in 10 mL phosphate buffered saline (PBS, pH 7.5 Sigma-Aldrich, St. Louis, MO, USA), treated with 0.2 volumes of chloroform (Merck) and centrifuged at 10,000× g for 5 min at 4 °C. The final effluent samples were processed using the modified glass wool adsorption-elution technique as described previously (Mabasa et al., 2018). All virus concentrates were stored as 1 mL aliquots at −20 °C.

2.2. Nucleic Acid Extraction and Virus Detection

Before nucleic acid extraction, all virus concentrates were re-treated with chloroform to further remove PCR inhibitors. Briefly, 200 µL chloroform (Merck) was added to 1 mL virus concentrate and vortexed vigorously for 1 min followed by centrifugation at 5000× g for 5 min at room temperature. The aqueous phase was transferred to a clean 2 mL microcentrifuge tube and spiked with a known concentration of murine mengovirus strain MCo, which served as an extraction efficiency control [26]. Total nucleic acid was extracted using a QIAamp UltraSens Virus kit (Qiagen, Valencia, CA, USA) according to manufacturer’s instructions. The nucleic acid was eluted in 100 µL nuclease-free water and stored as 10 µL aliquots at −80 °C. A one-step real-time RT-PCR was used for the detection of norovirus GI, GII, and mengovirus genomic material using a QuantiFast Pathogen RT-PCR + IC Kit (Qiagen) with published Taqman probes and primers (Thermo Scientific, Waltham, MA, USA) (Table 1). All assays were performed on the QuantStudio™ 5 real-time system (Applied Biosystems, Foster City, CA, USA). The kit’s internal control was used to monitor the efficiency of target amplification. A cycle threshold (Ct) value of 40 was used as a cut-off for norovirus-positive samples and ≥1% extraction efficiency was regarded as a successful extraction of nucleic acid from the samples as described in ISO 15216-1 [27].

2.3. Molecular Typing of Noroviruses

A two-step RT-PCR was used to determine P-types and genotypes of detected noroviruses using EmeraldAmp® Max HS PCR Master Mix (Takara, Shiga, Japan) with published primers (Table 1). These target the 579 bp and 570 bp fragments representing the BC region of GI and GII genomes, respectively [28]. Complementary DNA was synthesised using 10 µL of the extracted nucleic acids, 30 µM random hexamers (Thermo Scientific), 0.5 mM dNTPs (New England Biolabs Inc., Ipswich, MA, USA), 1 × ProtoScript® II reverse transcriptase reaction buffer (New England Biolabs Inc.), 0.1 M DTT (New England Biolabs Inc.), 20 U RNase inhibitor (New England Biolabs Inc.), and 200 U ProtoScript® II reverse transcriptase (New England Biolabs Inc.). Subsequently, 5 µL of the generated cDNA was used as template for first-round amplification of the BC region. A second-round semi-nested PCR was performed using 1 µL of first-round PCR as a template. This was performed to increase the amplicon yield and to incorporate Illumina MiSeq overhang adapters (Table 1) to the amplicons. The same amplification conditions were used for the first- and second-round PCR (Table 1). The resulting amplicon fragments were analysed on 1.5% agarose gels, purified using Agencourt AMPure XP beads (Beckman Coulter, Brea, CA, USA) following the manufacturer’s instructions and sent to the National Institute for Communicable Diseases (NICD), Johannesburg, SA for NGS.
Table 1. Real-time RT-PCR and typing PCR assay primers and probes.
Table 1. Real-time RT-PCR and typing PCR assay primers and probes.
VirusPrimer/ProbeSequence (5′–3′) *PolarityLocationAmplification Conditions
Norovirus GIQNIF4 aCGCTGGATGCGNTTCCAT+5291–5308 †50 °C, 20 min
95 °C, 5 min
×45 cycles
95 °C, 15 s
55 °C, 30 s
65 °C, 30 s
NV1LCR bCCTTAGACGCCATCATCATTTAC5354–5376 †
NVGG1 bFAM-TGGACAGGAGAYCGCRATCT-TAMRA+5321–5340 †
Norovirus GIIQNIF2 cATGTTCAGRTGGATGAGRTTCTCWGA+5012–5037 *50 °C, 20 min
95 °C, 5 min
×45 cycles
95 °C, 15 s
60 °C, 30 s
65 °C, 30 s
COG2R dTCGACGCCATCTTCATTCACA5080–5100 *
QNIFS cFAM -AGCACGTGGGAGGGCGATCG-TAMRA+5042–5061 *
MengovirusMengo110F eGCGGGTCCTGCCGAAAGT+110–127 ❄50 °C, 20 min
95 °C, 5 min
×45 cycles
95 °C, 15 s
60 °C, 30 s
65 °C, 30 s
Mengo209R eGAAGTAACATATAGACAGACGCACAC245–270 ❄
Mengo147 eMGB-ATCACATTACTGGCCGAAGC-TAMRA+208–227 ❄
TargetPrimerSequence (5′–3′) *PolarityLocationAmplification conditions
1st round2nd round
Norovirus GI BC region JV12Y fATACCACTATGATGCAGAYTA+4552–4572 †×40 cycles
95 °C, 15 s
50 °C, 30 s
72 °C, 1 min
72 °C, 5 min
×40 cycles
95 °C, 15 s
50 °C, 30 s
72 °C, 45 s
72 °C, 5 min
MON432Tx g TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGTGGACICGYGGICCYAAYCA+5093–5112 †
G1SKRTx hGTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCCAACCCARCCATTRTACA5653–5671 †
Norovirus GII BC regionJV12Y fATACCACTATGATGCAGAYTA+4279–4299 *×40 cycles
95 °C, 15 s
55 °C, 30 s
72 °C, 1 min
72 °C, 5 min
×40 cycles
95 °C, 15 s
55 °C, 30 s
72 °C, 45 s
72 °C, 5 min
MON431Tx gTCGTCGGCAGCGTCAGATGTGTATAAGAGACAGTGGACIAGRGGICCYAAYCA+4821–4840 *
G2SKRTx hGTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGCCRCCNGCATRHCCRTTRTACAT5367–5389 *
Norovirus GI
BC region for viruses with P7 P-type
JV12Y fATACCACTATGATGCAGAYTA+4552–4572 †×40 cycles
95 °C, 15 s
50 °C, 30 s
72 °C, 1 min
72 °C, 5 min
×40 cycles
95 °C, 15 s
50 °C, 30 s
72 °C, 1 min
72 °C, 5 min
GIP7F iCTGATGTGGAATTTGACCCAATAAAG+4890–4915 †
G1SKR hCCAACCCARCCATTRTACA5653–5671 †
a [29], b [30], c [31], d [32], e [33], f [34], g [35], h [36], i (in-house designed). * Degeneracy code: Y = C/T; R = A/G; W = A/T; I = Inosine; N = any base. Bold bases = MiSeq adapter. Probe labels: 6-carboxy fluorescein (FAM), minor groove binder (MGB), and 6-carboxy-tetramethylrhodamine (TAMRA). Location based on corresponding nucleotide position of M87661 (†), X86557 (*) and L22089 (❄).

2.4. DNA Library Preparation and Illumina Sequencing

Amplicons were pooled equimolar based on wastewater sample type and month of collection. For the amplicon library preparations, amplicons were indexed using IDT for Illumina Nextera DNA Unique Dual Indexes (Illumina Inc., San Diego, CA, USA) according to manufacturer’s recommendation. The amplicon library concentration was measured on the Qubit® 3.0 Fluorometer (Invitrogen) and the size of the amplicons were then visualised using the 4200 TapeStation (Agilent Technologies, Santa Clara, CA, USA). A negative control was used in the amplicon preparation step. No amplification was observed in the negative control. The purified amplicon libraries were normalised to 2 nM and then sequenced on the Illumina MiSeq platform using the Illumina MiSeq v3 kit (Illumina Inc.), to obtain 2 × 300 bp paired-end sequences.

2.5. NGS Data Analysis

The paired-end Illumina MiSeq reads were received as FASTQ files, which were merged using default settings and format changed to FASTA using BBMap [37]. The merged reads were length-filtered to eliminate sequences that were below 570 bases using Galaxy [38]. The reads were clustered based on a 90% sequence similarity threshold using CD-HIT-EST [39] and a FASTA file of representative sequences was generated. The proportion representation of each genotype was then calculated by dividing the number of merged reads attributed to the genotype over the total number of merged reads. The number of sequences clustered to each representative genotype was used to quantify the abundance of each genotype within wastewater samples.

2.6. Sanger Sequencing

A two-step RT-PCR was performed for the amplification and characterisation of putative novel recombinants using EmeraldAmp® Max HS PCR Master Mix (Takara) with in-house designed and published primers [36] (Inqaba Biotech™, Pretoria, South Africa). A total of 100 norovirus GI.P7 sequences were retrieved from NCBI GenBank and aligned using BioEdit [40] in order to identify conserved regions for primer design. Complementary DNA was synthesised as described above. Two rounds of PCR were performed using the same reaction components and amplification conditions to increase amplicon yield (Table 1). Amplicons were analysed on 1.5% agarose gels and purified with Agencourt AMPure XP beads. Subsequently, the amplicons were cloned using the ClonJETTM PCR cloning kit (Thermo Scientific) and transformed into Lucigen E. cloni 10G competent cells (Lucigen Corp., Middleton, WI, USA). Ten randomly selected clones were subjected to colony PCR to confirm the presence of the insert fragment. A 20 µl reaction mixture comprised of 1 × OneTaq® Quick-load® Master Mix (New England Biolabs Inc.), 0.2 µM forward and reverse pJET1.2 primers (Thermo Scientific) and nuclease-free water. The PCR products were analysed on 1.5% agarose gels and clones with the correct insert fragment size were purified with a DNA Clean and Concentrator™-25 Kit (Zymo Research, Irvine, CA, USA). The clones were sequenced using pJET1.2 primers and the ABI PRISM BigDye1 Terminator version 3.1 Cycle Sequencing kit on an ABI 3130 automated analyser (Applied Biosystems).

2.7. Norovirus Phylogeny and Recombination Analysis

Phylogenetic analyses of the detected strains were performed on the partial polymerase (265 bp) and capsid (314 bp) sequences to determine norovirus P-types and genotypes, respectively. Representative sequences were aligned using MAFFT version 7 [41] and headers edited with FaBox version 1.5 [42] to match our study identification codes. The sequences were then analysed with the online Norovirus Typing Tool version 2.0 [43] to assign polymerase and capsid types. BLAST-n (http://blast.ncbi.nlm.nih.gov/Blast.cgi - accessed between October 2018 and July 2022) was used to compare the study sequences to norovirus sequences in GenBank and putative recombination breakpoint analysis was performed using SimPlot version 3.5.1 [44]. The analysis was carried out using a Kimura 2-parameter distance model [45] and a window size of 200 bp and a step of 20 bp. Phylogenetic trees were constructed using the Maximum Likelihood method in MEGA X supported by 1000 bootstrap replicates and the evolutionary distances were determined using the Kimura 2-parameter model. Nucleotide sequences obtained from the samples were submitted to NCBI GenBank (accession numbers: OP218002-OP218007, OP236435-OP236446, OP245964-OP245995, OP246120-OP246202, OP246242-OP246373, OP289691-OP289693, OP289784-OP289798, OP295607-OP295694, OP297036-OP297041, OP297403-OP297417, OP303353-OP303356, OP303495-OP303519, OP307860-OP310020, and OP746072-OP747040). The sequences were labelled according to month and year of detection, wastewater type (R = raw sewage, F = final effluent), genotype and P-type, e.g., 618RGII.2[P2] = GII.2[P2] virus detected in raw sewage collected in June 2018.

2.8. Statistical Analyses

Descriptive statistics were used to summarise norovirus detection rates and proportions of genotypes. The detection rates of norovirus GI and GII in raw sewage and final effluent were compared using the Mantel-Haenszel chi square test. (http://www.openepi.com/TwobyTwo/TwobyTwo.html, accessed on 17 November 2022).

3. Results

3.1. Detection of Noroviruses

A total of 200 wastewater samples were collected during the study period: 50 raw sewage and 50 final effluent samples from each WWTW. Samples could not be collected between the 2nd week of March 2020 and the end of June 2020 due to the COVID-19 pandemic lockdown restrictions. Overall, noroviruses were detected in 81% (162/200) of wastewater samples, comprised of 89/100 raw sewage and 73/100 final effluent. Genogroup I viruses were detected in 40% (80/200) and GII dominated being detected in 78% (156/200) of the samples. Both GI and GII viruses were detected more frequently in raw sewage (GI—p < 0.001; GII—p < 0.02) compared with final effluent. The majority (80%; 64/80) of GI viruses were detected in raw sewage and 20% (16/80) in final effluent, whereas GII was more evenly distributed between raw sewage (54%; 85/156) and final effluent (46%; 71/156) (Figure 1). The kit’s internal positive control (QuantiFast Pathogen RT-PCR) was detected in 100% of the wastewater samples tested at the required Ct value; therefore, no re-testing was required. Mengovirus was detected in all samples and concentrations varied between samples; however, the extraction efficiencies were above 1%.

3.2. Molecular Typing of Noroviruses

The BC region of norovirus genomes was successfully amplified from viruses in 56.3% (45/80) and 56.4% (88/156) of norovirus GI and GII positive samples, respectively. This resulted in 40 (23 raw sewage and 17 final effluent) NGS amplicon pools. Thirteen GI P-types were identified in wastewater and these included GI.P7 (43%; 17/40), GI.P13 (38%; 15/40), GI.P3 (28%; 11/40), GI.P4 (23%; 9/40), GI.P10 (23%; 9/40), GI.P11 (15%; 6/40), GI.P9 (8%; 3/40), GI.P1, GI.P2, GI.P5, GI.P6, GI.P8, and GI.P12 (5%; 2/40 each). These P-types circulated together with nine genotypes, which included GI.3 (50%; 20/40), GI.7 (48%; 19/40), GI.4 (23%; 9/40), GI.6 (18%; 7/40), GI.8 (10%; 4/40), GI.5 (8%; 3/40), GI.9 (8%; 3/40), GI.1 (5%; 2/40), and GI.2 (5%; 2/40). Overall, a total of 44 different GI RdRp-capsid combinations were identified and these included 28 putative novel recombinants, 13 previously reported RdRp-capsid combinations and three Sanger-sequencing confirmed novel recombinants (Figure 2A,B, Supplementary Table S1). The top five frequently detected GI viruses were GI.7[P7] (48%; 19/40), GI.3[P13] (40%; 16/40), GI.3[P3] (28%; 11/40), GI.3[P7] (25%; 10/40), and GI.4[P4] (23%; 9/40). Although GI.7[P7] was detected at least once in every season, there was a noticeable peak of these viruses in the second half of the year with most of them identified in winter and spring (Figure 2A,B). The GI.3[P13] viruses were identified in small proportions in raw sewage between 2018 and 2019. However, in 2020 they dominated both in raw sewage and final effluent samples (Figure 2A,B). The in-house-designed forward primer (GIP7F) successfully amplified a 770 bp BC region fragment of GI viruses with a P7 RdRp. The viruses identified included GI.7[P7] and three (GI.1[P7], GI.3[P7], GI.4[P7]) of the putative novel recombinants. For GII, nine P-types were identified and these included GII.P7 (65%; 26/40), GII.P16 (60%; 24/40), GII.PNA (30%; 12/40), GII.P17 (20%; 8/40), GII.P31 (18%; 7/40), GII.P33 (10%; 4/40), GII.PNA7 (10%; 4/40), GII.P8 (5%; 2/40), and GII.P21 (3%; 1/40). These P-types circulated together with 13 genotypes, which included GII.2 (55%; 22/40), GII.9 (48%; 19/40), GII.4 (40%; 16/40), GII.6 (35%; 14/40), GII.17 (20%; 8/40), GII.3 (13%; 5/40), GII.12 (13%; 5/40), GII.7 (10%; 4/40), GII.1 (8%; 3/40), GII.8 (5%; 2/40), GII.10 (5%; 2/40), GII.13 (3%; 1/40), and GII.16 (3%; 1/40). Overall, 45 different GII RdRp-capsid combinations were found in circulation, and these included 23 putative novel recombinants, 21 previously reported RdRp-capsid combinations and one emerging novel recombinant (GII.12[P16]) (Figure 3A,B, Supplementary Table S1). The predominant GII viruses identified were GII.2[P16] (55%; 22/40), GII.9[P7] (48%; 19/40), GII.6[P7] (35%; 14/40), GII.2[P7] (30%; 12/40), and GII.9[P16] (20%; 8/40). The majority of the GII.2[P16] viruses were identified in samples collected in 2019 (Figure 3A,B). Almost all (95%; 20/21) of the GII.4 viruses were identified in samples collected between June and December. As many as 24 GI and 19 GII viruses were characterised from a single raw sewage sample in February 2020 and September 2019, respectively. The majority of untypeable viruses were in final effluent samples as shown by the blank spaces in Figure 2B and Figure 3B.

3.3. Recombination Analysis

The breakpoints of putative and confirmed novel recombinant noroviruses were identified from 106 representative GI (50) and GII (56) norovirus sequences (Supplementary Table S2). Single capsid/RdRp combinations were identified for 4 viruses, with the remaining 102 viruses exhibiting capsid/RdRp combinations that ranged from one capsid combined with 2 RdRps to 7 RdRps. Differences in breakpoints within the same cap-sid/RdRp combination further increased the diversity (up to seven strains with different breakpoints in the GI.3[P7] recombinant). In predominance order, the GI capsids included GI.3, GI.4, GI.6, GI.7, GI.1, GI.9, GI.5, GI.8, and GI.2. The GI.3 viruses recombined with seven different RdRps resulting in 19 strains making it the most diverse group of recombinant GI viruses. The least diverse recombinant groups were GI.5 and GI.8, with two different RdRps each, and GI.2 with one RdRp. The most common GI P-types identified in recombination were GI.P7 and GI.P13 accounting for almost a quarter (12/50) of strains, each. The most common GII recombinant strains were GII.2 and GII.9 with three different RdRps each and this resulted in 13 and 12 strains, respectively. The least diverse recombinant viruses harboured GII.13 and GII.16 capsids with one P-type resulting in one strain, each. The predominant GII P-type was GII.P7 followed by GII.P16, both accountable for 71% (40/56) of GII recombinant strains. The GII.P7 recombined with six different capsids resulting in 26 strains whereas GII.P16 was found in combination with four different capsids resulting in 14 strains. The ORF2 starting point was at nucleotide 266 for all GI and GII strains except for strain 1018RGI.1[P3] that started at position 270. The majority (62%; 66/106) of breakpoints were identified within the capsid gene and the remaining 38% were within the RdRp region. In addition, 76% of all breakpoints were between nucleotides 246 and 289.

3.4. Phylogenetic Analysis

The Maximum Likelihood phylogenetic trees of the GI partial RdRp and capsid were constructed to determine their relatedness to previously reported GI strains with sequences submitted to GenBank (Figure 4 and Figure 5). The analyses revealed relatedness between strains of the same P-type associated with different genotypes (Figure 4); however, there is a significant diversity within the capsids (Figure 5). Identified GI.3 viruses form two main clusters that further divide into several sub-clusters. Interestingly, the analysis also showed that the GI.3 Cluster I is more closely related to GI.8 than it is to GI.3 Cluster II. Likewise, GI.5 and GI.6 viruses form distinctive clusters and sub-clusters (Figure 5). The GI.9 viruses also form different clusters, but they are not closely related to their reference strains (Figure 5). Figure 6 and Figure 7 show the relationship between identified GII viruses and sequences available in GenBank. P-types GII.P7 and GII.P16 recombined with more genotypes compared with other identified P-types. In addition, the GII.PNA7 sequences from the study form two distinctive clusters, which group apart from the reference strain LC342059. The GII.3, GII.7, GII.9, GII.12, and GII.16 viruses clustered separately from their reference and closely related strains (Figure 7). Figure 8 displays the genetic diversity of the dominant GII.2, the well-documented GII.4, and previously dominant GII.17.

4. Discussion

To the best of our knowledge, this is the first study utilising NGS to understand the diversity and distribution of human norovirus circulating in wastewater samples in Africa. The use of NGS in norovirus surveillance provides an in-depth understanding of strains circulating at population level, at any given time. In addition, environmental NGS data reveal rare and emerging strains that might be missed with the cloning and Sanger sequencing approach. In this study, Illumina MiSeq was used to assess the norovirus GI and GII diversity in wastewater from Pretoria, SA. Overall, noroviruses were detected in 81% of the wastewater samples tested. As expected, the majority of these samples were raw sewage, which represented pooled stools from millions of individuals from the city. The treated final effluent samples also had detectable norovirus genomic material but less prevalent compared with raw sewage samples. No real-time RT-PCR inhibition was observed as the internal positive control was successfully detected at the required Ct value in all samples. A recent study from Botswana reported comparable results with norovirus GI detected in 67% and GII in 78% of raw sewage samples [47].
The data show that GII (78%) was more prevalent in the environment compared with GI (40%) and these findings mirror previous reports from Africa [19,22,48], Brazil [20], Thailand [49], and Sweden [50]. In addition, Figure 1B shows a decline in detection rate of GI virus genomes in final effluent samples. However, the norovirus GII detection rate was comparable in raw sewage and final effluent (Figure 1A,B). This suggests that GI viruses might be less stable in the environment compared with GII and thus more susceptible to wastewater treatment methods used. This is in contrast to findings reported by Sano and colleagues that the estimated log reductions in norovirus GI (1.48; 95% CI, 0.96–2.00) and GII (1.35; 95% CI, 0.52–2.18) were comparable when the conventional activated sludge treatment method was used [51]. However, differences in the stability of GI and GII have been described by Cromeans and co-workers, who reported that norovirus GI was more sensitive to reduction in RNA levels due to treatment with alcohols, chlorine, and high hydrostatic pressure than GII noroviruses [52]. The reduction in virions with intact capsids can lead to a lack of adequate, good quality nucleic acids. This can explain the lower numbers of merged reads (Supplementary Table S1) and fewer GI genotypes from final effluent samples (Figure 2B) in 2018 and 2019 compared with higher numbers of GII merged reads and increased virus diversity (Figure 3B). The overall molecular typing success rate was 56.4% for both norovirus GI and GII. The majority of the untypeable strains were detected in final effluent samples and this places emphasis on the point that lack of adequate good quality RNA template can be one of the reasons for low typing success rate. Despite this, NGS resulted in excellent data, revealing at least 89 different RdRp-capsid norovirus combinations of which the majority were putative novel recombinants. This is the first study to report such diverse data on circulating noroviruses and putative novel recombinants from environmental surveillance. Although these putative recombinants were identified in up to ten different amplicon pools, their merged reads proportion was less compared with that attributable to well-documented strains (Supplementary Table S1). This is possibly due to low virion concentration in the wastewater or primer-mismatch during PCR resulting in a poor representation of a strain in a pool of amplicons. Therefore, it would be rather challenging to identify such strains using the classical cloning and Sanger sequencing approach. Similar to other reports where norovirus diversity was studied in wastewater, this study proved the advantages of using NGS over Sanger sequencing. The GI.P7 RdRp dominated amongst other GI P-types and this correlates with recent data that show its prevalence as one the most reported norovirus P-types in public databases [53]. In this study, it was linked to the majority of the putative novel recombinants, and the viruses that carried it varied greatly (Figure 5). This prompted the design of GIP7F that was used together with a G1SKR reverse primer to amplify genomic material of viruses with GI.P7 RdRp for further phylogenetic analysis. This PCR confirmed the presence of the well-documented GI.7[P7] and three novel recombinants; GI.1[P7], GI.3[P7], and GI.4[P7] in circulation. Another interesting trend observed was the noticeable peak and dominance of GI.7[P7] viruses in winter and spring months. The GI.P7 strains associated with GI.4 capsid, and strains 1118RGI.7[P7], 219RGI.3[P7], and 619RGI.3[P7] cluster separately from each other and away from the remaining majority of GI.P7 strains associated with GI.3 and GI.7 capsids. The latter cluster includes GI.7 reference strain and closely related (93–99% nucleotide similarity) strains from paediatric patients in Brazil [54]. The GI.3[P13] viruses were first identified as a minority in raw sewage samples in July 2018 and this continued until their significant peak in October 2019 and continued this trend into 2020, where they were detected both in raw sewage and final effluent (Figure 2A,B). The GI.3 capsid was the most prevalent (50%; 20/40) among GI viruses and phylogenetically they are closely related (97–99% nucleotide similarity) to predominant GI.3 viruses that caused outbreaks in Taiwan [55]. Furthermore, these viruses formed two distinct clusters; GI.3 Cluster I and Cluster II (Figure 5) and the former was closer to GI.8 viruses than it was to GI.3 Cluster II by evolutionary distance. The GI.3 Cluster I was further divided into three sub-clusters and this vast diversity within GI.3 viruses is well-documented [56,57,58]. The GI.3 viruses have been reported amongst most prevalent noroviruses causing outbreaks and sporadic acute infections in Spain, Australia [59,60], Taiwan [55] and New Zealand [60]. This strengthens its place in the top 10 most frequently reported norovirus genotypes in the last 27 years [53]. In addition, NoroSurv, which is a global paediatric norovirus strain surveillance network for children <5 years of age hospitalised due to acute gastroenteritis reported GI.3 as the most prevalent GI genotype [61]. The dominance of clinically relevant, putative, and confirmed novel recombinant GI.3 viruses in our wastewater systems can be a mirror of viruses circulating in SA communities or it can be an early indicator of possible outbreaks.
The predominance of non-GII.4 noroviruses in SA wastewater was previously described. In that study, GII.2 viruses were predominant in wastewater from the Free State and Gauteng provinces between 2015 and 2016 [19]. Between 2016 and 2017, their dominance was seen in wastewater and a sewage-polluted river in Pretoria and Johannesburg, respectively [62]. In this study, GII.2 was the most frequently detected GII virus found in more than 50% of the amplicon pools; however, this trend is not unique to SA as data from GenBank also shows GII.2 strains to be among the top three norovirus genotypes reported globally between 1995 and 2019 [53]. An observation of concern made is that these viruses were found harbouring five different P-types, which include GII.P31 and co-dominant GII.P7 and GII.P16. The remaining two P-types, GII.P17, and GII.PNA together with GII.P7 were identified as putative novel GII.2 recombinants. Acquisition of these novel P-types by GII.2 is likely to escalate their capsid evolution, which might significantly increase their infectivity and prevalence potential [63]. Without clinical surveillance data, no conclusion can be drawn on the medical importance of these viruses. Norovirus GII.17’s prevalence in wastewater and sewage-polluted surface waters increased with the global emergence of the novel Kawasaki_2014 sub-variants 308 and 323. These viruses replaced GII.4 Sydney_2012 as the dominant strain causing norovirus-associated gastroenteritis in East Asian regions [64,65,66]. Other countries such as the USA [67], Italy [68], and Romania [69] only experienced the increase in GII.17-associated outbreaks and sporadic cases but not replacement of GII.4 Sydney_2012 as a dominant strain. Interestingly, GII.4 (median age = 1 year) and GII.17 (median age = 49 years) viruses have different age distributions and therefore, the age group targeted by clinical surveillance impacts the estimated genotype prevalence in a population. In SA, the shift in dominance was only observed in the wastewaters [19] and this is likely due to exclusion of diarrhoeal cases from individuals > 5 years by the majority of clinical surveillance studies. In addition, SA does not have a norovirus outbreak reporting system and therefore, GII.17 infections are likely missed and therefore underreported. This study revealed the circulation of the novel Kawasaki and original GII.17 variants in Pretoria. The latter formed a distinct cluster of strains that share a common ancestor with strain KC495674 that was previously identified in SA, and they are not closely related to any strain reported from other regions of the world (Figure 8). Norovirus GII.17 is not the only genotype with strains unique to SA; GII.4 Sydney_2012 strains clustered separately from four strains that could not be assigned to an existing and named variant. Analysis of complete VP1 nucleotide sequences of norovirus strains, especially those that are unique to specific parts of the world would assist in assigning them to a variant. The emerging GII.12[P16] recombinant was detected once in a raw sewage sample from January 2019 and during the same period, it was linked to norovirus-associated gastroenteritis outbreaks in Canada [70]. In October 2020, it caused a gastroenteritis outbreak in Brazil where the majority of cases were linked to the consumption of a local brand of ice pop [71]. Related strains were detected in the USA (MK754447), Spain (MT501819), and Japan (LC579431) between 2018 and 2020.
Clinical norovirus surveillance in SA is focused on children <5 years of age, hospitalised with gastroenteritis, which represents severe disease. During the study period the dominant genotype detected in this population was GII.4[P31] Sydney_2012 (unpublished data) which shared 98.8% nucleotide identity with the GII.4 strains detected in raw sewage. In a previous 5-year clinical norovirus surveillance study, GII genotype detection peaked from August to November every year [72,73]. In contrast, in this study norovirus GI and GII strains were detected in wastewater samples throughout the year with no distinct seasonality. The amplicon-based NGS approach was a limitation of the study, since this method can be biased to viruses that match the primers used; therefore, novel strains could be missed. With more resources, deeper sequencing using cDNA on more advanced Illumina platforms can provide more valuable data. Another limitation was disruption of sample collection by protests and COVID-19 pandemic lockdown restrictions.

5. Conclusions

Our analyses revealed a vast diversity of noroviruses circulating Pretoria, of which the majority were putative novel recombinants. Phylogenetic analyses showed the presence of uncommon strains that have not been detected worldwide and detected for the first time in SA. Furthermore, this study confirmed the emergence of four novel recombinants, including GII.12[P16]. Overall, this study contributed to the growing norovirus diversity data in SA and proved the usefulness of NGS in environmental surveillance. These data will assist the on-going vaccine and drug development studies, which rely on the continuous submission of norovirus sequences to public platforms such as GenBank.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v14122732/s1, Table S1: Norovirus GI and GII viruses and their respective merged reads from sequencing pools; Table S2: Summary of putative novel GI and GII recombination breakpoints from strains identified in this study.

Author Contributions

Literature review and data analysis, V.V.M.; designed the study and wrote the manuscript, V.V.M. and J.M.; contributed to scientific discussion and manuscript editing, V.V.M., W.B.v.Z., A.I., M.A., M.B.T. and J.M. All authors have read and agreed to the published version of the manuscript.

Funding

The study was funded by the National Health Laboratory Service Research Trust Grant (Grant Number 94673).

Institutional Review Board Statement

The study was conducted under ethical clearance 164/2017 and 499/2019 from the Faculty of Health Sciences at the University of Pretoria.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in the article and Tables S1 and S2.

Acknowledgments

The authors thank the City of Tshwane, South Africa for permission to collect wastewater samples.

Conflicts of Interest

Authors of this work do not have a conflict of interest or competing interest with regard to the work contained herein.

References

  1. Lopman, B.A.; Steele, D.; Kirkwood, C.D.; Parashar, U.D. The vast and varied global burden of norovirus: Prospects for prevention and control. PLoS Med. 2016, 13, e1001999. [Google Scholar] [CrossRef]
  2. Burke, R.M.; Hall, A.J. Global Burden of Norovirus. In Norovirus; Melhem, N., Ed.; Springer: Cham, Switzerland, 2019; pp. 1–29. [Google Scholar]
  3. Huys, A.; Grau, K.R.; Karst, S.M. Development of oral rotavirus and norovirus vaccines. In Mucosal Vaccines, 2nd ed.; Hiroshi, K., David, W.P., Eds.; Academic Press: Cambridge, MA, USA, 2020; pp. 699–712. [Google Scholar]
  4. Riera-Montes, M.; O’ryan, M.; Verstraeten, T. Norovirus and rotavirus disease severity in children: Systematic review and meta-analysis. Pediatr. Infect. Dis. J. 2018, 37, 501–505. [Google Scholar] [CrossRef]
  5. Lindsay, L.; Wolter, J.; De Coster, I.; Van Damme, P.; Verstraeten, T. A decade of norovirus disease risk among older adults in upper-middle- and high-income countries: A systematic review. BMC Infect. Dis. 2015, 15, 425. [Google Scholar] [CrossRef] [Green Version]
  6. Brown, J.R.; Shah, D.; Breuer, J. Viral gastrointestinal infections and norovirus genotypes in a paediatric UK hospital, 2014–2015. J. Clin. Virol. 2016, 84, 1–6. [Google Scholar] [CrossRef]
  7. Kampmeier, S.; Pettke, A.; Kossow, A.; Willems, S.; Mellmann, A. Norovirus infections in a tertiary care centre−individual cases do not necessarily lead to an outbreak. J. Clin. Virol. 2016, 84, 39–41. [Google Scholar] [CrossRef]
  8. Lopman, B.; Gastanaduy, P.; Park, G.W.; Hall, A.J.; Parashar, U.D.; Vinjé, J. Environmental transmission of norovirus gastroenteritis. Curr. Opin. Virol. 2012, 2, 96–102. [Google Scholar] [CrossRef]
  9. Marsh, Z.; Shah, M.P.; Wikswo, M.E.; Barclay, L.; Kisselburgh, H.; Kambhampati, A.; Cannon, J.L.; Parashar, U.D.; Vinjé, J.; Hall, A.J. Epidemiology of foodborne norovirus outbreaks–United States, 2009–2015. Food Safety 2018, 6, 58–66. [Google Scholar] [CrossRef] [Green Version]
  10. Ghosh, S.; Malik, Y.S.; Kobayashi, N. Therapeutics and immunoprophylaxis against noroviruses and rotaviruses: The past, present, and future. Curr. Drug Metab. 2018, 19, 170–191. [Google Scholar] [CrossRef]
  11. Riddle, M.S.; Walker, R.I. Status of vaccine research and development for norovirus. Vaccine 2016, 34, 2895–2899. [Google Scholar] [CrossRef] [Green Version]
  12. Salim, O.; Lambden, P.R.; Clarke, I.N. Norovirus. In The Springer Index of Viruses; Tidona, C., Darai, G., Eds.; Springer: New York, NY, USA, 2011; pp. 245–250. [Google Scholar]
  13. Vinjé, J.; Estes, M.K.; Esteves, P.; Green, K.Y.; Katayama, K.; Knowles, N.J.; L’Homme, Y.; Martella, V.; Vennema, H.; White, P.A. ICTV Virus Taxonomy Profile: Caliciviridae. J. Gen. Virol. 2019, 100, 1469–1470. [Google Scholar] [CrossRef]
  14. Chhabra, P.; de Graaf, M.; Parra, G.I.; Chan, M.C.-W.; Green, K.; Martella, V.; Wang, Q.; White, P.A.; Katayama, K.; Vennema, H. Updated classification of norovirus genogroups and genotypes. J. Gen. Virol. 2019, 100, 1393. [Google Scholar] [CrossRef] [PubMed]
  15. Hernandez, J.M.; Silva, L.D.; Junior, E.C.S.; Bandeira, R.S.; Rodrigues, E.A.M.; Lucena, M.S.S.; Costa ST, P.; Gabbay, Y.B. Molecular epidemiology and temporal evolution of norovirus associated with acute gastroenteritis in Amazonas state, Brazil. BMC Infect. Dis. 2018, 18, 147. [Google Scholar] [CrossRef] [PubMed]
  16. Parra, G.I.; Squires, R.B.; Karangwa, C.K.; Johnson, J.A.; Lepore, C.; Sosnovtsev, S.V.; Green, K.Y. Static and evolving norovirus genotypes: Implications for epidemiology and immunity. PLoS Pathog. 2017, 13, e1006136. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Choi, Y.S.; Koo, E.S.; Kim, M.S.; Choi, J.D.; Shin, Y.; Jeong, Y.S. Re-emergence of a GII.4 norovirus Sydney 2012 variant equipped with GII.P16 RdRp and its predominance over novel variants of GII.17 in South Korea in 2016. Food Environ. Virol. 2017, 9, 168–178. [Google Scholar] [CrossRef]
  18. Barreira, D.M.P.G.; Fumian, T.M.; Tonini, M.A.L.; Volpini, L.P.B.; Santos, R.P.; Ribeiro, A.L.C.; Leite, J.P.G.; Souza, M.T.B.D.M.E.; Brasil, P.; da Cunha, D.C.; et al. Detection and molecular characterization of the novel recombinant norovirus GII.P16-GII.4 Sydney in southeastern Brazil in 2016. PLoS ONE 2017, 12, e0189504. [Google Scholar] [CrossRef] [Green Version]
  19. Mabasa, V.V.; Meno, K.D.; Taylor, M.B.; Mans, J. Environmental surveillance for noroviruses in selected South African wastewaters 2015–2016: Emergence of the novel GII.17. Food Environ. Virol. 2018, 10, 16–28. [Google Scholar] [CrossRef]
  20. Fumian, T.M.; Fioretti, J.M.; Lun, J.H.; Dos Santos, I.A.; White, P.A.; Miagostovich, M.P. Detection of norovirus epidemic genotypes in raw sewage using next generation sequencing. Environ. Int. 2019, 123, 282–291. [Google Scholar] [CrossRef]
  21. La Rosa, G.; Della Libera, S.; Iaconelli, M.; Proroga, Y.; De Medici, D.; Martella, V.; Suffredini, E. Detection of norovirus GII.17 Kawasaki 2014 in shellfish, marine water and underwater sewage discharges in Italy. Food Environ. Virol. 2017, 9, 326–333. [Google Scholar] [CrossRef]
  22. Murray, T.Y.; Mans, J.; Taylor, M.B. Human calicivirus diversity in wastewater in South Africa. J. Appl. Microbiol. 2013, 114, 1843–1853. [Google Scholar] [CrossRef] [Green Version]
  23. Cantalupo, P.G.; Pipas, J.M. Detecting viral sequences in NGS data. Curr. Opin. Virol. 2019, 39, 41–48. [Google Scholar] [CrossRef]
  24. Makhura, D. Profile: City of Tshwane. Provincial Gazette. Available online: https://www.cogta.gov.za/ddm/wp-content/uploads/2020/08/2nd-Take_Final_DistrictProfile_TSHWANE2306-1-002 (accessed on 7 January 2020).
  25. Bofill-Mas, S.; Hundesa, A.; Calgua, B.; Rusiñol, M.; de Motes, C.M.; Girones, R. Cost-effective method for microbial source tracking using specific human and animal viruses. J. Vis. Exp. 2011, 3, e2820. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Rachida, S.; Taylor, M.B. Potentially infectious novel hepatitis A virus strains detected in selected treated wastewater discharge sources, South Africa. Viruses 2020, 12, 1468. [Google Scholar] [CrossRef] [PubMed]
  27. ISO 15216-1:2017; Microbiology of the Dood Chain—Horizontal Method for Determination of Hepatitis A Virus and Norovirus Using Real-Time RT-PCR—Part 1: Method for Quantification. ISO: Bengaluru, Bengaluru, 2017.
  28. Stals, A.; Mathijs, E.; Baert, L.; Botteldoorn, N.; Denayer, S.; Mauroy, A.; Scipioni, A.; Daube, G.; Dierick, K.; Herman, L. Molecular detection and genotyping of noroviruses. Food Environ. Virol. 2012, 4, 153–167. [Google Scholar] [CrossRef] [PubMed]
  29. da Silva, A.K.; Le Saux, J.C.; Parnaudeau, S.; Pommepuy, M.; Elimelech, M.; Le Guyader, F.S. Evaluation of removal of noroviruses during wastewater treatment, using real-time reverse transcription-PCR: Different behaviors of genogroups I and II. Appl. Environ. Microbiol. 2007, 73, 7891–7897. [Google Scholar] [CrossRef] [Green Version]
  30. Svraka, S.; Duizer, E.; Vennema, H.; de Bruin, E.; van der Veer, B.; Dorresteijn, B.; Koopmans, M. Etiological role of viruses in outbreaks of acute gastroenteritis in The Netherlands from 1994 through 2005. J. Clin. Microbiol. 2007, 1389–1394. [Google Scholar] [CrossRef] [Green Version]
  31. Loisy, F.; Atmar, R.L.; Guillon, P.; Le Cann, P.; Pommepuy, M.; Le Guyader, F.S. Real-time RT-PCR for norovirus screening in shellfish. J. Virol. Methods 2005, 123, 1–7. [Google Scholar] [CrossRef]
  32. Kageyama, T.; Kojima, S.; Shinohara, M.; Uchida, K.; Fukushi, S.; Hoshino, F.B.; Takeda, N.; Katayama, K. Broadly reactive and highly sensitive assay for Norwalk-like viruses based on real-time quantitative reverse transcription-PCR. J. Clin. Microbiol. 2003, 41, 1548–1557. [Google Scholar] [CrossRef] [Green Version]
  33. Pintó, R.M.; Costafreda, M.I.; Bosch, A. Risk assessment in shellfish-borne outbreaks of hepatitis A. Appl. Environ. Microbiol. 2009, 75, 7350–7355. [Google Scholar] [CrossRef] [Green Version]
  34. Vennema, H.; De Bruin, E.; Koopmans, M. Rational optimization of generic primers used for Norwalk-like virus detection by reverse transcriptase polymerase chain reaction. J. Clin. Virol. 2002, 25, 233–235. [Google Scholar] [CrossRef]
  35. Beuret, C. A simple method for isolation of enteric viruses (noroviruses and enteroviruses) in water. J. Virol. Methods 2003, 107, 1–8. [Google Scholar] [CrossRef]
  36. Kojima, S.; Kageyama, T.; Fukushi, S.; Hoshino, F.B.; Shinohara, M.; Uchida, K.; Natori, K.; Takeda, N.; Katayama, K. Genogroup-specific PCR primers for detection of Norwalk-like viruses. J. Virol. Methods 2002, 100, 107–114. [Google Scholar] [CrossRef] [PubMed]
  37. Bushnell, B. Map: A fast, Accurate, Splice-Aware Aligner (No. LBNL-7065E); Lawrence Berkeley National Lab. (LBNL): Berkeley, CA, USA, 2014. [Google Scholar]
  38. Afgan, E.; Baker, D.; Batut, B.; Van Den Beek, M.; Bouvier, D.; Čech, M.; Chilton, J.; Clements, D.; Coraor, N.; Grüning, B.A. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018, 46, W537–W544. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Li, W.; Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22, 1658–1659. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Hall, T.A. BioEdit: A user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. In Nucleic Acids Symposium Series; Information Retrieval Ltd.: London, UK, 1999; Volume 41, pp. 95–98. [Google Scholar]
  41. Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  42. Villesen, P. FaBox: An online toolbox for fasta sequences. Mol. Ecol. Notes 2007, 7, 965–968. [Google Scholar] [CrossRef]
  43. Kroneman, A.; Vennema, H.; Deforche, K.; Avoort, H.; Penaranda, S.; Oberste, M.; Vinjé, J.; Koopmans, M. An automated genotyping tool for enteroviruses and noroviruses. J. Clin. Virol. 2011, 51, 121–125. [Google Scholar] [CrossRef]
  44. Lole, K.S.; Bollinger, R.C.; Paranjape, R.S.; Gadkari, D.; Kulkarni, S.S.; Novak, N.G.; Ingersoll, R.; Sheppard, H.W.; Ray, S.C. Full-length human immunodeficiency virus type 1 genomes from subtype c-infected seroconverters in India, with evidence of intersubtype recombination. J. Virol. 1999, 73, 152–160. [Google Scholar] [CrossRef] [Green Version]
  45. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  46. Tamura, K.; Stecher, G.; Peterson, D.; Filipski, A.; Kumar, S. MEGA 6: Molecular evolutionary genetics analysis version 6.0. Mol. Biol. Evol. 2013, 30, 2725–2729. [Google Scholar] [CrossRef] [Green Version]
  47. Tubatsi, G.; Kebaabetswe, L.P. Detection of enteric viruses from wastewater and river water in Botswana. Food Environ. Virol. 2022, 14, 157–169. [Google Scholar] [CrossRef]
  48. Kiulia, N.; Mans, J.; Mwenda, J.; Taylor, M. Norovirus GII.17 predominates in selected surface water sources in Kenya. Food Environ. Virol. 2014, 6, 221–231. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Kittigul, L.; Rupprom, K.; Che-Arsae, M.; Pombubpa, K.; Thongprachum, A.; Hayakawa, S.; Ushijima, H. Occurrence of noroviruses in recycled water and sewage sludge: Emergence of recombinant norovirus strains. J. Appl. Microbiol. 2019, 126, 1290–1301. [Google Scholar] [CrossRef] [PubMed]
  50. Wang, H.; Neyvaldt, J.; Enache, L.; Sikora, P.; Mattsson, A.; Johansson, A.; Lindh, M.; Bergstedt, O.; Norder, H. Variations among viruses in influent water and effluent water at a wastewater plant over one year as assessed by quantitative PCR and metagenomics. Appl. Environ. Microbiol. 2020, 86, e02020–e02073. [Google Scholar] [CrossRef] [PubMed]
  51. Sano, D.; Amarasiri, M.; Hata, A.; Watanabe, T.; Katayama, H. Risk management of viral infectious diseases in wastewater reclamation and reuse. Environ. Int. 2016, 91, 220–229. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Cromeans, T.; Park, G.W.; Costantini, V.; Lee, D.; Wang, Q.; Farkas, T.; Lee, A.; Vinjé, J. Comprehensive comparison of cultivable norovirus surrogates in response to different inactivation and disinfection treatments. Appl. Environ. Microbiol. 2014, 80, 5743–5751. [Google Scholar] [CrossRef] [Green Version]
  53. Kendra, J.A.; Tohma, K.; Parra, G.I. Global and regional circulation trends of norovirus genotypes and recombinants, 1995–2019: A comprehensive review of sequences from public databases. Rev. Med. Virol. 2022, 32, e2354. [Google Scholar] [CrossRef]
  54. Cantelli, C.P.; da Silva, M.F.M.; Fumian, T.M.; da Cunha, D.C.; de Andrade, J.d.S.R.; Malta, F.C.; Fialho, A.M.; de Moraes, M.T.B.; Brasil, P.; Miagostovich, M.P.; et al. High genetic diversity of noroviruses in children from a community-based study in Rio de Janeiro, Brazil, 2014–2018. Arch. Virol. 2019, 164, 1427–1432. [Google Scholar] [CrossRef] [PubMed]
  55. Chiu, S.-C.; Hsu, J.-K.; Hu, S.-C.; Wu, C.-Y.; Wang, Y.-C.; Lin, J.-H. Molecular epidemiology of GI.3 norovirus outbreaks from acute gastroenteritis surveillance system in Taiwan, 2015–2019. BioMed Res. Int. 2020, 2020, 4707538. [Google Scholar] [CrossRef]
  56. Esteves, A.; Nordgren, J.; Tavares, C.; Fortes, F.; Dimbu, R.; Saraiva, N.; Istrate, C. Genetic diversity of norovirus in children under 5 years of age with acute gastroenteritis from Angola. Epidemiol. Infect. 2018, 146, 551–557. [Google Scholar] [CrossRef] [Green Version]
  57. Lysén, M.; Thorhagen, M.; Brytting, M.; Hjertqvist, M.; Andersson, Y.; Hedlund, K.-O. Genetic diversity among food-borne and waterborne norovirus strains causing outbreaks in Sweden. J. Clin. Microbiol. 2009, 47, 2411–2418. [Google Scholar] [CrossRef]
  58. Nenonen, N.P.; Hannoun, C.; Larsson, C.U.; Bergström, T. Marked genomic diversity of norovirus genogroup I strains in a waterborne outbreak. Appl. Environ. Microbiol. 2012, 78, 1846–1852. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  59. Bruggink, L.; Catton, M.; Marshall, J. A norovirus intervariant GII.4 recombinant in Victoria, Australia, June 2016: The next epidemic variant? Eurosurveillance 2016, 21, 30353. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  60. Lun, J.H.; Hewitt, J.; Sitabkhan, A.; Eden, J.-S.; Tuipulotu, D.E.; Netzler, N.E.; Morrell, L.; Merif, J.; Jones, R.; Huang, B. Emerging recombinant noroviruses identified by clinical and wastewater screening. Emerg. Microbes Infect. 2018, 7, 50. [Google Scholar] [CrossRef]
  61. Cannon, J.L.; Bonifacio, J.; Bucardo, F.; Buesa, J.; Bruggink, L.; Chan, M.C.-W.; Fumian, T.M.; Giri, S.; Gonzalez, M.D.; Hewitt, J. Global trends in norovirus genotype distribution among children with acute gastroenteritis. Emerg. Infect. Dis. 2021, 27, 1438. [Google Scholar] [CrossRef] [PubMed]
  62. Mabasa, V.V. Characterisation and histo-blood group antigen binding profiles of South African norovirus genotype II strains. Master’s Thesis, University of Pretoria, Pretoria, South Africa, August 2017. [Google Scholar]
  63. Parra, G.I. Emergence of norovirus strains: A tale of two genes. Virus Evol. 2019, 5, vez048. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  64. Chan, M.C.; Lee, N.; Hung, T.-N.; Kwok, K.; Cheung, K.; Tin, E.K.; Lai, R.W.; Nelson EA, S.; Leung, T.F.; Chan, P.K. Rapid emergence and predominance of a broadly recognizing and fast-evolving norovirus GII.17 variant in late 2014. Nat. Commun. 2015, 6, 10061. [Google Scholar] [CrossRef] [Green Version]
  65. Lu, J.; Sun, L.; Fang, L.; Yang, F.; Mo, Y.; Lao, J.; Zheng, H.; Tan, X.; Lin, H.; Rutherford, S. Gastroenteritis outbreaks caused by norovirus GII.17, Guangdong Province, China, 2014–2015. Emerg. Infect. Dis. 2015, 21, 1240. [Google Scholar] [CrossRef]
  66. Matsushima, Y.; Ishikawa, M.; Shimizu, T.; Komane, A.; Kasuo, S.; Shinohara, M.; Nagasawa, K.; Kimura, H.; Ryo, A.; Okabe, N. Genetic analyses of GII. 17 norovirus strains in diarrheal disease outbreaks from December 2014 to March 2015 in Japan reveal a novel polymerase sequence and amino acid substitutions in the capsid region. Eurosurveillance 2015, 20, 21173. [Google Scholar] [CrossRef] [Green Version]
  67. Parra, G.I.; Green, K.Y. Genome of emerging norovirus GII.17, United States, 2014. Emerg. Infect. Dis. 2015, 21, 1477. [Google Scholar] [CrossRef]
  68. Medici, M.; Tummolo, F.; Calderaro, A.; Chironna, M.; Giammanco, G.; De Grazia, S.; Arcangeletti, M.; De Conto, F.; Chezzi, C.; Martella, V. Identification of the novel Kawasaki 2014 GII.17 human norovirus strain in Italy, 2015. Eurosurveillance 2015, 20, 30010. [Google Scholar] [CrossRef]
  69. Dinu, S.; Nagy, M.; Negru, D.G.; Popovici, E.D.; Zota, L.; Oprișan, G. Molecular identification of emergent GII.P17-GII.17 norovirus genotype, Romania, 2015. Eurosurveillance 2016, 21, 30141. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  70. Pabbaraju, K.; Wong, A.A.; Tipples, G.A.; Pang, X.-L. Emergence of a novel recombinant norovirus GII.P16-GII.12 strain causing gastroenteritis, Alberta, Canada. Emerg. Infect. Dis. 2019, 25, 1556. [Google Scholar] [CrossRef] [PubMed]
  71. Fumian, T.M.; Ferreira, F.C.; de Andrade, J.d.S.R.; Canal, N.; Silva Gomes, G.; Teixeira, L.B.; Miagostovich, M.P. Norovirus foodborne outbreak associated with the consumption of ice pop, Southern Brazil, 2020. Food Environ. Virol. 2021, 13, 553–559. [Google Scholar] [CrossRef]
  72. Mans, J.; Murray, T.Y.; Nadan, S.; Netshikweta, R.; Page, N.A.; Taylor, M.B. Norovirus diversity in children with gastroenteritis in South Africa from 2009 to 2013: GII. 4 variants and recombinant strains predominate. Epidemiol. Infect. 2016, 144, 907–916. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  73. Page, N.A.; Groome, M.J.; Nadan, S.; Netshikweta, R.; Keddy, K.H.; Poonsamy, B.; Cohen, C. Norovirus epidemiology in South African children <5 years hospitalised for diarrhoeal illness between 2009 and 2013. Epidemiol. Infect. 2017, 145, 1942–1952. [Google Scholar] [PubMed]
Figure 1. Distribution and prevalence of norovirus GI and GII in wastewater samples. Norovirus GI and GII detected in raw sewage (A) and final effluent (B) samples collected between June 2018 and August 2020. The numbers in parentheses indicate the number of samples collected in each month.
Figure 1. Distribution and prevalence of norovirus GI and GII in wastewater samples. Norovirus GI and GII detected in raw sewage (A) and final effluent (B) samples collected between June 2018 and August 2020. The numbers in parentheses indicate the number of samples collected in each month.
Viruses 14 02732 g001
Figure 2. Norovirus GI molecular typing. Distribution and abundance of norovirus GI strains identified from (A) raw sewage and final effluent (B) samples between June 2018 and August 2020. The amplicons from different sampling sites were pooled equimolar based on wastewater sample type and month of collection. The numbers in parentheses indicate the number of merged reads for each month. The key shows known RdRp-capsid combinations, putative novel recombinants (?) and Sanger-sequencing confirmed novel recombinants (*).
Figure 2. Norovirus GI molecular typing. Distribution and abundance of norovirus GI strains identified from (A) raw sewage and final effluent (B) samples between June 2018 and August 2020. The amplicons from different sampling sites were pooled equimolar based on wastewater sample type and month of collection. The numbers in parentheses indicate the number of merged reads for each month. The key shows known RdRp-capsid combinations, putative novel recombinants (?) and Sanger-sequencing confirmed novel recombinants (*).
Viruses 14 02732 g002
Figure 3. Norovirus GII molecular typing. Distribution and abundance of norovirus GII strains identified from (A) raw sewage and final effluent (B) samples between June 2018 and August 2020. The amplicons from different sampling sites were pooled equimolar based on wastewater sample type and month of collection. The numbers in parentheses indicate the number of merged reads for each month. The key shows known RdRp-capsid combinations and putative novel recombinants (?).
Figure 3. Norovirus GII molecular typing. Distribution and abundance of norovirus GII strains identified from (A) raw sewage and final effluent (B) samples between June 2018 and August 2020. The amplicons from different sampling sites were pooled equimolar based on wastewater sample type and month of collection. The numbers in parentheses indicate the number of merged reads for each month. The key shows known RdRp-capsid combinations and putative novel recombinants (?).
Viruses 14 02732 g003
Figure 4. Phylogenetic analysis of the 265 bp at 3′-end of norovirus GI ORF1. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic trees were constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 618RGI.2[P2] = GI.2[P2] virus detected in raw sewage collected in June 2018.
Figure 4. Phylogenetic analysis of the 265 bp at 3′-end of norovirus GI ORF1. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic trees were constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 618RGI.2[P2] = GI.2[P2] virus detected in raw sewage collected in June 2018.
Viruses 14 02732 g004
Figure 5. Phylogenetic analysis of the 314 bp at 5′-end of norovirus GI ORF2. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic trees were constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 618RGI.2[P2] = GI.2[P2] virus detected in raw sewage collected in June 2018.
Figure 5. Phylogenetic analysis of the 314 bp at 5′-end of norovirus GI ORF2. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic trees were constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 618RGI.2[P2] = GI.2[P2] virus detected in raw sewage collected in June 2018.
Viruses 14 02732 g005
Figure 6. Phylogenetic analysis of the 265 bp at 3′-end of norovirus GII ORF1. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic trees were constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 618RGII.2[P2] = GII.2[P2] virus detected in raw sewage collected in June 2018.
Figure 6. Phylogenetic analysis of the 265 bp at 3′-end of norovirus GII ORF1. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic trees were constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 618RGII.2[P2] = GII.2[P2] virus detected in raw sewage collected in June 2018.
Viruses 14 02732 g006
Figure 7. Phylogenetic analysis of the 305 bp at 5′-end of norovirus GII ORF2. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic trees were constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 618RGII.2[P2] = GII.2[P2] virus detected in raw sewage collected in June 2018.
Figure 7. Phylogenetic analysis of the 305 bp at 5′-end of norovirus GII ORF2. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic trees were constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 618RGII.2[P2] = GII.2[P2] virus detected in raw sewage collected in June 2018.
Viruses 14 02732 g007
Figure 8. Phylogenetic analysis of the 302 bp at 5′-end of norovirus GII.2, GII.4 and GII.17 ORF2. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic tree was constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 1018RGII.2[P2] = GII.2[P2] virus detected in raw sewage collected in October 2018.
Figure 8. Phylogenetic analysis of the 302 bp at 5′-end of norovirus GII.2, GII.4 and GII.17 ORF2. Representative sequences identified in this study are indicated by black circles (●). All strain sequence names contain specific ID, genotype and P-type. Reference strains and the most closely related strains were downloaded from GenBank. The Maximum Likelihood phylogenetic tree was constructed using MEGA X software [46] and 1000 bootstrap replicates based on the Kimura 2-parameter model. The sequences were labelled according to month and year of detection, wastewater type, P-type and genotype, e.g., 1018RGII.2[P2] = GII.2[P2] virus detected in raw sewage collected in October 2018.
Viruses 14 02732 g008
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Mabasa, V.V.; van Zyl, W.B.; Ismail, A.; Allam, M.; Taylor, M.B.; Mans, J. Multiple Novel Human Norovirus Recombinants Identified in Wastewater in Pretoria, South Africa by Next-Generation Sequencing. Viruses 2022, 14, 2732. https://doi.org/10.3390/v14122732

AMA Style

Mabasa VV, van Zyl WB, Ismail A, Allam M, Taylor MB, Mans J. Multiple Novel Human Norovirus Recombinants Identified in Wastewater in Pretoria, South Africa by Next-Generation Sequencing. Viruses. 2022; 14(12):2732. https://doi.org/10.3390/v14122732

Chicago/Turabian Style

Mabasa, Victor Vusi, Walda Brenda van Zyl, Arshad Ismail, Mushal Allam, Maureen Beatrice Taylor, and Janet Mans. 2022. "Multiple Novel Human Norovirus Recombinants Identified in Wastewater in Pretoria, South Africa by Next-Generation Sequencing" Viruses 14, no. 12: 2732. https://doi.org/10.3390/v14122732

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop