Next Article in Journal
Antigenicity Alternations of Variant PEDV S Protein Disclosed by Linear B Cell Epitope Mapping
Next Article in Special Issue
Off-Target Effect of Activation of NF-κB by HIV Latency Reversal Agents on Transposable Elements Expression
Previous Article in Journal
A Newly Engineered A549 Cell Line Expressing ACE2 and TMPRSS2 Is Highly Permissive to SARS-CoV-2, Including the Delta and Omicron Variants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Origin and Deep Evolution of Human Endogenous Retroviruses in Pan-Primates

1
CAS Key Laboratory of Molecular Virology & Immunology, Institute Pasteur of Shanghai, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Shanghai 200031, China
2
University of Chinese Academy of Sciences, Beijing 100049, China
3
Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-1353 Copenhagen, Denmark
4
State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650201, China
*
Author to whom correspondence should be addressed.
Viruses 2022, 14(7), 1370; https://doi.org/10.3390/v14071370
Submission received: 25 May 2022 / Revised: 18 June 2022 / Accepted: 22 June 2022 / Published: 23 June 2022
(This article belongs to the Special Issue Endogenous Retroviruses)

Abstract

:
Human endogenous retroviruses (HERVs) are viral “fossils” in the human genome that originated from the ancient integration of exogenous retroviruses. Although HERVs have sporadically been reported in nonhuman primate genomes, their deep origination in pan-primates remains to be explored. Hence, based on the in silico genomic mining of full-length HERVs in 49 primates, we performed the largest systematic survey to date of the distribution, phylogeny, and functional predictions of HERVs. Most importantly, we obtained conclusive evidence of nonhuman origin for most contemporary HERVs. We found that various supergroups, including HERVW9, HUERSP, HSERVIII, HERVIPADP, HERVK, and HERVHF, were widely distributed in Strepsirrhini, Platyrrhini (New World monkeys) and Catarrhini (Old World monkeys and apes). We found that numerous HERVHFs are spread by vertical transmission within Catarrhini and one HERVHF was traced in 17 species, indicating its ancient nature. We also discovered that 164 HERVs were likely involved in genomic rearrangement and 107 HERVs were potentially coopted in the form of noncoding RNAs (ncRNAs) in humans. In summary, we provided comprehensive data on the deep origination of modern HERVs in pan-primates.

1. Introduction

Human endogenous retroviruses (HERVs) are retroviral remnants in the human genome derived from the ancient integration of exogenous retroviruses and occupy approximately 8% of the human genomic DNA [1,2]. HERVs are mainly formed via two major mechanisms: (1) horizontal transmission, in which exogenous retroviral RNA is integrated into the host genome, thus becoming a provirus that will produce infectious virus; (2) vertical transmission, in which the past retroviral infection of germline cells results in a provirus with Mendelian heritability [1,3].
A typical HERV contains two long terminal repeats (LTRs) and four major genes: gag (containing matrix (MA), capsid (CA) and nucleocapsid (NC) domains), pro (containing protease (PR) and dUTPase (DU) domains), pol (containing reverse transcriptase (RT), RNAse H (RH) and integrase (IN) domains) and env (containing surface (SU) and transmembrane (TM) domains) [4,5]. Since the first HERV was identified in the 1980s [6], over 3000 classifiable HERVs have been identified, and they can be divided into 3 classes (Class I, Class II and Class III) and 11 supergroups based on the phylogeny of pol genes: Class I: MLLV, HERVERI, HERVFRDLIKE, HEPSI, HUERSP, HERVW9, HERVIPADP, MER50like, and HERVHF; Class II: HERVK/HML; and Class III: HSERVIII [7,8].
Some evidence has shown that HERVs also appear in nonhuman primates [9,10,11]. Furthermore, LTR dating indicated that some HERVs could have entered the genomes of primate ancestors 25 million years ago (MYA) [12], although such dating results should be considered cautiously [9]. To decode the deep origination of HERVs, we performed the systematically in silico mining survey of HERVs in 49 primate genomes covering all major families of the order Primates. Based on these extensive data, we present the full landscape of evolutionary pathways leading to the generation of modern HERVs.

2. Materials and Methods

2.1. Genome Screening and Identification of HERVs

For the strict identification of full-length HERVs, we first used the RT (reverse transcriptase) sequences from the human genome available in the gEVE database [13] and performed TBLASTN searches [14] to screen the 49 primate genomes with a cutoff e-value of 0.00001. All genome assemblies were accessed from National Center for Biotechnology Information (NCBI) Assembly Database (https://www.ncbi.nlm.nih.gov/assembly/ accessed on 30 November 2021) and the Sequence Read Archive (SRA) Database (https://www.ncbi.nlm.nih.gov/sra/ accessed on 30 November 2021) under an accessible BioProject accession code: PRJNA785018. Then, the data generated from the TBLASTN analysis were transformed into gene transfer format (GTF), and repetitive data were removed according to the contig names, start positions, and end positions. BEDTools [15] was employed to merge locations with distances of less than 1000 base pairs (bps).
To reduce false positive results, the merged sequences were extracted, and DIAMOND BLASTX searches [16] were performed against all primate (taxonomy ID 9443) alone sequences and viruses (taxonomy ID 10239) in the Reference Sequence (RefSeq) database [17] with an e-value of 0.00001. The best alignment results were extracted and screened based on query names and bit scores, and phylogenetic analysis (see below for details) was performed with reference RT sequences of HERVs from a previous publication [18] to confirm whether the screened results represented true RT domains of HERVs. The recognized sequences were extended to a length of 10,000 bp from both ends for further LTR (long terminal repeat) identification.
LTR-Harvest [19] with the default parameters was utilized to determine the boundaries of each LTR of HERVs. The internal sequences were searched against HERV proteins in the gEVE database by using DIAMOND BLASTX with an e-value of 0.00001. The results were transformed into browser extensible data (BED) files, merged, and extracted as previously described. Another round of DIAMOND BLASTX searches was performed to ascertain the nature of HERV genes. The results were manually checked, genomic fragments were merged based on the orders and locations in primate genomes to reduce repeatability, and only translations of at least 100 amino acids in the length of HERV genes were retained. For classification, phylogenetic reconstruction was performed using pol sequences aligned by MAFFT [20] with the parameter “--auto”, and the alignments were trimmed by trimAl [21] with the parameter “-gt 0.1” or “-gt 0.5”. IQ-TREE2 [22] was applied to construct the maximum likelihood (ML) trees with the parameters “-B 1000 -alrt 1000”.

2.2. Vertical Transmission Identification

To identify vertical transmission events, BLASTN searches [14] were performed with the full-length HERVs with both flanking regions (~2000 bp in length). Hits that met the following three conditions were extracted: (1) two HERV sequences showed 90% coverage and identity; (2) two HERV LTR-flanking sequences were matched with an identity over 90%; and (3) the BLASTN results of both LTR-flanking sequences showed over 25% coverage, with at least one result showing 80% coverage. Only candidate HERVs that simultaneously met all three conditions were selected for further analysis. Vertical transmission-associated paired HERVs were analyzed with the igraph package [23], and vertical transmission events were estimated based on the species that contained paired HERVs. The species tree of primates was generated using TimeTree [24], and the bubble pie chart was created for visualization with the scatterpie package [25].

2.3. Genomic Rearrangement Analysis

Homologous recombination (i.e., between two similar HERVs in different genomic locations in a given species) leading to genomic rearrangement might have occurred during primate genome evolution [26]. We first attempted to detect the rearrangement signals by performing phylogenetic LTR reconstructions according to different HERV categories and primate species based on the 5′ and 3′LTR sequences of full-length HERVs. We collected sequences that did not cluster in pairs (i.e., the 5′LTR and 3′LTR of a single HERV) in the phylogenetic trees. We next tested their coverage and identity by performing BLASTN searches with the same query and subject. We selected the paired LTRs with higher bit scores from different HERV sources (e.g., the bit score of 5′ LTR-1 vs. 5′ LTR-2 was higher than that of 5′ LTR-1 vs. 3′ LTR-1). Then, we checked whether these paired LTRs matched other LTRs (e.g., 5′LTR-1 matched 5′/3′LTR-2 and 3′LTR-1 matched 3′/5′LTR-2). We reasoned that the HERVs with matching LTRs may be subjected to genomic rearrangement.

2.4. HERVs-Derived ncRNA Verification in the Human Genome

We employed the coordinates of all known human transcripts from Ensembl database version 104 and ncRNAs [27] from the NONCODE database [28] in Genome Reference Consortium Human Build 38 (GRCh38/hg38) and retained those that intersected with the coordinates of HERVs in the human genome by using BEDTools with the parameter “intersect -wo -s”. We only selected the results for which the coverage of at least one feature in a pair of features was equal to 100% and predicted the possible HERV-derived ncRNA molecules based on these results.

3. Results

3.1. HERVs Are Widely Dispersed in the Genomes of Old World Monkeys and Apes

To identify HERVs, we first analyzed the genomes of 49 species of primates to identify the RT domains of reference HERVs because the RT domains of HERVs are often used to distinguish HERVs and other retroviruses [18,29,30]. Briefly, we performed a first round of TBLASTN analysis to search the HERV RT domains of primate genomes, and a second round of DIAMOND BLASTX analysis was then performed to exclude those RT domains that were better aligned with host proteins or other viral proteins. Next, we performed phylogenetic analysis to verify whether these RT sequences belonged to HERVs. We extended the length of the verified sequences and identified their LTR boundaries. We subsequently estimated the internal sequences between two LTRs with DIAMOND BLASTX, and only sequences that contained RT domains and showed the correct ordering of other genes (e.g., gag-pro-pol-env) were identified as classifiable HERVs, and hence defined as “full-length HERVs” (F-HERVs). We reconstructed the pol gene of each HERV and performed phylogenetic analysis to classify the HERVs (Figure 1A). The HERVs were classified according to their phylogenetic relationships with reference sequences and the similarity of their RT domains with reference RT sequences.
In total, we identified 2301 classifiable F-HERV copies (Figure 1A,B & Supplementary Data S2), with most of them found in Catarrhini, (Figure 1B). The limited numbers identified in this study reflected our rigorous search methodology and the limitations of using full-length HERVs. The greatest number of the identified F-HERVs belonged to the HERVHF supergroup, whose members reportedly integrated into Catarrhini genomes at least 30–45 million years ago [31,32,33,34]. It is worth noting that we found HERVH sequences not only in the species in which they have been reported previously (Homo sapiens, Gorilla gorilla gorilla, Pongo abelii, Papio anubis, Chlorocebus aethiops, Callithrix jacchus, Pan troglodytes, Nomascus siki and Aotus nancymaae) but also in some new genera of Catarrhini, such as Mandrillus, Rhinopithecus, and Colobus (Figure 1B), further confirming the widespread and ancient nature of HERVHF. We also found other types of F-HERVs, such as HERVW9, HERVIPADP, HERVK, and HSERVIII members (Figure 1B), in primates, which was consistent with previous studies [35,36,37,38] but with hosts expanded in this study. Together, these results demonstrated that F-HERVs are ancient, and humans inherited such elements via vertical transmission from nonhuman primates.

3.2. Numerous HERVHFs Are Spread by Vertical Transmission within Catarrhini

If vertical transmission events occurred, the sequences of the two viruses and their flanking sequences should be the same in different primate genomes. However, over a long evolutionary history, many mutations accumulate in HERVs, which makes it difficult to identify vertical transmission events. Therefore, we set the following strict criteria for identifying possible vertical transmission events: (1) two HERVs must show high identity and coverage; (2) the flanking sequences of the two HERVs must show high identity; and (3) at least one of the flanking sequences must show high coverage (Figure 1C).
In total, we discovered 1226 F-HERVs that may participate in vertical transmission and identified 408 vertical transmission events (Figure 1D). All of the vertical transmission events were identified within Catarrhini, and more than half of these (222 of 408) were found in apes. According to HERV classification, most of the vertically transmitted F-HERVs belonged to the HERVHF group, which was consistent with the distribution of HERVs (Figure 1B). Interestingly, we found that several F-HERVs may have infiltrated the common ancestor of Old-World monkeys and apes, including 10 HERVHF, 2 HERVK, 1 HERVIPADP, 1 HSERVIII, and 1 HUERSP members (Figure 1D). Strikingly, one HERVHF was vertically transmitted from Old World monkeys to apes, and the pathway of its vertical transmission was traced in 17 species (Figure 1E & Supplementary Data S4–S6). In addition, we estimated the time of F-HERV integration based on the time tree of these 49 primates and the vertical transmission events detected within them (Figure 1D). We speculated that detectable vertical transmission of F-HERVs occurred from 0 to 29.4 MYA and that nearly 25% of vertical transmission events (118 in 468) occurred at 9.1 MYA, when Gorilla gorilla gorilla separated from Homo sapiens.

3.3. Some F-HERVs May Be Involved in Genomic Rearrangement

HERVs are not only molecular ‘fossils’ of ancient retroviruses but are also functional in host genomes under certain circumstances [39,40,41]. One of the functions of HERVs is mediating host genomic recombination, leading to potential genomic rearrangement [26,42,43]. When a HERV is integrated into a host genome, the two LTRs of that one element should be more similar to each other than to the LTRs of any other element, although they accumulate mutations after integration and residence in the germ line [26]. Therefore, if a HERV has two similar but different LTRs, genomic recombination may occur within that HERV.
We predicted F-HERVs that may be involved in host genomic recombination based on this hypothesis and found that 25.5% (586 of 2301) of F-HERVs had a pair of nonclustered LTRs, indicating that these F-HERVs may be related to host genomic recombination (Figure 2A). We performed BLASTN searches to identify the “mismatches” of LTRs (LTRs showing better alignment with other HERV LTRs) (Figure 2B,C) and counted the number of different types of recombination-related HERVs in each species (Figure 2D & Supplementary Data S7–S9). The results showed that most of the recombination-related F-HERVs (147 of 164) located in the genomes of apes belonged to the HERVHF group (Figure 2D), which was consistent with the total distribution of F-HERVs (Figure 1B). Overall, our results suggested that some of the F-HERVs that we identified were associated with the recombination of primate genomes.

3.4. Some F-HERVs in Human Genomes Are Likely Transcribed into ncRNAs

Another function of HERVs may involve their transcription into ncRNAs that then regulate host genes [44,45,46]. Because the location information of human ncRNAs has been well annotated in human genomes [28], we used our HERV coordinates to merge the known human ncRNAs. We attempted to identify ncRNAs derived from F-HERVs or ncRNAs containing F-HERVs. We finally identified 107 F-HERVs and ncRNAs that showed the same locations and orientations (Table 1 and Supplementary Data S10). We calculated the statistics of the F-HERV distribution and classification on each human chromosome and found that most of the ncRNA-correlated F-HERVs belonged to the HERVHF group (104 of 107), and the three the human chromosomes that possessed the greatest numbers of ncRNA-correlated F-HERVs were chromosomes 6, 2 and 1, with 12, 11 and 10 of these F-HERVs, respectively (Figure 2E). In short, these data showed that some F-HERVs in human genomes may be involved in evolutionary co-option with primates and function in the form of ncRNAs.

4. Discussion

In the past 30 years, many HERVs have been identified in human genomes, but there has been little systematic research on HERVs in other nonhuman primates. In this study, we used all known HERV sequences to determine the classifiable HERVs in 49 species of primates and annotated their specific loci in the host genomes (Supplementary Data S2). We only discovered 292 F-HERVs in humans, which was much lower than the number indicated by previous research [8,47]. One major reason for this difference was that we used only the RT sequences of HERVs in the human genomes available from gEVE, rather than using exogenous retroviral pol sequences to search HERVs, because we were focused on tracing the origin of different types of HERVs in human genomes and the RT domain is the most conserved domain that can be used to distinguish retroviruses [29,48]. Indeed, many HERVs accumulate mutations or are even lost during long-term evolution, and phylogenetic analysis based on these proteins sometimes cannot rebuild their phylogenetic relationships. Although we identified fewer F-HERVs in the human genome through our pipeline, these F-HERVs showed a relatively intact genomic structure and covered 5 superclasses of HERVs (Figure 1A,B). In addition, further investigation revealed that some of these F-HERVs were involved in vertical transmission. Thus, these F-HERVs could help us to effectively pursue the origin of HERVs.
Vertical transmission events of HERVs provide strong evidence that could indicate the origin of HERVs. One of the most ancient HERVs (HERVLs) reported to date, which integrated into an ancestor of all extant placental mammals at more than 100 MYA, was identified based on this line of reasoning [49]. We found that most vertical transmission-related F-HERVs in human genomes were derived from those present in Hominidae, and the others came from Hylobates and Old World monkeys (Supplementary Data S3). We estimated that the integration times of these F-HERVs, which ranged from 9.1 MYA to 29.4 MYA (Figure 1C and Supplementary Data S3) depended on the time of separation, and this result was consistent with previous reports [50,51]. Vertical transmission events spanning long periods are difficult to track because of the strict definition of vertical transmission, which requires very high sequence similarity and coverage of HERVs and their flanking sequences. Mutations in HERVs show a positive correlation with time, and we were, therefore, unable to identify vertical transmission that may have occurred in the ancestors of NWM or Strepsirrhini. Another reason for the unsuccessful detection of vertical transmission was that the total number of F-HERVs found in the first step was small. If we were to consider the HERVs that have lost their RT domains, different results might be obtained.
HERVs are capable of causing homologous recombination due to their high sequence similarities. Many studies have analyzed HERV-related gene recombination by comparing the genomes of different individuals [52,53,54]. However, the available genomes from the different individuals of the same species are insufficient. We assume that homologous recombination takes place between two HERVs of the same type (e.g., HERVHF) and that they share highly similar sequences but show differences in their LTRs. When homologous recombination occurs, such HERVs will exchange their internal sequences, leading to a pair of analogous but different LTRs. We conjecture that homologous recombination takes place on the basis of this assumption (Figure 2B,C & Supplementary Data S7), and the results should be treated with caution because recombination of endogenous retroviruses which had microhomologic sequences has also been reported in other mammals [55].
Recently, HERVs have been reported to be associated with many human diseases, including cancer and infectious and autoimmune diseases, and the mechanisms underlying the functions of HERVs in these illnesses also vary (e.g., acting as promoters or enhancers to regulate gene expression or encoding peptides that participate in immune regulation) [39,40,56,57,58]. Therefore, it is important to consider HERVs that have the potential to be expressed. To identify potentially expressed F-HERVs in humans, we intersected the coordinates of all human F-HERVs and all known transcripts from other public databases and found that some HERVs may be transcribed into ncRNAs and functional under certain conditions, such as viral infections [40]. We also performed searches of F-HERVs from other nonhuman primates in the RefSeq database and the nucleotide sequence (nt) database of NCBI with BLASTN, and we did not find any credible transcripts strongly related to these HERVs. However, some of these F-HERVs showed high similarities with F-HERVs from humans, so we surmised they may have homologous functions.
In conclusion, we traced the F-HERVs present in the human genome back to nonhuman primates and found that some HERVs originated before the speciation of Hominidae, Hylobates, and Old-World monkeys. In addition, some of the F-HERVs that we identified were possibly functional from the perspective of genomics or transcriptomics, likely indicating long-term co-option. Together, these findings could help us to better understand the deep origin and evolution of modern HERVs.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v14071370/s1, Data S1: The common name, scientific name and classification of each primate we analyzed in this research; Data S2: The loci of each HERV we identified in the genomes of 49 primates; Data S3: The vertical transmission events in human genomes; Data S4: The alignments of the 5LTR and flanking sequences of a widely vertical transmitted HERV-H in 17 species; Data S5: The alignments of the 3LTR and flanking sequences of a widely vertical transmitted HERV-H in 17 species; Data S6: The alignments of the internal sequences of a widely vertical transmitted HERV-H in 17 species; Data S7: The LTRs’ blastn results of possible HERVs which were involved in host gene recombination; Data S8: The alignments of the 5LTR sequences of HERV in Figure 2B; Data S9: The alignments of the 3LTR sequences of HERV in Figure 2B; Data S10: The relationship between HERVs in human genomes and Human ncRNA.

Author Contributions

Conceptualization, J.C.; Methodology, Y.L.; Validation, Y.L.; Formal Analysis, Y.L.; Investigation, Y.L.; Resources, G.Z.; Data Curation, Y.L.; Writing—Original Draft Preparation, Y.L. and J.C.; Writing—Review & Editing, J.C.; Visualization, Y.L.; Supervision, J.C. and G.Z.; Project Administration, J.C.; Funding Acquisition, J.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (31970176).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in the article and the Supplementary Materials here. Additional data related to this article may be acquired from the authors.

Acknowledgments

The authors would like to express gratitude to Primate Sequencing Consortium for early access for the newly sequenced primate genomes.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Johnson, W.E. Origins and evolutionary consequences of ancient endogenous retroviruses. Nat. Rev. Microbiol. 2019, 17, 355–370. [Google Scholar] [CrossRef] [PubMed]
  2. Griffiths, D.J. Endogenous retroviruses in the human genome sequence. Genome Biol. 2001, 2, reviews1017.1. [Google Scholar] [CrossRef] [PubMed]
  3. Greenwood, A.D.; Ishida, Y.; O’Brien, S.P.; Roca, A.L.; Eiden, M.V. Transmission, Evolution, and Endogenization: Lessons Learned from Recent Retroviral Invasions. Microbiol. Mol. Biol. Rev. 2018, 82, e00044-17. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Garcia-Montojo, M.; Doucet-O’Hare, T.; Henderson, L.; Nath, A. Human endogenous retrovirus-K (HML-2): A comprehensive review. Crit. Rev. Microbiol. 2018, 44, 715–738. [Google Scholar] [CrossRef] [PubMed]
  5. Jansz, N.; Faulkner, G.J. Endogenous retroviruses in the origins and treatment of cancer. Genome Biol. 2021, 22, 147. [Google Scholar] [CrossRef]
  6. Martin, M.A.; Bryan, T.; Rasheed, S.; Khan, A.S. Identification and cloning of endogenous retroviral sequences present in human DNA. Proc. Natl. Acad. Sci. USA 1981, 78, 4892–4896. [Google Scholar] [CrossRef] [Green Version]
  7. Hayward, A.; Cornwallis, C.K.; Jern, P. Pan-vertebrate comparative genomics unmasks retrovirus macroevolution. Proc. Natl. Acad. Sci. USA 2015, 112, 464–469. [Google Scholar] [CrossRef] [Green Version]
  8. Vargiu, L.; Rodriguez-Tome, P.; Sperber, G.O.; Cadeddu, M.; Grandi, N.; Blikstad, V.; Tramontano, E.; Blomberg, J. Classification and characterization of human endogenous retroviruses; mosaic forms are common. Retrovirology 2016, 13, 7. [Google Scholar] [CrossRef] [Green Version]
  9. Bannert, N.; Kurth, R. The evolutionary dynamics of human endogenous retroviral families. Annu. Rev. Genom. Hum. Genet. 2006, 7, 149–173. [Google Scholar] [CrossRef]
  10. Escalera-Zamudio, M.; Greenwood, A.D. On the classification and evolution of endogenous retrovirus: Human endogenous retroviruses may not be ‘human’ after all. APMIS 2016, 124, 44–51. [Google Scholar] [CrossRef] [Green Version]
  11. Mager, D.L.; Stoye, J.P. Mammalian Endogenous Retroviruses. Microbiol. Spectr. 2015, 3, MDNA3-0009-2014. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Tristem, M. Identification and characterization of novel human endogenous retrovirus families by phylogenetic screening of the human genome mapping project database. J. Virol. 2000, 74, 3715–3730. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Nakagawa, S.; Takahashi, M.U. gEVE: A genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes. Database 2016, 2016, baw087. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  15. Quinlan, A.R. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr. Protoc. Bioinform. 2014, 47, 11.12.1–11.12.34. [Google Scholar] [CrossRef]
  16. Buchfink, B.; Reuter, K.; Drost, H.G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods 2021, 18, 366–368. [Google Scholar] [CrossRef]
  17. O’Leary, N.A.; Wright, M.W.; Brister, J.R.; Ciufo, S.; Haddad, D.; McVeigh, R.; Rajput, B.; Robbertse, B.; Smith-White, B.; Ako-Adjei, D.; et al. Reference sequence (RefSeq) database at NCBI: Current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2016, 44, D733–D745. [Google Scholar] [CrossRef] [Green Version]
  18. Xu, X.; Zhao, H.; Gong, Z.; Han, G.Z. Endogenous retroviruses of non-avian/mammalian vertebrates illuminate diversity and deep history of retroviruses. PLoS Pathog. 2018, 14, e1007072. [Google Scholar] [CrossRef]
  19. Ellinghaus, D.; Kurtz, S.; Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinform. 2008, 9, 18. [Google Scholar] [CrossRef] [Green Version]
  20. Katoh, K.; Misawa, K.; Kuma, K.; Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef] [Green Version]
  21. Capella-Gutierrez, S.; Silla-Martinez, J.M.; Gabaldon, T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 2009, 25, 1972–1973. [Google Scholar] [CrossRef] [PubMed]
  22. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Csardi, G.; Nepusz, T. The igraph software package for complex network research. InterJ. Complex Syst. 2006, 1695, 1–9. [Google Scholar]
  24. Kumar, S.; Stecher, G.; Suleski, M.; Hedges, S.B. TimeTree: A Resource for Timelines, Timetrees, and Divergence Times. Mol. Biol. Evol. 2017, 34, 1812–1819. [Google Scholar] [CrossRef] [PubMed]
  25. Yu, G. Scatterpie: Scatter Pie Plot. 2021. Available online: https://guangchuangyu.github.io/scatterpie/ (accessed on 1 December 2016).
  26. Hughes, J.F.; Coffin, J.M. Evidence for genomic rearrangements mediated by human endogenous retroviruses during primate evolution. Nat. Genet. 2001, 29, 487–489. [Google Scholar] [CrossRef]
  27. Howe, K.L.; Achuthan, P.; Allen, J.; Allen, J.; Alvarez-Jarreta, J.; Amode, M.R.; Armean, I.M.; Azov, A.G.; Bennett, R.; Bhai, J.; et al. Ensembl 2021. Nucleic Acids Res. 2021, 49, D884–D891. [Google Scholar] [CrossRef]
  28. Zhao, L.; Wang, J.; Li, Y.; Song, T.; Wu, Y.; Fang, S.; Bu, D.; Li, H.; Sun, L.; Pei, D.; et al. NONCODEV6: An updated database dedicated to long non-coding RNA annotation in both animals and plants. Nucleic Acids Res. 2021, 49, D165–D171. [Google Scholar] [CrossRef]
  29. Xiong, Y.; Eickbush, T.H. Origin and evolution of retroelements based upon their reverse transcriptase sequences. EMBO J. 1990, 9, 3353–3362. [Google Scholar] [CrossRef]
  30. De Parseval, N.; Heidmann, T. Human endogenous retroviruses: From infectious elements to human genes. Cytogenet. Genome Res. 2005, 110, 318–332. [Google Scholar] [CrossRef]
  31. Mager, D.L.; Freeman, J.D. HERV-H endogenous retroviruses: Presence in the New World branch but amplification in the Old World primate lineage. Virology 1995, 213, 395–404. [Google Scholar] [CrossRef] [Green Version]
  32. Sverdlov, E.D. Retroviruses and primate evolution. Bioessays 2000, 22, 161–171. [Google Scholar] [CrossRef]
  33. Yi, J.M.; Kim, H.S. Evolutionary implication of human endogenous retrovirus HERV-H family. J. Hum. Genet. 2004, 49, 215–219. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Goodchild, N.L.; Wilkinson, D.A.; Mager, D.L. Recent evolutionary expansion of a subfamily of RTVL-H human endogenous retrovirus-like elements. Virology 1993, 196, 778–788. [Google Scholar] [CrossRef] [PubMed]
  35. Grandi, N.; Cadeddu, M.; Blomberg, J.; Mayer, J.; Tramontano, E. HERV-W group evolutionary history in non-human primates: Characterization of ERV-W orthologs in Catarrhini and related ERV groups in Platyrrhini. BMC Evol. Biol. 2018, 18, 6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Holloway, J.R.; Williams, Z.H.; Freeman, M.M.; Bulow, U.; Coffin, J.M. Gorillas have been infected with the HERV-K (HML-2) endogenous retrovirus much more recently than humans and chimpanzees. Proc. Natl. Acad. Sci. USA 2019, 116, 1337–1346. [Google Scholar] [CrossRef] [Green Version]
  37. Lee, J.W.; Kim, H.S. Endogenous retrovirus HERV-I LTR family in primates: Sequences, phylogeny, and evolution. Arch. Virol. 2006, 151, 1651–1658. [Google Scholar] [CrossRef]
  38. Yi, J.M.; Kim, T.H.; Huh, J.W.; Park, K.S.; Jang, S.B.; Kim, H.M.; Kim, H.S. Human endogenous retroviral elements belonging to the HERV-S family from human tissues, cancer cells, and primates: Expression, structure, phylogeny and evolution. Gene 2004, 342, 283–292. [Google Scholar] [CrossRef]
  39. Shah, A.H.; Gilbert, M.; Ivan, M.E.; Komotar, R.J.; Heiss, J.; Nath, A. The role of human endogenous retroviruses in gliomas: From etiological perspectives and therapeutic implications. Neuro Oncol. 2021, 23, 1647–1655. [Google Scholar] [CrossRef]
  40. Srinivasachar Badarinarayan, S.; Sauter, D. Switching Sides: How Endogenous Retroviruses Protect Us from Viral Infections. J. Virol. 2021, 95, e02299-20. [Google Scholar] [CrossRef]
  41. Xiang, Y.; Liang, H. The Regulation and Functions of Endogenous Retrovirus in Embryo Development and Stem Cell Differentiation. Stem Cells Int. 2021, 2021, 6660936. [Google Scholar] [CrossRef]
  42. Campbell, I.M.; Gambin, T.; Dittwald, P.; Beck, C.R.; Shuvarikov, A.; Hixson, P.; Patel, A.; Gambin, A.; Shaw, C.A.; Rosenfeld, J.A.; et al. Human endogenous retroviral elements promote genome instability via non-allelic homologous recombination. BMC Biol. 2014, 12, 74. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Trombetta, B.; Fantini, G.; D’Atanasio, E.; Sellitto, D.; Cruciani, F. Evidence of extensive non-allelic gene conversion among LTR elements in the human genome. Sci. Rep. 2016, 6, 28710. [Google Scholar] [CrossRef] [PubMed]
  44. Wilson, K.D.; Ameen, M.; Guo, H.; Abilez, O.J.; Tian, L.; Mumbach, M.R.; Diecke, S.; Qin, X.; Liu, Y.; Yang, H.; et al. Endogenous Retrovirus-Derived lncRNA BANCR Promotes Cardiomyocyte Migration in Humans and Non-human Primates. Dev. Cell 2020, 54, 694–709.e699. [Google Scholar] [CrossRef] [PubMed]
  45. Zhou, B.; Qi, F.; Wu, F.; Nie, H.; Song, Y.; Shao, L.; Han, J.; Wu, Z.; Saiyin, H.; Wei, G.; et al. Endogenous Retrovirus-Derived Long Noncoding RNA Enhances Innate Immune Responses via Derepressing RELA Expression. MBio 2019, 10, e00937-19. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Hu, T.; Pi, W.; Zhu, X.; Yu, M.; Ha, H.; Shi, H.; Choi, J.H.; Tuan, D. Long non-coding RNAs transcribed by ERV-9 LTR retrotransposon act in cis to modulate long-range LTR enhancer function. Nucleic Acids Res. 2017, 45, 4479–4492. [Google Scholar] [CrossRef] [Green Version]
  47. Kojima, K.K. AcademH, a lineage of Academ DNA transposons encoding helicase found in animals and fungi. Mob. DNA 2020, 11, 15. [Google Scholar] [CrossRef]
  48. Xiong, Y.; Eickbush, T.H. Similarity of reverse transcriptase-like sequences of viruses, transposable elements, and mitochondrial introns. Mol. Biol. Evol. 1988, 5, 675–690. [Google Scholar] [CrossRef] [Green Version]
  49. Lee, A.; Nolan, A.; Watson, J.; Tristem, M. Identification of an ancient endogenous retrovirus, predating the divergence of the placental mammals. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2013, 368, 20120503. [Google Scholar] [CrossRef] [Green Version]
  50. Jern, P.; Sperber, G.O.; Blomberg, J. Divergent patterns of recent retroviral integrations in the human and chimpanzee genomes: Probable transmissions between other primates and chimpanzees. J. Virol. 2006, 80, 1367–1375. [Google Scholar] [CrossRef] [Green Version]
  51. Magiorkinis, G.; Blanco-Melo, D.; Belshaw, R. The decline of human endogenous retroviruses: Extinction and survival. Retrovirology 2015, 12, 8. [Google Scholar] [CrossRef] [Green Version]
  52. Bosch, E.; Jobling, M.A. Duplications of the AZFa region of the human Y chromosome are mediated by homologous recombination between HERVs and are compatible with male fertility. Hum. Mol. Genet. 2003, 12, 341–347. [Google Scholar] [CrossRef] [PubMed]
  53. Robberecht, C.; Voet, T.; Zamani Esteki, M.; Nowakowska, B.A.; Vermeesch, J.R. Nonallelic homologous recombination between retrotransposable elements is a driver of de novo unbalanced translocations. Genome Res. 2013, 23, 411–418. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Weckselblatt, B.; Hermetz, K.E.; Rudd, M.K. Unbalanced translocations arise from diverse mutational mechanisms including chromothripsis. Genome Res. 2015, 25, 937–947. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Löber, U.; Hobbs, M.; Dayaram, A.; Tsangaras, K.; Jones, K.; Alquezar-Planas, D.E.; Ishida, Y.; Meers, J.; Mayer, J.; Quedenau, C.; et al. Degradation and remobilization of endogenous retroviruses by recombination during the earliest stages of a germ-line invasion. Proc. Natl. Acad. Sci. USA 2018, 115, 8609–8614. [Google Scholar] [CrossRef] [Green Version]
  56. Ariza, M.E.; Williams, M.V. A human endogenous retrovirus K dUTPase triggers a TH1, TH17 cytokine response: Does it have a role in psoriasis? J. Investig. Dermatol. 2011, 131, 2419–2427. [Google Scholar] [CrossRef] [Green Version]
  57. Volkman, H.E.; Stetson, D.B. The enemy within: Endogenous retroelements and autoimmune disease. Nat. Immunol. 2014, 15, 415–422. [Google Scholar] [CrossRef] [Green Version]
  58. Srinivasachar Badarinarayan, S.; Shcherbakova, I.; Langer, S.; Koepke, L.; Preising, A.; Hotter, D.; Kirchhoff, F.; Sparrer, K.M.J.; Schotta, G.; Sauter, D. HIV-1 infection activates endogenous retroviral promoters regulating antiviral gene expression. Nucleic Acids Res. 2020, 48, 10890–10908. [Google Scholar] [CrossRef]
Figure 1. The distribution, classification, and vertical transmission of F-HERVs in 49 primates. (A) The phylogenetic tree of all F-HERVs identified. The classifications of F-HERVs are indicated in different colors, and “Unknown” represents F-HERVs that cannot be classified according to their phylogenetic relationships with reference sequences or the similarity of their RT domains with reference RT sequences. Ultrafast bootstrap approximation (UFBoot) values over 95 are provided beside the nodes. (B) Histogram showing the number of F-HERVs in each species, and the classifications of HERVs are indicated in different colors. (C) The diagram shows the standards that were used to screen vertical transmission events. The blue boxes, pink boxes, and green boxes represent the flanking sequences (Flanking), LTR sequences (5LTR or 3LTR), and internal sequences (Int) of F-HERVs, respectively. The dashed boxes indicate the BLASTN searches we performed and the thresholds of each search (in red). The white segments represent the mismatch or gap in BLASTN. The detailed procedure of vertical transmission identification is provided in the Materials and Methods. (D) The left panel shows the rooted phylogenetic tree and the divergence times of 50 primates (Tupaia glis was used as the root), and the numbers of possible vertical transmission events are indicated on the corresponding nodes by the area of pies. The right panel shows the numbers of vertical transmission-related F-HERVs in each species we studied. The classifications of the F-HERVs are indicated in different colors. (E) The image shows the alignments of the 5′LTR and flanking sequences (upper) and the 3′LTR and flanking sequences (lower) of a widely vertically transmitted F-HERV-H in 17 species. The names of the species are listed on the left, and red boxes indicate the LTR sequences of F-HERVH. The full alignments of the LTR and flanking sequences are provided in the Supplementary Materials.
Figure 1. The distribution, classification, and vertical transmission of F-HERVs in 49 primates. (A) The phylogenetic tree of all F-HERVs identified. The classifications of F-HERVs are indicated in different colors, and “Unknown” represents F-HERVs that cannot be classified according to their phylogenetic relationships with reference sequences or the similarity of their RT domains with reference RT sequences. Ultrafast bootstrap approximation (UFBoot) values over 95 are provided beside the nodes. (B) Histogram showing the number of F-HERVs in each species, and the classifications of HERVs are indicated in different colors. (C) The diagram shows the standards that were used to screen vertical transmission events. The blue boxes, pink boxes, and green boxes represent the flanking sequences (Flanking), LTR sequences (5LTR or 3LTR), and internal sequences (Int) of F-HERVs, respectively. The dashed boxes indicate the BLASTN searches we performed and the thresholds of each search (in red). The white segments represent the mismatch or gap in BLASTN. The detailed procedure of vertical transmission identification is provided in the Materials and Methods. (D) The left panel shows the rooted phylogenetic tree and the divergence times of 50 primates (Tupaia glis was used as the root), and the numbers of possible vertical transmission events are indicated on the corresponding nodes by the area of pies. The right panel shows the numbers of vertical transmission-related F-HERVs in each species we studied. The classifications of the F-HERVs are indicated in different colors. (E) The image shows the alignments of the 5′LTR and flanking sequences (upper) and the 3′LTR and flanking sequences (lower) of a widely vertically transmitted F-HERV-H in 17 species. The names of the species are listed on the left, and red boxes indicate the LTR sequences of F-HERVH. The full alignments of the LTR and flanking sequences are provided in the Supplementary Materials.
Viruses 14 01370 g001
Figure 2. F-HERVs may be involved in host genomic recombination or transcribed into ncRNAs. (A) The pie chart shows the number of F-HERVs with clustered LTRs (dark red) and HERVs with non-clustered LTRs (blue green). (B) The phylogenetic tree (upper) shows an example of LTR separation in one F-HERV-H of humans. The separation of the two LTRs is indicated in red, and ultrafast bootstrap approximation (UFBoot) values over 95 are provided beside the nodes. The BLASTN results of these two pairs of LTRs are shown in the table (lower) together with the BLASTN results. “qcovhsp” indicates the coverage of the query, and the red arrows indicate better alignments. The detailed alignments are shown in the Supplementary Data. (C) The diagram shows the standards used to screen F-HERV-related host genomic recombination events. The pink boxes and green boxes represent the LTR sequences (5LTR or 3LTR) or internal sequences (int) of F-HERVs, respectively. The dashed boxes show the BLASTN searches that we performed and the thresholds of each search (in red). The details of the genomic rearrangement analysis are provided in the Materials and Methods. (D) Histogram showing the number of recombination-related F-HERVs in each species, and the classifications of the F-HERVs are indicated in different colors. “Unknown” represents F-HERVs that cannot be classified according to their phylogenetic relationships with reference sequences or the similarity of their RT domains with reference RT sequences. (E) Histogram showing the number of F-HERVs that share the same locations and orientations with known ncRNAs on each human chromosome. The classifications of the F-HERVs are indicated in different colors.
Figure 2. F-HERVs may be involved in host genomic recombination or transcribed into ncRNAs. (A) The pie chart shows the number of F-HERVs with clustered LTRs (dark red) and HERVs with non-clustered LTRs (blue green). (B) The phylogenetic tree (upper) shows an example of LTR separation in one F-HERV-H of humans. The separation of the two LTRs is indicated in red, and ultrafast bootstrap approximation (UFBoot) values over 95 are provided beside the nodes. The BLASTN results of these two pairs of LTRs are shown in the table (lower) together with the BLASTN results. “qcovhsp” indicates the coverage of the query, and the red arrows indicate better alignments. The detailed alignments are shown in the Supplementary Data. (C) The diagram shows the standards used to screen F-HERV-related host genomic recombination events. The pink boxes and green boxes represent the LTR sequences (5LTR or 3LTR) or internal sequences (int) of F-HERVs, respectively. The dashed boxes show the BLASTN searches that we performed and the thresholds of each search (in red). The details of the genomic rearrangement analysis are provided in the Materials and Methods. (D) Histogram showing the number of recombination-related F-HERVs in each species, and the classifications of the F-HERVs are indicated in different colors. “Unknown” represents F-HERVs that cannot be classified according to their phylogenetic relationships with reference sequences or the similarity of their RT domains with reference RT sequences. (E) Histogram showing the number of F-HERVs that share the same locations and orientations with known ncRNAs on each human chromosome. The classifications of the F-HERVs are indicated in different colors.
Viruses 14 01370 g002
Table 1. The annotation of HERV related ncRNAs in human.
Table 1. The annotation of HERV related ncRNAs in human.
ChromosomeStartEndHERVnameStrandRelated-ncRNA
12299748823002547Homo_sapiens_1_23000272-23002212-HERVHFNONHSAG057580.1
14308797443091095Homo_sapiens_1_43087972-43089561-HERVHF+NONHSAG056389.1
16838600368391994Homo_sapiens_1_68388791-68390135-HERVHF+NONHSAG001773.3
18235458182360561Homo_sapiens_1_82356924-82357985-HERVHFENSG00000233290
18295529982961592Homo_sapiens_1_82956772-82959208-HERVHF+ENSG00000230817
1209451675209454012Homo_sapiens_1_209451456-209452546-HERVHF+NONHSAG057153.1
1224840009224846095Homo_sapiens_1_224842067-224843662-HERVHFENSG00000286719
1228942542228942868Homo_sapiens_1_228948757-228950104-HERVHF+NONHSAG057288.1
1232120241232123943Homo_sapiens_1_232120704-232121847-HERVHF+NONHSAG004630.2
1241433890241439885Homo_sapiens_1_241436435-241438237-HERVHF+ENSG00000287516
250007685003355Homo_sapiens_2_5003505-5005037-HERVHF+NONHSAG077068.1
21679195016797713Homo_sapiens_2_16793801-16795145-HERVHFNONHSAG078497.1
23478981834796058Homo_sapiens_2_34792231-34793755-HERVHF+NONHSAG077257.1
23808080038086513Homo_sapiens_2_38082627-38083973-HERVHFENSG00000138061
26733373467337603Homo_sapiens_2_67334137-67335887-HERVHF+NONHSAG028042.3
26978990069795859Homo_sapiens_2_69792965-69796021-HUERSPNONHSAG028084.2
27796513777970868Homo_sapiens_2_77967627-77968976-HERVHF+NONHSAG077496.1
2192506078192513184Homo_sapiens_2_192509946-192511342-HERVHF+NONHSAG030125.2
2215922303215928129Homo_sapiens_2_215924899-215926434-HERVHF+NONHSAG078151.1
2224225331224230988Homo_sapiens_2_224227814-224229160-HERVHF+NONHSAG078214.1
2237606784237611630Homo_sapiens_2_237609479-237610820-HERVHF+NONHSAG110040.1
32118903121194139Homo_sapiens_3_21190643-21192440-HERVHFENSG00000282987
35463448254638068Homo_sapiens_3_54636349-54637755-HERVHFENSG00000265992
3112418410112423366Homo_sapiens_3_112419312-112420768-HERVHFNONHSAG035734.2
3115798715115799166Homo_sapiens_3_115795176-115796709-HERVHFNONHSAG085690.1
3155274423155278762Homo_sapiens_3_155276457-155278448-HERVHFNONHSAG036456.2
3186660747186663692Homo_sapiens_3_186660542-186661888-HERVHF+ENSG00000113905
439274453930682Homo_sapiens_4_3929901-3931242-HERVHF+NONHSAG087348.2
41700054517003928Homo_sapiens_4_17000127-17001778-HERVHF+NONHSAG037572.2
42450097524501427Homo_sapiens_4_24503534-24505060-HERVHF+NONHSAG037630.2
42797487427981319Homo_sapiens_4_27976550-27977552-HERVHF+NONHSAG037691.2
49227149292275299Homo_sapiens_4_92273363-92274770-HERVHFENSG00000249152
4103553770103557353Homo_sapiens_4_103555460-103556971-HERVHFENSG00000250920
4128640901128644450Homo_sapiens_4_128642726-128644003-HERVHFNONHSAG088517.2
4145698823145703505Homo_sapiens_4_145701612-145702617-HERVHF+ENSG00000237136
4152741345152747172Homo_sapiens_4_152743196-152744540-HERVHFNONHSAG039129.2
4175461163175467003Homo_sapiens_4_175463047-175464647-HERVHFENSG00000249945
59282603392829706Homo_sapiens_5_92826486-92827829-HERVHF+ENSG00000248588
5136303790136307028Homo_sapiens_5_136303833-136305180-HERVHF+ENSG00000250947
5161245405161254586Homo_sapiens_5_161251016-161252646-HERVHF+NONHSAG090654.1
61625901016264893Homo_sapiens_6_16260854-16262201-HERVHFENSG00000282024
61875414218756902Homo_sapiens_6_18755932-18757277-HERVHFNONHSAG043117.2
68050979580515805Homo_sapiens_6_80511941-80513513-HERVHFNONHSAG113295.1
69455391794559610Homo_sapiens_6_94555806-94557152-HERVHFNONHSAG044390.2
69777948997785327Homo_sapiens_6_97782122-97783636-HERVHF+ENSG00000271860
6123582333123588007Homo_sapiens_6_123584156-123585562-HERVHFENSG00000186439
6125701846125707764Homo_sapiens_6_125703727-125705069-HERVHFENSG00000237742
6126851224126854456Homo_sapiens_6_126851273-126852794-HERVHF+NONHSAG044785.3
6131295347131301206Homo_sapiens_6_131297975-131299503-HERVHF+NONHSAG093612.2
6131338799131344566Homo_sapiens_6_131340338-131342739-HERVHF+NONHSAG093612.2
6131903830131907420Homo_sapiens_6_131904209-131905555-HERVHF+ENSG00000236673
6144923164144928866Homo_sapiens_6_144925698-144927040-HERVHF+NONHSAG095837.2
72602419926029809Homo_sapiens_7_26026061-26027405-HERVHFNONHSAG047156.2
73430013234300573Homo_sapiens_7_34301985-34303122-HERVHFNONHSAG047318.2
7102975230102978736Homo_sapiens_7_102976263-102977254-HERVHF+ENSG00000230257
7125920130125924112Homo_sapiens_7_125920071-125921895-HERVHF+ENSG00000197462
7155238821155244070Homo_sapiens_7_155240740-155241657-HERVKNONHSAG049243.2
87167697271680514Homo_sapiens_8_71677433-71678968-HERVHF+ENSG00000254277
89009022490093794Homo_sapiens_8_90091914-90093348-HERVHFENSG00000104327
89720076997206658Homo_sapiens_8_97202388-97204973-HERVHF+NONHSAG098987.1
8114284546114287727Homo_sapiens_8_114284508-114286044-HERVHF+ENSG00000254339
8132080235132086002Homo_sapiens_8_132081909-132083445-HERVHFENSG00000132297
91295083212954130Homo_sapiens_9_12950845-12952399-HERVHF+NONHSAG101172.2
98013729780143055Homo_sapiens_9_80139873-80141469-HERVHF+NONHSAG052646.2
98546112085466955Homo_sapiens_9_85461166-85462902-HERVHF+NONHSAG052703.2
9115475420115478923Homo_sapiens_9_115475976-115477349-HERVHF+NONHSAG053288.3
1067970816802954Homo_sapiens_10_6798770-6800364-HERVHFNONHSAG005151.3
102571642025722928Homo_sapiens_10_25718978-25720776-HERVHF+ENSG00000280809
1163660396371662Homo_sapiens_11_6368276-6369885-HERVHF+NONHSAG007525.2
112762907227632889Homo_sapiens_11_27630864-27632291-HERVHFENSG00000254934
119464166194647315Homo_sapiens_11_94644134-94645475-HERVHF+ENSG00000255666
119649996096506627Homo_sapiens_11_96501724-96503654-HERVHF+ENSG00000183340
119659043996593677Homo_sapiens_11_96590449-96591982-HERVHF+ENSG00000254587
11130565609130570121Homo_sapiens_11_130566060-130567548-HERVHFNONHSAG010050.2
11130753494130756702Homo_sapiens_11_130755373-130756704-HERVHFNONHSAG010050.2
1240186234023691Homo_sapiens_12_4021109-4022208-HERVHF+ENSG00000256969
121146216811468022Homo_sapiens_12_11463877-11465381-HERVHFENSG00000121335
123426909734274242Homo_sapiens_12_34268101-34269869-HERVHF+NONHSAG010874.2
127044489470450107Homo_sapiens_12_70446553-70447732-HERVKNONHSAG011664.2
128694153086944748Homo_sapiens_12_86941432-86943069-HERVHF+NONHSAG064903.1
134286800142871007Homo_sapiens_13_42869513-42870545-HERVHFNONHSAG013351.2
134886677148872457Homo_sapiens_13_48868391-48870330-HERVHFNONHSAG067525.1
135116986651175008Homo_sapiens_13_51172521-51173517-HERVHF+NONHSAG013541.3
135412741754133159Homo_sapiens_13_54129305-54130960-HERVHFENSG00000234787
136614225066147037Homo_sapiens_13_66143157-66144503-HERVHFNONHSAG013698.2
137927661179279830Homo_sapiens_13_79276654-79278001-HERVHF+NONHSAG067153.1
143819331938196529Homo_sapiens_14_38193317-38194783-HERVHF+ENSG00000258649
144152142641521883Homo_sapiens_14_41518469-41520184-HERVHF+NONHSAG014802.2
144826238948263146Homo_sapiens_14_48256895-48258584-HERVHFENSG00000287492
157435414174359786Homo_sapiens_15_74355867-74357936-HERVHF+ENSG00000260266
158783110787837024Homo_sapiens_15_87833731-87835137-HERVHF+NONHSAG017784.2
166007853660084582Homo_sapiens_16_60081354-60082700-HERVHF+NONHSAG071739.1
166522980365233421Homo_sapiens_16_65231504-65233039-HERVHFENSG00000260834
182869302828696068Homo_sapiens_18_28694974-28696314-HERVHFNONHSAG075074.1
185641774556421344Homo_sapiens_18_56418118-56419466-HERVHF+NONHSAG074828.1
185706464757070296Homo_sapiens_18_57068491-57069834-HERVHFENSG00000258609
187332717173330369Homo_sapiens_18_73327166-73328509-HERVHF+ENSG00000261780
192256826922575022Homo_sapiens_19_22570768-22572352-HERVHF+NONHSAG025320.2
201275602712759632Homo_sapiens_20_12756400-12757916-HERVHF+NONHSAG031288.2
204026904740274769Homo_sapiens_20_40271576-40272881-HERVHF+NONHSAG081519.1
211712402417127764Homo_sapiens_21_17123959-17125734-HERVHF+NONHSAG110806.1
212622794726233485Homo_sapiens_21_26229594-26231104-HERVHFNONHSAG032575.2
214280084542803999Homo_sapiens_21_42802518-42804296-HERVHFNONHSAG083198.1
X7126437271272628Homo_sapiens_X_71266645-71268493-HERVHF+ENSG00000147140
X9469881894701832Homo_sapiens_X_94700559-94702091-HERVHFNONHSAG054922.2
X111543806111549675Homo_sapiens_X_111546380-111547978-HERVHF+NONHSAG055109.3
X122227333122227787Homo_sapiens_X_122224556-122226109-HERVHF+NONHSAG055239.2
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, Y.; Zhang, G.; Cui, J. Origin and Deep Evolution of Human Endogenous Retroviruses in Pan-Primates. Viruses 2022, 14, 1370. https://doi.org/10.3390/v14071370

AMA Style

Li Y, Zhang G, Cui J. Origin and Deep Evolution of Human Endogenous Retroviruses in Pan-Primates. Viruses. 2022; 14(7):1370. https://doi.org/10.3390/v14071370

Chicago/Turabian Style

Li, Yian, Guojie Zhang, and Jie Cui. 2022. "Origin and Deep Evolution of Human Endogenous Retroviruses in Pan-Primates" Viruses 14, no. 7: 1370. https://doi.org/10.3390/v14071370

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop