Next Article in Journal
The Complete Mitochondrial Genome of Platysternon megacephalum peguense and Molecular Phylogenetic Analysis
Previous Article in Journal
The Challenge of the Sponge Suberites domuncula (Olivi, 1792) in the Presence of a Symbiotic Bacterium and a Pathogen Bacterium
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Genome of the Steller Sea Lion (Eumetopias jubatus)

by
Harwood H. Kwan
1,2,
Luka Culibrk
1,3,
Gregory A. Taylor
1,
Sreeja Leelakumari
1,
Ryan Tan
1,
Shaun D. Jackman
1,
Kane Tse
1,
Tina MacLeod
1,
Dean Cheng
1,
Eric Chuah
1,
Heather Kirk
1,
Pawan Pandoh
1,
Rebecca Carlsen
1,
Yongjun Zhao
1,
Andrew J. Mungall
1,
Richard Moore
1,
Inanc Birol
1,2,
Marco A. Marra
1,2,
David A.S. Rosen
4,5,
Martin Haulena
5 and
Steven J. M. Jones
1,2,6,*
add Show full author list remove Hide full author list
1
Canada’s Michael Smith Genome Sciences Centre, British Columbia Cancer, Vancouver, BC V5Z-4S6, Canada
2
Department of Medical Genetics, University of British Columbia, Vancouver, BC V6T-1Z4, Canada
3
Department of Graduate Studies, Bioinformatics, University of British Columbia, Vancouver, BC V6T-1Z4, Canada
4
Institute for the Oceans and Fisheries, University of British Columbia, Vancouver, BC V6T-1Z4, Canada
5
Vancouver Aquarium, Vancouver, BC V6G 3E2, Canada
6
Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC V5A-1S6, Canada
*
Author to whom correspondence should be addressed.
Genes 2019, 10(7), 486; https://doi.org/10.3390/genes10070486
Submission received: 15 April 2019 / Revised: 20 June 2019 / Accepted: 21 June 2019 / Published: 26 June 2019
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
The Steller sea lion is the largest member of the Otariidae family and is found in the coastal waters of the northern Pacific Rim. Here, we present the Steller sea lion genome, determined through DNA sequencing approaches that utilized microfluidic partitioning library construction, as well as nanopore technologies. These methods constructed a highly contiguous assembly with a scaffold N50 length of over 14 megabases, a contig N50 length of over 242 kilobases and a total length of 2.404 gigabases. As a measure of completeness, 95.1% of 4104 highly conserved mammalian genes were found to be complete within the assembly. Further annotation identified 19,668 protein coding genes. The assembled genome sequence and underlying sequence data can be found at the National Center for Biotechnology Information (NCBI) under the BioProject accession number PRJNA475770.

1. Introduction

Steller sea lions (Eumetopias jubatus) inhabit the coastal waters of the subarctic and are mainly found in the northern Pacific Rim, stretching from central California to northern Japan [1]. Genetic analyses have identified three distinct Steller sea lion populations: the eastern, western and Asian populations [2,3]. The Steller sea lion is the largest member of the sea lion and fur seal family, Otariidae [1,4], and is only supplanted in size by the walrus and the northern and southern elephant seal amongst pinnipeds. Similar to other otariids, the Steller sea lion is amphibious, hauling-out onto land to reproduce, pup rear, molt and rest, while their time in the water is spent feeding, with a diet consisting of a wide range of fish as well as cephalopods [5,6]. Despite being top-level carnivores, Steller sea lions are still susceptible to predation, primarily by larger aquatic species such as killer whales (orcas) and sharks [7].
Significant population declines in the 1970s and 1980s eliminated more than 80% of the Steller sea lion population, leading to the species being listed as “threatened”, with the western regional populations being reclassified “endangered” in 1997 [1]. Although the exact reason for the population decline has yet to be defined, many suspected causes have been identified; including overfishing of fish stock in the Gulf of Alaska; increased predation by orcas and sharks; indirect effects of climate changes; the effects of diseases, parasites and contaminants; as well as hunting by humans [8,9,10].
Recent research has explored the genetic basis of mammalian adaptation into the marine environment, unveiling a multitude of genes which have accelerated evolutionary rates in marine mammalian lineages [11]. These genes appear to be enriched in pathways that control many functional adaptations for marine animals, including sensory systems, muscle physiology and lipid metabolism [11]. An increase in mammalian species with complete genetic information may aide in the progression and affirmation of these studies.
Here we present the genomic sequence and gene annotation resources for the Steller sea lion. This assembly will assist in the conservation process of Steller sea lions, as well as contribute to the comparative genomic analysis of marine mammals, aiding in our understanding of how marine animals may have adapted in their transition back to the water.

2. Materials and Methods

The genome was assembled from a 10x Genomics Chromium microfluidic partitioned genomic DNA (gDNA) library and was subsequently modified and scaffolded using a nanopore library. Sequencing was performed using an Illumina HiSeqX (Illumina, San Diego, CA, USA) instrument and a MinION sequencer (Oxford Nanopore Technologies (ONT), Oxford, UK) at Canada’s Michael Smith Genome Sciences Centre, BC Cancer. The animal under study was a female Steller sea lion (ISIS/GAN: 26980396). She was born in the wild in British Columbia, Canada in 2000 and brought to the aquarium a pup. She has lived at the Vancouver Aquarium ever since. The Vancouver Aquarium is accredited by the American Association of Zoos and Aquariums (AZA). Peripheral blood was extracted as part of a routine veterinary preventative health care plan performed as part of the AZA credentialing requirements. Surplus blood was used for DNA sequencing.
The microfluidic partitioned library was constructed as follows: High molecular weight (HMW) gDNA was extracted from the peripheral blood with the QIAGEN MagAttract HMW DNA Kit (QIAGEN, Germantown, MD, USA), using the HMW gDNA extraction protocol as detailed in the Chromium Genome Reagent Kits Version 2 User Guide (PN-120229) (10x Genomics, Pleasanton, CA, USA). Quality of the gDNA was then assessed using pulse-field gel electrophoresis (PFGE). Gel Bead-In-Emulsions (GEMs) were then created from a library of Genome Gel Beads combined with 1 ng of gDNA, Master Mix, and partitioning oil, using the 10x Genomics Chromium Controller instrument with a micro-fluidic Genome chip (PN-120216). The GEMs were then subjected to an isothermic amplification step. Bar-coded DNA fragments were extracted and underwent Illumina library construction, as detailed in the Chromium Genome Reagent Kits Version 2 User Guide (PN-120229). Library yield was measured through quantitative PCR (qPCR). Library fragment size and distribution was measured using an Agilent 2100 Bioanalyzer DNA 1000 chip (Santa Clara, CA, USA), with 500 bp fragments being the goal. The library was then run on an Illumina HiSeqX sequencer (Illumina) with a paired-end protocol to produce 150 bp reads for downstream genome construction. This resulted in 797.9 million 150 bp reads, which corresponded to an estimated 34.65-fold genome coverage. 56.02% of the reads were nonduplicates and could be phased, while 10.63% of the reads were duplicates. The estimated weighted mean molecule length for the library was 38.03 Kb.
The nanopore library was constructed as follows: high-quality gDNA was extracted as previously described. The gDNA was subjected to size selection and sequencing preparation using the Library Loading Bead Kit R9 (ONT) (EXP-LLB001). Adapters were added using the Ligation Sequencing Kit 1DR9 (ONT) (SQK-LSK108). The MinION flow cell was prepared with the Flow Cell Priming Kit (ONT) (EXP-FLP001). The prepared gDNA was loaded onto the MinION flow cell and was run in the MinION sequencer to produce long reads. The sequencing resulted in around 877 thousand reads containing 9 billion sequenced bases, with an N50 of 22.7 Kbp, an approximately 4-fold genome coverage.
Genome assembly was performed on the paired-end sequence reads from the partitioned library using Supernova (version 2.1.1, 10x Genomics, San Francisco, CA, USA). The initial Supernova assembly was 2.404 Gbp with a genomic scaffold N50 length of 41.76 Mbp and a contig N50 length of 174.4 Kbp (Table 1). The phase block N50 length of the initial Supernova assembly was 426.69 Kbp. Improvements to the initial assembly were made by correcting misassemblies, re-scaffolding through the incorporation of the nanopore long read data, and by gap filling.
Briefly, misassemblies in the genome were identified and corrected using Tigmint (version 1.1.2, Canada’s Michael Smith Genome Sciences Centre; parameters -span at 20; -window at 1000; -minsize at 2000; -as at 0.65; -nm at 5; -dist at 50,000; -mapq at 0; -trim at 0; -t at 8) [12]. Altogether, 625 cuts were made by Tigmint, rendering the N50 length to 13.55 Mbp. Scaffolding was then performed using the Assembly Roundup by Chromium Scaffolding algorithm (ARCS) (version 1.0.1 Canada’s Michael Smith Genome Sciences Centre; parameters -c at 5; -e at 30000; -r at 0.05) [13], which improves the contiguity of the assembly through organization using the 10x Genomics linked reads. Further scaffolding on the assembly was done with LINKS (version 1.8.5, Canada’s Michael Smith Genome Sciences Centre; parameters -d at 4000; -k at 15; -e at 0.1; -l at 5; -a at 0.3; -t at 5; -o at 0; -z at 500; p at 0.001; -x at 0) [14], using information from the uncorrected nanopore long reads to assist in the scaffolding. Altogether, the scaffolding steps improved the scaffold N50 length to 14.02 Mbp. Gaps were filled with Sealer (version 2.0.2, Canada’s Michael Smith Genome Sciences Centre; parameters -k at 90 to 120, step 10) [15], using the Illumina reads from the Chromium library to populate the Bloom filter. The nanopore reads were not utilized for Sealer as they have a high sequencing error rate relative to the Illumina reads. Altogether, Sealer closed a total of 6681 gaps, improving the contig N50 to 242.2 Kbp.
Benchmarking Universal Single-Copy Orthologs (BUSCO) [16], a program which attempts to reconstruct a set of conserved mammalian genes in a genome, was used as an assessment for genome completeness after every step of the assembly (Table 1). The reconstruction rate of BUSCO genes improved slightly after each stage of the assembly process, with 3904 genes identified in the assembly from a set of 4104 highly conserved mammalian genes. An additional 102 genes were present but in a fragmented state.
The genome was annotated using the NCBI Eukaryotic Genome Annotation Pipeline and considered of sufficient quality to become a RefSeq reference genome [17]. Within the genome, 19,668 coding genes were identified, along with 3786 non-coding genes, 6814 pseudogenes, and 69 immunoglobulin/T-cell receptor gene segments.

3. Results and Discussion

The final assembled Eumetopias jubatus genome consisted of 2,404,049,571 sequenced bases with a scaffold N50 of 14.02 Mbp, representing a good overview of the 18 chromosome pairs in the Steller sea lion [18]. Examination of the predicted Steller sea lion genes revealed that 25 of the top 100 frequently lost genes in marine mammals identified by Chikina et al. [11] were missing or were severely altered (by frameshift or truncation), resulting in a nonfunctional protein. The missing or altered genes were: SSTR4, PDE1C, GRIN3B, PDEC1, LHFP, TMEM235, ASIC4, CYP3A34, PRSS3, FMO1, C1QTNF8, GSTM4, KRTAP12-3, CALML3, OR51I2, OR51I1, OR10Z1, OR5C1, OR4E2, OR6K6, OR4S1, OR51V1, OR52K1, OR10G3, OR1I1. The Steller sea lion genome also contained a homozygous lesion in PON1, a gene found to have accrued deleterious lesions in all marine mammal lineages, while remaining functional in terrestrial mammals [19]. As the majority of these genes are olfactory and gustatory, their loss helps to solidify evidence that these genes are marine-accelerated, and that reductions in the sense of taste and smell are ubiquitous in marine mammals (despite their importance in social interactions) [11,20]. Additionally, the loss of PON1 may be crucial in the future conservation of the species, due to the role the gene plays in mammals in the defense against neurotoxicity from specific man-made organophosphorus compounds [19].
The closest relative to the Steller sea lion with a sequenced genome is the California sea lion (Zalophus californianus) [21]. Both species contain a diploid karyotype with 18 chromosome pairs [18]. Alignment of the Steller sea lion assembly to the California sea lion assembly with the Burrows-Wheeler Aligner MEM algorithm (BWA-MEM)(version 0.7.17) [22] yielded a variant rate of 1 variant in every 183 bases, indicating a 99.5% similarity between the two genomes. An overview of assembly statistics of both genomes is presented in Table 2, where the Steller sea lion is shown to be more contiguous at the DNA level, while Dovetail Hi-C scaffolding of the California sea lion allowed for increased scaffolding. An examination with BUSCO on both genomes using the 4104 gene mammalian dataset suggests similar completeness in both genomes, with a slight advantage to the California sea lion.
The alignment of the Steller sea lion assembly to the California sea lion assembly through BWA-MEM (version 0.7.17) [22] was subsequently visualized as a Jupiter plot (Figure 1) [23], a Circos [24] based genome assembly consistency plot used to view large scale translocations or other large structural variations. The connecting bands within the circle represent regions of synteny, whereas the blocks on the arc of the circle represent the largest scaffolds in the assembly. The lack of diagonal lines extending from the middle of the scaffold block suggests there are no definite breaks in the synteny between the two assemblies at 10 Kb resolution.
Both assemblies have been annotated by the NCBI Eukaryotic Genome Annotation Pipeline [17]. A comparison of the annotation statistics is present in Table 3. While the California sea lion appears to have a higher identified gene count, the majority of this difference can be attributed to regions predicted to be non-coding genes. Variant calling using samtools [25] and snpEff [26] revealed 91,797 coding variant changes between the two species, with 33,860 missense variants and 44,597 synonymous variants. The ten genes containing the most variants are: AHNAK2 (424 SNVs), a zinc-finger protein-like gene (386 SNVs), a basic proline-rich protein-like gene (239 SNVs), THAP12 (211 SNVs), VWA2 (200 SNVs), AKAP17A (154 SNVs), PPFIA3 (173 SNVs), FLG2 (230 SNVs), GLYCTK (189 SNVs), a SPATA31D3-like gene (159 SNVs), and RCOR2 (155 SNVs). In-depth phylogenetic assessment will be needed to determine the genetic differences responsible for speciation between the two sea lions.
The Steller sea lion assembly shows that microfluidic partitioned libraries greatly improve assembly of complete genomes. Additional information from nanopore reads is also shown to improve scaffolding, resulting in a high-quality reference genome from multiple sources of genomic sequence. A reference Steller sea lion genome may assist the understanding of the genetic effects of population decline, and ultimately aid in the conservation process. Additionally, the genome, alongside the California sea lion reference genome, can serve as a strong starting point for evolutionary studies regarding the divergence of sea lions from other pinnipeds.

Author Contributions

Conceptualization, H.H.K., L.C., G.A.T. and S.J.M.J.; Data curation, H.H.K., R.T., K.T., D.C., E.C., R.C. and R.M.; formal analysis, H.H.K.; funding acquisition, M.A.M. and S.J.M.J.; investigation, H.H.K., S.L., T.M., H.K., D.A.S.R. and M.H.; methodology, H.H.K., L.C., G.A.T., S.D.J., P.P. and S.J.M.J.; project administration, G.A.T. and S.J.M.J.; resources, M.A.M., M.H. and S.J.M.J.; software, H.H.K., L.C., G.A.T., R.T., S.D.J. and I.B.; supervision, K.T., D.C., E.C., P.P., R.C., Y.Z., A.J.M., R.M., I.B., M.A.M., M.H. and S.J.M.J.; validation, H.H.K., L.C. and G.A.T.; Visualization, H.H.K.; writing—original draft, H.H.K., L.C., G.A.T. and S.J.M.J.; writing—review and editing, H.H.K., L.C., G.A.T., S.L., R.T., S.D.J., K.T., T.M., D.C., E.C., H.K., P.P., R.C., Y.Z., A.J.M., R.M., I.B., M.A.M., D.A.S.R., M.H. and S.J.M.J.

Funding

Funding for the sequencing of the Steller sea lion genome was supported through the CanSeq150 program of Canada’s Genomics Enterprise (www.cgen.ca).

Acknowledgments

We would like to thank Genome Canada for their support and the BC Cancer Foundation as well as the Canada’s Michael Smith Genome Sciences Centre for their contribution to infrastructure and operations.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Pendleton, G.W.; Pitcher, K.W.; Fritz, L.W.; York, A.E.; Raum-Suryan, K.L.; Loughlin, T.R.; Calkins, D.G.; Hastings, K.K.; Gelatt, T.S. Survival of Steller sea lions in Alaska: A comparison of increasing and decreasing populations. Can. J. Zool. 2006, 84, 1163–1172. [Google Scholar] [CrossRef]
  2. Bickham, J.W.; Patton, J.C.; Loughlin, T.R. High variability for control-region sequences in a marine mammal: Implications for conservation and biogeography of Steller sea lions (Eumetopias jubatus). J. Mammal. 1996, 77, 95–108. [Google Scholar] [CrossRef]
  3. Baker, A.R.; Matson, C.W.; Trujillo, R.G.; Bickham, J.W.; Loughlin, T.R.; Burkanov, V.; Calkins, D.G.; Wickliffe, J.K. Variation of mitochondrial control region sequences of Steller sea lions: The three-stock hypothesis. J. Mammal. 2005, 86, 1075–1084. [Google Scholar] [CrossRef]
  4. Perez, M.A.; Merrick, R.L.; Loughlin, T.R. Eumetopias jubatus. Mamm. Species 1987, 283, 1–7. [Google Scholar] [CrossRef]
  5. Fiscus, C.H.; Baines, G.A. Food and feeding behavior of Steller and California sea lions. J. Mammal. 1966, 47, 195–200. [Google Scholar] [CrossRef]
  6. Sinclair, E.H.; Zeppelin, T.K. Seasonal and spatial differences in diet in the western stock of Steller sea lions (Eumetopias Jubatus). J. Mammal. 2002, 83, 973–990. [Google Scholar] [CrossRef]
  7. Gelatt, T.; Sweeny, K. The IUCN Red List of Threatened Species. Available online: https://www.iucnredlist.org/species/8239/45225749 (accessed on 19 February 2019).
  8. Loughlin, T.; York, A. An accounting of the sources of Steller sea lion, Eumetopias jubatus, Mortality. Mar. Fish. Rev. 2000, 62, 40–45. [Google Scholar]
  9. Springer, A.M.; Estes, J.A.; van Vliet, G.B.; Williams, T.M.; Doak, D.F.; Danner, E.M.; Forney, K.A.; Pfister, B. Sequential megafaunal collapse in the North Pacific Ocean: An ongoing legacy of industrial whaling? Proc. Natl. Acad. Sci. USA 2003, 100, 12223–12228. [Google Scholar] [CrossRef] [Green Version]
  10. Fritz, L.W.; Hinckley, S. A critical review of the regime shift-“junk food”-nutritional stress hypothesis for the decline of the western stock of Steller sea lion. Mar. Mammal Sci. 2005, 21, 476–518. [Google Scholar] [CrossRef]
  11. Chikina, M.; Clark, N.L.; Robinson, J.D. Hundreds of genes experienced convergent shifts in selective pressure in marine mammals. Mol. Biol. Evol. 2016, 33, 2182–2192. [Google Scholar] [CrossRef]
  12. Jackman, S.D.; Coombe, L.; Chu, J.; Warren, R.L.; Vandervalk, B.P.; Yeo, S.; Xue, Z.; Mohamadi, H.; Bohlmann, J.; Jones, S.J.M.; et al. Tigmint: Correcting assembly errors using linked reads from large molecules. BMC Bioinform. 2018, 19, 393. [Google Scholar] [CrossRef] [PubMed]
  13. Yeo, S.; Coombe, L.; Warren, R.L.; Chu, J.; Birol, I. ARCS: Scaffolding genome drafts with linked reads. Bioinformatics 2017, 34, 725–731. [Google Scholar] [CrossRef] [PubMed]
  14. Warren, R.L.; Yang, C.; Vandervalk, B.P.; Behsaz, B.; Lagman, A.; Jones, S.J.M.; Birol, I. LINKS: Scalable, alignment-free scaffolding of draft genomes with long reads. GigaScience 2015, 4, 35. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Paulino, D.; Warren, R.L.; Vandervalk, B.P.; Raymond, A.; Jackman, S.D.; Birol, I. Sealer: A scalable gap-closing application for finishing draft genomes. BMC Bioinform. 2015, 16, 230. [Google Scholar] [CrossRef] [PubMed]
  16. Waterhouse, R.M.; Seppey, M.; Simão, F.A.; Manni, M.; Ioannidis, P.; Klioutchnikov, G.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO Applications from quality assessments to gene prediction and phylogenomics. Mol. Biol. Evol. 2017, 35, 543–548. [Google Scholar] [CrossRef] [PubMed]
  17. Pruitt, K.D.; Brown, G.R.; Hiatt, S.M.; Thibaud-Nissen, F.; Astashyn, A.; Ermolaeva, O.; Farrell, C.M.; Hart, J.; Landrum, M.J.; McGarvey, K.M.; et al. RefSeq: An update on mammalian reference sequences. Nucleic Acids Res. 2013, 42, D756–D763. [Google Scholar] [CrossRef] [PubMed]
  18. Árnason, Ú. Comparative chromosome studies in Pinnipedia. Hereditas 1974, 76, 179–225. [Google Scholar] [CrossRef]
  19. Meyer, W.K.; Jamison, J.; Richter, R.; Woods, S.E.; Partha, R.; Kowalczyk, A.; Kronk, C.; Chikina, M.; Bonde, R.K.; Crocker, D.E.; et al. Ancient convergent losses of Paraoxonase 1 yield potential risks for modern marine mammals. Science 2018, 361, 591. [Google Scholar] [CrossRef]
  20. Pitcher, B.J.; Harcourt, R.G.; Schaal, B.; Charrier, I. Social olfaction in marine mammals: Wild female Australian sea lions can identify their pup’s scent. Biol. Lett. 2011, 7, 60–62. [Google Scholar] [CrossRef]
  21. Peart, C.R.; Pophaly, S.D.; Breen, M.; Gulland, F.M.D.; Johnson, J.A.; Neely, B.A.; Wolf, J.B.W. Zalophus Californianus v.2.2. NCBI. Available online: https://www.ncbi.nlm.nih.gov/bioproject/511654 (accessed on 15 February 2019).
  22. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
  23. Chu, J. Jupiter Plot: A Circos-Based Tool To Visualize Genome Assembly Consistency (Version 1.0). Zenodo. Available online: https://zenodo.org/record/1241235#.XA92q2hKiUk (accessed on 21 February 2019).
  24. Krzywinski, M.; Schein, J.; Birol, I.; Connors, J.; Gascoyne, R.; Horsman, D.; Jones, S.J.; Marra, M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009, 19, 1639–1645. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R.; 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Cingolani, P.; Platts, A.; Wang, L.L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A Program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin) 2012, 6, 80–92. [Google Scholar] [CrossRef] [PubMed]
Figure 1. A Jupiter plot illustrating the global genome alignment of the Steller sea lion genome (right) to the California sea lion genome (left). Alignment was accomplished with BWA-MEM. Connections within the circle represent alignment between the two assemblies. California sea lion scaffolds over 10 Mb in length were selected. The longest Steller sea lion scaffolds which sum to the same amount of sequence were also selected. Only alignments over 10 Kb in length are displayed.
Figure 1. A Jupiter plot illustrating the global genome alignment of the Steller sea lion genome (right) to the California sea lion genome (left). Alignment was accomplished with BWA-MEM. Connections within the circle represent alignment between the two assemblies. California sea lion scaffolds over 10 Mb in length were selected. The longest Steller sea lion scaffolds which sum to the same amount of sequence were also selected. Only alignments over 10 Kb in length are displayed.
Genes 10 00486 g001
Table 1. Comparison of assembly statistics for steps in the assembly process.
Table 1. Comparison of assembly statistics for steps in the assembly process.
AssemblyTotal Size (Gbp)No. of GapsContig N50 (Kbp)No of ScaffoldsScaffold N50 (Mbp)Longest Scaffold (Mbp)BUSCO Complete Genes (of 4104)
Supernova2.40424,113174.4723841.76146.195.0% (3899)
Tigmint2.40424,086174.4774813.5559.0295.0% (3901)
ARCS/LINKS2.40424,145174.4768914.0259.0295.1% (3903)
Sealer2.40417,464242.4768914.0259.0295.1% (3904)
Table 2. Assembly statistics of the Steller sea lion and the California sea lion.
Table 2. Assembly statistics of the Steller sea lion and the California sea lion.
AssemblyTotal Size (Gbp)No. of ContigsContig N50 (Kbp)Contig L50No of ScaffoldsScaffold N50 (Mbp)Scaffold L50BUSCO Complete Genes (of 4104)
Steller sea lion2.40424,747242.42995747214.025495.1% (3904)
California sea lion2.36757,87197.7718110,423143.4795.4% (3912)
Table 3. Annotation summary for the sea lion assemblies.
Table 3. Annotation summary for the sea lion assemblies.
AssemblyTotal CountProtein CodingNon-CodingPseudogenesImmunoglobulin/
T-Cell Receptor Gene Segments
Steller sea lion30,33619,6683786681468
California sea lion32,11319,6175644678567

Share and Cite

MDPI and ACS Style

Kwan, H.H.; Culibrk, L.; Taylor, G.A.; Leelakumari, S.; Tan, R.; Jackman, S.D.; Tse, K.; MacLeod, T.; Cheng, D.; Chuah, E.; et al. The Genome of the Steller Sea Lion (Eumetopias jubatus). Genes 2019, 10, 486. https://doi.org/10.3390/genes10070486

AMA Style

Kwan HH, Culibrk L, Taylor GA, Leelakumari S, Tan R, Jackman SD, Tse K, MacLeod T, Cheng D, Chuah E, et al. The Genome of the Steller Sea Lion (Eumetopias jubatus). Genes. 2019; 10(7):486. https://doi.org/10.3390/genes10070486

Chicago/Turabian Style

Kwan, Harwood H., Luka Culibrk, Gregory A. Taylor, Sreeja Leelakumari, Ryan Tan, Shaun D. Jackman, Kane Tse, Tina MacLeod, Dean Cheng, Eric Chuah, and et al. 2019. "The Genome of the Steller Sea Lion (Eumetopias jubatus)" Genes 10, no. 7: 486. https://doi.org/10.3390/genes10070486

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop