Next Article in Journal
New Species of Mallocybe and Pseudosperma from North China
Next Article in Special Issue
Kinesin Motors in the Filamentous Basidiomycetes in Light of the Schizophyllum commune Genome
Previous Article in Journal
Genomic and Experimental Analysis of the Insecticidal Factors Secreted by the Entomopathogenic Fungus Beauveria pseudobassiana RGM 2184
Previous Article in Special Issue
Establishment of an Efficient Polyethylene Glycol (PEG)-Mediated Transformation System in Pleurotus eryngii var. ferulae Using Comprehensive Optimization and Multiple Endogenous Promoters
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The First Whole Genome Sequence Discovery of the Devastating Fungus Arthrinium rasikravindrae

1
College of Forestry, Sichuan Agricultural University, Chengdu 611130, China
2
Department of Plant Pathology, College Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
*
Author to whom correspondence should be addressed.
J. Fungi 2022, 8(3), 255; https://doi.org/10.3390/jof8030255
Submission received: 27 January 2022 / Revised: 24 February 2022 / Accepted: 26 February 2022 / Published: 2 March 2022
(This article belongs to the Special Issue Genetics and Breeding of Basidiomycetes for Biotechnology)

Abstract

:
Devastating fungi are one of the most important biotic factors associated with numerous infectious diseases not only in plants but in animals and humans too. Arthrinium rasikravindrae a devastating fungus is responsible for severe infections in a large number of host plants all over the world. In the present study, we analyzed the whole genome sequence of devastating fungus A. rasikravindrae strain AQZ-20, using Illumina Technology from the Novogene Bio-informatics Co., Ltd. Beijing, China. To identify associated annotation results, various corresponding functional annotations databases were utilized. The genome size was 48.24 MB with an N90 (scaffolds) length of 2,184,859 bp and encoded putative genes were 11,101, respectively. In addition, we evaluated the comparative genomic analyses with 4 fungal strains of Ascomycetes. Two related species showed a strong correlation while others exhibited a weak correlation with the A. rasikravindrae AQZ-20 fungus. This study is a discovery of the genome-scale assembly, as well as annotation for A. rasikravindrae. The results obtained from the whole genome sequencing and genomic resources developed in this study will contribute significantly to genetic improvement applications against diseases caused by A. rasikravindrae. In addition, the phylogenetic tree, followed by genomic RNA, transcriptomic, proteomic, metabolic, as well as pathogenic data reported in current research will provide deep insight for further studies in the future.

1. Introduction

Fungi are the most destructive and diverse groups of pathogenic fungi responsible for many lethal diseases in plants, especially those belonging to the family Poaceae and Cyperaceae. Such fungi exist in nature as endo-parasites and are widely distributed all over the world [1,2]. As per the National Centre for Biotechnology Information (NCBI), the taxonomic classification of this fungus is under Fungi, Dikarya, Ascomycota, Saccharomyces, Pezizomycotina, Leotiomyceta, Sordariomyceta, Sordariomycetes, Xylariomycetidae, Xylariales, Apiosporaceae, Arthrinium, A. rasikravindrae and was first reported by Singh et al. [3] from Norway. Previously, A. rasikravindrae fungus was reported as A. phaeospermum and then re-determined as A. rasikravindrii but recently, it has orthographically been corrected to A. rasikravindrae. The current name is now widely recognized as well as accepted and also used in Mycobank (http://www.mycobank.org/ assessed on 16 August 2020) and Index Fungorum (http://www.indexfunorum.org assessed on 16 August 2020) [3,4]. Numerous associations were originating dead culms of Ph. aurea from Europe as well by [5]. To date, 31 different species of this genus have been reported from all over the world [6]. The data available in NCBI GenBank indicates that 20 entries have been mentioned so far based on the diseased samples collected from different hosts and geographical regions globally including Brazil, China, India, Japan, Netherlands, Norway, Thailand, and Switzerland, [7]. Many studies from China have shown that there exists a rich species diversity of Arthrinium in the bamboo (Bambusa vulgaris) plant [4,8]. Out of the total 31 Arthrinium species, 17 have been reported from host plants [4,8,9,10]. These hosts are widely distributed in nature and have never been comprehensively surveyed [11]. To date, 32 species of Arthrinium fungi have been reported [6], The Arthrinium genus inhabit a large number of hosts and substrates, including air, lichens, marine algae, plants, soil debris, and even human tissues [9,11].
This genus can be associated with phytopathogens, such as A. ovatum and A. hydei from Hong Kong, A. kogelbergense from South Africa, A. malaysianum and A. pseudospegazzinii from Malaysia, A. phragmites from Italy, A. pseudosinense from the Netherlands, A. xenocordella in Zimbabwe, A. arundinis in America, [12], A. sacchari in Canada and A. phaeospermum in China, respectively [12,13]. Based on the recent studies conducted on A. phaeospermum, only two cases have been noticed where A. phaeospermum acted as a causal agent of infectious diseases in human beings [14,15,16], however, many others are known to produce diverse bioactive compounds with a variety of pharmacological applications [17,18], In addition, some fungal species associated with genus Arthrinium are known to produce some toxic compounds that inhibit the disease-causing potential of a broad range of human pathogens including filamentous fungi, bacteria and yeasts [19]. The study of Li et al. [14] has found that A. phaeospermum produces growth-promoting substances in C. kobomugi [20]. Previous studies [21,22] have shown that a non-protein toxic compound mediates the physiological, biochemical, metabolic activities, and pathogenicity of A. phaeospermum, and is hence responsible for the devastation in hybrid bamboo tissues. Additionally, A. phaeospermum has also been reported to synthesize protein-based toxic compounds as well [23,24,25,26] and hybrid bamboo cells have been reported to contain a constricting position on the semi-permeable membrane [27]. It was also shown that proteinaceous toxic compounds damage the mitochondrial membrane, increase lipid peroxidation of mitochondria, disrupt membrane integrity, as well as inhibit respiration [28]. This shows the evidence that toxin protein played an essential part in the pathogenicity of A. phaeospermum [29]. Metabolic reports have illustrated that A. phaeospermum is involved in the synthesis of numerous biochemical compounds including tetrahydroxy anthraquinone pigment, succinic acid, ergosterol, phenols (C18O5) [30], gibberellic acid [31] 3-nitropropionic acid [32], and arthrichitin (C33H46N4O9), respectively [33]. For instance, the study of Shrestha et al. [34,35] revealed that A. phaeospermum possesses various bioactive enzymes that degrade plant cell wall constituents, including exo-cellulase, endo-nuclease, xylanase, ligninase, and β-glucosidase, which perform imperative functions in the host plant cell wall degradation [14]. However, no studies have reported beneficial or dangerous effects of A. rasikravindrae on living organisms. To date, the complete genome sequences of 178 human and 24 plant-related fungi have been obtained [14]. Out of these, various devastating pathogens including Ustilago maydis in 2003 [36], the draft genome sequence of Alternaria alternate in 2015 [37], Candida auris in 2021, Penicillium digitatum [38], Verticelium albo-atrum [39], Candida albicans [40], Arthrinium phaeospermum AP-Z13, and Diporthe capsici in 2020 have been successfully sequenced [14,41].
Previous studies have reported that various organisms have been utilized for genomic transformations as individuals or cells, such as the Cancer Genome Association and Arabidopsis Genome Project [42]. Significant efforts have been made in the past five years to improve assemblies of genomes, and the improvement of nearly complete genomes using long-read sequencing technologies [43]. Simultaneously, there are no whole genomic studies have been found of A. rasikravindrae worldwide.
The objectives of the current research are (1) to search the whole genome sequence of the A. rasikravindrae strain AQZ-20, (2) to evaluate the species-specific and extended gene families by performing comparative genomic analysis with the four Xylariales fungal strains classified under Ascomycetes. Our findings will provide enhanced insights for future detailed studies of A. rasikravindrae.

2. Materials and Methods

2.1. Fungal Material

In this research, the A. rasikravindrae strain was isolated from the bamboo (Lignania intermedia) shoots given by the Key Laboratory of Forest Conservation, Sichuan Agricultural University, Chengdu, China. Pure culture of A. rasikravindrae strain was inoculated on (PDA) medium in 90-mm Petri dishes and cultivated for seven days at 25 °C. Later, the collected fresh mycelia samples were sent to the NovaSeq Illumina PE150 Novogene Bioinformatics Technology Co. Ltd. Beijing to extract the DNA and construct the whole genome sequence of the strain A. rasikravindrae AQZ-20. Strain identifiers of comparative studies for each isolate, accession numbers to the genomes, Species, Naming in GenBank, Total Length (MB), GC (%), N50, Scaffolds, and Contigs are listed in Supplementary Table S1. The sequencing metrics for each genome were obtained from NCBI.

2.2. Extraction of Genome DNA, Library Construction, Sequencing, and Assembly

Extraction of DNA from the mycelia samples was performed using the SDS method [44]. The purity and integrity of the DNA was observed by Agarose gel-electrophoresis, while Qubit was utilized for quantification. Later, an assemblage for (SMRT) sequencing was created with an interruption size of 20 kb using the SMRTbell™ Template Prep kit, as well as its attribution was measured on the Qubit 2.0 Fluorometer (Thermo Scientific, Buildings 3 & 6 & 7, No. 27, Xinjinqiao Road, Pudong New Area, Shanghai, China). The size of the interruption was resolved using Agilent 2100 (Agilent Technologies, No. 3 Wangjing North Road, Chaoyang District, Beijing, China). Lastly, the whole genome of A. rasikravindrae was sequenced in exploitation by the PacBio Sequel platform [45]. The SMRT Link [41,46,47] was utilized to accomplish the construction of reads as well as to acquire the explorative consequences. The redundant sequences in the preliminary assembly outcomes were separated, as well as the reads were matched with the assembled sequence. By deceiving the GC content of the assembled sequence as well as the extended depth of reads, the GC bias, as well as recurrent sequences of the genome, were figured, as well as the part with assembly mistakes were chastised to acquire the best assembly results [41].

2.3. Gene Prediction and Genome Assembly

The whole-genome information for A. rasikravindrae was obtained using AUGUSTUS [48] as well as the homologous GeneWise software, based on the homologous protein sequence of A. phaeospermum (taken from NCBI) as the remarkable sequence. The two outcomes were then incorporated with EVM and confirmed using PASA in the second round to acquire the reasoning consequences for the coding genes. These aggregations were then used to screen the assembled genomes for repeats using Repeat Masker (RM) version open-4.0.7 [49], RM was utilized to predict the dispersed repeats sequence (DRs). Tandem Repeats (TR) Finder [50], was utilized to predict the TR sequence (TRs). For ribosomal RNA (rRNA) prevision, all the genome was investigated against rRNA sequences, while for the transfer RNA (tRNA), (tRNAs) genes were determined utilizing the tRNAscan-SEv 2.0 [51], as well as rRNAmmer [52] which was utilized to anticipate rRNAs, sRNAs, and in advance, microRNA (miRNA); small nucleolar (snRNA) genes were predicted using BLAST against the Rfam database [53,54].
The predictive genes protein sequences were equivalenced and functional annotations were assigned to the genes using functional databases, including (KOG), [55], (GO), (KEGG) database [56,57], NR Protein Database (NR), [58], P450, Pfam, Swiss-Prot, [59] and (TCDB), [60], were also used for functional annotation results. A whole-genome Blast [61] was executed against the preceding seven databases and the secretory proteins were foreseen using the SignalP [62] database whereas secondary metabolism gene clusters were predicted using antiSMASH [62]. Furthermore, (PHI) and fungal virulence factor database (DFVF) were utilized to examine and confirm the pathogenicity of the pathogens [63]. Carbohydrate-active enzymes were expected using the Carbohydrate-Active enzymes Database [64]. The genome sequence of strain A. rasikravindrae AQZ-20 was deposited in the GenBank database with the following accession numbers: JACVVL000000000. The assembly and sequenced genome raw data reported in this paper are subordinated with NCBI BioProject: PRJNA661692 and BioSample: SAMN16069862 within GenBank. The SRA accession number is SRR12768822. All the deposited data will be published online after article publication.

2.4. Comparative Genomic Analysis

To compare the genome data sets, the genome sequence of Arthrinium phaeospermum (accession number: QYRS00000000.1) [14] was used as the credited genome for A. rasikravindrae homological taxons. The genomes of Arthrinium malaysianum STlab-iicb (accession number: (QUSE00000000.1), Fusarium oxysporum FO2 [35] (accession number: (AAXH00000000.1) [65], and Fusarium proliferatum ET1 (accession number: (FJOF00000000), a pathogenic fungal species responsible for causing similar infections were used for comparison with genomic information obtained for A. rasikravindrae. The alignment of the genome among the representative genome as well as cited genome was evaluated using the MUMmer [66] while LASTZ [67,68,69] was utilized to conduct pairwise comparisons between A. rasikravindrae and the four reference genomes. The gene family was constructed with multiple software: Blast [70] and collinearity between the genomes was performed by MCScan X software, [71], whereas MEGA software was used for phylogenetic analysis, and the phylogenetic tree was built by the Neighbour- Joining method [9].

3. Results

3.1. Assembly of the Genome and Characteristics of Genome Analysis

The material used for whole-genome analysis was isolated from bamboo shoots. The whole-genome analysis of the Arthrinium rasikravindrae was carried out and the number of reads in raw offline information was obtained. The preliminary assembly results showed that A. rasikravindrae contained 33 contigs and the maximum length of the assembled contig was noticed as 4,830,709 bp. The N50 contig length was 2,356,085 bp and total contig length was 45,879,981 bp. Then, reads were straightened to the assembly sequence and the ultimate assembly results after alignment and assessment also showed similar results as noticed in the preliminary assembly. The assembled contig GC content was 52.66% (Table 1).

3.2. Gene Prediction

The analysis of genomic data predicted a total of 11,101 coding genes with an entire length of 45,874,955 bp, accounting for 31.73% of the total genome length. The coding genes mean length was noticed as 1311 bp (Table 2). Furthermore, we analyzed the genomic data for dispersed repeat sequences (DRs) as summarized in Table 2. The DRs were classified into six categories namely LTR, DNA, LINE, SINE, RC, and Unknown which accounted for 0.14%, 0.09%, 0.08%, 0.01%, 0.007% and 0.001% of the total genome with average length of 81, 74, 89, 96, 78 and 71 base pairs, respectively. The tandem repeats sequences (TRs), Minisatellite DNA, and Microsatellite DNA were recorded for 0.61%, 0.43% and 0.08% of the total genome with a repeat size of 1~456 bp, 10~60 bp, and 2~6 bp, respectively (Table 3). In addition, we also noticed RNAs in A. rasikravindrae as summarized in Table 4. The highest number of RNAs were recorded as tRNA (268), followed by snRNA (15), sRNA (2), and mi RNA (1), respectively.

3.3. Gene Annotation

To further verify the 11,101 coding genes and functional characterization, the gene annotation analysis was carried out using different databases including CAZy.CAZy is a database of Carbohydrate-Active enZYmes (CAZymes), DFVF, GO (Gene Ontology), KEGG (Kyoto Encyclopedia of Genes and Genomes), KOG (Eukaryotic Orthologous Groups of proteins), NR (Nuclear Receptor), P450, Pfam, PHI (The Pathogen-Host Interactions database (PHI-base), SwissProt, (http://www.expasy.ch/sprot, assessed on 4 September 2021), and TCDB (The Transporter Classification Database), respectively.
The gene annotation by the GO platform annotated 6792 coding genes in A. rasikravindrae, as categorized into three major classes, i.e., molecular function (11 sub-classes), cellular component (13 sub-classes), and biological process (25 sub-classes) as shown in Figure 1. Among the three classes, six functional entries were noticed with more than 2000 annotated genes. These entries included cellular process (3162 genes), metabolic process (3526 genes), cell (2351), cell part (2351), binding (3302), and catalytic activities (3560), respectively.
To further classify the function of identified coding genes in A. rasikravindrae, the KEGG pathway annotation was performed. The KEGG database annotated 8893 genes and classification graph of KEGG metabolic pathways has been shown in Figure 2. The biological pathways are divided into six categories, each of which is subdivided, and each category is labeled with the relevant information. Among all the categories, the highest number of annotated genes were noticed in metabolic pathways (2650), within which the carbohydrate metabolism exhibited the most abundant number of genes.
KOG database annotated about 2085 genes of functional categories based on the protein functions (Figure 3). The protein functions are primarily centered in the amino acid transport and metabolism (164 genes), energy production and conversion (179 genes), general function diction only (242 genes), post translation modification, protein turnover, and chaperones (249 genes), translation, ribosomal structure and biology (214 genes), and other aspects, respectively.

3.4. NR, Pfam, and Swiss-Prot Annotation Database

The NR database annotation is summarized in Supplementary Figure S1. Our results showed the top 20 species that exhibited matching genes with the annotated genes of A. rasikravindrae. The highest number of genes were matched with Pseudornasarlella vexala (2526), followed by Pestalotiopsis fici (1978) and Hypoxylon sp. (1041), respectively. However, the species Colletotrichum incanum showed the lowest number of matching genes, i.e., 32 in the NR annotation database. In addition, 197 genes were matched in fungal sp. (Supplementary Figure S1). The number of annotated genes in the Pfam and Swiss-prot database were 6792 and 3241, respectively.

3.5. TCBD Function Classification

We employed the TCBD platform for functional classification of transporters in A. rasikravindrae (Supplementary Figure S2). Transporters canister transport certain virulence factors or compounds concealed by pathogenic fungi once they contaminate hosts so that causal agents can contaminate particular plants more effortlessly; consequently, they are similarly deliberated as a class of pathogenic aspects in pathogenic fungi. The results showed that there were 539 genes annotated in the TCDB database and grouped in nine categories as shown in Supplementary Figure S3. The highest number of transporter genes were noted in electrochemical potential driven transporters (197 genes), monitored by primary active transporters (151), accessory factors involved in transport (33), transmembrane electron carriers (3), channels/pools (81), group transporters (5), and incompletely characterized transport systems (69), respectively (Supplementary Figure S3).

3.6. The Carbohydrate Enzyme Classification and Annotation

The CAZy database was employed to explore the carbohydrate enzyme classification and annotation in the genome of A. rasikravindrae (Supplementary Figure S4). This database mapped 624 genes and these genes were further grouped into six CAZy classes. The highest number of genes (309) were allocated in GH (glycoside hydrolases) class, followed by the CAZy class oxidoreductase Auxiliary Activities (AA), carbohydrates Carbohydrate-Binding Module (CBM), Glycosyltransferases (GT), Carbohydrate Esterases (CE), and Polysaccharide Lyase (PL), respectively, in which 113, 112, 107, 58 and 10 genes were assigned.

3.7. Secreted Proteins Prediction

Typically, many pathogenesis-related and unrelated secreted proteins can be predicted in genomes using bioinformatics and computer-based prediction algorithms, which help to elucidate the molecular mechanisms of pathogen-plant interactions. A total of 1326 signal peptide protein structures, 2271 proteins with transmembrane structures, and 1081 secreted proteins were predicted in the genomic data of Arthrinium rasikravindrae.

3.8. P450

The fungal cytochrome P450 database (FCPD) is a superfamily of enzymes involved in the production and modification of different compounds. The annotation results for the cytochrome (Figure 4). Most (98) of the genes were originated to encode the E-class cyt P450, group I.
A total of 8731 P450 genes were isolated from 113 fungi and oomycetes. According to the position of genes in the database, they were divided into 16 groups, which were aggregated into 2579 by triple MCL Class. P450 enzymes contributed to the construction of vital metabolites in organisms but play an important role in adapting to dissimilar environments via the modification of harmful compounds too. Using blast software, the amino acid sequence of the target species was compared with the P450 database, and the gene of the target species and its corresponding functional annotation was assigned to obtain the annotation results (Figure 4). Since each sequence alignment result may be more than one, to ensure its biological significance, an optimal ratio was reserved for annotation of the gene. The blast results are in the format of M8. At the same time, the annotated results of some databases are also provided. In addition, the P450 gene exhibits conservative sequence characteristics, namely the heme binding domain motif CG and the oxygen binding motif Er of the k-helix. According to these two sequence characteristics, the P450 gene was found, and finally, the P450 gene sequence based on blast and sequence characteristics was provided. Unless otherwise specified, the sample names in the following contents represent the target species (Figure 4).

3.9. Gene Clusters of Secondary Metabolites

The gene clusters for secondary metabolites were predicted. There were 47 genes and four clusters encoding (NRPS), as well as 82 genes and eight clusters in T1PKS. In addition, 13 genes and one cluster in NRPS, T1PKS (NRPS, Non-Ribosomal Peptide Synthetase Cluster; T1PKS, Type I Pks (PolyKetide Synthase)), while 15 genes and one cluster in T1PKS, Indole were noticed. Furthermore, our results also revealed 32 genes and five clusters in terpene and 26 genes and two clusters in NRPS-like, respectively (Supplementary Figure S4).

3.10. Pathogen Host Interaction (PHI)

There are 1415 genes annotated in the PHI database. The assigned genes for pathogenicity of A. rasikravindrae in the pathogen host interaction (PHI) database were classified into 29 categories (Supplementary Figure S5). Results revealed that the maximum (520) number of matched genes was related to the functional class of “unaffected pathogenicity”. The second highest functional class (496) belongs functional class of “reduced virulence”. In addition, 130 matched genes were in the NA function class, 105 matched genes were found in the loss of pathogenicity, 71 in the lethal function class, and 20 in the GO function class, respectively (Supplementary Figure S5). Amongst them, the resemblance of A02487 and A03535 genes of A. rasikravindrae and between CAM as well as GzGPA1 genes of Magnaporthe oryzae was 100% and 99.7%, respectively (Supplementary Figure S5).

3.11. DFVF

DFVF database is an inclusive database of recognized fungal virulence issues. It collects 2058 pathogenic genes released by 228 fungal strains from 85 genera. In the present genomic study, there were 453 genes in DFVF, 3.85% predicted genes were accounted. Among them, 99.7% similarity was found between the A02487 gene of A. rasikravindrae and the A03535 gene of Fusarium oxysporum.

3.12. The Circular Whole-Genome Map of A. rasikravindrae

The outermost loop is the situation coordinates of the genome sequence, from the outer to the inner side, respectively, the GC content of the genome is counted by the window (genome/1000) bp and the step size (genome/1000) bp. The blue part inward designated that the whole genome sequence GC content of this portion is poorer than the average GC content, and the purple part outwards is the conflict. The high peak value indicates the superior difference from the average GC content. Genome GC skews value: window (genome/1000) bp, step size (genome/1000) bp, the particular algorithm is GC/G + C. The innermost green portion shows that the GC content of the section is inferior to the C content, and the outer pink portion is contradictory. The gene density (in the window genome/1000 bp, step size genome/1000 bp, respectively, counts the gene thickness of coding genes, rRNA snRNA tRNA, the darker the color, the greater the gene mass in the window), and chromosome duplication.

3.13. Comparative Genomic Analysis

3.13.1. Analysis of Collinearity

The genomic analysis of collinearity between A. rasikravindrae and A. phaeospermum AP-Z13, A. malaysianum STlab-iicb, F. proliferatum ET1, and F. oxysporum Fo2 genomes is shown in Figure 5a–d. The collinearity analysis showed the highest collinear correlation between A. rasikravindrae and A. phaeospermum, followed by A. rasikravindrae and A. malaysianum. For instance, the collinearity analysis showed that there were 11,040 collinearity blocks in A. rasikravindrae and A. phaeospermum, and the entire base length (35,846,460 bp) in the collinear block of A. phaeopsermum was elucidated for 73.99% of the entire gene length (48,449,939 bp) presenting great collinear correlation between the two fungal species. Similarly, 8988 collinearity blocks in A. rasikravindrae and A. malaysianum and the entire base length (23,555,373 bp) in the collinear block of A. malaysianum was elucidated for 51.09% of the entire gene length (46,110,019 bp), representing the second great collinear correlation amongst A. rasikravindrae and A. malaysianum. However, the direction of collinearity was very low regarding the remaining two fungal species, i.e., F. oxysporum and F. proliferatum. The collinear block number within A. rasikravindrae and F. oxysporum and F. proliferatum was 1740 and 1161, respectively, and the whole base length was elucidated for 2.92% as well as 1.77% of the entire gene length.

3.13.2. The Core and Pan Genomic Analysis

The core genes are a set of genes with direct involvement in the disease pathway, while the pan-genes are the entire set of genes within a clade. The number of specific genes in A. rasikravindrae and A. phaeospermum was found to be 6337 and 8372, harmoniously, the core genes number was 2839, and the pan genes number was 427. The specific genes number in A. malaysianum was 8055, the core genes number was 3480, and the pan genes number was 520, respectively. The specific genes number in F. oxysporum Fo2 was 8492, the core genes number was 6809, and the pan genes number was 3396, respectively. The specific genes number in F. proliferatum was 11,158, the core genes number was 6935, and the pan genes number was 3438, respectively (Figure 6a).

3.13.3. Analysis Gene Family

The gene families of A. rasikravindrae, A. phaeospermum, A. malaysianum, F. oxysporum, and F. proliferatum were subjected to cluster analysis. Cluster analysis was performed to find the unique genes family of core and pan-genome. The total and the unique number of gene families of A. rasikravindrae and other spp. were presented in the form of Venn diagrams. The specific genes number, the total gene family’s number, and the number of families’ unique genes are displayed in Figure 6b.
Based on the identity value of the database sequence and target sequence these are the 10 common genes shown in Figure 6a. The genes exhibiting identity values higher than 90 were A03535 (99.7%) unknown function, A11076 (99.7%), A10939 (97%) unidentified or un classified sequence, A02329 (95.8%) other sequence, A08957 (95.5%) unidentified or un classified sequence, A08380 (93.4%) synthetic construct, A00946 (93.3%) conceptual translation, A08495 (91.5%) unknown function, A07052 (91%) unknown function and A08159 (90.7%) synthetic construct, (Table 5.) respectively, in the DFVF database. The genes with identity values higher than 90 were A02487 (100%), A03535 (99.7%), A08794 (97.5%), A06181 (97.4%), A08957 (95.5%), A10642 (95.5%), A10939 (95.3%), A01584 (95.2%), A03803 (94.9%), A02448 (94.8%), A03859 (94.3%), A08892 (93.8%), A05643 (93.5%), A00274 (93.5%), A09431 (92.7%), A05678 (92.6%), A09487 (92.2%), A01337 (91.3%), A08159 (90.7%), A09292 (90.3%), A03055 (90.2%) and A01326 (90%), respectively, in PHI.

4. Discussion

A. rasikravindrae, is a phytopathogenic fungus that provides a promising tool for epidemiological analysis and is expected to replace traditional methods in the near future. Whole genome sequencing is expected to revolutionize the surveillance and diagnosis of infectious diseases due to its high resolution [72]. The Pacbio RSII sequencing platform is an innovative third-generation sequencing platform at home and abroad. It has the advantages of long read length, high precision, and high sensitivity [73]. Our present study estimated the whole genome analysis and established that the entire genome size was 48.24-Mb (Figure 7). The SMRT technology directed by the PacBio platform can convey more extremely accurate long reads. Thus, we generated genome sequencing merging PacBio long-reads sequencing assembly. The sequencing data generated by the PacBio platform showed the contig N90 of 2,184,859 bp while encoded putative genes were 11,101. It can faultlessly collaborate with PacBio to whole high superiority genome assembly. Our study revealed 2839 core and 427 pan genes between A. rasikravindrae and A. phaeospermum as shown in (Figure 6a). All these findings verified that the PacBio platform assembly facilitated obtaining high-quality genome data. Our results are consistent with the studies of Chin et al. [74], and Huo et al. [75].
The entire genome of the A. rasikravindrae assembly was sequenced by this platform and matched with the genomes of A. phaeospermum AP-Z13, A. malaysianum, F. oxysporum, and F. proleferatum. The result showed the high similarity between A. phaeospermum and A. rasikravindrae of the same genus (Supplementary Table S1 and Figure 8). Our results advise that A. rasikravindrae might be elaborate in the production of some secondary metabolites, cell wall degrading enzymes, and gene transcription regulation processes protease, which is closely associated with pathogenesis.
Improved genome assembly and exact gene collinearity ensure the inference of thousands of plausible homologs arising from evolutionary events, polyploidization, or speciation. Collinear homologs were likely to be generated simultaneously in corresponding events [43]. Our results showed 11,040 collinearity blocks in A. rasikravindrae and A. phaeospermum which resembled 73.99% of the entire gene length (48,449,939 bp), thus representing a large linear correlation between both fungal species. Similarly, the 8988 collinearity blocks in A. rasikravindrae and A. malaysianum resembled approximately 51.09% of the entire gene length (46,110,019 bp) as shown in Figure 5.
From a taxonomic point of view, the current results obtained from the genome sequencing of A. rasikravindrae show a surprising side of the fungal kingdom and diversity among closely related species. However, A. rasikravindrae and A. phaeospermum both belong to class Sordariomycetes, hence similar methods are often utilized for detailed studies. Furthermore, whole-genome sequence studies revealed that the genes encoding the proteins were intact in both species A. phaeospermum AP-Z13 and A. rasikravindrae AQZ-20 of the same genus Arthrinium (Figure 8). Our findings suggest that the genes encoding proteins mainly include A03865, a gene located in the small nuclear ribonucleoprotein as well as A05515, A06181, and A02487 involved in putative protein, GTPase, actin cytoskeleton organization, cell polarity, and gene expression or gene silencing, respectively.
Additionally, the A. rasikravindrae gene annotated in the NR database exhibited a higher gene identity with P. vexata, M. oryzae or P. fici and lower gene identity with Colletotrichum incanum, Magnaporthiopsis poae, and Nectria heamatococca, respectively (Supplementary Figure S2). Previous studies reported that higher gene identity with P. vexata, M. oryzae, and P. fici can cause Fusarium ear blight in the host Hordeum vulgare. Among the NRPS gene clusters encoded by the A. rasikravindrae cluster, the largest number of A. rasikravindrae genes was paired with Pseudornasarlella vexala (Supplementary Figure S2). In addition, studies have found that species associated with the genus Arthrinium are also the casual agents of dead bamboo columns disease in Bambusa indica [69].
Furthermore, the A02487 (Q9UWF0) and A03535 (I1RNF9) genes in A. rasikravindrae were paired with the CAM gene of M. oryzae, which caused rice blast and head blast in host Triticum aestivum [76], and the GzGPA1 gene of Fusarium graminearum, which caused Fusarium ear blight [77]. Therefore, the A02487 and A03535 genes might be associated with rice blast pathogenicity [9], and Fusarium ear blight in Barley. The GzGPA1 pathogenicity of the barley-directed strain was the same as that of the wild-type strain. GzGPA1 is a conserved low salt protein that encodes 353 (AA), highly similar to A. nidulans FadA in the genome database of F. graminearum (93% identity), and also negatively controls mycotoxin production. F. graminearum is the causative agent of small grain head blight disease and produces mycotoxins deoxynivalenol (DON) and zearalenone (ZEA) in diseased masses, hence its exposure is a danger to the health of both animals and humans. In G. zeae sexual expansion, GzGPA1 played a crucial role [76]. Deletion of GzGPA1 is somewhat valuable in PDA asexual expansion, by the evolutionary proportions of the modified being homologous on Barley wild type strain.
However, the similarity between the A02487 or A03535 genes of A. rasikravindrae and F. oxysporum, as well as the CAM and GzGPA1 genes of Magnaporthe oryzae was 100% and 99.7%, respectively. In M. oryzae, the homology between A. rasikravindrae and the A06181 gene was 97.4% in wheat and rice [76]. A. rasikravindrae is a common plant pathogen that induces actin cytoskeleton organization and cell polarity; establishment of appressorium and altered gene expression silencing, which are closely related to pathogenesis. However, four of the 10 core genes in the DFVF database have different functions, as shown in Table 5.

5. Conclusions

In the current research, the Pac bio RSII platform was utilized to complete the genome sequence of A. rasikravindrae strain AQZ-20, which revealed unique features in fungal genomes. The genomic study contained functional annotations, such as gene clusters of secondary metabolite synthesis, pathogen-host interaction database, and fungal virulence factors database to analyze preliminary pathogenicity analysis. In addition, the gene families were identified and the whole genome compilation results were determined as a final product. The whole-genome sequencing of A. rasikravindrae provided important data and the theoretical basis for transcriptomic, proteomic, and metabolic studies of this fungus and further enriched the A. rasikravindrae database showing specific association with strain AQZ-20. The results of gene analysis of A. rasikravindrae and genome sequence evaluation among several fungal species will contribute towards detailed further research about the fungal pathogenicity in forestry, medicine, and agriculture. Simultaneously, the preliminary comparison and analysis of several Arthrinium genomes provide a foundation to study the evolutionary system of Arthrinium, elucidate its pathogenesis, and support the prevention and control of diseases caused by the pathogenic fungi of Arthrinium. Moreover, this study will provide valuable resources for genomic studies and reference information for the management of different endophytic Arthrinium species as well as other related fungi.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jof8030255/s1, Table S1. Statistical assembly summary for comparative fungal. Figure S1. NR annotation of species. Figure S2. TCBD functional classification. Figure S3. Carbohydrate enzyme classification and annotation. Figure S4. Secondary metabolic gene Cluster. Figure S5. PHI phenotype classification.

Author Contributions

Conceptualization, A.Q.M. and T.Z.; Methodology, A.Q.M. and J.C.; Software, A.Q.M. and T.Z.; Validation, A.Q.M. and T.Z.; Formal Analysis, A.Q.M. and T.Z.; Investigation, A.Q.M., T.Z. and S.L.; Resources, T.Z.; Data Curation, A.Q.M., Z.G.N.G., S.M.M. and G.W.; Writing—Original Draft Preparation, A.Q.M.; Writing–Review and Editing, A.Q.M. and T.Z.; Visualization, A.Q.M. and T.Z.; Supervision, T.Z.; Project Administration, T.Z.; Funding Acquisition, T.Z. and S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 32171795.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The whole-genome sequence for Arthrinium rasikravindrae strain AQZ-20 data has been submitted to the database of GenBank with accession no. JACVVL000000000. The assembly and sequenced genome raw data reported in this paper are associated with NCBI BioProject: PRJNA661692 and BioSample: SAMN16069862 within GenBank. The SRA accession number is SRR12768822. The authors state that the current study existing the necessary data for the conclusion and supplementary material will be fully exposed.

Acknowledgments

We are highly thankful to the National Centre for Biotechnology Information (NCBI), (https://www.ncbi.nlm.nih.gov/genome, assessed on 6 July 2021) for giving the vast majority of the sequencing information for growths utilized in this investigation. We are also grateful to the Chinese Science Foundation for approval of conducting this research.

Conflicts of Interest

The authors declared no conflict of interest.

References

  1. Hawksworth, D.L.; Lücking, R. Fungal Diversity Revisited: 2.2 to 3.8 Million Species. Microbiol. Spectr. 2017, 5, 5.4.10. [Google Scholar] [CrossRef] [PubMed]
  2. Wang, M.; Tan, X.M.; Liu, F.; Cai, L. Eight new Arthrinium species from China. MycoKeys 2018, 34, 1–24. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Singh, S.M.; Yadav, L.S.; Singh, P.N.; Hepat, R.; Sharma, R.; Singh, S.K. Arthrinium rasikravindrii sp. nov. from Svalbard, Norway. Mycotaxon 2012, 122, 449–460. [Google Scholar] [CrossRef]
  4. Dai, D.Q.; Phookamsak, R.; Wijayawardene, N.N.; Li, W.J.; Bhat, D.J.; Xu, J.C.; Taylor, J.E.; Hyde, K.D.; Chukeatirote, E. Bambusicolous fungi. Fungal Divers. 2017, 82, 1–105. [Google Scholar] [CrossRef]
  5. Pintos, Á.; Alvarado, P.; Planas, J.; Jarling, R. Six new species of Arthrinium from Europe and notes about A. Caricicola and other species found in Carex spp. Hosts. MycoKeys 2019, 49, 15–48. [Google Scholar] [CrossRef] [Green Version]
  6. Kirk, P.M.; Cannon, P.F.; Stalpers, J.; Minter, D.W. Dictionary of the Fungi, 10th ed.; CABI: Wallingford, UK, 2008. [Google Scholar]
  7. Rana, S.; Singh, P.N.; Gaikwad, S.B.; Singh, S.K. Morphology, phylogeny and ex situ conservation of Arthrinium rasikravindrae (Apiosporaceae: Xylariales): A new record from India. Kavaka 2017, 5, 1–5. [Google Scholar]
  8. Dai, D.Q.; Jiang, H.B.; Tang, L.Z.; Bhat, D.J. Two new species of Arthrinium (Apiosporaceae, Xylariales) associated with bamboo from Yunnan, China. Mycosphere 2016, 7, 1332–1345. [Google Scholar] [CrossRef]
  9. Crous, P.W.; Groenewald, J.Z. A phylogenetic re-evaluation of Arthrinium. IMA Fungus 2013, 4, 133–154. [Google Scholar] [CrossRef] [Green Version]
  10. Senanayake, I.C.; Maharachchikumbura, S.; Hyde, K.D.; Bhat, J.D.; Jones, E.B.G.; McKenzie, E.H.C.; Dai, D.Q.; Daranagama, D.A.; Dayarathne, M.C.; Goonasekara, I.D.; et al. Towards unraveling relationships in Xylariomycetidae (Sordariomycetes). Fungal Divers. 2015, 73, 73–144. [Google Scholar] [CrossRef]
  11. Jiang, N.; Li, J.; Tian, C. Arthrinium species associated with bamboo and reed plants in China. Fungal Syst. Evol. 2018, 2, 1–9. [Google Scholar] [CrossRef] [Green Version]
  12. Cano, C.M. First Report of Arthrinium arundinis Causing Kernel Blight on Barley. Plant Dis. 1992, 76, 1077B. [Google Scholar] [CrossRef]
  13. Li, B.J.; Liu, P.Q.; Jiang, Y.; Weng, Q.Y.; Chen, Q.H. First Report of Culm Rot Caused by Arthrinium phaeospermum on Phyllostachys viridis in China. Plant Dis. 2016, 100, 1013. [Google Scholar] [CrossRef]
  14. Li, S.; Tang, Y.; Fang, X.; Qiao, T.; Han, S.; Zhu, T. Whole-genome sequence of Arthrinium phaeospermum, a globally distributed pathogenic fungus. Genomics 2019, 112, 919–929. [Google Scholar] [CrossRef] [PubMed]
  15. Zhao, Y.M.; Deng, C.R.; Chen, X. Arthrinium phaeospermum causing dermatomycosis a new record of China. Mycosystema 1990, 3, 232–235. [Google Scholar]
  16. Pansanit, A.; Pripdeevech, P. Antibacterial secondary metabolites from an endophytic fungus, Arthrinium sp. MFLUCC16-1053 isolated from Zingiber cassumunar. Mycology 2018, 9, 264–272. [Google Scholar] [CrossRef] [Green Version]
  17. Wang, X.; Boevink, P.; McLellan, H.; Armstrong, M.; Bukharova, T.; Qin, Z.; Birch, P.R. A Host KH RNA-Binding Protein Is a Susceptibility Factor Targeted by an RXLR Effector to Promote Late Blight Disease. Mol. Plant 2015, 8, 1385–1395. [Google Scholar] [CrossRef] [Green Version]
  18. Wang, J.-F.; Xu, F.-Q.; Wang, Z.; Lu, X.; Wan, J.-T.; Yang, B.; Zhou, X.-F.; Zhang, T.-Y.; Tu, Z.-C.; Liu, Y. A new naphthalene glycoside from the sponge-derived fungus Arthrinium sp. ZSDS1-F3. Nat. Prod. Res. 2014, 28, 1070–1074. [Google Scholar] [CrossRef]
  19. Ramos, H.P.; Braun, G.H.; Pupo, M.; Said, S. Antimicrobial activity from endophytic fungi Arthrinium state of Apiospora montagnei Sacc. and Papulaspora immersa. Braz. Arch. Biol. Technol. 2010, 53, 629–632. [Google Scholar] [CrossRef]
  20. Hamayun, M.; Khan, S.A.; Iqbal, I.; Ahmad, B.; Lee, I.-J. Isolation of a Gibberellin-producing fungus (Penicillium sp. MH7) and Growth Promotion of Crown Daisy (Chrysanthemum coronarium). J. Microbiol. Biotechnol. 2010, 20, 202–207. [Google Scholar] [CrossRef] [Green Version]
  21. Lina, Z. Study on the basic properties of Arthroboma subsporum non-protein toxoid. J. Sichuan Agric. Univ. 2011, 29, 3–7. [Google Scholar] [CrossRef]
  22. Shujiang, L. Effects of Arthrobium subsporum Non-Protein Toxoid on Physiological Metabolism of Hybrid Bamboo. Phytopathol. Res. 2011, 41. [Google Scholar] [CrossRef]
  23. Zhu, T.; Li, S. Guan-nan. Study on the conditions for the production of protein toxoids of Arthrophora nidulans. J. Sichuan Agric. Univ. 2011, 29, 2–6. [Google Scholar]
  24. Tianhui, Z. Detection of Protein Toxins from Hybrid Bamboo Blight Fungus by Protein A Indirect Enzyme-linked Immunoassay. J. Sichuan For. Sci. Technol. 2012, 33, 8–11. [Google Scholar] [CrossRef]
  25. Li, S.-J.; Zhu, T.-H.; Zhu, H.-M.; Liang, M.; Qiao, T.-M.; Han, S.; Che, G.-N. Purification of Protein AP-Toxin from Arthrinium phaeospermum Causing Blight in Bambusa pervariabilis × Dendrocalamopisis grandis and Its Metabolic Effects on Four Bamboo Varieties. Phytopathology 2013, 103, 135–145. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Shujiang, L. Toxicity-producing Inducible Factors of Hybrid Bamboo Shoot Blight. J. Northeast For. Univ. 2013, 41, 104–109. [Google Scholar] [CrossRef]
  27. Li, S.; Zhu, T. Binding site of toxic protein from Arthrinium phaeospermum on plasmalemma of hybrid bamboo. J. Jhejiang Univ. (Agri. Life Sic.) 2012, 38, 355–361. [Google Scholar]
  28. Li, S.J.; Qiao, T.M.; Han, S.; Zhu, T.H.; Che, G.N. Effects of protein toxin from Arthrinium phaeospermum on biophysical characteristic and respiration of Bambusa pervariabilis × dendrocalamopsis grandis mitochondria. Sci. Silyae Sin. 2014, 50, 119–125. [Google Scholar]
  29. Li, S.-J.; Zhu, T.-H. Purification of the toxin protein PC from Arthrinium phaeospermum and its effect on the defence enzymes of Bambusa pervariabilis × Dendrocalamopsis grandisvarieties. For. Pathol. 2013, 44, 96–106. [Google Scholar] [CrossRef]
  30. Van Eijk, G.W. Bostrycin, a tetrahydroanthraquinone pigment and some other metabolites from the fungus Arthrinium phaeospermum. Experientia 1975, 31, 783–784. [Google Scholar] [CrossRef]
  31. Khan, S.A.; Hamayun, M.; Kim, H.-Y.; Yoon, H.-J.; Seo, J.-C.; Choo, Y.-S.; Lee, I.-J.; Kim, S.-D.; Rhee, I.-K.; Kim, J.-G. A new strain of Arthrinium phaeospermum isolated from Carex kobomugi Ohwi is capable of gibberellin production. Biotechnol. Lett. 2008, 31, 283–287. [Google Scholar] [CrossRef]
  32. Wei, D.L.; Chang, S.C.; Lin, S.C.; Doong, M.L.; Jong, S.C. Production of 3-Nitropropionic Acid by Arthrinium Species. Curr. Microbiol. 1994, 28, 1–5. [Google Scholar] [CrossRef]
  33. Vijayakumar, E.K.S.; Roy, K.; Chatterjee, S.; Deshmukh, S.K.; Ganguli, B.N.; Fehlhaber, H.-W.; Kogler, H. ChemInform Abstract: Arthrichitin. A New Cell Wall Active Metabolite from Arthrinium phaeospermum. ChemInform 1997, 28, 192. [Google Scholar] [CrossRef]
  34. Shrestha, P.; Szaro, T.M.; Bruns, T.D.; Taylor, J.W. Systematic Search for Cultivatable Fungi That Best Deconstruct Cell Walls of Miscanthus and Sugarcane in the Field. Appl. Environ. Microbiol. 2011, 77, 5490–5504. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Shrestha, P.; Ibáñez, A.B.; Bauer, S.; I Glassman, S.; Szaro, T.M.; Bruns, T.D.; Taylor, J.W. Fungi isolated from Miscanthus and sugarcane: Biomass conversion, fungal enzymes, and hydrolysis of plant cell wall polymers. Biotechnol. Biofuels 2015, 8, 38. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Kämper, J.; Kahmann, R.; Bölker, M.; Ma, L.-J.; Brefort, T.; Saville, B.J.; Banuett, F.; Kronstad, J.W.; Gold, S.E.; Müller, O.; et al. Insights from the genome of the biotrophic fungal plant pathogen Ustilago maydis. Nature 2006, 444, 97–101. [Google Scholar] [CrossRef] [PubMed]
  37. Nguyen, H.D.T.; Lewis, C.T.; Lévesque, C.A.; Gräfenhan, T. Draft Genome Sequence of Alternaria alternata ATCC 34957. Genome Announc. 2016, 4, e01554-15. [Google Scholar] [CrossRef] [Green Version]
  38. Marcet-Houben, M.; Ballester, A.-R.; de la Fuente, B.; Harries, E.; Marcos, J.F.; González-Candelas, L.; Gabaldón, T. Genome sequence of the necrotrophic fungus Penicillium digitatum, the main postharvest pathogen of citrus. BMC Genom. 2012, 13, 646. [Google Scholar] [CrossRef] [Green Version]
  39. Shi-Kunne, X.; Faino, L.; Berg, G.C.M.V.D.; Thomma, B.P.H.J.; Seidl, M.F. Evolution within the fungal genus Verticillium is characterized by chromosomal rearrangement and gene loss. Environ. Microbiol. 2017, 20, 1362–1373. [Google Scholar] [CrossRef] [Green Version]
  40. Bartelli, T.F.; Bruno, D.D.C.F.; Briones, M.R. Whole-Genome Sequences and Annotation of the Opportunistic Pathogen Candida albicans Strain SC5314. Genome Announc. 2018, 6, e01475-17. [Google Scholar] [CrossRef] [Green Version]
  41. Fang, X.; Qin, K.; Li, S.; Han, S.; Zhu, T. Whole genome sequence of Diaporthe capsici, a new pathogen of walnut blight. Genomics 2020, 112, 3751–3761. [Google Scholar] [CrossRef]
  42. Sohn, J.-I.; Nam, J.-W. The present and future of de novowhole-genome assembly. Brief. Bioinform. 2016, 19, bbw096-40. [Google Scholar] [CrossRef] [Green Version]
  43. Garg, V.; Dudchenko, O.; Wang, J.; Khan, A.W.; Gupta, S.; Kaur, P.; Han, K.; Saxena, R.K.; Kale, S.M.; Pham, M.; et al. Chromosome-length genome assemblies of six legume species provide insights into genome organization, evolution, and agronomic traits for crop improvement. J. Adv. Res. 2021. [Google Scholar] [CrossRef]
  44. Lim, H.J.; Lee, E.; Yoon, Y.; Chua, B.; Son, A. Portable lysis apparatus for rapid single-step DNA extraction of Bacillus subtilis. J. Appl. Microbiol. 2016, 120, 379–387. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Wang, Y.; Zhang, X.; Lu, C.; Li, X.; Zhou, J.; Wang, J. Lanthanum: A novel inducer for enhancement of fungal laccase production by Shiraia bambusicola. J. Rare Earths 2020. [Google Scholar] [CrossRef]
  46. Ardui, S.; Ameur, A.; Vermeesch, J.R.; Hestand, M.S. Single molecule real-time (SMRT) sequencing comes of age: Applications and utilities for medical diagnostics. Nucleic Acids Res. 2018, 46, 2159–2168. [Google Scholar] [CrossRef] [Green Version]
  47. Reiner, J.; Pisani, L.; Qiao, W.; Singh, R.; Yang, Y.; Shi, L.; Khan, W.A.; Sebra, R.; Cohen, N.; Babu, A.; et al. Cytogenomic identification and long-read single molecule real-time (SMRT) sequencing of a Bardet–Biedl Syndrome 9 (BBS9) deletion. Npj Genom. Med. 2018, 3, 3. [Google Scholar] [CrossRef]
  48. Stanke, M.; Diekhans, M.; Baertsch, R.; Haussler, D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 2008, 24, 637–644. [Google Scholar] [CrossRef] [Green Version]
  49. Saha, S.; Bridges, S.; Magbanua, Z.V.; Peterson, D.G. Empirical comparison of ab initio repeat finding programs. Nucleic Acids Res. 2008, 36, 2284–2294. [Google Scholar] [CrossRef] [Green Version]
  50. Benson, G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999, 27, 573–580. [Google Scholar] [CrossRef] [Green Version]
  51. Lowe, T.M.; Eddy, S.R. tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence. Nucleic Acids Res. 1997, 25, 955–964. [Google Scholar] [CrossRef]
  52. Lagesen, K.; Hallin, P.; Rødland, E.A.; Staerfeldt, H.-H.; Rognes, T.; Ussery, D.W. RNAmmer: Consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007, 35, 3100–3108. [Google Scholar] [CrossRef] [PubMed]
  53. Gardner, P.P.; Daub, J.; Tate, J.G.; Nawrocki, E.P.; Kolbe, D.L.; Lindgreen, S.; Wilkinson, A.C.; Finn, R.D.; Griffiths-Jones, S.; Eddy, S.R.; et al. Rfam: Updates to the RNA families database. Nucleic Acids Res. 2008, 37, D136–D140. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Nawrocki, E.P.; Kolbe, D.L.; Eddy, S.R. Infernal 1.0: Inference of RNA alignments. Bioinformatics 2009, 25, 1335–1337. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Galperin, M.; Makarova, K.S.; Wolf, Y.; Koonin, E.V. Expanded microbial genome coverage and improved protein family annotation in the COG database. Nucleic Acids Res. 2014, 43, D261–D269. [Google Scholar] [CrossRef]
  56. Kanehisa, M.; Goto, S.; Kawashima, S.; Okuno, Y.; Hattori, M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004, 32, D277–D280. [Google Scholar] [CrossRef] [Green Version]
  57. Kanehisa, M.; Goto, S.; Hattori, M.; Aoki-Kinoshita, K.F.; Itoh, M.; Kawashima, S.; Katayama, T.; Araki, M.; Hirakawa, M. From genomics to chemical genomics: New developments in KEGG. Nucleic Acids Res. 2006, 34, D354–D357. [Google Scholar] [CrossRef]
  58. Li, W.; Jaroszewski, L.; Godzik, A. Tolerating some redundancy significantly speeds up clustering of large protein databases. Bioinformatics 2002, 18, 77–82. [Google Scholar] [CrossRef]
  59. Bairoch, A. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 2000, 28, 45–48. [Google Scholar] [CrossRef]
  60. Sayer, M.H., Jr.; Reddy, V.S.; Tamang, D.G.; Västermark, Å. The Transporter Classification Database. Nucleic Acids Res. 2014, 42, D251–D258. [Google Scholar] [CrossRef]
  61. Cantarel, B.L.; Coutinho, P.M.; Rancurel, C.; Bernard, T.; Lombard, V.; Henrissat, B. The Carbohydrate-Active EnZymes database (CAZy): An expert resource for Glycogenomics. Nucleic Acids Res. 2009, 37, D233–D238. [Google Scholar] [CrossRef]
  62. Medema, M.H.; Blin, K.; Cimermancic, P.; De Jager, V.; Zakrzewski, P.; Fischbach, M.A.; Weber, T.; Takano, E.; Breitling, R. antiSMASH: Rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011, 39, W339–W346. [Google Scholar] [CrossRef] [PubMed]
  63. Urban, M.; Pant, R.; Raghunath, A.; Irvine, A.G.; Pedro, H.; Hammond-Kosack, K. The Pathogen-Host Interactions database (PHI-base): Additions and future developments. Nucleic Acids Res. 2014, 43, D645–D655. [Google Scholar] [CrossRef] [Green Version]
  64. Jia, B.; Raphenya, A.R.; Alcock, B.; Waglechner, N.; Guo, P.; Tsang, K.K.; Lago, B.A.; Dave, B.M.; Pereira, S.; Sharma, A.N.; et al. CARD 2017: Expansion and model-centric curation of the comprehensive antibiotic resistance database. Nucleic Acids Res. 2016, 45, D566–D573. [Google Scholar] [CrossRef] [PubMed]
  65. Ma, L.-J.; Van Der Does, H.C.; Borkovich, K.A.; Coleman, J.J.; Daboussi, M.-J.; Di Pietro, A.; Dufresne, M.; Freitag, M.; Grabherr, M.; Henrissat, B.; et al. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium. Nature 2010, 464, 367–373. [Google Scholar] [CrossRef] [PubMed]
  66. Kurtz, S.; Phillippy, A.; Delcher, A.L.; Smoot, M.; Shumway, M.; Antonescu, C.; Salzberg, S.L. Versatile and open software for comparing large genomes. Genome Biol. 2004, 5, R12. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
  68. Chiaromonte, F.; Yap, V.B.; Miller, W. Scoring pairwise genomic sequence alignments. In Biocomputing 2002; World Scientific: Singapore, 2001; pp. 115–126. [Google Scholar] [CrossRef]
  69. Harris, R.S. Improved Pairwise Alignment of Genomic DNA. Ph.D. Thesis, Pennsylvania State University, State College, PA, USA, 2007. [Google Scholar]
  70. Krzywinski, M.; Schein, J.; Birol, I.; Connors, J.; Gascoyne, R.; Horsman, D.; Jones, S.J.; Marra, M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009, 19, 1639–1645. [Google Scholar] [CrossRef] [Green Version]
  71. Wang, Y.; Tang, H.; DeBarry, J.D.; Tan, X.; Li, J.; Wang, X.; Lee, T.-H.; Jin, H.; Marler, B.; Guo, H.; et al. MCScanX: A toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012, 40, e49. [Google Scholar] [CrossRef] [Green Version]
  72. Land, M.; Hauser, L.; Jun, S.-R.; Nookaew, I.; Leuze, M.R.; Ahn, T.-H.; Karpinets, T.; Lund, O.; Kora, G.; Wassenaar, T.; et al. Insights from 20 years of bacterial genome sequencing. Funct. Integr. Genom. 2015, 15, 141–161. [Google Scholar] [CrossRef] [Green Version]
  73. Jiang, N.; Liang, Y.M.; Tian, C.M. A novel bambusicolous fungus from China, Arthrinium chinense (Xylariales). Sydowia 2020, 72, 77–83. [Google Scholar]
  74. Chin, C.-S.; Alexander, D.H.; Marks, P.; Klammer, A.A.; Drake, J.; Heiner, C.; Clum, A.; Copeland, A.; Huddleston, J.; Eichler, E.E.; et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 2013, 10, 563–569. [Google Scholar] [CrossRef] [PubMed]
  75. Huo, J.; Zhong, S.; Du, X.; Cao, Y.; Wang, W.; Sun, Y.; Tian, Y.; Zhu, J.; Chen, J.; Xuan, L.; et al. Whole-genome sequence of Phellinus gilvus (mulberry Sanghuang) reveals its unique medicinal values. J. Adv. Res. 2020, 24, 325–335. [Google Scholar] [CrossRef] [PubMed]
  76. Dean, R.; Van Kan, J.A.L.; Pretorius, Z.A.; Hammond-Kosack, K.E.; Di Pietro, A.; Spanu, P.D.; Rudd, J.J.; Dickman, M.; Kahmann, R.; Ellis, J.; et al. The Top 10 fungal pathogens in molecular plant pathology. Mol. Plant Pathol. 2012, 13, 414–430. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  77. Brown, N.A.; Bass, C.; Baldwin, T.K.; Chen, H.; Massot, F.; Carion, P.W.C.; Urban, M.; van de Meene, A.M.L.; Hammond-Kosack, K.E. Characterisation of the Fusarium graminearum-Wheat Floral Interaction. J. Pathog. 2011, 2011, 626345. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. GO functional annotation of the A. rasikravindrae genome. In the (GO) annotation, the protein-coding sequences of the fungus A. rasikravindrae genome are divided into three major categories, (MF), (CC), and (Bprocess); different colors represent 49 sub-classes from these three major categories. Although in this figure the x-axis represents the class of genes and y-axis represents the percent of genes (%), while the z-axis represents the number of genes with a scale up to 6793.
Figure 1. GO functional annotation of the A. rasikravindrae genome. In the (GO) annotation, the protein-coding sequences of the fungus A. rasikravindrae genome are divided into three major categories, (MF), (CC), and (Bprocess); different colors represent 49 sub-classes from these three major categories. Although in this figure the x-axis represents the class of genes and y-axis represents the percent of genes (%), while the z-axis represents the number of genes with a scale up to 6793.
Jof 08 00255 g001
Figure 2. Represents the (KEGG) functional annotation of the A. rasikravindrae genome. In this figure, the major classes with their names and the number of genes from the concerned sub-class divisions are represented. The x-axis represents the scale for several genes up to 1000. KEGG functional annotation is divided into six major classes viz; Cellular process, Environmental information processing, Genetic information processing, Human diseases, Metabolism or Organismal system, and 40 subclasses. Each sub-class has been represented with a distinct color.
Figure 2. Represents the (KEGG) functional annotation of the A. rasikravindrae genome. In this figure, the major classes with their names and the number of genes from the concerned sub-class divisions are represented. The x-axis represents the scale for several genes up to 1000. KEGG functional annotation is divided into six major classes viz; Cellular process, Environmental information processing, Genetic information processing, Human diseases, Metabolism or Organismal system, and 40 subclasses. Each sub-class has been represented with a distinct color.
Jof 08 00255 g002
Figure 3. Clusters of orthologous groups of proteins (KOG) functional classification of proteins in A. rasikravindrae. In this figure, the x-axis shows the functions of the class and the y-axis shows the number of matched genes. Although KOG function classification is divided into 26 groups categorized by A–Z. Different colors with their names and the number of genes is also mentioned.
Figure 3. Clusters of orthologous groups of proteins (KOG) functional classification of proteins in A. rasikravindrae. In this figure, the x-axis shows the functions of the class and the y-axis shows the number of matched genes. Although KOG function classification is divided into 26 groups categorized by A–Z. Different colors with their names and the number of genes is also mentioned.
Jof 08 00255 g003
Figure 4. Functional classification of cytochrome P450 in the A. rasikravindrae genome. In this figure, the x-axis shows the P450 class, y-axis represents the number of matched genes with a scale from 0–100. P450 classification includes 12 classes.
Figure 4. Functional classification of cytochrome P450 in the A. rasikravindrae genome. In this figure, the x-axis shows the P450 class, y-axis represents the number of matched genes with a scale from 0–100. P450 classification includes 12 classes.
Jof 08 00255 g004
Figure 5. Parallel collinearity comparison analysis between Arthrinium rasikravindrae AQZ-20 and the A. phaeospermum AP-Z13, A. malaysianum STlab-iicb, F. proliferatum ET1, F. oxysporum Fo2, and A. rasikravindrae species. This figure shows the comparative genomic analysis between A. rasikravindrae and four other closely related fungus from the same genus Ascomycetes. In which two fungus are from the same family while the other two are from different families. (a) collinearity between A. rasikravindrae and A. phaeospermum strain AP-Z13, (b) collinearity between A. rasikravindrae and A. malaysianum STlab-iib, (c) collinearity between A. rasikravindrae and F. proliferatum ET1, and (d) collinearity between A. rasikravindrae and F. oxysporum Fo2 species. While at top left side of these figures (ad) scale has been shown with different colors and corresponding names. The parallel lines with boxes upside and downside represent the contigs number while these are counted by the scale 0–60 MB upside the parallel lines.
Figure 5. Parallel collinearity comparison analysis between Arthrinium rasikravindrae AQZ-20 and the A. phaeospermum AP-Z13, A. malaysianum STlab-iicb, F. proliferatum ET1, F. oxysporum Fo2, and A. rasikravindrae species. This figure shows the comparative genomic analysis between A. rasikravindrae and four other closely related fungus from the same genus Ascomycetes. In which two fungus are from the same family while the other two are from different families. (a) collinearity between A. rasikravindrae and A. phaeospermum strain AP-Z13, (b) collinearity between A. rasikravindrae and A. malaysianum STlab-iib, (c) collinearity between A. rasikravindrae and F. proliferatum ET1, and (d) collinearity between A. rasikravindrae and F. oxysporum Fo2 species. While at top left side of these figures (ad) scale has been shown with different colors and corresponding names. The parallel lines with boxes upside and downside represent the contigs number while these are counted by the scale 0–60 MB upside the parallel lines.
Jof 08 00255 g005aJof 08 00255 g005b
Figure 6. (a) Core and Pan Gene orthological Venn diagram of the AQZ-20, AP-Z13, STlab-iicb, ET1, and Fo2. The Venn map of A. rasikravindrae AQZ-20 was obtained according to the annotation results of the A. phaeospermum AP-Z13, A. malaysianum STlab-iicb, F. oxysporum Fo2, and F. proliferatum ET1 protein predicted genes. It showed 10 core genes. Besides, 6241, 8356, 8033, 8439, and 10,756 specific genes were found in the AQZ-20, AP-Z13, STlab-iib, Fo2, and ET1 species, respectively. The total pan genes 48,296 were found in core and pan gene analysis, whereas the total 57,168 genes and other 48,286 were dispensable. (b) Gene family Orthologous Venn diagram of AQZ-20, AP-Z13, STlab-iicb, Fo2 and ET1. The analysis of the gene family of A. rasikravindrae and AP-Z13, STlab-iicb, Fo2 and ET1 are shown in the orthology Venn diagram. Amongst them, the homologous single-copy genes number of A. rasikravindrae, F. proliferatum ET1, A. malaysianum STlab-iicb, A. phaeospermum AP-Z13, and F. oxysporum Fo2 was 107, 471, 49, 60, or 101, respectively. This diagram shows the five devastating fungus species’ correlation with their corresponding colors.
Figure 6. (a) Core and Pan Gene orthological Venn diagram of the AQZ-20, AP-Z13, STlab-iicb, ET1, and Fo2. The Venn map of A. rasikravindrae AQZ-20 was obtained according to the annotation results of the A. phaeospermum AP-Z13, A. malaysianum STlab-iicb, F. oxysporum Fo2, and F. proliferatum ET1 protein predicted genes. It showed 10 core genes. Besides, 6241, 8356, 8033, 8439, and 10,756 specific genes were found in the AQZ-20, AP-Z13, STlab-iib, Fo2, and ET1 species, respectively. The total pan genes 48,296 were found in core and pan gene analysis, whereas the total 57,168 genes and other 48,286 were dispensable. (b) Gene family Orthologous Venn diagram of AQZ-20, AP-Z13, STlab-iicb, Fo2 and ET1. The analysis of the gene family of A. rasikravindrae and AP-Z13, STlab-iicb, Fo2 and ET1 are shown in the orthology Venn diagram. Amongst them, the homologous single-copy genes number of A. rasikravindrae, F. proliferatum ET1, A. malaysianum STlab-iicb, A. phaeospermum AP-Z13, and F. oxysporum Fo2 was 107, 471, 49, 60, or 101, respectively. This diagram shows the five devastating fungus species’ correlation with their corresponding colors.
Jof 08 00255 g006
Figure 7. The circular whole-genome map of A. rasikravindrae. It is explained in the figure that from the exterior to the interior are: 1. Contigs (>1 MB in length); 2. GC content: deliberated as the % of G + C in 1 kb non-overlapping windows.
Figure 7. The circular whole-genome map of A. rasikravindrae. It is explained in the figure that from the exterior to the interior are: 1. Contigs (>1 MB in length); 2. GC content: deliberated as the % of G + C in 1 kb non-overlapping windows.
Jof 08 00255 g007
Figure 8. Inter-specific phylogenetic tree of A. rasikravindrae AQZ-20, A. phaeospermum AP.Z13, A. malaysianum STlab-iicb, F. proliferatum ET1, and F. oxysporum Fo2. As per the evaluation results of species analysis, only duplicate genes are recognized by a gene family. A. phaeospermum, A. malysianum, and A. rasikravindrae have shown the same evolutionary branch and have no greater evolutionary distance. F. oxysporum Fo2 and F. proiferatum ET1 have a different evolutionary branch from A. rasikravindrae and has greater evolutionary distance. As per the NCBI taxonomy database, A. phaeospermum, A. malaysianum STlab-iicb, A. rasikravindrae and F. proliferatum ET1, and F. oxysporum Fo2, belong to Xylariales, so, these are very closely linked to A. rasikravindrae.
Figure 8. Inter-specific phylogenetic tree of A. rasikravindrae AQZ-20, A. phaeospermum AP.Z13, A. malaysianum STlab-iicb, F. proliferatum ET1, and F. oxysporum Fo2. As per the evaluation results of species analysis, only duplicate genes are recognized by a gene family. A. phaeospermum, A. malysianum, and A. rasikravindrae have shown the same evolutionary branch and have no greater evolutionary distance. F. oxysporum Fo2 and F. proiferatum ET1 have a different evolutionary branch from A. rasikravindrae and has greater evolutionary distance. As per the NCBI taxonomy database, A. phaeospermum, A. malaysianum STlab-iicb, A. rasikravindrae and F. proliferatum ET1, and F. oxysporum Fo2, belong to Xylariales, so, these are very closely linked to A. rasikravindrae.
Jof 08 00255 g008
Table 1. Gene assembly data of A. rasikravindrae genome.
Table 1. Gene assembly data of A. rasikravindrae genome.
Arthrinium rasikravindraeCharacteristic
Contigs32
Max_Length (bp)4,831,660
N50_Length (bp)2,356,432
Total length (bp)45,874,955
GC (%)52.66
Genome size (bp)45,874,955
Gene number (#)11,101
Gene total length (bp)14,554,907
Gene average length (bp)1311
Gene length/Genome (%)31.73
Table 2. A. rasikravindrae genome data related dispersed repeat sequences (DRs).
Table 2. A. rasikravindrae genome data related dispersed repeat sequences (DRs).
TypeNumber (#)Entire Length (bp)In Genome (%)Average Length (bp)
LTR83464,2210.1481
DNA62241,4350.090374
LINE45937,2260.081189
SINE7268940.01596
RC4232770.007178
Unknown128530.001971
Total2041139,0960.303281
Table 3. A. rasikravindrae genome data related tandem repeat sequences (TRs).
Table 3. A. rasikravindrae genome data related tandem repeat sequences (TRs).
TypeNumber (#)Repeat Size (bp)Entire Length (bp)In Genome (%)
TR64421~456282,6540.6161
Mini-satellite DNA459610~60199,8590.4357
Micro-satellite DNA9302~639,1320.0853
Table 4. A. rasikravindrae genome data related RNAs.
Table 4. A. rasikravindrae genome data related RNAs.
TypeNumber (#)Average Length (bp)Entire Length (bp)
tRNA2688723,415
5 s (denovo)541166243
5.8 s (denovo)000
18 s (denovo)117951795
28 s (denovo)133403340
sRNA2236473
snRNA151592388
miRNA19595
Table 5. Gene product name and functions of DFVF 10 genes.
Table 5. Gene product name and functions of DFVF 10 genes.
Gene ID/Functions
A03535Unknown
A11076CATALYTIC ACTIVITY: ATP + a protein = ADP + a phosphoprotein; SIMILARITY: Contains 1 protein kinase domain.
A10939Tubulin is the major constituent of microtubules. It binds two moles of GTP, one at an exchangeable site on the beta chain and one at a non-exchangeable site on the alpha-chain (By similarity); SUBUNIT: Dimer of alpha and beta chains (By similarity); SIMILARITY: Belongs to the tubulin family.
A02329Guanine nucleotide-binding proteins (G proteins) are involved as a modulator or transducers in various transmembrane signaling systems. The beta and gamma chains are required for the GTPase activity, for replacement of GDP by GTP, and for G protein- effector interaction; SUBUNIT: G proteins are composed of 3 units, alpha, beta, and gamma; SIMILARITY: Belongs to the WD repeat G protein beta family; SIMILARITY: Contains 7 WD repeats.
A08957Unknown function
A08380Unknown function
A00946Mitogen-activated protein kinase is involved in a signal transduction pathway that is activated by changes in the osmolarity of the extracellular environment. Controls osmotic regulation of transcription of target genes (By similarity). Involved in the virulence and conidia formation. Mediates tannic acid-induced laccase expression and cryparin expression; CATALYTIC ACTIVITY: ATP + a protein = ADP + a phosphoprotein; COFACTOR: Magnesium (By similarity); ENZYME REGULATION: Activated by tyrosine and threonine phosphorylation (By similarity). Hypoviruses like CHV1-EP713 induce inactivation by lowering the degree of phosphorylation in response to various environmental stresses; SUBCELLULAR LOCATION: Cytoplasm (By similarity). Nucleus (By similarity); DOMAIN: The TXY motif contains the threonine and tyrosine residues whose phosphorylation activates the MAP kinases; PTM: Dually phosphorylated on Thr-171 and Tyr-173, which activates the enzyme (By similarity). Phosphorylated in response of osmotic stress; SIMILARITY: Belongs to the protein kinase superfamily. Ser/Thr protein kinase family. MAP kinase subfamily. HOG1 sub-subfamily.; SIMILARITY: Contains 1 protein kinase domain.
A08495Unknown function
A07052Unknown function
A08159Unknown function
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Majeedano, A.Q.; Chen, J.; Zhu, T.; Li, S.; Gishkori, Z.G.N.; Mastoi, S.M.; Wang, G. The First Whole Genome Sequence Discovery of the Devastating Fungus Arthrinium rasikravindrae. J. Fungi 2022, 8, 255. https://doi.org/10.3390/jof8030255

AMA Style

Majeedano AQ, Chen J, Zhu T, Li S, Gishkori ZGN, Mastoi SM, Wang G. The First Whole Genome Sequence Discovery of the Devastating Fungus Arthrinium rasikravindrae. Journal of Fungi. 2022; 8(3):255. https://doi.org/10.3390/jof8030255

Chicago/Turabian Style

Majeedano, Abdul Qayoom, Jie Chen, Tianhui Zhu, Shujiang Li, Zeeshan Ghulam Nabi Gishkori, Sumbul Mureed Mastoi, and Gang Wang. 2022. "The First Whole Genome Sequence Discovery of the Devastating Fungus Arthrinium rasikravindrae" Journal of Fungi 8, no. 3: 255. https://doi.org/10.3390/jof8030255

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop