Next Article in Journal
Sericin Ser3 Ectopic Expressed in Posterior Silk Gland Affects Hemolymph Immune Melanization Response via Reducing Melanin Synthesis in Silkworm
Previous Article in Journal
Drivers of Insect Community Change along the Margins of Mountain Streams in Serra da Estrela Natural Park (Portugal)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

FibH Gene Complete Sequences (FibHome) Revealed Silkworm Pedigree

1
State Key Laboratory of Silkworm Genome Biology, Southwest University, Chongqing 400715, China
2
Biological Science Research Center, Southwest University, Chongqing 400715, China
3
Integrative Science Center of Gerplasm Greation in Western China (CHONGQING) Science City & Southwest University, Chongqing 400715, China
*
Authors to whom correspondence should be addressed.
Insects 2023, 14(3), 244; https://doi.org/10.3390/insects14030244
Submission received: 11 February 2023 / Revised: 23 February 2023 / Accepted: 24 February 2023 / Published: 28 February 2023
(This article belongs to the Section Insect Systematics, Phylogeny and Evolution)

Abstract

:

Simple Summary

The fibroin heavy chain (FibH) is the core protein of silkworm silk, and a systematic analysis of its sequence (primary structure) is essential for understanding silkworm silk fibers. However, only a few FibH sequences have been identified. The recently published high-resolution silkworm pan-genome was useful in analyzing the FibH gene systematically. In this study, we extracted the complete FibH sequences of wild silkworms, local and improved strains from the silkworm pan-genome, bringing the total number of known silkworm FibH genes to 286. We characterized in detail the FibH variation during domestication and breeding and established a silkworm pedigree based on the FibH gene sequence. Additionally, we investigated the relationship between the cis-regulatory elements of FibH gene and silk yield in wild and domesticated silkworms. Our study laid the foundation for artificial silk as well as provided guidance for silkworm breeding.

Abstract

The highly repetitive and variable fibroin heavy chain (FibH) gene can be used as a silkworm identification; however, only a few complete FibH sequences are known. In this study, we extracted and examined 264 FibH gene complete sequences (FibHome) from a high-resolution silkworm pan-genome. The average FibH lengths of the wild silkworm, local, and improved strains were 19,698 bp, 16,427 bp, and 15,795 bp, respectively. All FibH sequences had a conserved 5′ and 3′ terminal non-repetitive (5′ and 3′ TNR, 99.74% and 99.99% identity, respectively) sequence and a variable repetitive core (RC). The RCs differed greatly, but they all shared the same motif. During domestication or breeding, the FibH gene mutated with hexanucleotide (GGTGCT) as the core unit. Numerous variations existed that were not unique to wild and domesticated silkworms. However, the transcriptional factor binding sites, such as fibroin modulator-binding protein, were highly conserved and had 100% identity in the FibH gene’s intron and upstream sequences. The local and improved strains with the same FibH gene were divided into four families using this gene as a marker. Family I contained a maximum of 62 strains with the optional FibH (Opti-FibH, 15,960 bp) gene. This study provides new insights into FibH variations and silkworm breeding.

1. Introduction

Bombyx mori, the domesticated silkworm, is an economically important insect that has played a vital role in human economic and cultural history. According to evolutionary analysis and archeological evidence, domesticated silkworms originated in the middle and lower reaches of the Yellow River and were domesticated from Chinese wild silkworms [1,2,3]. The Chinese trimolter may have been the first domesticated local strain. Following selection and dissemination, the trimolter silkworm gradually evolved into a Chinese tetramolter strain, a European strain, and a tropical variety [4]. Local strains were bred into improved strains, including Chinese and Japanese improved strains to improve silk quality and yield. The Chinese improved strains were chosen from the Chinese tetramolter strains, whereas the Japanese improved strains were derived primarily from European strains [4]. Evolutionary analysis based on conserved genes revealed the origin of silkworms but could not reveal the silkworm breeding history. With the exception of the breeding process of some improved strains, the pedigree of local and some improved strains has remained unclear.
Silkworm silk is mainly composed of three fibroin and five sericin proteins, with the fibroin heavy chain (FibH), encoded by the FibH gene, having the highest content and molecular weight [5,6,7,8]. The first reported FibH gene is 16,788 bp long, consisting of two exons (67 bp, 15,750 bp) and one intron. Exon II comprises non-repetitive sequences at both ends and numerous tandem repeats [9,10]. We previously obtained the complete FibH sequences of seven wild (Bombyx mandarina) and seventeen domesticated silkworms using CACS method [11]. Their exon I is identical, the non-repetitive sequences at both ends of exon II are very conservative, and the introns are dominated by single-base mutations. The repetitive sequence of exon II is composed of the same repeat units, but their number and arrangement order differ, resulting in considerable differences in the sequence and length of the FibH gene between wild and domesticated silkworms. Wild silkworms had the longest average FibH and the most variation, followed by local strains, whereas improved strains had the shortest average FibH, and the fewest variations. Wild silkworms and local strains do not share the same FibH gene. This gene is identical in all improved strains, except for two mutants (grr and nds), which are closely related to their breeding history. The parents of strain 872 were strains 46 and 782, the grandfathers were strains 794 and 796, and the parents of strain 54A were strains 532 and 794, implying that the FibH gene could be used as a marker gene for identifying silkworm pedigree.
There are thousands of silkworm strains, including wild silkworms, local and improved strains, and genetic mutants, but only 25 strains have known FibH sequences [9,12,13]. Tong et al. recently established a high-resolution pan-genomic database using deeply re-sequencing and long-read sequencing [3]. In this study, we extracted FibH sequences from the silkworm pan-genome, characterized the FibH variation, and developed a silkworm pedigree. To investigate the relationship between cis-regulatory elements and silk yield, the intron, upstream, and downstream sequences of the FibH gene were examined. Our results improve our understanding of the FibH gene in addition to providing insights for silkworm breeding.

2. Materials and Methods

2.1. Data Source

Tong et al. assembled 545 high-resolution silkworm genomes and deposited them in the China National GeneBank DataBase (CNGBdb; https://db.cngb.org/ (accessed on 10 November 2022)) [3]. Sample IDs (such as BomW2, BomL1, and BomP18) in Supplementary Table S2 were used to retrieve data from the CNGBdb. Each sample ID contained the assembled chromosome and contig data, each with a unique assembly number. The assembled chromosome data were downloaded for subsequent analyses.

2.2. Sequence Analysis

The complete FibH sequence, as well as its upstream and downstream sequences, were extracted from the assembled chromosomal genome using the terminal sequence (20 bp) of the Dazao strain’s FibH gene as a barcode. A few FibH sequences were investigated and aligned using Bioedit 7.0.9.0 and Jalview 2.11.2.5 [14]. The online software mafft (https://mafft.cbrc.jp/alignment/server/ (accessed on 12 November 2022)) [15] was used to perform multiple sequence alignment of all FibH genes and draw a phylogenetic tree based on the average linkage (UPGMA) algorithm. TNR and RC sequences were aligned by online software mafft and their identity was calculated by DNAman version 9.0 (Lynnon Biosoft, San Ramon, CA, USA). RNA22 [16] and RNAhybrid [17] were used to predict miRNA targeting sites existing in the 3′ UTR of all FibH genes.

2.3. Motif Identification

A novel motif in the FibH gene was discovered by MEME in MEME-Suite 5.5.0 [18,19]. The parameters were set to default, except for the number of motifs to be searched (four motifs), the minimum motif width (18 bp), the minimum number of sites per motif (10 sites), and the distribution of any number of repetitions (anr) in FibH. Each amorphous region of the FibH gene had one Nde I site [11], which was used to distinguish the amorphous regions. We used MAST in the MEME-Suite to scan Nde I sites. All the parameters were set to default, except for the motif sequence (restriction site of Nde I) and the FibH sequence.

2.4. Visualization

TBtools software [20] was used to visualize the motifs detected and scanned by MEME and MAST in MEME-Suite. The phylogenetic tree was modified using the online software iTOL (https://itol.embl.de/ (accessed on 1 December 2022)) [21]. The aligned introns as well as the upstream and downstream sequences were visualized through the NCBI Multiple Sequence Alignment Viewer 1.22.2 (https://www.ncbi.nlm.nih.gov/projects/msaviewer/ (accessed on 5 December 2022)). A map was constructed using the software Mark O’Travel 3.2.1 (https://download.cnet.com/Mark-O-Travel-your-travel-map-Where-you-ve-been/3000-20428_4-76133240.html (accessed on 1 December 2022)); the geographical distribution of silkworms was marked on the map by Adobe Illustrator 2021 (Adobe Systems, San Jose, CA, USA).

3. Results

3.1. FibH Gene of High Quality from the Silkworm Pan-Genome

Oxford Nanopore Technologies (ONTs) have long reads and systematic errors that are significantly biased and primarily located in the homopolymer and tandem repetition regions [22]. In a previous study, the FibH gene from Cpf1-based enrichment and ONT (CEO)-enriched sequencing (coverage depth 87.07×) lost three bases in polythymine and polyguanine of introns, resulting from sequencing errors [13]. Tong et al. de novo assembled 545 silkworm genomes with average sequencing depth of 97× through long-read sequencing [3]. To assess these genomes, we first extracted FibH sequences from the genomes of three local strains: L38 (81×), L86 (72×), and L193 (103×). Compared to the FibH of Damao, C108 and Bilian strains obtained by CACS [11], L38 and Damao, L193 and C108, Bilian, and L86 had identical FibH gene, indicating that the pan-genome was of high quality and could be used for entire FibH analysis (Figure 1, Tables S1 and S2).

3.2. FibH Gene Complete Sequences (FibHome) of 264 Silkworms

Tong et al. assembled 545 high-resolution silkworm genomes, including 39 wild silkworms, 160 local and 118 improved strains, and 228 genetic stocks [3]. Excluding FibH with polyguanine and polyN (N representing any base), we extracted 264 complete FibH sequences from the genomes of wild (20), local (138), and improved silkworm strains (106) (Figure 2). All FibH genes were composed of the same motif (AGGWRSYGGWGCTGGYTCAGGWGCTGGTGCTGGTKCAGGWGCTGGTGCT, W = T or A, R = G or A, S = G or C, Y = T or C, K = T or G) and were rich in GC (Figure 2). Except for W14, which had 60.0364% GC, the GC content of the other FibH sequences ranged from 58.9449% to 59.6878% (Tables S1 and S2). Most of the silkworms in genetic stocks were mutants of unknown origin, so we did not focus on their FibH genes.
The length of the FibH sequence extracted from wild silkworms varied from 24,162 bp (W14) to 17,664 bp (W12), with an average length of 19,698 bp. The local strains’ FibH genes ranged in size from 14 to 19 kb, with an average length of 16,427 bp. FibH genes of the same length (15,960 bp) were observed in 11.59% of local strains (L79, L90, L91, L102, etc.). The improved strains had an average FibH of 15,795 bp. Except for P204 (20,541 bp), the FibH genes of other improved strains were distributed in 13–19 kb. Of the improved strains, 46.23%, 23.58%, 7.55%, and 4.72% had 15,960 bp, 15,318 bp, 15,315 bp, and 14,874 bp FibH genes, respectively (Figure 2). These results implied that the FibH gene is shortened during domestication.

3.3. Conserved Terminal Non-Repetitive and Variable Repetitive Cores of FibH Genes

FibH gene coding sequences in wild and domesticated silkworms were composed of 5′ and 3′ terminal non-repetitive (5′ and 3′ TNR, respectively) sequences and a repetitive core (RC). The 3′ TNR of 264 silkworm FibH genes from the pan-genome was highly conserved, with 99.99% identity. Except for the 3′ TNR of one improved strain (P204) and two wild silkworms (W17, W35) with base substitutions, the others were the same (Figure 3A). In contrast, the variation of the 5′ TNR was more than that of the 3′ TNR, with 99.74% identity (Figure 3A). One local strain (L96) had a base insertion, resulting in a frameshift mutation and failure of silkworm spinning. This may be due to a sequencing error. Two silkworm strains (L93 and P201) had one base substitution at the same position, and six local silkworm strains had three consecutive base deletions (CAT) at the same position. These variations originate from single nucleotide polymorphisms in the silkworm strains. Some wild silkworms had 1–4 base substitutions, but no insertions or deletions (indels) were identified. The 5′ TNR of the other silkworms was the same (Figure 3A). The high conservation of 5′ and 3′ TNR implied that they may play key role in silk formation.
The RC of FibH gene (16.12% identity) was more variable in length and sequence than the 5′ and 3′ TNR, which is consistent with previous studies [11]. The RC of wild silkworms was the longest and each wild silkworm had a unique RC with 21.83% identity (Figure 2). The average RC of the local strains was shorter than that of the wild silkworms and had 23.91% identity. The improved strains had the shortest RC and least difference, with 48.40% identity (Figure 3B). The same RC of 14,319 bp and 13,692 bp was shared by 48.11% and 23.58% of improved silkworms, respectively (Table S3).

3.4. FibH Gene Mutated with GGTGCT as the Core Unit

The FibH of some silkworms differed from one another in the pan-genome by only one repeat, that is, the differential sequence. All differential sequences were classified into four groups. The differential sequence of Group 1 had only six bases (GGTGCT). The sequence difference between Groups 2 and 3 was 18 bp, and the two groups differed by only one base, and both encoded the hexapeptides GAGAGS, which are the repetitive units of FibH. Group 4 had a 54–4302 bp differential sequence that could encode a complete peptide. The first six bases of the other three groups of differential sequences, with the exception of Group 1, were GGTGCT (encoding dipeptide GA), and the last three bases were TCA (serine) (Figure 4, Table S4). Both dipeptides, GA and serine, are components of the repetitive GAGAGS unit. There were two or more differential sequences containing GGTGCT between FibH genes of other silkworms. These findings indicated the FibH gene mutated with hexanucleotide GGTGCT as the core unit.

3.5. Cis-Regulatory Elements Regulating the FibH Gene Were Highly Conserved

The fiber content of FibH, as the main component of cocoon silk, determines the silk yield [23]. The FibH gene is highly expressed in the fifth instar larvae and the silkworm’s posterior silk gland and is tightly regulated by cis-regulatory elements (including E, C sites, and TATA-box) and trans-regulatory factors (including Bmdimm, SGFs, FMBPs, Fkh, and POU-M2) [24,25,26,27,28,29]. We found those cis-regulatory elements in the 5 kb upstream sequences of 264 FibH genes were highly conserved and had 100% identity (Figure 5A). In addition to the upstream sequence, the intron is also involved in the regulation of the FibH gene [24]. The FibH gene has only one intron. The L84 intron was 1007 bp long, and the other introns were 958–988 bp long, with 971 bp and 986 bp accounting for 51.14% and 37.50%, respectively. Binding sites for the transcription factor FMBPs were present on all of the silkworm FibH introns, suggesting their important role in regulating FibH expression (Figure 5B).
There are currently few studies on the downstream regulatory sequence of FibH, and it remains unknown if it regulates the FibH gene. The 3′ UTR is the first 85 bp of the downstream 5 kb sequence of FibH. Except for W35 and L62, all silkworms had the same 3′ UTR. There is an E-box in the downstream sequence of the 3′ UTR that may be involved in regulating the FibH gene. Bmo-miR-2739 was the first miRNA discovered in domesticated silkworms, and it can bind to the 3′ UTR to regulate target genes [30,31]. The RNA22 and RNAhybrid predicted target sites existed in the 3′ UTR of all FibH genes. There was no specific sequence or variation in the upstream and downstream sequences or introns of FibH gene of wild and domesticated silkworms (Figure 5C). These findings suggested that cis-regulatory elements are essential for FibH expression, but are not the cause of silk yield differences between silkworm strains, which are likely to be mainly determined by trans-acting factors.

3.6. Silkworm Pedigree Revealed by FibHome

In a previous study, we observed that improved strains (breeding strains, excluding mutants grr and nds) had identical FibH genes [11], which is consistent with their breeding process (Figure 6A). In the pan-genome, strains P181 and P177, P68 and 794, 796 had the same FibH gene. P181 (JingsongB) is a binary hybrid of P177 (781A) and 757, while P68 (Yun8B) is a ternary hybrid of strain 794, 796, and 9042, and strain 872 is a ternary hybrid of 794, 796, and 792. Strain 872 (15,960 bp) had the same FibH gene as 794 and 796, but differed from P40 (strain 782, 15,920 bp), indicating that the FibH gene of strain 872 was derived from strain 794 and 796. These results suggested that FibH could be used as a marker gene for silkworm identification.
In the FibH sequence extracted by us, not all wild silkworms had the same FibH gene. As a result, they could not be clustered (Figure 6B). There were mainly five families of local strains, with a maximum of 16 strains in one family (Figure 6C). There were four improved strain families with 5–47 silkworm strains (Figure 6D). Local and improved strains with identical FibH genes were classified into four large clusters: Family I (62 silkworms), II (31 silkworms), III (16 silkworms), and IV (6 silkworms) (Figure 6E and Table 1). The local strains of Families I and III were distributed in many Asian and European countries (mainly China and Japan). The improved strains were all from China and Japan. Both local and improved strains of Family II were obtained from China. Family IV had only one local strain and five improved strains from Romania and Japan (Figure 6F). These results implied that silkworms, particularly local strains, are not confined to one geographic region but spread throughout many countries, with improved strains primarily bred in China and Japan.

4. Discussion

FibH is the main component of silkworm silk, and its sequence (primary structure) is the structural basis for the excellent mechanical properties of silk fiber [32,33]. The analysis of its sequence is helpful for understanding the formation mechanism of silk fibers and silk formation. The FibH gene has been studied for more than 30 years; however, few complete FibH sequences have been reported because of its long sequence (up to 20 kb), rich GC, and repetitive sequences [9,13,34]. Single-molecule sequencing has been used to resolve the complete FibH sequence in recent years [3,4], but it is expensive. In this study, we have fully mined the pan-genome data and extracted the FibH and its upstream and downstream sequences from wild silkworms, local and improved strains, and some genetic mutants, providing a comprehensive understanding of the FibH gene and silkworm breeding.
We attempted to describe the complete sequence of the FibH gene and have established the CEO and CACS methods successively [11,13]. The CEO method based on Cpf1 and ONT had an 87.07× coverage depth and 2.29-fold target enrichment. The FibH obtained by CEO was identical to that obtained by P50T [11], except that three bases were deleted in the intron. CACS is a cloning and sequencing method based on CRISPR/Cas9 and PacBio single molecule sequencing. The 24 silkworm FibH sequences obtained by CACS were translated continuously without frameshift mutations. The 545 high-resolution pan-genomes were derived using whole-genome sequencing (NGS and ONT sequencing). Among the FibH sequences we extracted, some had polyguanine and polyN (N representing any base), as well as single-base variations that led to premature termination of translation. CEO is expensive, and the enrichment factor is affected by Cpf1/crRNA activity and chromosomal fragments, which are suitable for target gene sequencing of a small number of samples. In contrast, CACS has a low cost and high accuracy, but the cloning cycle is long; therefore, it is suitable for obtaining target genes or fragments of interest from chromosomes. The cycle of whole-genome sequencing is short, and the cost and accuracy are related to the sequencing depth; as a result, it is more appropriate for an unknown genome or mass resequencing.
We obtained a total of 286 silkworm (27 wild silkworms, 146 local, and 113 improved strains) FibH sequences combined this and previous study [11]. The optimal FibH (Opti-FibH) gene, which has a length of 15,960 base pairs, was found in 24.48% (70/286) of silkworms, including Family I. The maximum strength of silk is positively correlated with the length of the FibH gene. The longer the FibH, the stronger the silk fiber [11]. However, domesticated silkworms are protected by humans and do not need silk as strong as wild silkworms; the FibH gene is too short, silk is weak, and silk yield is low [11], which is in contrast to domestication and breeding. In addition to Opti-FibH, there were many FibH genes with similar lengths, but none of them were preferentially selected. We speculated that Opti-FibH may be more conducive to spinning and textile processing.
With FibH gene as marker gene, all silkworms were divided into four large families, and each family had at least six members containing local and improved strains. Except for these four families, the rest had few members and only local strains. Moreover, we obtained the FibH sequences of four heterozygotes (L185, L194, P21, and P25) from the pan-genome. The FibH genes of L185 and L194 were different from those of other strains. P25 had one FibH gene (15,315 bp) belonging to Family I, while the other was different from other FibH genes. The two FibH genes (15,960 bp and 15,315 bp) of P21 belonged to Families I and III, respectively, indicating that P21 was associated with these two families, but we could not determine its parents. More written records and genome-wide analyses are needed to identify the parents of the silkworms.
In conclusion, we globally characterized hundreds of silkworm FibH sequences from the pan-genome, revealed the relationship between the cis-regulatory element and silk yield, and established a silkworm pedigree. This study provides a theoretical foundation for silk improvement and silkworm breeding.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/insects14030244/s1, Table S1: The silkworm strains in detail; Table S2: The FibH gene sequences from silkworm pan-genome; Table S3: Sequence and length of all repetitive cores; Table S4: Differential sequence between FibH genes.

Author Contributions

Conceptualization, S.M. and Q.X.; methodology, W.L. and S.M.; software, T.Z. and Q.Z.; validation, W.L.; formal analysis, W.L.; investigation, W.L.; resources, S.M.; data curation, T.Z.; writing—original draft preparation, W.L.; writing—review and editing, L.J., S.M., and Q.X.; visualization, N.Z., Q.Z., and T.Z.; supervision, Q.X.; project administration, Q.X.; funding acquisition, Q.X. and S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by grants from the National Natural Science Foundation of China (No. 32122084), Chongqing Natural Science Foundation (No. cstc2020jcyj-bshX0092; cstc2021ycjh-bgzxm0005), PhD Start-up Foundation of Southwest University (No. SWU120012), and Fundamental Research Funds for Central Universities (no. SWU-KT22042).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Xia, Q.; Guo, Y.; Zhang, Z.; Li, D.; Xuan, Z.; Li, Z.; Dai, F.; Li, Y.; Cheng, D.; Li, R.; et al. Complete Resequencing of 40 Genomes Reveals Domestication Events and Genes in Silkworm (Bombyx). Science 2009, 326, 433–436. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Sun, W.; Yu, H.; Shen, Y.; Banno, Y.; Xiang, Z.; Zhang, Z. Phylogeny and Evolutionary History of the Silkworm. Sci. China Life Sci. 2012, 55, 483–496. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Tong, X.; Han, M.-J.; Lu, K.; Tai, S.; Liang, S.; Liu, Y.; Hu, H.; Shen, J.; Long, A.; Zhan, C.; et al. High-Resolution Silkworm Pan-Genome Provides Genetic Insights into Artificial Selection and Ecological Adaptation. Nat. Commun. 2022, 13, 5619. [Google Scholar] [CrossRef] [PubMed]
  4. Xiang, H.; Liu, X.; Li, M.; Zhu, Y.; Wang, L.; Cui, Y.; Liu, L.; Fang, G.; Qian, H.; Xu, A.; et al. The Evolutionary Road from Wild Moth to Domestic Silkworm. Nat. Ecol. Evol. 2018, 2, 1268–1279. [Google Scholar] [CrossRef]
  5. Inoue, S.; Tanaka, K.; Arisaka, F.; Kimura, S.; Ohtomo, K.; Mizuno, S. Silk Fibroin of Bombyx Mori Is Secreted, Assembling a High Molecular Mass Elementary Unit Consisting of H-Chain, L-Chain, and P25, with a 6:6:1 Molar Ratio. J. Biol. Chem. 2000, 275, 40517–40528. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Peng, Z.; Yang, X.; Liu, C.; Dong, Z.; Wang, F.; Wang, X.; Hu, W.; Zhang, X.; Zhao, P.; Xia, Q. Structural and Mechanical Properties of Silk from Different Instars of Bombyx Mori. Biomacromolecules 2019, 20, 1203–1216. [Google Scholar] [CrossRef] [PubMed]
  7. Dong, Z.; Guo, K.; Zhang, X.; Zhang, T.; Zhang, Y.; Ma, S.; Chang, H.; Tang, M.; An, L.; Xia, Q.; et al. Identification of Bombyx Mori Sericin 4 Protein as a New Biological Adhesive. Int. J. Biol. Macromol. 2019, 132, 1121–1130. [Google Scholar] [CrossRef]
  8. Guo, K.; Zhang, X.; Zhao, D.; Qin, L.; Jiang, W.; Hu, W.; Liu, X.; Xia, Q.; Dong, Z.; Zhao, P. Identification and Characterization of Sericin5 Reveals Non-Cocoon Silk Sericin Components with High β-Sheet Content and Adhesive Strength. Acta Biomater. 2022, 150, 96–110. [Google Scholar] [CrossRef]
  9. Zhou, C.-Z. Fine Organization of Bombyx Mori Fibroin Heavy Chain Gene. Nucleic Acids Res. 2000, 28, 2413–2419. [Google Scholar] [CrossRef] [Green Version]
  10. Zhou, C.-Z.; Confalonieri, F.; Jacquet, M.; Perasso, R.; Li, Z.-G.; Janin, J. Silk Fibroin: Structural Implications of a Remarkable Amino Acid Sequence. Proteins 2001, 44, 119–122. [Google Scholar] [CrossRef]
  11. Lu, W.; Ma, S.; Sun, L.; Zhang, T.; Wang, X.; Feng, M.; Wang, A.; Shi, R.; Jia, L.; Xia, Q. Combined CRISPR Toolkits Reveal the Domestication Landscape and Function of the Ultra-Long and Highly Repetitive Silk Genes. Acta Biomater. 2023, 158, 190–202. [Google Scholar] [CrossRef] [PubMed]
  12. Kawamoto, M.; Kiuchi, T.; Katsuma, S. SilkBase: An Integrated Transcriptomic and Genomic Database for Bombyx Mori and Related Species. Database 2022, 2022, baac040. [Google Scholar] [CrossRef] [PubMed]
  13. Lu, W.; Lan, X.; Zhang, T.; Sun, H.; Ma, S.; Xia, Q. Precise Characterization of Bombyx Mori Fibroin Heavy Chain Gene Using Cpf1-Based Enrichment and Oxford Nanopore Technologies. Insects 2021, 12, 832. [Google Scholar] [CrossRef]
  14. Waterhouse, A.M.; Procter, J.B.; Martin, D.M.A.; Clamp, M.; Barton, G.J. Jalview Version 2--a Multiple Sequence Alignment Editor and Analysis Workbench. Bioinformatics 2009, 25, 1189–1191. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Katoh, K.; Kuma, K.; Toh, H.; Miyata, T. MAFFT Version 5: Improvement in Accuracy of Multiple Sequence Alignment. Nucleic Acids Res. 2005, 33, 511–518. [Google Scholar] [CrossRef] [PubMed]
  16. Miranda, K.C.; Huynh, T.; Tay, Y.; Ang, Y.-S.; Tam, W.-L.; Thomson, A.M.; Lim, B.; Rigoutsos, I. A Pattern-Based Method for the Identification of MicroRNA Binding Sites and Their Corresponding Heteroduplexes. Cell 2006, 126, 1203–1217. [Google Scholar] [CrossRef] [Green Version]
  17. Rehmsmeier, M.; Steffen, P.; Höchsmann, M.; Giegerich, R. Fast and Effective Prediction of MicroRNA/Target Duplexes. RNA 2004, 10, 1507–1517. [Google Scholar] [CrossRef] [Green Version]
  18. Bailey, T.L.; Boden, M.; Buske, F.A.; Frith, M.; Grant, C.E.; Clementi, L.; Ren, J.; Li, W.W.; Noble, W.S. MEME SUITE: Tools for Motif Discovery and Searching. Nucleic Acids Res. 2009, 37, W202–W208. [Google Scholar] [CrossRef]
  19. Bailey, T.L.; Johnson, J.; Grant, C.E.; Noble, W.S. The MEME Suite. Nucleic Acids Res. 2015, 43, W39–W49. [Google Scholar] [CrossRef] [Green Version]
  20. Chen, C.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.; Xia, R. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
  21. Letunic, I.; Bork, P. Interactive Tree Of Life v2: Online Annotation and Display of Phylogenetic Trees Made Easy. Nucleic Acids Res. 2011, 39, W475–W478. [Google Scholar] [CrossRef] [PubMed]
  22. Cretu Stancu, M.; van Roosmalen, M.J.; Renkens, I.; Nieboer, M.M.; Middelkamp, S.; de Ligt, J.; Pregno, G.; Giachino, D.; Mandrile, G.; Espejo Valle-Inclan, J.; et al. Mapping and Phasing of Structural Variation in Patient Genomes Using Nanopore Sequencing. Nat. Commun. 2017, 8, 1326. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Zhang, Y.; Zhao, P.; Dong, Z.; Wang, D.; Guo, P.; Guo, X.; Song, Q.; Zhang, W.; Xia, Q. Comparative Proteome Analysis of Multi-Layer Cocoon of the Silkworm, Bombyx Mori. PLoS ONE 2015, 10, e0123403. [Google Scholar] [CrossRef] [PubMed]
  24. Takiya, S.; Kokubo, H.; Suzuki, Y. Transcriptional Regulatory Elements in the Upstream and Intron of the Fibroin Gene Bind Three Specific Factors POU-M1, Bm Fkh and FMBP-1. Biochem. J. 1997, 321, 645–653. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Kimoto, M.; Tsubota, T.; Uchino, K.; Sezutsu, H.; Takiya, S. LIM-Homeodomain Transcription Factor Awh Is a Key Component Activating All Three Fibroin Genes, FibH, FibL and Fhx, in the Silk Gland of the Silkworm, Bombyx Mori. Insect Biochem. Mol. Biol. 2015, 56, 29–35. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Zhou, C.; Zha, X.; Shi, P.; Wei, S.; Wang, H.; Zheng, R.; Xia, Q. Multiprotein Bridging Factor 2 Regulates the Expression of the Fibroin Heavy Chain Gene by Interacting with Bmdimmed in the Silkworm Bombyx Mori: MBF2 and Expression of the Fibroin Heavy Chain Gene. Insect Mol. Biol. 2016, 25, 509–518. [Google Scholar] [CrossRef]
  27. Zhou, C.; Zha, X.; Shi, P.; Zhao, P.; Wang, H.; Zheng, R.; Xia, Q. Nuclear Hormone Receptor BmFTZ-F1 Is Involved in Regulating the Fibroin Heavy Chain Gene in the Silkworm, Bombyx Mori. Biochim. Biophys. Acta (BBA)—Gen. Subj. 2016, 1860, 2529–2536. [Google Scholar] [CrossRef]
  28. Liu, L.; Wang, Y.; Li, Y.; Guo, P.; Liu, C.; Li, Z.; Wang, F.; Zhao, P.; Xia, Q.; He, H. Insights into the Repression of Fibroin Modulator Binding Protein-1 on the Transcription of Fibroin H-Chain during Molting in Bombyx Mori. Insect Biochem. Mol. Biol. 2019, 104, 39–49. [Google Scholar] [CrossRef]
  29. Ma, Y.; Luo, Q.; Ou, Y.; Tang, Y.; Zeng, W.; Wang, H.; Hu, J.; Xu, H. New Insights into the Proteins Interacting with the Promoters of Silkworm Fibroin Genes. Sci. Rep. 2021, 11, 15880. [Google Scholar] [CrossRef]
  30. Yu, X.; Zhou, Q.; Cai, Y.; Luo, Q.; Lin, H.; Hu, S.; Yu, J. A Discovery of Novel MicroRNAs in the Silkworm (Bombyx Mori) Genome. Genomics 2009, 94, 438–444. [Google Scholar] [CrossRef] [Green Version]
  31. Chen, E.; Chen, Z.; Li, S.; Xing, D.; Guo, H.; Liu, J.; Ji, X.; Lin, Y.; Liu, S.; Xia, Q. MicroRNAs Bmo-MiR-2739 and Novel-MiR-167 Coordinately Regulate the Expression of the Vitellogenin Receptor in Bombyx Mori Oogenesis. Development 2020, 147, dev.183723. [Google Scholar] [CrossRef] [PubMed]
  32. Guo, C.; Zhang, J.; Jordan, J.S.; Wang, X.; Henning, R.W.; Yarger, J.L. Structural Comparison of Various Silkworm Silks: An Insight into the Structure–Property Relationship. Biomacromolecules 2018, 19, 906–917. [Google Scholar] [CrossRef] [PubMed]
  33. Fang, G.; Sapru, S.; Behera, S.; Yao, J.; Shao, Z.; Kundu, S.C.; Chen, X. Exploration of the Tight Structural–Mechanical Relationship in Mulberry and Non-Mulberry Silkworm Silks. J. Mater. Chem. B 2016, 4, 4337–4347. [Google Scholar] [CrossRef] [PubMed]
  34. Mita, K.; Ichimura, S.; James, T.C. Highly Repetitive Structure and Its Organization of the Silk Fibroin Gene. J. Mol. Evol. 1994, 38, 583–592. [Google Scholar] [CrossRef]
Figure 1. Comparison of the fibroin heavy chain (FibH) genes from pan-genome and as obtained by CACS. The FibH genes of Damao, C108, and Bilian were obtained by CACS [11], and L38, L86, and L193 were from the pan-genome.
Figure 1. Comparison of the fibroin heavy chain (FibH) genes from pan-genome and as obtained by CACS. The FibH genes of Damao, C108, and Bilian were obtained by CACS [11], and L38, L86, and L193 were from the pan-genome.
Insects 14 00244 g001
Figure 2. FibH gene complete sequences (FibHome) of 264 silkworms from the pan-genome. All FibH sequences were unaligned and arranged by length. Motif, orange; Wild silkworms, red; Local silkworms, green; Improved silkworms, blue.
Figure 2. FibH gene complete sequences (FibHome) of 264 silkworms from the pan-genome. All FibH sequences were unaligned and arranged by length. Motif, orange; Wild silkworms, red; Local silkworms, green; Improved silkworms, blue.
Insects 14 00244 g002
Figure 3. Conserved analysis of the FibH gene. (A) Similarity analysis of TNR of 264 FibH sequences. 5′ or 3′ TNR, 5′ or 3′ terminal non-repetitive. (B) Similarity analysis of RC of wild silkworms, local and improved strains. RC, repetitive core.
Figure 3. Conserved analysis of the FibH gene. (A) Similarity analysis of TNR of 264 FibH sequences. 5′ or 3′ TNR, 5′ or 3′ terminal non-repetitive. (B) Similarity analysis of RC of wild silkworms, local and improved strains. RC, repetitive core.
Insects 14 00244 g003
Figure 4. FibH gene with differential sequence. The nucleotide sequence is the differential sequence between two adjacent silkworm FibH genes. All differential sequences were divided into four groups. The FibH gene and the amorphous region identified by Nde I are represented by black horizontal lines and red blocks, respectively.
Figure 4. FibH gene with differential sequence. The nucleotide sequence is the differential sequence between two adjacent silkworm FibH genes. All differential sequences were divided into four groups. The FibH gene and the amorphous region identified by Nde I are represented by black horizontal lines and red blocks, respectively.
Insects 14 00244 g004
Figure 5. Comparison of the introns and upstream and downstream sequences of the FibH gene in wild silkworms, local and improved strains. (AC) Upstream, intron, and downstream sequence alignment of the FibH gene. Red lines and green triangles indicate sequence differences and cis-regulatory elements, respectively.
Figure 5. Comparison of the introns and upstream and downstream sequences of the FibH gene in wild silkworms, local and improved strains. (AC) Upstream, intron, and downstream sequence alignment of the FibH gene. Red lines and green triangles indicate sequence differences and cis-regulatory elements, respectively.
Insects 14 00244 g005
Figure 6. Silkworm pedigree based on FibHome. (A) Silkworm family of improved strains. Domesticated silkworms with identical FibH genes from a previous study [11], yellow background; domesticated silkworms from the pan-genome, red font. (B) The phylogenetic tree of wild silkworms. (C) The phylogenetic tree of local strains. (D) The phylogenetic tree of improved strains. (E) The phylogenetic tree of silkworms containing wild silkworms, local and improved strains. The branches of the same color had the identical FibH gene except for the black ones, in which the thickened branches were improved strains. (F) Geographical distribution of silkworms in Families I, II, III, and IV. Local strains, circle; Improved strains, Pentacle. Family I, red; Family II, green; Family III, blue; Family IV, yellow.
Figure 6. Silkworm pedigree based on FibHome. (A) Silkworm family of improved strains. Domesticated silkworms with identical FibH genes from a previous study [11], yellow background; domesticated silkworms from the pan-genome, red font. (B) The phylogenetic tree of wild silkworms. (C) The phylogenetic tree of local strains. (D) The phylogenetic tree of improved strains. (E) The phylogenetic tree of silkworms containing wild silkworms, local and improved strains. The branches of the same color had the identical FibH gene except for the black ones, in which the thickened branches were improved strains. (F) Geographical distribution of silkworms in Families I, II, III, and IV. Local strains, circle; Improved strains, Pentacle. Family I, red; Family II, green; Family III, blue; Family IV, yellow.
Insects 14 00244 g006
Table 1. Silkworm pedigree with the same FibH gene.
Table 1. Silkworm pedigree with the same FibH gene.
FamilySample IDSampleFibH Gene (bp)
Family IL79, L90, L102, L109, L113, L172, L173, L174, L176, L178, L182, L183, L184, L200, L214, P20, P22, P35, P38, P46, P47, P49, P50, P51, P53, P54, P55, P59, P63, P64, P68, P70, P72, P73, P97, P103, P113, P119, P121, P125, P150, P153, P155, P157, P158, P161, P168, P169, P171, P172, P174, P175, P180, P183, P184, P185, P186, P195, P196, P197, P199, P208 6215,960
Family IIL87, L94, L97, L105, L201, L208, P99, P101, P104, P105, P107, P108, P111, P114, P122, P124, P129, P131, P149, P156, P167, P176, P177, P178, P181, P187, P188, P191, P192, P193, P1943115,318
Family IIIL47, L71, L75, L92, L98, L179, L180, L193, P18, P26, P27, P33, P34, P41, P163, P2021615,315
Family IVL63, P37, P67, P74, P76, P79614,874
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lu, W.; Zhang, T.; Zhang, Q.; Zhang, N.; Jia, L.; Ma, S.; Xia, Q. FibH Gene Complete Sequences (FibHome) Revealed Silkworm Pedigree. Insects 2023, 14, 244. https://doi.org/10.3390/insects14030244

AMA Style

Lu W, Zhang T, Zhang Q, Zhang N, Jia L, Ma S, Xia Q. FibH Gene Complete Sequences (FibHome) Revealed Silkworm Pedigree. Insects. 2023; 14(3):244. https://doi.org/10.3390/insects14030244

Chicago/Turabian Style

Lu, Wei, Tong Zhang, Quan Zhang, Na Zhang, Ling Jia, Sanyuan Ma, and Qingyou Xia. 2023. "FibH Gene Complete Sequences (FibHome) Revealed Silkworm Pedigree" Insects 14, no. 3: 244. https://doi.org/10.3390/insects14030244

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop