Next Article in Journal
Roles of the Unsaturated Fatty Acid Docosahexaenoic Acid in the Central Nervous System: Molecular and Cellular Insights
Next Article in Special Issue
GmWRKY81 Encoding a WRKY Transcription Factor Enhances Aluminum Tolerance in Soybean
Previous Article in Journal
Novel Elastic Threads for Intestinal Anastomoses: Feasibility and Mechanical Evaluation in a Porcine and Rabbit Model
Previous Article in Special Issue
A Sequence Variation in GmBADH2 Enhances Soybean Aroma and Is a Functional Marker for Improving Soybean Flavor
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of a Candidate restorer-of-fertility Gene Rf3 Encoding a Pentatricopeptide Repeat Protein for the Cytoplasmic Male Sterility in Soybean

1
Soybean Research Institute, Jilin Academy of Agricultural Sciences, Changchun 130033, China
2
Key Laboratory of Hybrid Soybean Breeding of the Ministry of Agriculture and Rural Affairs, Changchun 130033, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(10), 5388; https://doi.org/10.3390/ijms23105388
Submission received: 15 April 2022 / Revised: 9 May 2022 / Accepted: 10 May 2022 / Published: 11 May 2022
(This article belongs to the Special Issue Soybean Molecular Breeding and Genetics)

Abstract

:
The cytoplasmic male sterility/restorer-of-fertility (CMS/Rf) system plays a vital role in high-efficiency hybrid seed production in crops, including soybean (Glycine max (L.) Merr.). The markers linked to fertility restoration and the restorer-of-fertility (Rf) genes are essential because they can facilitate the breeding of new CMS lines and production of commercial hybrid soybean seeds. To date, several soybean Rf genes have been mapped to various genetic loci in diverse genetic populations. However, the mapping range of restorer genes remains narrow, with relatively limited practical applicability. Therefore, in the present study, F2 and F3 segregating populations derived from the CMS line JLCMS5A crossed with the restorer line JLR2 were developed and used for Rf3 gene fine mapping. Genetic investigation indicated that the restorer line JLR2 was controlled by a single dominant gene, Rf3. By integrating bulk-segregant analysis and next-generation sequencing, a 4 Mb region on chromosome 9 was identified, which was most likely the target region harboring the candidate gene responsible for fertility restoration. This region was further narrowed down to 86.44 Kb via fine mapping in F2 and F3 populations using SSR, InDel, and dCAPS markers. This region contained 10 putative genes (Glyma.09G171100Glyma.09G172000). Finally, Glyma.09G171200, which encodes a mitochondria-targeted pentatricopeptide repeat protein, was proposed as the potential candidate for Rf3 using sequence alignment and expression analysis in restorer and CMS lines. Based on single-nucleotide polymorphisms in Glyma.09G171200, a CAPS marker co-segregated with Rf3 named CAPS1712 was developed. Our results will be fundamental in the assisted selection and creation of potent lines for the production and rapid selection of novel restorer lines.

1. Introduction

Soybean (Glycine max (L.) Merr.), which originated in China and has been cultivated for over 5000 years, is a great source of quality protein and vegetable oil. In recent years, the demand for soybean has grown rapidly, but the supply remains inadequate. Hybrid breeding is an important approach to increase the yield and improve the quality of crops, and it has been successfully applied in rice, maize, wheat, rapeseed, and other crops. However, a high degree of self-pollination and availability of limited germplasm resources restrict the use and spread of commercial hybrid cultivars.
Cytoplasmic male sterility (CMS) is the maternally inherited inability to produce functional pollen, which is widespread in vascular plants, being recorded in nearly 200 species worldwide [1]. Fertility loss in CMS plants is often attributed to unusual chimeric open reading frames (ORFs) in the mitochondrial genome [2], and it can be reversed by the restorer-of-fertility (Rf) genes located in the nuclear genome [3]. The restoration of CMS via Rf genes can be achieved at the genomic, transcriptomic, proteomic, and metabolic levels [4]. To date, the CMS/Rf systems have been widely identified and applied to improve the yield of many crops, such as rice [5], maize [6], wheat [7], pepper [8], and Brassicaceae crops [9]. Therefore, the use of a CMS/Rf system is an ideal strategy for large-scale hybrid soybean production.
To date, Rf genes have been successfully isolated from several plants, including Rf2 and Rf4 in maize [6,10]; Rf-PPR592 in petunia [11]; Rfo in radish [12]; Rf1, Rf2, Rf4, Rf5, Rf6, and Rf17 in rice [1,13,14,15,16,17]; Rf1 in sugar beet [18]; Rfp, Rfn, and Rfh in rapeseed [19,20,21]; and Rf1 and Rf3 in wheat [7], among others. Most of these Rf genes encode pentatricopeptide repeat (PPR) proteins localized in the mitochondria. In addition, several other types of Rf genes encode non-PPR proteins. For instance, Rf2 and Rf4 in maize encode an aldehyde dehydrogenase [10] and a basic helix–loop–helix (bHLH) transcription factor [6], respectively. Moreover, Rf2 and Rf17 in rice encode a glycine-rich protein [14] and an acyl-carrier protein synthase [17], respectively. Finally, Rf1 in sugar beet encodes a peptidase [18].
In soybean, only GmPPR576 has been identified on chromosome 16 between the markers BARCSOYSSR_16_1067 and BARCSOYSSR_16_1078, and it is considered the restorer-of-fertility gene of the NJCMS1A line [22]. According to Wang et al. (2021), GmPPR576, which encodes a PPR protein, belongs to the restorer-of-fertility-like (RFL) Rf gene family and is targeted to the mitochondria. In addition, several soybean Rf genes have been mapped to various genetic loci in different genetic populations, and most of these genes are located on chromosome 16. For instance, Zhao et al. (2007) and Wang et al. (2010) reported that in the restorer line JIHUI 1, the genetic distances between the Rf gene and the SSR markers Sctt011 and Satt547 were, respectively, 3.6 and 5.4 cM [23,24]. Meanwhile, Dong et al. (2012) suggested that in the restorer line FuHui9, the Rf gene was mapped on chromosome 16, between the markers BARCSOYSSR_16_1064 and BARCSOYSSR_16_1082, at the distances of 0.59 and 0.83 cM, respectively [25]. Wang et al. (2016) showed that the M-type CMS line restorer-of-fertility gene Rf-m is located within a 162.4 Kb region on chromosome 16 and is flanked on each side by markers GmSSR1602 and GmSSR1610 at a distance of 0.11 and 0.25 cM, respectively [26]. Rf genes have also been found on other chromosomes. For instance, Yang et al. (2007) used N8855 sterile and restorer lines as materials and selected two polymorphic markers linked to Rf genes, which are located on chromosomes 5 and 7 [27]. However, the mapping range of most Rf genes in soybean is limited, and the underlying molecular mechanisms remain unknown, thus restricting the development of commercial varieties.
In this context, the identification and characterization of Rf genes is an effective and important approach to rapidly establish a CMS/Rf system for commercial application in hybrid soybean production. Therefore, in the present study, Rf3 in soybean was physically mapped using published and newly developed markers. Specifically, an 86.44 Kb candidate interval on chromosome 9 was identified by integrating bulk-segregant analysis and next-generation sequencing (BSA-Seq) using F2 and F3 segregation populations. In this region, Glyma.09G171200—a PPR family gene localized in the mitochondria—was considered a candidate for Rf3 based on sequence alignment and expression analysis. Our findings will serve as a reference in the assisted selection and development of potent lines in the production and rapid selection of novel restorer lines. Our results will lay a theoretical foundation for exploring the mechanisms underlying the CMS/Rf system in soybean.

2. Results

2.1. Gametophyte Sterility in the Soybean CMS Line JLCMS5A

JLR2 is a restorer line of the soybean CMS line JLCMS5A. JLCMS5A was indistinguishable from JLR2 during vegetative growth, except that JLCMS5A produced small and fleshy seedless pods after flowering (Figure 1A–C). JLR2, JLCMS5A, F1, and F2 generation pollen were subjected to the I2-KI test. Microscopic observations revealed that all pollens produced by the restorer line JLR2 were viable (Figure 1D). Conversely, the CMS line JLCMS5A produced only non-viable pollen grains (Figure 1E). The F1 hybrids were semi-fertile, with only half of the pollen being viable (Figure 1F). The I2–KI test was also performed on the F2 generation pollen of JLCMS5A × JLR2, and the pollen fertility of 217 F2 plants was investigated. There were 121 fertile pollen plants and 96 pollen semi-sterile plants, meeting the 1:1 ratio according to the chi-square test [χ2 = 1.44 < χ2(0.05,1) = 3.84]. Based on the pollen fertility segregation ratios of F2 plants, we confirmed that JLCMS5A is gametophyte sterile. Thus, the fertility restoration of JLCMS5A by JLR2 is a qualitative trait controlled by a pair of dominant genes, which were named Rf3.

2.2. BSA-Seq

Based on microscopic observations, DNA bulks of two parent lines (male and female) as well as fertile (F) and semi-fertile (SF) plants from the F2 segregation population were prepared and subjected to BSA-Seq. From the sequencing analysis, 85.10 Gb of raw data were obtained; the average Q20, Q30, and GC content values were 97.22%, 92.40%, and 36.76%, respectively. After filtering out reads containing unknown bases (N), low-quality data, and only adaptor reads, a total of 12,707,252,100 bp clean reads were obtained from the male sample, with the Q20 value of 96.59%, Q30 value of 91.16%, and GC content of 36.80%. Meanwhile, 10,205,530,500 bp clean reads were obtained from the female sample, with the Q20 value of 97.39%, Q30 value of 92.68%, and GC content of 37.99%. Finally, 31,090,225,200 and 30,918,562,800 bp clean reads were obtained from the F and SF samples, with the Q20 values of 97.72% and 97.18%, Q30 values of 93.54% and 92.22%, and GC content of 36.14% and 36.10%, respectively. We then mapped these clean reads to the reference genome, which allowed for the mapping of 99.48% reads on average. The sequencing depths for the four samples ranged from 8.64× to 24.59×, with an average depth of 17.09×, and 96.22% of the genome had at least 1× coverage (Table 1). As a result, all samples have enough data, qualified sequences, normal GC distribution, and normal comparison results, and the above data can be used for subsequent variation detection and correlation analysis.
To conduct analyses of SNPs and InDels, SNPs and InDels were first called between DNA bulks and the Zhonghong13 reference genome using GATK3.8. A total of 2,627,469 SNPs and 474,499 InDels were identified between the SF and F bulks. There were significantly more SNPs than InDels. Most of the SNPs were in the intergenic region (n = 2,011,863), followed by intronic (n = 252,156), upstream (n = 141,803), downstream (n = 127,175), exonic (n = 88,602), upstream/downstream (n = 5344), and splicing (n = 526) regions (Figure 2A). Moreover, some SNPs were predicted to produce non-synonymous (n = 50,820), synonymous (n = 36,335), stop–gain (n = 1175), and stop–loss (n = 270) effects, giving rise to phenotypic variations (Figure 2B).
Furthermore, of the 474,499 InDels, over 70% were in the intergenic region (n = 332,160), followed by the intronic (n = 58,203), upstream (n = 41,715), downstream (n = 35,367), exonic (n = 5262), upstream/downstream (n = 1598), and splicing (n = 194) regions (Figure 2C). According to the functional annotation, the InDels variations included frameshift insertions (n = 1367), frameshift deletions (n = 1339), non-frameshift deletions (n = 1282), non-frameshift insertions (n = 1116), stop–gain variations (n = 109), and stop–loss variations (n = 22), which may lead to many types of phenotypic changes, including fertility transition (Figure 2D). SNPs and InDels were retained for rapid genetic mapping.

2.3. Fine Mapping of Rf3 Gene in JLR2

According to the result of BSA-Seq, we screened for the markers of homozygous differences between the two parents. However, these markers neither possessed more than two alleles nor showed any polymorphism between the F and FS bulks. Following quality control, only 1,557,956 polymorphic marker loci (SNPs and InDels) were used for the calculation of the All index (SNPs and InDels). Then, the ΔAll-index values for all sites and windows were obtained by calculating the difference in the All-index values between the two offsprings, and a Manhattan diagram of the ΔAll-index values was drawn. A candidate region significantly associated with male sterility traits in the range of 38–42 Mb on chromosome 9 was identified based on the ΔAll index, suggesting that Rf3 may be located in this range. To date, no other published Rf gene of soybean has been reported in this range, indicating that Rf3 is a novel restorer-of-fertility gene.
To further narrow the Rf3 locus, 85 pairs of SSR markers on chromosome 9 of soybean were used for BSA. Sixteen SSR markers showed polymorphism between the two parents, as well as between the two F2 bulked samples. Of these, eight polymorphic SSR markers were used to analyze 217 individual plants in the F2 population and construct a genetic map of the Rf3 gene (Figure 3A). To narrow down the region harboring Rf3, eight pairs of InDel markers and four pairs of dCAPS markers were designed between SSR_09_1161 and SSR_09_1170. Among these, one InDel (InDel09-3) and one dCAPS marker (dCAPs09-2) were polymorphic. We evaluated F2 and F3 plants using these 10 markers and calculated their genetic distances. These results revealed that Rf3 is located in the interval between dCAPs09-2 and SSR_09_1170 (Figure 3B,C).

2.4. Rf3 Gene Location

According to the physical map of the reference genome of Williams 82, the locations of the markers dCAPs09-2 and SSR_09_1170 are at 39.60 and 39.68 Mb, respectively (Figure 4A,B), at a physical distance of 86.44 Kb. This 86.44 Kb region harbors 10 putative genes (Glyma.09G171100Glyma.09G172000) (Figure 4C).
Subsequently, the genomic sequences of 10 genes in this region were cloned from the CMS line JLCMS5A [S (rf3rf3)] and the restorer line JLR2 [N (Rf3Rf3)]. Following sequence alignments, we found only two genes (Glyma.09G171200 and Glyma.09G171800) that differed between JLCMS5A and JLR2 (Figure 4C). Glyma.09G171200 harbored 47 SNPs, leading to 31 amino acid variations (Tables S1 and S2). However, most of the mutated amino acids were concentrated on the right subunit of the Glyma.09G171200 protein, and these mutations also altered the interactions between amino acids (Figure S1). Glyma.09G171800 contained two transitions: A to G at 294 bp and G to A at 475 bp. The last transition led to amino acid variation (K to E).
To confirm whether the SNPs in these two genes are related to fertility restoration, we cloned the two genes from the restorer and CMS lines of another soybean cultivar and performed genome sequence alignment. In these restorer and CMS lines, the CDS of Glyma.09G171200 contained SNPs, but the sequence was normal in all restorer cultivars and Williams 82 (the above restorer line and Williams 82 have been proved to have fertility recovery ability by experiments on test cross) (Figure S2). In the CMS lines with the same background, Glyma.09G171200 possessed SNPs at the same positions (Figure S2). However, the SNP at the 294 bp position in the CDS of Glyma.09G171800 was consistent with that in the CMS lines and Williams 82. Conversely, at the 475 bp site, the sequence was normal in all restorer cultivars and Williams 82 (Figure S3). Regarding function, only one annotated gene (Glyma.09G171200) belongs to the PPR protein family (information on the 10 genes is presented in Table 2). Based on these results, we concluded that Glyma.09G171200 may be the Rf3 gene in JLR2.

2.5. Restorer-of-Fertility Molecular Markers

Based on the SNPs of Glyma.09G171200 and Glyma.09G171800, CAPS markers named CAPS1712 and CAPS1718 were developed. These two markers are useful for predicting the phenotypes of the offspring of JLCMS5A×JLR2. After amplification and digestion by Sca I restriction endonuclease, the CAPS1712 marker produced different bands in fertile, sterile, and semi-fertile plants. Two fragments, located between the 800 and 1200 bp loci, were considered homozygous at the restorer-of-fertility gene locus (Rf3Rf3), indicating fertile plants. A single band greater than 2000 bp was considered homozygous but lacked the restorer-of-fertility gene allele (rf3rf3), indicating sterile plants. Three fragments were considered heterozygous at the restorer-of-fertility gene locus (Rf3rf3), indicating semi-fertile plants (Figure 5).
For CAPS1718, after amplification and digestion by BspEI restriction endonuclease, the CAPS1718 marker also produced different bands in fertile, sterile, and semi-fertile plants. Two fragments, one located between the 500 and 800 bp loci and the other located between the 1200 and 2000 bp loci, were considered homozygous at the restorer-of-fertility gene locus (Rf3Rf3), indicating fertile plants. A single band on 2000 bp loci was considered homozygous but lacked the restorer-of-fertility gene allele (rf3rf3), indicating sterile plants. Three fragments were considered heterozygous at the restorer-of-fertility gene locus (Rf3rf3), indicating semi-fertile plants (Figure 5).

2.6. Expression and Localization of Candidate Rf3

To verify whether Glyma.09G171200 and Glyma.09G171800 were the candidate genes for Rf3, qRT-PCR analysis was performed for analyzing the transcript levels of these two genes in five different organs of soybean, including root, stem, leaf, flower, and bud of the restorer and CMS lines (Figure 6). The expression levels of Glyma.09G171200 at all stages were significantly higher in the restorer line than in the CMS line, especially in the stem, leaf, and flower organs. However, the transcript levels of Glyma.09G171800 at stem, flower, and bud stages were slightly upregulated in the restorer line compared to the CMS line and downregulated in the root and leaf stages of the restorer line. Moreover, the gene expression level of Glyma.09G171200 was significantly higher than Glyma.09G171800 in all detective organs. Based on these results, the expression of Glyma.09G171200 may better match the expression profile of the candidate gene.
Subsequently, ProtComp 9.0 was first used to predict the subcellular localization of the Glyma.09G171200 and Glyma.09G171800 protein. The results showed significant probability of mitochondrial localization of Glyma.09G171200 and extracellular (secreted) localization of Glyma.09G171800. Furthermore, SignalP—4.1 software predicted that only Glyma.09G171200 encodes a protein containing pentatricopeptide repeats and had potential mitochondrial localization signals at its N-terminus. Since most reported Rf genes encode PPR proteins localized to the mitochondria, we propose Glyma.09G171200 as the candidate gene for Rf3, and further subcellular localization analysis was performed.
To further confirm subcellular localization of Glyma.09G171200, we expressed the GFP-fused Rf3 protein (green fluorescent protein) via a plasmid driven by the CaMV 35S promoter in tobacco protoplasts. The protein was targeted to the mitochondria based on the observation of the exclusively co-localized GFP signal with the MSTP-labeled mitochondrial signal (Figure 7). The fluorescence of MSTP and Rf3 seems to appear on the cell membrane or wall as mitochondria are squeezed to the edge of the cell wall by cytoplasmic inclusions, such as vacuoles. These results are in line with the ProtComp 9.0 prediction. Therefore, we propose Glyma.09G171200 as the strongest candidate for Rf3.

2.7. Phylogenetic Analysis of the Rf3 Protein

By searching the articles of Rf genes, we found 19 proteins (15 PPR and 4 non-PPR proteins) from different plants, including Arabidopsis thaliana, Brassica napus, Petunia hybrida, Glycine max, Medicago truncatula, Triticum aestivum, Oryza sativa, and Zea mays. To understand the evolutionary relationships of Glyma.09G171200 with other PPR proteins, we constructed a phylogenetic tree, including 20 proteins (including Glyma.09G171200). The phylogenetic analysis revealed that these proteins were mainly divided into four groups. The first group included six PPR proteins from cruciferous plants (Arabidopsis thaliana, Raphanus sativus, and Brassica napus). Notably, Glyma.09G171200 belonged to the second group. In this group, the genetic distance between Glyma.09G171200 and Rf1 of the leguminous plant Medicago truncatula was the smallest, and the genetic distance between Glyma.09G171200 and GmPPR576 of Glycine max was slightly farther. The third group included six PPR proteins from graminaceous crops (Oryza sativa and Triticum aestivum). The fourth group included the four non-PPR proteins (Figure 8). From these phylogenetic relationships, we concluded that the Glyma.09G171200 protein from JLR2 belongs to the PPR protein, clustered with the previously reported GmPPR576 protein from Glycine max. The result showed that Glyma.09G171200 gene is consistent with most of the characteristics of Rf genes.

3. Discussion

The CMS/Rf system has been proven effective to produce hybrid seeds in crops. Considering the importance of these systems, numerous map-based cloning studies have been performed on Rf genes, and several genes have been isolated from various crops.

3.1. Fertility Restorer Type in JLR2 Is Controlled by an Uncharacterized New Gene

In soybean, recent research on Rf genes has mainly focused on CMS-RN, CMS-M, and CMS-N8855 types, among others. In the CMS-RN type, Wang et al. used 103 F2 individuals, derived from a cross between the CMS line JLCMS82A and the restorer line JIHUI 1, as the mapping population. The authors demonstrated that the mode of restoration was gametophytic and controlled by a single dominant locus (Rf). The genetic distances between the Rf gene and the SSR markers Sctt011 and Satt547 were 3.6 and 5.4 cm, respectively [24]. In the CMS-M type, Wang et al. suggested that Rf-m in the restorer line WR016 is a monogenic dominant gene located within a 162.4 Kb region on chromosome 16, which is flanked on each side by the markers GmSSR1602 and GmSSR1610, at a distance of 0.11 and 0.25 cM, respectively [26]. In the CMS-N8855 type, Yang et al. showed that two independent Rf loci from NJCMS1A are linked to Satt626 on chromosome 7 and Satt300 on chromosome 5 at the genetic distances of 9.75 and 11.18 cm, respectively [27]. Furthermore, Wang et al. suggested that the CMS line NJCMS1A is gametophyte sterility, and the fertility restoration of NJCMS1C to NJCMS1A is a quality trait controlled by a single gene locus. Moreover, GmPPR576, targeted to the mitochondria, was confirmed to be a fertility restorer gene in NJCMS1A, located on chromosome 16 between the markers BARCSOYSSR_16_1067 and BARCSOYSSR_16_1078 [22].
In the present study, we confirmed that JLCMS5A is gametophyte sterility, and the fertility restoration of JLCMS5A by JLR2 is a qualitative trait controlled by a pair of dominant fertility restoration genes, which we named Rf3. This result is consistent with the observations of Zhao et al. and Wang et al. that the mode of fertility restoration is gametophytic [23,24]. We constructed a genetic map of Rf3 and noted that it is located within an 86.44 Kb region defined by the markers dCAPs09-2 and SSR_09_1170 on chromosome 9. The mapping interval of the Rf3 gene locus is different from the previously reported loci [22,23,24,26,27], suggesting that the fertility restorer type in JLR2 is controlled by an uncharacterized novel gene.

3.2. Multiple PPR Genes and Fertility Restoration

In vascular plants, CMS, which is typically linked to an aberrant chimeric mitochondrial gene in the mitochondria, is conferred by genomic incompatibility between the mitochondria and nuclei [28]. In most cases, the male sterile phenotype can be recovered by a specific set of nuclear genes, termed the Rf genes, which encode RNA-binding PPR proteins [4,29,30]. PPR proteins, most of which are likely localized to the plastids and/or mitochondria, constitute a large nuclear-encoded RNA-binding protein family, which is further divided into P, PLS, and S subfamilies [31,32,33].
Numerous studies have indicated that the sterile phenotype of CMS lines can be restored by Rf genes, which encode PPR proteins. For instance, Rf-PPR592, the first Rf gene detected in petunia, encodes 14 PPR motifs [11]. The Rfo gene of radish encodes 16 PPR motifs [12]. Rfp, Rfn, and Rfh genes of rapeseed harbor PPR motifs [19,20,21]. In rice, Rf1 encodes 16 PPR motifs [13]; while Rf4, Rf5, and Rf6 encode typical P-type PPR motifs [1,15,16]. Similarly, Rf1 and Rf3 in wheat harbor PPR motifs [7].
In the present study, the mapping interval of the Rf3 gene was 86.44 Kb. The annotation against the soybean Williams 82 reference genome revealed that this interval harbors 10 genes (Table 2) (Glyma.09G171100Glyma.09G172000). Genomic sequence analysis revealed that two genes, Glyma.09G171200 and Glyma.09G171800, exhibit SNPs that resulted in protein mutations in all Rf3 genotype lines. However, only Glyma.09G1712000 encodes a PPR protein. This result is consistent with previous reports on other major crops in which the Rf genes belong to the PPR family [1,7,11,12,13,15,16,19,20,21,22].
PPR family proteins are mainly involved in transcript processing in the chloroplasts and mitochondria [34]. All PPR-type Rf genes known thus far are targeted to the mitochondria [4]. By integrating ProtComp and subcellular localization analysis, we determined that Glyma.09G1712000 is also localized to the mitochondria. Therefore, within the mapping interval, Glyma.09G1712000, which encodes a mitochondria-targeted PPR protein, is the strongest candidate gene for Rf3.
In conclusion, the F2 and F3 populations were selected as genetic populations to map Rf3. Using BSA-Seq analysis with SSR, InDel, and dCAPS markers, Rf3 locus was identified in an 86.44 Kb candidate interval on chromosome 9. This region contains 10 putative genes. Among these, Glyma.09G171200, which encodes a mitochondria-targeted PPR protein, was proposed as the strongest candidate for Rf3 based on sequence alignment and expression analysis. Finally, a CAPS marker co-segregated with Rf3, named CAPS1712, was developed. Although the present study proposes Glyma.09G171200 as the restorer-of-fertility gene for JLCMS5A, the function of this gene warrants further exploration.

4. Materials and Methods

4.1. Plant Material

RN-type soybean CMS line JLCMS5A [S (rf3rf3)] was selected as the female parent, and it was crossed with the CMS restorer line JLR2 [N (Rf3Rf3)] in 2018. F1 plants [S (Rf3rf3)] were self-pollinated to create F2 mapping populations in 2019. In 2020, the CMS line, restorer line, and F2 seeds were cultivated for pollen sterility analysis and Rf3 gene mapping at the experimental station of Jilin Academy of Agricultural Sciences, Fanjiatun, Changchun City, China (43°31′ N, 124°49′ E). All plant materials were provided by the Soybean Research Institute, Jilin Academy of Agricultural Sciences, Changchun, China. Cultivation practices, including soil preparation, fertilization, and irrigation, were consistent across all experiments.

4.2. Genetic Analysis

To investigate the inheritance of sterility, at least three blooming flowers from 217 F2 plants (from the cross of JLCMS5A with JLR2) were sampled. Anthers were dissected, squashed into an aqueous solution of 1% I2-KI, and observed under a microscope (Olympus, Japan). To determine the genetic model of Rf3, the observed values of fertile and semi-fertile plants were obtained, and the expected segregation ratio compatibility was assessed using the chi-square test in Excel.

4.3. Library Preparation and BSA-Seq

Total genomic DNA was extracted from the young leaves of soybean plants using the Plant Genomic DNA Kit (CWBIO, Jiangsu, China), according to the manufacturer’s instructions. The quality of the genomic DNA was monitored using 1% agarose gel electrophoresis. The concentration of each DNA sample was measured using the Nanodrop 2000 spectrophotometer (Thermo Fisher, Waltham, MA, USA) and adjusted to 1.5 μg DNA per sample. For BSA-Seq, four DNA bulks were prepared from male parents, female parents, fertile plants, and semi-fertile plants in the F2 population. The male and female parent bulks were constructed by mixing 10 μL of DNA each from 10 plants (labeled male and female), and the fertile and semi-fertile bulks were constructed by mixing 5 μL of DNA each from 20 plants exhibiting the fertile and semi-fertile phenotypes (labeled F and FS).
Libraries were prepared using the Truseq Nano DNA HT Sample Preparation Kit (Illumina, San Diego, CA, USA), following the manufacturer’s recommendations. Briefly, four DNA samples were fragmented by sonication to a size of 350 bp, and the DNA fragments were end-polished, A-tailed, and ligated with a full-length adapter for BSA-Seq, followed by PCR amplification. The PCR products were purified on the AMPure XP system. The libraries were analyzed for size distribution using the Agilent2100 Bioanalyzer and quantified using real-time PCR. Finally, the prepared libraries were sequenced using a paired-end read protocol, with 150 bp of data collected per run, on the Illumina Hiseq 4000 platform.

4.4. Sequencing Data Processing

To map the obtained sequences to the reference genome, clean reads were obtained from raw data by filtering the adaptor sequences, low-quality sequences, and unknown nucleotides (N), and stored in the FASTQ format [35]. The clean reads were mapped to the reference genomic sequence of Zhonghuang13 (https://ngdc.cncb.ac.cn/gwh/Assembly/125/show (accessed on 3 August 2020)) [36] using the Burrows–Wheeler aligner (BWA) (settings: mem -t 4 -k 32 -M -R) [37]. The alignment files were converted to the SAM/BAM format using SAMtools [38].
To identify and annotate the variants, reads at each position were filtered using the Genome Analysis Toolkit (GATK) package with the minimum base and mapping quality [39], and variant calling was performed for single-nucleotide polymorphisms (SNPs) and small InDels across bulks (settings: -Window 4, -filter “QD < 4.0|| FS > 60.0 || MQ < 40.0”, -G_filter “GQ < 20”). To determine the physical position of each variant, ANNOVAR [40] was used for aligning and annotating the SNPs or InDels.
To determine the candidate interval, sliding window analysis was applied to the frequency distribution of SNPs (SNP index) in the population of bulked individuals, and the SNP index was calculated for all SNP positions. SNP positions with an SNP index of <0.3 or >0.7 and the read depth of <7 from the two sequences were excluded, as these may represent spurious SNPs called due to sequencing and alignment errors. The Δ(SNP index) was calculated by subtracting the SNP indices of male and female bulk pools or the F and FS bulk pools. Quantitative trait loci (QTLs) were identified in the positive or negative peak regions with 95% confidence intervals in 10,000 bootstrap replicates. The selected SNPs and InDels in the peak regions were used to annotate and screen for the potential functional variants.

4.5. Marker Development and PCR Genotyping

To increase the marker density of the Rf3 candidate region, various PCR-based markers, including SSRs, InDels, and SNPs, were developed based on the Zhonghuang13 genome [36] sequence and the BSA-Seq data. The PCR primer pairs showing polymorphisms between JLCMS5A and JLR2, as well as between fertile and semi-fertile F2 plants, were used to genotype 217 F2 plants (Table S3). For SSR analysis, PCR was performed in a 20 μL reaction mixture containing 50 ng DNA template, 1 μL each of forward and reverse primers (1 μM), and 12 μL of 2× Es Taq MasterMix (CWBIO, Jiangsu, China). The reaction conditions were as follows: initial cycle of 10 min at 95 °C, followed by 32 cycles of 30 s at 95 °C, 30 s at 50 °C, and 30 s at 72 °C, and a final cycle of 10 min at 72 °C. Denatured PCR products were separated on a 6% polyacrylamide gel in 1× TBE buffer and visualized using the silver staining method. Gel images were documented with a digital camera. For dCAPS analysis, the same PCR amplification protocol as that for SSR analysis was used, but the PCR products were digested with Xho I and the fragments separated on a 1% agarose gel. Based on the differences in genotypes as assessed using polymorphic markers, recombinants were identified in the 217 F2 plants and used to fine map Rf3.

4.6. Gene Cloning and Sequencing

The number and functional annotation of genes in the fine mapping interval were determined based on the soybean reference genome (https://phytozome-next.jgi.doe.gov/ (accessed on 10 July 2021)) and BSA-Seq data. For the above genes, PCR amplification was performed with primer pairs covering the full-length coding sequence (CDS) of each gene from the JLCMS5A and JLR2 lines, and genes with long sequences were divided into several segments for amplification with the corresponding number of primer pairs (Table S3). The PCR amplicons were purified and cloned using the pMDTM19-T Vector Cloning Kit (TaKaRa, Beijing, China). At least three randomly selected positive colonies for each amplicon were sequenced and assembled. Sequences of the above genes from the JLCMS5A and JLR2 lines were aligned using DNAMAN. For phylogenetic reconstruction, MEGA-X was used. The statistical method was maximum likelihood, and the phylogeny was examined using the bootstrap method with 1000 bootstrap values.

4.7. Expression Analysis

To validate their expression in the root, stem, leaf, flower, and bud of JLCMS5A and JLR2, the candidate genes were analyzed by real-time quantitative PCR (qRT-PCR). Total RNA was extracted using the RNApure Plant Kit (DNase I) (CWBIO, Jiangsu, China), and first-strand cDNA was synthesized using the SuperRT cDNA Synthesis Kit (CWBIO, Jiangsu, China) according to the manufacturer’s protocol. The housekeeping gene Cons4 (GenBank accession number: BU578186) was used as the reference gene. All primer sequences used in the present study are listed in Table S3. qRT-PCR was performed in triplicate using the ChamQ Universal SYBR qPCR Master Mix (Vazyme, Nanjing, China) on the StepOnePlus Real-Time PCR System (Thermo Fisher, Waltham, MA, USA). Statistical analysis was performed using the well-known 2−ΔΔCT method [41]. All data are expressed as mean ± standard deviation.

4.8. Subcellular Localization

ProtComp (http://www.softberry.com/ (accessed on 15 October 2021)) was used to predict the subcellular localization of Rf3. Further, a transient expression system was used to study the subcellular localization of the Rf3 protein. Briefly, the Rf3 coding sequence, which lacked the stop codon, was generated through PCR using linker primers with the vector sequences ZF12 and ZR12 (Table S3). The subcellular localization vector pCAMBIA1302 was digested with Sal I and Pst I. The linker fragment was ligated to the linearized vector using T4 DNA Ligase (TaKaRa, Beijing, China) to construct the subcellular localization vector pCAMBIA1302: Rf3-GFP and pCAMBIA1302: MSTP-mKate for expression in Nicotiana tabacum L. protoplasts. The vector pCAMBIA1302-GFP was used as the control. The vector pCAMBIA1302: MSTP-mkate was used as mitochondrial marker. MSTP is a mitochondrial localization signal protein, whose amino acid sequence is MANRFRSGISFFKTIAVTDSVSSVRSKSLFPALRTYATASAQT [42]. The three vectors were injected into the abaxial surface of tobacco leaves, and the leaves were cultured for 72 h. Tobaccos were grown in pots containing a 1:1 mixture of forest vermiculite in a light chamber (400 μmol/m2/s) under 24 °C (day)/16 h and 3 °C (night)/8 h condition with 70% humidity. The images were obtained and observed under a laser confocal microscope (Nikon C2-ER, Japan). The excitation wavelengths for the green fluorescent protein (GFP) and far-red fluorescent protein (mKate) were 488 and 561 nm, respectively.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms23105388/s1.

Author Contributions

Conceptualization, C.Z.; gene mapping, Y.S., Y.Z. and S.J.; subcellular localization, Y.S. and Y.Z.; gene expression, Y.S. and C.L.; fertility identification, J.Z., H.Y. and W.Z.; population construction, B.P.; parent creation, L.Z.; writing—original draft preparation, Y.S.; writing—review and editing, Y.S. and C.Z.; funding acquisition, Y.S. and C.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China (32101766), Jilin Province Science and Technology Development Plan Project (20210302005NC), China Agriculture Research System (CARS-04), Jilin Province Agricultural Science and Technology Innovation Project (CXGC2020RCG005; CXGC2021RCB001; CXGC2021ZD002) and Fundamental Research Funds of Jilin Province (KYJF2021ZR110).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hu, J.; Wang, K.; Huang, W.; Liu, G.; Gao, Y.; Wang, J.; Huang, Q.; Ji, Y.; Qin, X.; Wan, L.; et al. The rice pentatricopeptide repeat protein RF5 restores fertility in Hong-Lian cytoplasmic male-sterile lines via a complex with the glycine-rich protein GRP162. Plant Cell 2012, 24, 109–122. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Chase, C.D. Cytoplasmic male sterility: A window to the world of plant mitochondrial-nuclear interactions. Trends Genet. 2007, 23, 81–90. [Google Scholar] [CrossRef] [PubMed]
  3. Carlsson, J.; Glimelius, K. Cytoplasmic male-sterility and nuclear encoded fertility restoration. In Plant Mitochondria; Springer: New York, NY, USA, 2011; pp. 469–491. [Google Scholar]
  4. Chen, L.; Liu, Y.G. Male sterility and fertility restoration in crops. Annu. Rev. Plant Biol. 2014, 65, 579–606. [Google Scholar] [CrossRef] [PubMed]
  5. Tang, H.; Xie, Y.; Liu, Y.G.; Chen, L. Advances in understanding the molecular mechanisms of cytoplasmic male sterility and restoration in rice. Plant Reprod. 2017, 30, 179–184. [Google Scholar] [CrossRef] [PubMed]
  6. Jaqueth, J.S.; Hou, Z.; Zheng, P.; Ren, R.; Nagel, B.A.; Cutter, G.; Niu, X.; Vollbrecht, E.; Greene, T.W.; Kumpatla, S.P. Fertility restoration of maize CMS-C altered by a single amino acid substitution within the Rf4 bHLH transcription factor. Plant J. 2020, 101, 101–111. [Google Scholar] [CrossRef]
  7. Melonek, J.; Duarte, J.; Martin, J.; Beuf, L.; Murigneux, A.; Varenne, P.; Comadran, J.; Specel, S.; Levadoux, S.; Bernath-Levin, K.; et al. The genetic basis of cytoplasmic male sterility and fertility restoration in wheat. Nat. Commun. 2021, 12, 1036. [Google Scholar] [CrossRef]
  8. Zhang, Z.; Zhu, Y.; Cao, Y.; Yu, H.; Bai, R.; Zhao, H.; Zhang, B.; Wang, L. Fine mapping of the male fertility restoration gene CaRf032 in Capsicum annuum L. Theor. Appl. Genet. 2020, 133, 1177–1187. [Google Scholar] [CrossRef]
  9. Yamagishi, H.; Bhat, S.R. Cytoplasmic male sterility in Brassicaceae crops. Breed. Sci. 2014, 64, 38–47. [Google Scholar] [CrossRef] [Green Version]
  10. Cui, X.; Wise, R.; Schnable, P. The rf2 nuclear restorer gene of male-sterile T-cytoplasm maize. Science 1996, 272, 1334–1336. [Google Scholar] [CrossRef]
  11. Bentolila, S.; Alfonso, A.A.; Hanson, M.R. A pentatricopeptide repeat-containing gene restores fertility to cytoplasmic male-sterile plants. Proc. Natl. Acad. Sci. USA 2002, 99, 10887–10892. [Google Scholar] [CrossRef] [Green Version]
  12. Brown, G.G.; Formanova, N.; Jin, H.; Wargachuk, R.; Dendy, C.; Patil, P.; Laforest, M.; Zhang, J.; Cheung, W.Y.; Landry, B.S. The radish Rfo restorer gene of Ogura cytoplasmic male sterility encodes a protein with multiple pentatricopeptide repeats. Plant J. 2003, 35, 262–272. [Google Scholar] [CrossRef] [PubMed]
  13. Komori, T.; Ohta, S.; Murai, N.; Takakura, Y.; Kuraya, Y.; Suzuki, S.; Hiei, Y.; Imaseki, H.; Nitta, N. Map-based cloning of a fertility restorer gene, Rf-1, in rice (Oryza sativa L.). Plant J. 2004, 37, 315–325. [Google Scholar] [CrossRef] [PubMed]
  14. Itabashi, E.; Iwata, N.; Fujii, S.; Kazama, T.; Toriyama, K. The fertility restorer gene, Rf2, for Lead Rice-type cytoplasmic male sterility of rice encodes a mitochondrial glycine-rich protein. Plant J. 2011, 65, 359–367. [Google Scholar] [CrossRef] [PubMed]
  15. Tang, H.; Luo, D.; Zhou, D.; Zhang, Q.; Tian, D.; Zheng, X.; Chen, L.; Liu, Y.G. The rice restorer Rf4 for wild-abortive cytoplasmic male sterility encodes a mitochondrial-localized PPR protein that functions in reduction of WA352 transcripts. Mol. plant 2014, 7, 1497–1500. [Google Scholar] [CrossRef] [Green Version]
  16. Huang, W.; Yu, C.; Hu, J.; Wang, L.; Dan, Z.; Zhou, W.; He, C.; Zeng, Y.; Yao, G.; Qi, J.; et al. Pentatricopeptide-repeat family protein RF6 functions with hexokinase 6 to rescue rice cytoplasmic male sterility. Proc. Natl. Acad. Sci. USA 2015, 112, 14984–14989. [Google Scholar] [CrossRef] [Green Version]
  17. Fujii, S.; Toriyama, K. Suppressed expression of RETROGRADE-REGULATED MALE STERILITY restores pollen fertility in cytoplasmic male sterile rice plants. Proc. Natl. Acad. Sci. USA 2009, 106, 9513–9518. [Google Scholar] [CrossRef] [Green Version]
  18. Kitazaki, K.; Arakawa, T.; Matsunaga, M.; Yui-Kurino, R.; Matsuhira, H.; Mikami, T.; Kubo, T. Post-translational mechanisms are associated with fertility restoration of cytoplasmic male sterility in sugar beet (Beta vulgaris). Plant J. 2015, 83, 290–299. [Google Scholar] [CrossRef] [Green Version]
  19. Liu, Z.; Yang, Z.; Wang, X.; Li, K.; An, H.; Liu, J.; Yang, G.; Fu, T.; Yi, B.; Hong, D. A mitochondria-targeted PPR protein restores pol cytoplasmic male sterility by reducing orf224 transcript levels in oilseed rape. Mol. Plant 2016, 9, 1082–1084. [Google Scholar] [CrossRef] [Green Version]
  20. Liu, Z.; Dong, F.; Wang, X.; Wang, T.; Su, R.; Hong, D.; Yang, G. A pentatricopeptide repeat protein restores nap cytoplasmic male sterility in Brassica napus. J. Exp. Bot. 2017, 68, 4115–4123. [Google Scholar] [CrossRef] [Green Version]
  21. Wang, H.; Xiao, Q.; Wei, C.; Chen, H.; Chen, X.; Dai, C.; Wen, J.; Ma, C.; Tu, J.; Fu, T.; et al. A mitochondria-localized pentatricopeptide repeat protein is required to restore hau cytoplasmic male sterility in Brassica napus. Theor. Appl. Genet. 2021, 134, 1377–1386. [Google Scholar] [CrossRef]
  22. Wang, T.; He, T.; Ding, X.; Zhang, Q.; Yang, L.; Nie, Z.; Zhao, T.; Gai, J.; Yang, S. Confirmation of GmPPR576 as a fertility restorer gene of cytoplasmic male sterility in soybean. J. Exp. Bot 2021, 72, 7729–7742. [Google Scholar] [CrossRef] [PubMed]
  23. Zhao, L.; Wang, Y.; Sun, H.; Zhao, H.; Cheng, Y.; Peng, B.; Wang, S.; Zhang, W.; Dong, Y. Identification of SSR markers linked to the fertility restorer gene for the CMS in soybean. Soybean Sci. 2007, 26, 835–839. [Google Scholar]
  24. Wang, Y.; Zhao, L.; Wang, X.; Sun, H. Molecular mapping of a fertility restorer gene for cytoplasmic male sterility in soybean. Plant Breed. 2010, 129, 9–12. [Google Scholar] [CrossRef]
  25. Dong, D.K.; Li, Z.; Yuan, F.J.; Zhu, S.L.; Chen, P.; Yu, W.; Yang, Q.H.; Fu, X.J.; Yu, X.M.; Li, B.Q.; et al. Inheritance and fine mapping of a restorer-of-fertility (Rf) gene for the cytoplasmic male sterility in soybean. Plant Sci. 2012, 188–189, 36–40. [Google Scholar] [CrossRef] [PubMed]
  26. Wang, D.; Zhang, L.; Li, J.; Hu, G.; Wu, Q.; Jiang, H.; Huang, Z.; Hovav, R. The restorer gene for soybean M-type cytoplasmic male sterility, Rf-m, is located in a PPR gene-rich region on chromosome 16. Plant Breed. 2016, 135, 342–348. [Google Scholar] [CrossRef]
  27. Yang, S.; Duan, M.; Meng, Q.; Qiu, J.; Fan, J.; Zhao, T.; Yu, D.; Gai, J. Inheritance and gene tagging of male fertility restoration of cytoplasmic-nuclear male-sterile line NJCMS1A in soybean. Plant Breed. 2007, 126, 302–305. [Google Scholar] [CrossRef]
  28. Hanson, M.R.; Bentolila, S. Interactions of mitochondrial and nuclear genes that affect male gametophyte development. Plant Cell 2004, 16, S154–S169. [Google Scholar] [CrossRef] [Green Version]
  29. Kim, Y.J.; Zhang, D. Molecular control of male fertility for crop hybrid breeding. Trends Plant Sci. 2018, 23, 53–65. [Google Scholar] [CrossRef]
  30. Kotchoni, S.O.; Jimenez-Lopez, J.C.; Gachomo, E.W.; Seufferheld, M.J. A new and unified nomenclature for male fertility restorer (RF) proteins in higher plants. PLoS ONE 2010, 5, e15906. [Google Scholar] [CrossRef] [Green Version]
  31. Lurin, C.; Andrés, C.; Aubourg, S.; Bellaoui, M.; Bitton, F.; Bruyère, C.; Caboche, M.; Debast, C.; Gualberto, J.; Hoffmann, B.; et al. Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis. Plant Cell 2004, 16, 2089–2103. [Google Scholar] [CrossRef] [Green Version]
  32. Small, I.; Peeters, N. The PPR motif—A TPR-related motif prevalent in plant organellar proteins. Trends Biochem. Sci. 2000, 25, 46–47. [Google Scholar] [CrossRef]
  33. Cheng, S.; Gutmann, B.; Zhong, X.; Ye, Y.; Fisher, M.F.; Bai, F.; Castleden, I.; Song, Y.; Song, B.; Huang, J.; et al. Redefining the structural motifs that determine RNA binding and RNA editing by pentatricopeptide repeat proteins in land plants. Plant J. 2016, 85, 532–547. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Schmitz-Linneweber, C.; Small, I. Pentatricopeptide repeat proteins: A socket set for organelle gene expression. Trends Plant Sci. 2008, 13, 663–670. [Google Scholar] [CrossRef] [PubMed]
  35. Cock, P.J.; Fields, C.J.; Goto, N.; Heuer, M.L.; Rice, P.M. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res. 2010, 38, 1767–1771. [Google Scholar] [CrossRef] [Green Version]
  36. Shen, Y.; Liu, J.; Geng, H.; Zhang, J.; Liu, Y.; Zhang, H.; Xing, S.; Du, J.; Ma, S.; Tian, Z. De novo assembly of a Chinese soybean genome. Sci. China Life Sci. 2018, 61, 871–884. [Google Scholar] [CrossRef]
  37. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [Green Version]
  38. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [Green Version]
  39. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [Green Version]
  40. Wang, K.; Li, M.; Hakonarson, H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010, 38, e164. [Google Scholar] [CrossRef]
  41. Livak, K.J.; Schmittgen, T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods 2001, 25, 402–408. [Google Scholar] [CrossRef]
  42. Heazlewood, J.L.; Tonti-Filippini, J.S.; Gout, A.M.; Day, D.A.; Whelan, J.; Millar, A.H. Experimental analysis of the Arabidopsis mitochondrial proteome highlights signaling and regulatory components, provides assessment of targeting prediction programs, and indicates plant-specific mitochondrial proteins. Plant Cell 2004, 16, 241–256. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Phenotypic and cytological characterization of the restorer and cytoplasmic male sterile (CMS) lines. (A) Restorer line JLR2. Scale bar: 25 cm. (B) CMS line JLCMS5A. Scale bar: 25 cm. (C) Pods of JLR2 (left) and JLCMS5A (right). Scale bar: 3 cm. (DF) Mature pollen grains of JLR2, JLCMS5A, and F1 plants stained with I2-KI.
Figure 1. Phenotypic and cytological characterization of the restorer and cytoplasmic male sterile (CMS) lines. (A) Restorer line JLR2. Scale bar: 25 cm. (B) CMS line JLCMS5A. Scale bar: 25 cm. (C) Pods of JLR2 (left) and JLCMS5A (right). Scale bar: 3 cm. (DF) Mature pollen grains of JLR2, JLCMS5A, and F1 plants stained with I2-KI.
Ijms 23 05388 g001
Figure 2. Statistics of the genomic loci of single-nucleotide polymorphisms (SNP) and InDel and their functional annotation. (A) Statistics of SNP genomic loci. (B) Statistics of SNP functional annotation. (C) Statistics of InDel genomic loci. (D) Statistics of InDel functional annotation.
Figure 2. Statistics of the genomic loci of single-nucleotide polymorphisms (SNP) and InDel and their functional annotation. (A) Statistics of SNP genomic loci. (B) Statistics of SNP functional annotation. (C) Statistics of InDel genomic loci. (D) Statistics of InDel functional annotation.
Ijms 23 05388 g002
Figure 3. (AC) Genetic mapping of the Rf3 locus on chromosome 9.
Figure 3. (AC) Genetic mapping of the Rf3 locus on chromosome 9.
Ijms 23 05388 g003
Figure 4. Physical mapping of the Rf3 locus. (A) The Rf3 locus is located on chromosome 9 according to BSA-Seq and primary mapping. (B) Fine mapping of the Rf3 region on chromosome 9. The Rf3 locus is mapped to an 86.44 Kb region between the markers dCAPs09-2 and SSR_09_1170. (C) The 86.44 Kb region containing 10 putative genes according to the Williams 82 reference genome. Gm09-11–Gm09-20 indicate Glyma.09G171100–Glyma.09G172000, respectively.
Figure 4. Physical mapping of the Rf3 locus. (A) The Rf3 locus is located on chromosome 9 according to BSA-Seq and primary mapping. (B) Fine mapping of the Rf3 region on chromosome 9. The Rf3 locus is mapped to an 86.44 Kb region between the markers dCAPs09-2 and SSR_09_1170. (C) The 86.44 Kb region containing 10 putative genes according to the Williams 82 reference genome. Gm09-11–Gm09-20 indicate Glyma.09G171100–Glyma.09G172000, respectively.
Ijms 23 05388 g004
Figure 5. The CAPS markers of Glyma.09G171200 and Glyma.09G171800 co-segregated with Rf3. M, marker; F, restorer line; S, cytoplasmic male sterile line; FS, semi-fertile F2 plants.
Figure 5. The CAPS markers of Glyma.09G171200 and Glyma.09G171800 co-segregated with Rf3. M, marker; F, restorer line; S, cytoplasmic male sterile line; FS, semi-fertile F2 plants.
Ijms 23 05388 g005
Figure 6. Gene expression analysis of Glyma.09G171200 and Glyma.09G171800 by qRT-PCR.
Figure 6. Gene expression analysis of Glyma.09G171200 and Glyma.09G171800 by qRT-PCR.
Ijms 23 05388 g006
Figure 7. Subcellular localization of Glyma.09G171200 in the mitochondria. (A) and (E) 35S: GFP and 35S: Rf3-GFP. (B) and (F) 35S: MSTP-mKate. (C) and (G) Bright field. (D) and (H) Merge of dark and bright field. Green shows the signal from GFP, red shows the signal from mKate. GFP, Rf3-GFP, and MSTP-mKate fusions under the control of the CaMV35S promoter were transiently expressed in tobacco protoplasts. Bar= 20 µm.
Figure 7. Subcellular localization of Glyma.09G171200 in the mitochondria. (A) and (E) 35S: GFP and 35S: Rf3-GFP. (B) and (F) 35S: MSTP-mKate. (C) and (G) Bright field. (D) and (H) Merge of dark and bright field. Green shows the signal from GFP, red shows the signal from mKate. GFP, Rf3-GFP, and MSTP-mKate fusions under the control of the CaMV35S promoter were transiently expressed in tobacco protoplasts. Bar= 20 µm.
Ijms 23 05388 g007
Figure 8. Phylogenetic analysis of Glyma.09G171200. The red branch indicates Glyma.09G171200. Bootstrap values of 1000 were used to assess the robustness of the neighbor-joining tree.
Figure 8. Phylogenetic analysis of Glyma.09G171200. The red branch indicates Glyma.09G171200. Bootstrap values of 1000 were used to assess the robustness of the neighbor-joining tree.
Ijms 23 05388 g008
Table 1. Quality appraisal of filtered reads in four samples.
Table 1. Quality appraisal of filtered reads in four samples.
SampleRaw Base (bp)Clean Base (bp)Q20 (%)Q30 (%)GC Content (%)Mapping Rate (%)Average Depth (×)Coverage at Least 1× (%)
Male12,707,411,10012,707,252,10096.5991.1636.8099.1811.0095.84
Female10,235,684,40010,205,530,50097.3992.6837.9999.558.6493.00
F31,164,198,00031,090,225,20097.7293.5436.1499.6024.1297.99
SF30,990,047,40030,918,562,80097.1892.2236.1099.5824.5998.06
Mean21,274,335,22521,230,392,65097.2292.4036.7699.4817.0996.22
Sum85,097,340,90084,921,570,600------
Note: Male, DNA pool of male parents; Female, DNA pool of female parents; F, DNA pool of fertile plants in F2; SF, DNA pool of semi-fertile plants in F2.
Table 2. Predicted genes and their information in the fine mapping region of Rf3.
Table 2. Predicted genes and their information in the fine mapping region of Rf3.
Gene NameCoding Region in the Longest TranscriptLocusDescription
Glyma.09G1711001239 bp39 607 686 bp~39 618 169 bp:+Homeodomain-like superfamily protein
Glyma.09G1712001494 bp39 623 177 bp~39 625 286 bp:−ATP binding; nucleic acid binding; helicases; pentatricopeptide repeat
Glyma.09G17130090 bp39 626 144 bp~39 626 702 bp:+Cytochrome b6-f complex, subunit 8
Glyma.09G171400105 bp39 627 087 bp~39 627 482 bp: −Photosystem II reaction center protein M
Glyma.09G171500357 bp39 630 511 bp~39 630 867 bp: −ATP synthase subunit alpha
Glyma.09G171600954 bp39 644 737 bp~39 649 123 bp: −Annexin 5
Glyma.09G1717001344 bp39 651 499 bp~39 653 374 bp:+Mitochondrial transcription termination factor family protein
Glyma.09G171800528 bp39 653 896 bp~39 655 287 bp:+Alpha/beta-Hydrolases superfamily protein
Glyma.09G1719001245 bp39 656 849 bp~39 660 683 bp: −Protein of unknown function (DUF760)
Glyma.09G1720002691 bp39 664 672 bp~39 673 958 bp:+Extra-large GTP-binding protein 3
Note: +/−, forward/reverse strand.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sun, Y.; Zhang, Y.; Jia, S.; Lin, C.; Zhang, J.; Yan, H.; Peng, B.; Zhao, L.; Zhang, W.; Zhang, C. Identification of a Candidate restorer-of-fertility Gene Rf3 Encoding a Pentatricopeptide Repeat Protein for the Cytoplasmic Male Sterility in Soybean. Int. J. Mol. Sci. 2022, 23, 5388. https://doi.org/10.3390/ijms23105388

AMA Style

Sun Y, Zhang Y, Jia S, Lin C, Zhang J, Yan H, Peng B, Zhao L, Zhang W, Zhang C. Identification of a Candidate restorer-of-fertility Gene Rf3 Encoding a Pentatricopeptide Repeat Protein for the Cytoplasmic Male Sterility in Soybean. International Journal of Molecular Sciences. 2022; 23(10):5388. https://doi.org/10.3390/ijms23105388

Chicago/Turabian Style

Sun, Yanyan, Yan Zhang, Shungeng Jia, Chunjing Lin, Jingyong Zhang, Hao Yan, Bao Peng, Limei Zhao, Wei Zhang, and Chunbao Zhang. 2022. "Identification of a Candidate restorer-of-fertility Gene Rf3 Encoding a Pentatricopeptide Repeat Protein for the Cytoplasmic Male Sterility in Soybean" International Journal of Molecular Sciences 23, no. 10: 5388. https://doi.org/10.3390/ijms23105388

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop