Next Article in Journal
Studies of the Variability of Biologically Active Compounds and Antioxidant Activity in Organically, Biodynamically, and Naturally Grown and Fermented Fireweed (Chamerion angustifolium (L.) Holub) Leaves
Previous Article in Journal
Selecting High-Performing and Stable Pea Genotypes in Multi-Environmental Trial (MET): Applying AMMI, GGE-Biplot, and BLUP Procedures
Previous Article in Special Issue
Efficient Somatic Embryogenesis, Regeneration and Acclimatization of Panax ginseng Meyer: True-to-Type Conformity of Plantlets as Confirmed by ISSR Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Association Study of Seed Morphology Traits in Senegalese Sorghum Cultivars

1
USDA-ARS Plant Science Research Unit, St. Paul, MN 55108, USA
2
Department of Plant Pathology, University of Minnesota, Saint Paul, MN 55108, USA
3
Molecular Plant Sciences, Washington State University, Pullman, WA 99164, USA
4
Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
5
USDA-ARS Southern Plains Agricultural Research Center, College Station, TX 77845, USA
6
Department of Plant Pathology and Microbiology, Texas A&M University, College Station, TX 77843, USA
*
Author to whom correspondence should be addressed.
Plants 2023, 12(12), 2344; https://doi.org/10.3390/plants12122344
Submission received: 4 June 2023 / Revised: 13 June 2023 / Accepted: 15 June 2023 / Published: 16 June 2023
(This article belongs to the Special Issue Plant Reproduction and Embryonic Development)

Abstract

:
Sorghum is considered the fifth most important crop in the world. Despite the potential value of Senegalese germplasm for various traits, such as resistance to fungal diseases, there is limited information on the study of sorghum seed morphology. In this study, 162 Senegalese germplasms were evaluated for seed area size, length, width, length-to-width ratio, perimeter, circularity, the distance between the intersection of length & width (IS) and center of gravity (CG), and seed darkness and brightness by scanning and analyzing morphology-related traits with SmartGrain software at the USDA-ARS Plant Science Research Unit. Correlations between seed morphology-related traits and traits associated with anthracnose and head smut resistance were analyzed. Lastly, genome-wide association studies were performed on phenotypic data collected from over 16,000 seeds and 193,727 publicly available single nucleotide polymorphisms (SNPs). Several significant SNPs were found and mapped to the reference sorghum genome to uncover multiple candidate genes potentially associated with seed morphology. The results indicate clear correlations among seed morphology-related traits and potential associations between seed morphology and the defense response of sorghum. GWAS analysis listed candidate genes associated with seed morphologies that can be used for sorghum breeding in the future.

1. Introduction

Sorghum [Sorghum bicolor (L.) Moench] is a gluten-free cereal widely consumed throughout Africa [1,2]. It is a climate-resilient and drought-tolerant crop used for animal feed, biofuels, forage, ethanol production, and fodder preservation; it is arguably one of Africa’s most versatile food crops [3]. Plant genetics researchers have frequently employed community association panels to investigate the genetic basis of naturally occurring phenotypic variation in several traits [4]. Sorghum germplasm lines from West and Central Africa are cultivated in rainy and high-humidity regions and have been a significant source of critical agronomic traits such as fungal disease resistance [5]. To date, Senegalese germplasms have been extensively tested through genome-wide association studies (GWAS) to identify novel sources of resistance genes against fungal diseases such as anthracnose and head smut [5,6,7], but the germplasms have yet to be widely studied for other agronomically important traits such as seed morphology.
Morphological variations in seed characters, such as differences in seed size and shape, are important traits in plant identification and classification of taxa, and are useful parameters for analyzing plant biodiversity [8,9]. Seed morphology is of agronomic importance, as it reflects genetic, physiological, and ecological components that affect yield, quality, and market price [8]; for instance, acceptance of high lysine-containing sorghums is limited due to many problems associated with their opaque kernel characteristics [10,11]. Breeding targets encompass seed shape and size, as seed size-related traits bear particular importance, and a comprehensive understanding of the genes underlying seed morphology equips breeders with the capacity to develop novel cultivars harboring desired characteristics [12].
Sakamoto et al. evaluated 329 sorghum germplasms from various origins and identified SNPs potentially associated with seed morphology, including SNP loci S01_50413644, S04_59021202, and S05_9112888 based on multi-traits GWAS [13]. Zhang et al. identified 73 quantitative trait loci (QTL) related to grain color and tannin content in Chinese sorghum accessions, and a new recessive allelic variant of Tannin2 was discovered [14]. A GWAS conducted on a diverse set of 635 Ethiopian sorghum accessions found variations in loci harboring seed protein genes involved in seed storage, late embryogenesis, and tannin biosynthesis, all of which are associated with sorghum grain mold resistance [15].
To add to our understanding of sorghum seed morphology, in this study, 162 Senegalese sorghum germplasms along with controls (BTx623, PI609251, and PI659985, which are widely used cultivars in sorghum research) were evaluated for various seed morphologies, including seed area size, length, width, length-to-width ratio (LWR), perimeter, circularity, the distance between the intersection of length and width (IS) and center of gravity (CG), and seed darkness and brightness. In addition to identifying variation for each trait, possible correlations among the characteristics were evaluated across the subset of the Senegalese collection. Furthermore, Ahn et al. previously studied identical germplasms to evaluate resistance to anthracnose at the seedling and 8-leaf stages and for head smut based on distinctive spot appearance rate and average time for spot appearance on the first leaf under water [6,7,16,17]. By taking advantage of the previous studies, the seed morphology-related traits were also analyzed to identify potential correlations with the traits associated with anthracnose and head smut. Finally, the phenotypic data collected from over 16,000 seeds for the traits were combined with 193,727 single nucleotide polymorphisms (SNPs) throughout the genome to perform GWAS regarding single traits and multi-traits. The top candidate SNPs were tracked back to the reference sorghum genome, resulting in the identification of multiple candidate genes potentially associated with seed morphology-related traits.

2. Results

2.1. Seed Morphologies

Based on the two-tailed test, the ANOVA for the 165 accessions, including BTx623, PI609251, and PI659985, showed significant differences, with p < 0.0001 for all evaluated traits (raw data available through Supplementary Data S1). The Shapiro–Wilk test identified normal distribution for the distance between IS and CG (p = 0.086), but other traits showed abnormal distributions with p < 0.0001 (Figure 1). Brightness and circularity were skewed to the right, and the other traits had a few to multiple outliers (Figure 1). The top five cultivars for each trait are shown in Table 1 (detailed phenotypic data are available through Supplementary Data S1). For example, the area size for PI514293 was 6.67 ± 0.86 mm2, while that for PI514404 was 19.26 ± 3.41 mm2 (Table 1 and Figure 2). Similarly, the seed colors between PI514471 and PI514419 showed great contrast (Table 1 and Figure 3). Significant phenotypic variations were observed in other traits across the population as well.

2.2. Correlations among the Seed Morphology-Related Traits

Based on Pearson’s correlation analysis, all evaluated seed size-related traits correlate to each other except for perimeter-circularity and LWR-circularity (Figure 4 and Table 2). Seed brightness showed no correlation to all seven seed size-related traits indicating the seed color-related trait is highly independent of seed size-related traits.
A PCA using eight seed morphology-related traits revealed that PC1 and PC2 explain 75.83% of the overall variation (Figure 5). The plot of the partial contribution of variables for eight traits revealed that PC1 comprises area size, perimeter, length, width, and length-to-width ratio (Figure 6). Circularity and distance between IS and CG mainly contribute to PC2. Seed brightness contributes mostly to PC3.

2.3. Correlations between the Seed Morphology-Related Traits and Anthracnose and Head Smut Resistance Responses in Sorghum

Based on multivariate correlation studies between seed morphology-related traits and sorghum responses to anthracnose and head smut, we identified two major correlations supported with significant p-values. There was a moderately strong negative correlation between head smut spot appearance rate (%) and circularity (Figure 7a). On the other hand, head smut spot appearance rate (%) showed a moderate positive correlation to distance between IS and CG, which are directly associated with seed morphology and circularity (Figure 7b). Although p-values were higher than 0.05, seed area size and length were also negatively correlated to the spot appearance rate (p-values ≈ 0.05). Other seed morphology-related traits did not show correlation to the diseases’ associated traits.

2.4. Population Structure and GWAS Analysis

The population structure analysis revealed that there are two major groups across the accessions tested in this study (Figure 8). The dendrogram displayed nearly identical results to the admixture plot (Figure 9). Overall, the population structure analysis aligned with previous studies indicating that the botanical subrace played a major role in shaping the diversity patterns of the population [7,18,19]. PCA plots, along with phylogenetic trees, were omitted as it has been reported in a recent study in the population [7].
Figure 10 and Table 3 show the top SNPs identified from GWAS and their associated genes. LD heatmaps highlighted LD of regions near the statistically significant SNP loci, indicating low LD (Figure 11). Overall, 100 SNP variants passed the Bonferroni threshold before secondary filtering with a t-test. The number of SNPs for each trait varied (3 SNPs for area size, 11 SNPs for perimeter, 0 SNP for length, 9 SNPs for width, 3 SNPs for LWR, 0 SNP for distance between IS and CG, 56 SNPs for PCs and 18 SNPs for brightness). After filtering with a t-test, multiple SNPs were excluded from the list, leaving the top SNPs that can be used as novel sources for sorghum seed morphology-related traits in breeding programs.

3. Discussion

Seed weight and size are critical yield components, and selecting for large seeds has long been a goal in crop domestication [20,21]. Measures of size and shape in seeds and their correlation are equally essential in current breeding to improve yield or quality [8]. As one of the most important crops worldwide, sorghum seed morphologies and their associations with molecular markers are of potential use not only for breeding but for evaluating the role of specific genes in seed shape and size. This study explored eight important sorghum seed morphology traits in Senegalese sorghum accessions that have yet to be extensively studied regarding seed-related traits. Figure 4 and Table 2 show that many seed size-related traits, including area size, perimeter, length, width, and length-to-width ratio, were correlated. Circularity and distance between IS and CG were also associated with other seed size-related traits. However, based on PCA analysis (Figure 5 and Figure 6), these traits were not closely grouped with other seed traits. Seed brightness, primarily explained by PC3, was not significantly associated with the other seed traits.
In the multivariate correlation studies examining the relationship between seed morphology-related traits and sorghum responses to anthracnose and head smut, we observed two significant correlations. Firstly, we found a moderate negative correlation between the head smut spot appearance rate (%) and circularity. Secondly, the head smut spot appearance rate (%) showed a moderate positive correlation with the distance between IS and CG. Craig and Frederiksen conducted a seedling inoculation for sorghum using peat pellets against Sporisorium reilianum (Kühn) Langdon & Fullerton (syns. Sphacelotheca reiliana (Kühn) G.P. Clinton and Sorosporium reilianum (Kühn) McAlpine) causing sorghum head smut [16,22]. The sorghum seedlings at the 1-leaf stage were inoculated with teliospore cultures. After four days, the seedlings were submerged in water-filled test tubes, and the presence of brown or dark spots on the first leaf blade distinguished susceptible genotypes from resistant ones. Craig and Frederiksen explained the spots caused by the fungal pathogen, but it is unclear if the spots are present due to fungal infection or due to a plant defense response [16,22]. Although the observed correlations may be mere coincidences, Seiwa et al. reported results suggesting that seed size may play a role in conspecific negative distance-dependent seedling mortality and negative density-dependent seedling survival variation (CNDD), and that seed size may promote species coexistence by influencing distance-dependent pathogen attacks, especially those related to leaf diseases in eight tree species [23]. A study conducted in the Peruvian Amazon suggested a positive correlation between tree seed size and susceptibility to pathogen attack [24]. Specifically, the results indicated that larger and shade-tolerant seeds exhibited a higher vulnerability to pathogen attack than smaller seeds relying on light dependence [24]. A positive relationship between seed weight and susceptibility to pathogens was also found [24]. Ahn et al. [7] reported top candidate genes associated with the spot appearance when inoculated with S. reilianum at the seedling stage. The top candidate genes (F-box and leucine-rich repeat protein (Sobic.004G202700), ankyrin repeats (Sobic.002G174700) and xyloglucan endotransglucosylase (Sobic.004G273200)) were located near growth and development-related genes such as rhodanese-like domain-containing protein-like (Sobic.004G202600) [25], cellulase/endoglucanase (Sobic.004G202800) [26], protein kinase AFC1 (Sobic.004G202500) [27], serine/threonine protein kinase (Sobic.002G174801) [28], legume lectin domain (Sobic.002G174600) [29], transcription factor jumonji (jmj) family protein/zinc finger (C5HC2 type) family protein (Sobic.004G273100) [30], ubiquitin and ubiquitin-like proteins (Sobic.004G273300) [31], and MYB-like DNA-binding domain (Sobic.004G273000) [32]. Therefore, it is speculated that the correlations between the head smut spot appearance rate and the two morphology-related traits are due to strong genetic linkage.
Zinc finger proteins play essential roles in plant growth, development, and responses to abiotic stresses such as drought, salt, temperature, reactive oxygen species, and harmful metals [33]. As listed in Table 3, zinc finger-associated genes were linked with multiple SNPs potentially associated with seeds (SNP loci for area: S06_12058855, perimeter: S02_36482136 and S07_16244875, width: S06_12058855 and S07_16244901, and PCs: S06_54692844 and S06_54708681). MicroRNAs (miRNAs) play an essential role in regulating plant development by mediating target genes at transcriptional and post-transcriptional levels [34], and DUF3537 (perimeter-associated SNP locus S06_39973848) is one of the predicted target genes of miRNAs in Acacia crassicarpa [35]. DUF3537 is a member of the transmembrane protein family. Transmembrane proteins are recognized to play a role in biological stress response [36]. Examples include pathogen-induced cysteine-rich transmembrane proteins [37], suppressors of NPR1 Constitutive2 proteins [36], polygalacturonase-inhibiting proteins [38,39,40], and ankyrin repeat-containing proteins [41].
The role of ribonucleotide reductase (perimeter and width: SNP loci S02_36533975 and S02_36673863) in the de novo synthesis of deoxynucleoside triphosphates (dNTPs) in DNA replication and cell cycle progression is critical [42]. Homeodomain leucine-zipper (perimeter: SNP locus S01_72472059) interacts genetically to align morphogenesis and environmental responses by modulating phytohormone-signaling networks [43]. SNP locus S02_46916695 (perimeter) is located next to Sobic.001G447400 and associated with TPR and ankyrin repeat. The TPR-containing protein TTL1in Arabidopsis regulates plant responses to abscisic acid (ABA) in seeds and seedlings [44]. Ankyrin repeat-containing proteins are essential in cell growth, development, and response to hormones and environmental stresses [45], and ankyrin-TPR repeats gene clusters in rice are associated with panicle branching diversity [46]. SNP locus S06_51902835, one of the top SNPs associated with perimeter, was 2571 bp away from calcium/calmodulin-dependent protein kinase. In rice, OsDMI3 (calcium/calmodulin-dependent protein kinase)-mediated phosphorylation of OsMKK1 (MAPK) kinase activates the MAPK cascade and positively regulates abscisic acid responses in seed germination, root growth, and tolerance to both water stress and oxidative stress [47]. Glycosyltransferase, tagged with seed perimeter-associated SNP locus S02_46926845, is known to have roles in seed coat mucilage composition [48]. Auxin is a plant hormone central to plant growth and development from embryogenesis to senescence, and PB1 domain (perimeter: SNP locus S04_67200818) interactions in auxin response factor ARF5 and repressor IAA17 [49]. The closest gene from the SNP locus S01_52374442 (width) was Sobic.001G271500, which contains a leucine-rich repeat. Leucine-rich repeat proteins are critical for growth promotion, seed maturation, stress response, and enhanced seed production [50,51]. Leucine-rich repeat proteins such as the polygalacturonase-inhibiting proteins are involved in plant defense [52]. The superfamily of cytochrome P450 tagged by SNP locus S03_5165375 (PCs) plays critical roles in plant growth and development, biotic and abiotic stress responses, and metabolic diversification [53,54]. Considering that PC1, which was used as input data, predominantly consists of variables such as area size, perimeter length, length, width, and other PCs included as covariates, the multivariate GWAS results mainly reflect traits associated with seed size.
The SNP locus that S06_58756099 (PCs) tagged to farnesyl-pyrophosphate synthetase is known to be associated with plant development [55]. The N-terminal domains of Arabidopsis rhamnose synthases RHM1, 2, and 3 have UDP-D-glucose 4,6-dehydratase activity (PCs-related SNP loci S04_62452658), and rhamnose synthases are required for the development of root hairs and cotyledon pavement cells and the synthesis of seed mucilage [56]. SNP loci S06_8316056 and S03_5151221 (PCs) were related to protein transport-related genes. In plants, the Golgi apparatus is central to synthesizing complex cell wall polysaccharides and glycolipids in the plasma membrane and adding oligosaccharides to proteins destined to reach the cell wall, plasma membrane, or storage vacuoles [57]. Rho GTPases (PCs-related SNP locus S04_62432641) modulate plant growth and development. Similarly, PPR proteins (PCs-related SNP locus S04_62438345) also play important roles in seed development, plant growth and development, and stress responses [58]. WD40 repeat genes, which include SNP loci S03_51995650 (PCs) and S10_15656073 (brightness), are reported to be associated with anthocyanin accumulation, seed pigmentation, seed germination, seed growth, and biomass [59,60]. DNA helicases, a gene close to SNP locus S01_53382065 associated with seed brightness, are molecular motor proteins that have suggested roles in cell division/proliferation during flower development, maintenance of genomic methylation patterns, and the plant cell cycle, as well as in basic cellular activities [61].
The brightness-related SNP locus S01_2657706 is found near amyloid beta precursor protein-binding protein 1 (APPBP1). Numerous amyloids are involved in pathogenesis; however, plant amyloids are poorly studied but are known to play roles in the autonomous flowering pathway and post-translational modification [62]. Ribonuclease III (brightness-related SNP locus S01_66050539) is responsible for the processing and maturation of RNA precursors into functional rRNA, mRNA, and other small RNA. However, no reported role of seed color or development has been known for the genes. SNP locus S06_37776989 (width) tagged gene of unknown function (DUF493) and SNP locus S10_53260179 (brightness) were closely located on a locus that does not have any annotated gene nearby. These SNPs may be associated with seed morphology but could also be false positives caused by pure coincidences.
Most of the genes identified are involved in plant development and physiological processes with potential aspects of seed morphology. Bonferroni correction is often considered highly conservative [63], and in this study, 100 SNPs passed the Bonferroni threshold. Furthermore, we verified the average score/SNP through a simple t-test and filtered out any SNP that failed to pass p < 0.05. Even with these strict methods to minimize false positives, there are many novel SNPs potentially conferring changes in various seed morphologies. Hence, it is expected that most identified SNPs are genuinely associated with seed morphology-related traits that can be used for sorghum breeding in the future. Further studies should investigate the relationship between seed morphology-related characteristics and molecular markers to better understand seed morphology-related genes in the subset of the Senegalese sorghum collection and other collections within the National Plant Germplasm System of the US.

4. Materials and Methods

4.1. Seed Phenotypic Evaluation

A total of 162 cultivars from the Senegalese germplasm collection (complete list available in Supplementary Data S1) maintained by the USDA-ARS, Plant Genetic Resources Conservation Unit, Griffin, Georgia, and Controls (BTx623, PI609251, and PI659985) were evaluated for seed area size (mm2), length (mm), width (mm), length-to-width ratio (LWR), perimeter (mm), circularity (0–1 range, 0: not circular to 1: complete circle), the distance between the intersection of length & width (IS) and center of gravity (CG), and seed darkness & brightness (0–255 range, 0: complete black to 255: pure white). CG is the point where the seed’s mass is concentrated and IS are points where the width and length hit the boundary for the seed parameter [64].
Around 80 to 100 seeds were evaluated for each trait across all the cultivars and controls, except for PI659985, where the available number of seeds was only 50. Seed images were captured with Canon imageRUNNER ADVANCE C7270 (Canon Inc., Tokyo, Japan). With the SmartGrain (version 1.3) high-throughput phenotyping software, the scanned seed images were measured for seed area size, length, width, LWR, perimeter, circularity, and distance between IS and CG [64]. Any errors generated by SmartGrain were manually corrected for each image. Seed darkness and brightness were measured using a multi-point function in ImageJ version 1.54d [65].

4.2. Statistical Analysis

Tukey’s HSD test for all possible cultivar comparisons was performed with JMP Pro 15 (SAS Institute, Cary, NC, USA) in each trait for statistical analysis. One-way ANOVA was performed for each trait separately. Pearson’s correlation was calculated for all possible pairs of seed morphology-related traits with JMP Pro 15. For anthracnose and head smut resistance traits [6,7,16,17], both Pearson’s (for parametric traits: all traits except anthracnose score-related traits) and Spearman’s rank (for non-parametric traits including the 1–5 scale scoring for anthracnose-related traits) correlation tests were performed. The Shapiro–Wilk test was conducted to evaluate normal distribution in each trait. A principal component analysis (PCA) was performed using data from the eight measured traits. Additional PCA was performed using seven correlated data, except the traits for color for multi-variate GWAS analysis.

4.3. GWAS and Population Genomic Analysis

For GWAS, a total of 193,727 SNP data were extracted from an integrated sorghum SNPs dataset based on sorghum reference genome version 3.1.1 and initially genotyped using GBS [18,66,67,68]. Phenotypic data was input univariately to perform GWAS with a mixed linear model (MLM) through TASSEL version 5.2.55 [69] association mapping software to identify chromosomal locations associated with each trait. Moreover, a PCA was performed with TASSEL version 5.2.55 for the traits that showed high correlations (all traits except seed color in this study), and a multivariate GWAS [PC GWAS based on PCs for phenotypic data (PC1 = data and other PCs = covariates)] was conducted through MLM. False associations were minimized by removing SNPs with higher than 20% unknown alleles and SNPs with minor allele frequency (MAF) below 5%, resulting in 132,024 SNPs [70]. The SNPs that passed the Bonferroni threshold were mapped back to the publicly available sorghum reference genome to be tracked to the specific chromosome location based on the sorghum reference genome sequence, version 3.1.1, available at the Phytozome 13 (https://phytozome.jgi.doe.gov, (accessed on 25 May 2023)) [71]. The mean values for Senegalese germplasms with either of the two prevalent bases were determined for each of the prospective genes listed in Table 3. The differences in these mean values were verified to be significant (p < 0.05) using JMP Pro 15 (SAS Institute, Cary, NC, USA). SNPs that did not pass the t-test were excluded from the list of candidate SNPs.
PLINK v1.9 [72] was used for VCF file conversions and to randomly select 50,000 SNPs from the genotypic data for analysis of population structure in R v4.1.2 and R studio v1.4.1717. The packages FactoMineR v2.8 [73] and Factoextra v1.0.7 [74] were used to conduct PCA and determine optimal k-means clustering using the average silhouette method, respectively. ADMIXTURE v1.3.0 [75] was run with the optimal k-means clustering value to visualize population structure. SNPRelate v1.28.0 and gdsfmt v1.30.lil0 [76] were used to generate a dendrogram of the accessions from the genotype data, which was visualized using ggtree v3.2.1 [73] to further validate population structure, and assign accessions to genetic groups. LD heatmap v1.0-6 [77] was utilized to plot LD of local variants around statistically significant SNPs.

5. Conclusions

In this study, we analyzed seed morphology-related traits in a subset of Senegalese germplasm. Even though a low genetic diversity was found, the accessions showed a wide range of morphological traits. Intriguingly, there were potential associations between seed morphology-related traits and head smut spot appearance rate, explained by possible genetic linkages. Nearly all the candidate genes from GWAS analysis had known roles in plant growth and development. The identified genes’ functions can be validated by using modern and cutting-edge techniques such as real-time quantitative reverse transcription PCR (Real-time qRT-PCR), RNA sequencing analysis (RNA-Seq), and CRISPR-Cas9-associated gene editing. Although applying gene-editing techniques in monocot crops is challenging, rapid developments in gene editing technology will offer fast and precise functional validations of the candidate genes. On the other hand, it is essential to survey seed morphology-related traits in other sorghum populations to identify additional candidate genes and genes that overlap in multiple populations.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants12122344/s1, Supplementary Data S1 (Excel spreadsheet): Phenotypic data from all tested seeds.

Author Contributions

E.A.: Conceptualization, Data Curation, Methodology, Software, Validation, Formal analysis, Investigation, Original Draft. J.B.: Software, Validation, Formal analysis, Writing—review and editing. V.E.: Writing—review and editing. Y.L.: Methodology, Writing—review and editing. K.P.: Writing—review and editing. L.K.P.: Resources, Methodology, Writing—review and editing. C.M.: Project administration, Funding acquisition, Supervision, Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by AFRI, NIFA, USDA grant number 20156800423492 and by The Feed the Future Innovation Lab for Collaborative Research on Sorghum and Millet, a United States Agency for International Development Cooperative grant number AID-OAA-A-13-00047 with title name: Enabling marker assisted selection for sorghum disease resistance in Senegal and Niger. E.A. and L.K.P. were supported by the U.S. Department of Agriculture, Agricultural Research Service. Mention of any trade names or commercial products in this article is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. USDA is an equal opportunity provider and employer, and all agency services are available without discrimination.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We express our gratitude to the reviewers for their valuable feedback.

Conflicts of Interest

The authors declare that they have no known competing financial interest or personal relationship that could have influenced the work reported in this paper.

References

  1. Arouna, N.; Gabriele, M.; Pucci, L. The Impact of Germination on Sorghum Nutraceutical Properties. Foods 2020, 9, 1218. [Google Scholar] [CrossRef] [PubMed]
  2. Frankowski, J.; Przybylska-Balcerek, A.; Stuper-Szablewska, K. Concentration of Pro-Health Compound of Sorghum Grain-Based Foods. Foods 2022, 11, 216. [Google Scholar] [CrossRef] [PubMed]
  3. Adebo, O.A. African Sorghum-Based Fermented Foods: Past, Current and Future Prospects. Nutrients 2020, 12, 1111. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Mural, R.V.; Sun, G.; Grzybowski, M.; Tross, M.C.; Jin, H.; Smith, C.; Newton, L.; Andorf, C.M.; Woodhouse, M.R.; Thompson, A.M.; et al. Association Mapping across a Multitude of Traits Collected in Diverse Environments in Maize. GigaScience 2022, 11, giac080. [Google Scholar] [CrossRef]
  5. Cuevas, H.E.; Prom, L.K.; Rosa-Valentin, G. Population Structure of the NPGS Senegalese Sorghum Collection and Its Evaluation to Identify New Disease Resistant Genes. PLoS ONE 2018, 13, e0191877. [Google Scholar] [CrossRef] [Green Version]
  6. Ahn, E.; Fall, C.; Prom, L.K.; Magill, C. Genome-Wide Association Study of Senegalese Sorghum Seedlings Responding to a Texas Isolate of Colletotrichum Sublineola. Sci. Rep. 2022, 12, 13025. [Google Scholar] [CrossRef]
  7. Ahn, E.; Prom, L.K.; Hu, Z.; Odvody, G.; Magill, C. Genome-wide Association Analysis for Response of Senegalese Sorghum Accessions to Texas Isolates of Anthracnose. Plant. Genome 2021, 14, e20097. [Google Scholar] [CrossRef]
  8. Cervantes, E.; Martín, J.J.; Saadaoui, E. Updated Methods for Seed Shape Analysis. Scientifica 2016, 2016, 5691825. [Google Scholar] [CrossRef] [Green Version]
  9. Hareesh, V.S.; Sabu, M. Significance of Seed Morphology and Anatomy in the Systematics of Musaceae. Bot. J. Linn. Soc. 2023, 201, 1–35. [Google Scholar] [CrossRef]
  10. Rao, V.J.M.; Mehta, S.L.; Rao, N.G.P. Breeding Sorghums for High Protein Content and Quality. Indian. J. Genet. Plant Breed. 1984, 44, 305–313. [Google Scholar]
  11. Ejeta, G.; Axtell, J.D. Development of Hard Endosperm High Lysine Sorghum Lines. In Proceedings of the International Conference on Sorghum Nutritional Quality, West Lafayette, IN, USA, 1–26 March 1990; pp. 126–140. [Google Scholar]
  12. Biselli, C.; Bagnaresi, P.; Cavalluzzo, D.; Urso, S.; Desiderio, F.; Orasen, G.; Gianinetti, A.; Righettini, F.; Gennaro, M.; Perrini, R.; et al. Deep Sequencing Transcriptional Fingerprinting of Rice Kernels for Dissecting Grain Quality Traits. BMC Genom. 2015, 16, 1091. [Google Scholar] [CrossRef] [Green Version]
  13. Sakamoto, L.; Kajiya-Kanegae, H.; Noshita, K.; Takanashi, H.; Kobayashi, M.; Kudo, T.; Yano, K.; Tokunaga, T.; Tsutsumi, N.; Iwata, H. Comparison of Shape Quantification Methods for Genomic Prediction, and Genome-Wide Association Study of Sorghum Seed Morphology. PLoS ONE 2019, 14, e0224695. [Google Scholar] [CrossRef] [Green Version]
  14. Zhang, L.; Xu, J.; Ding, Y.; Cao, N.; Gao, X.; Feng, Z.; Li, K.; Cheng, B.; Zhou, L.; Ren, M.; et al. GWAS of Grain Color and Tannin Content in Chinese Sorghum Based on Whole-Genome Sequencing. Theor. Appl. Genet. 2023, 136, 77. [Google Scholar] [CrossRef] [PubMed]
  15. Nida, H.; Girma, G.; Mekonen, M.; Tirfessa, A.; Seyoum, A.; Bejiga, T.; Birhanu, C.; Dessalegn, K.; Senbetay, T.; Ayana, G.; et al. Genome-Wide Association Analysis Reveals Seed Protein Loci as Determinants of Variations in Grain Mold Resistance in Sorghum. Theor. Appl. Genet. 2021, 134, 1167–1184. [Google Scholar] [CrossRef] [PubMed]
  16. Ahn, E.; Prom, L.K.; Fall, C.; Magill, C. Response of Senegalese Sorghum Seedlings to Pathotype 5 of Sporisorium Reilianum. Crops 2022, 2, 142–153. [Google Scholar] [CrossRef]
  17. Ahn, E.; Fall, C.; Prom, L.K.; Magill, C. A Genome-Wide Association Study of Senegalese Sorghum Seedlings Responding to Pathotype 5 of Sporisorium Reilianum. Plants 2022, 11, 2999. [Google Scholar] [CrossRef]
  18. Hu, Z.; Olatoye, M.O.; Marla, S.; Morris, G.P. An Integrated Genotyping-by-Sequencing Polymorphism Map for Over 10,000 Sorghum Genotypes. Plant Genome 2019, 12, 180044. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Morris, G.P.; Ramu, P.; Deshpande, S.P.; Hash, C.T.; Shah, T.; Upadhyaya, H.D.; Riera-Lizarazu, O.; Brown, P.J.; Acharya, C.B.; Mitchell, S.E.; et al. Population Genomic and Genome-Wide Association Studies of Agroclimatic Traits in Sorghum. Proc. Natl. Acad. Sci. USA 2013, 110, 453–458. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Gavazzi, G.; Sangiorgio, S. Seed Size: An Important Yield Component. In More Food: Road to Survival, 1st ed.; Pilu, R., Gavazzi, G., Eds.; Bentham Science: Sharjah, United Arab Emirates, 2017; pp. 143–167. ISBN 978-1-68108-467-1. [Google Scholar]
  21. Giordani, W.; Gama, H.C.; Chiorato, A.F.; Garcia, A.A.F.; Vieira, M.L.C. Genome-Wide Association Studies Dissect the Genetic Architecture of Seed Shape and Size in Common Bean. G3 Genes|Genomes|Genet. 2022, 12, jkac048. [Google Scholar] [CrossRef]
  22. Craig, J. Comparison of Sorghum Seedling Reactions to Sporisorium reilianum in Relation to Sorghum Head Smut Resistance Classes. Plant Dis. 1992, 76, 314. [Google Scholar] [CrossRef]
  23. Seiwa, K.; Masaka, K.; Konno, M.; Iwamoto, S. Role of Seed Size and Relative Abundance in Conspecific Negative Distance-Dependent Seedling Mortality for Eight Tree Species in a Temperate Forest. For. Ecol. Manag. 2019, 453, 117537. [Google Scholar] [CrossRef]
  24. Pringle, E.G.; Álvarez-Loayza, P.; Terborgh, J. Seed Characteristics and Susceptibility to Pathogen Attack in Tree Seeds of the Peruvian Amazon. Plant Ecol. 2007, 193, 211–222. [Google Scholar] [CrossRef]
  25. Selles, B.; Moseler, A.; Rouhier, N.; Couturier, J. Rhodanese Domain-Containing Sulfurtransferases: Multifaceted Proteins Involved in Sulfur Trafficking in Plants. J. Exp. Bot. 2019, 70, 4139–4154. [Google Scholar] [CrossRef] [PubMed]
  26. Perrot, T.; Pauly, M.; Ramírez, V. Emerging Roles of β-Glucanases in Plant Development and Adaptative Responses. Plants 2022, 11, 1119. [Google Scholar] [CrossRef]
  27. Garcia, A.V.; Al-Yousif, M.; Hirt, H. Role of AGC Kinases in Plant Growth and Stress Responses. Cell. Mol. Life Sci. 2012, 69, 3259–3267. [Google Scholar] [CrossRef]
  28. Zhang, X.; Wang, X.; Xu, K.; Jiang, Z.; Dong, K.; Xie, X.; Zhang, H.; Yue, N.; Zhang, Y.; Wang, X.-B.; et al. The Serine/Threonine/Tyrosine Kinase STY46 Defends against Hordeivirus Infection by Phosphorylating Γb Protein. Plant Physiol. 2021, 186, 715–730. [Google Scholar] [CrossRef]
  29. Naithani, S.; Komath, S.S.; Nonomura, A.; Govindjee, G. Plant Lectins and Their Many Roles: Carbohydrate-Binding and Beyond. J. Plant Physiol. 2021, 266, 153531. [Google Scholar] [CrossRef]
  30. Yamaguchi, N. Removal of H3K27me3 by JMJ Proteins Controls Plant Development and Environmental Responses in Arabidopsis. Front. Plant Sci. 2021, 12, 687416. [Google Scholar] [CrossRef]
  31. Lyzenga, W.J.; Stone, S.L. Abiotic Stress Tolerance Mediated by Protein Ubiquitination. J. Exp. Bot. 2012, 63, 599–616. [Google Scholar] [CrossRef] [Green Version]
  32. Katiyar, A.; Smita, S.; Lenka, S.K.; Rajwanshi, R.; Chinnusamy, V.; Bansal, K.C. Genome-Wide Classification and Expression Analysis of MYB Transcription Factor Families in Rice and Arabidopsis. BMC Genom. 2012, 13, 544. [Google Scholar] [CrossRef] [Green Version]
  33. Han, G.; Qiao, Z.; Li, Y.; Yang, Z.; Wang, C.; Zhang, Y.; Liu, L.; Wang, B. RING Zinc Finger Proteins in Plant Abiotic Stress Tolerance. Front. Plant Sci. 2022, 13, 877011. [Google Scholar] [CrossRef] [PubMed]
  34. Dong, Q.; Hu, B.; Zhang, C. MicroRNAs and Their Roles in Plant Development. Front. Plant Sci. 2022, 13, 824240. [Google Scholar] [CrossRef] [PubMed]
  35. Liu, W.; Yu, W.; Hou, L.; Wang, X.; Zheng, F.; Wang, W.; Liang, D.; Yang, H.; Jin, Y.; Xie, X. Analysis of MiRNAs and Their Targets during Adventitious Shoot Organogenesis of Acacia Crassicarpa. PLoS ONE 2014, 9, e93438. [Google Scholar] [CrossRef] [PubMed]
  36. Zhou, Y.; Wang, B.; Yuan, F. The Role of Transmembrane Proteins in Plant Growth, Development, and Stress Responses. Int. J. Mol. Sci. 2022, 23, 13627. [Google Scholar] [CrossRef]
  37. Pereira Mendes, M.; Hickman, R.; Van Verk, M.C.; Nieuwendijk, N.M.; Reinstädler, A.; Panstruga, R.; Pieterse, C.M.J.; Van Wees, S.C.M. A Family of Pathogen-Induced Cysteine-Rich Transmembrane Proteins Is Involved in Plant Disease Resistance. Planta 2021, 253, 102. [Google Scholar] [CrossRef]
  38. Cheng, S.; Li, R.; Lin, L.; Shi, H.; Liu, X.; Yu, C. Recent Advances in Understanding the Function of the PGIP Gene and the Research of Its Proteins for the Disease Resistance of Plants. Appl. Sci. 2021, 11, 11123. [Google Scholar] [CrossRef]
  39. Ellur, V.; Wei, W.; Chen, W. Using Chickpea Polygalacturonase-Inhibiting Protein as a Tool to Elucidate Molecular Mechanisms of Chickpea Interactions with Various Pathogens. In Proceedings of the 2018 Pacific Division Meeting, Portland, OR, USA, 25–27 June 2018; p. 44. [Google Scholar]
  40. Ellur, V. Roles of Chickpea Polygalacturonase-Inhibiting Proteins in Defense Against Pathogenic Fungi. Ph.D. Dissertation, Washington State University, Pullman, WA, USA, May 2022. [Google Scholar]
  41. Hu, P.; Ren, Y.; Xu, J.; Wei, Q.; Song, P.; Guan, Y.; Gao, H.; Zhang, Y.; Hu, H.; Li, C. Identification of Ankyrin-Transmembrane-Type Subfamily Genes in Triticeae Species Reveals TaANKTM2A-5 Regulates Powdery Mildew Resistance in Wheat. Front. Plant Sci. 2022, 13, 943217. [Google Scholar] [CrossRef]
  42. Gu, M.; Lu, Q.; Liu, Y.; Cui, M.; Si, Y.; Wu, H.; Chai, T.; Ling, H.-Q. Requirement and Functional Redundancy of Two Large Ribonucleotide Reductase Subunit Genes for Cell Cycle, Chloroplast Biogenesis and Photosynthesis in Tomato. Ann. Bot. 2022, 130, 173–187. [Google Scholar] [CrossRef]
  43. Brandt, R.; Cabedo, M.; Xie, Y.; Wenkel, S. Homeodomain Leucine-Zipper Proteins and Their Role in Synchronizing Growth and Development with the Environment: HD-ZIPs and Plant Development. J. Integr. Plant Biol. 2014, 56, 518–526. [Google Scholar] [CrossRef]
  44. Rosado, A.; Schapire, A.L.; Bressan, R.A.; Harfouche, A.L.; Hasegawa, P.M.; Valpuesta, V.; Botella, M.A. The Arabidopsis Tetratricopeptide Repeat-Containing Protein TTL1 Is Required for Osmotic Stress Responses and Abscisic Acid Sensitivity. Plant Physiol. 2006, 142, 1113–1126. [Google Scholar] [CrossRef] [Green Version]
  45. Zhao, J.-Y.; Lu, Z.-W.; Sun, Y.; Fang, Z.-W.; Chen, J.; Zhou, Y.-B.; Chen, M.; Ma, Y.-Z.; Xu, Z.-S.; Min, D.-H. The Ankyrin-Repeat Gene GmANK114 Confers Drought and Salt Tolerance in Arabidopsis and Soybean. Front. Plant Sci. 2020, 11, 584167. [Google Scholar] [CrossRef] [PubMed]
  46. Khong, G.N.; Le, N.T.; Pham, M.T.; Adam, H.; Gauron, C.; Le, H.Q.; Pham, D.T.; Colonges, K.; Pham, X.H.; Do, V.N.; et al. A Cluster of Ankyrin and Ankyrin-TPR Repeat Genes Is Associated with Panicle Branching Diversity in Rice. PLoS Genet. 2021, 17, e1009594. [Google Scholar] [CrossRef]
  47. Chen, M.; Ni, L.; Chen, J.; Sun, M.; Qin, C.; Zhang, G.; Zhang, A.; Jiang, M. Rice Calcium/Calmodulin-Dependent Protein Kinase Directly Phosphorylates a Mitogen-Activated Protein Kinase Kinase to Regulate Abscisic Acid Responses. Plant Cell 2021, 33, 1790–1812. [Google Scholar] [CrossRef]
  48. Phan, J.L.; Tucker, M.R.; Khor, S.F.; Shirley, N.; Lahnstein, J.; Beahan, C.; Bacic, A.; Burton, R.A. Differences in Glycosyltransferase Family 61 Accompany Variation in Seed Coat Mucilage Composition in Plantago spp. J. Exp. Bot. 2016, 67, 6481–6495. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Kim, Y.; Park, C.; Cha, S.; Han, M.; Ryu, K.-S.; Suh, J.-Y. Determinants of PB1 Domain Interactions in Auxin Response Factor ARF5 and Repressor IAA17. J. Mol. Biol. 2020, 432, 4010–4022. [Google Scholar] [CrossRef] [PubMed]
  50. Osakabe, Y.; Maruyama, K.; Seki, M.; Satou, M.; Shinozaki, K.; Yamaguchi-Shinozaki, K. Leucine-Rich Repeat Receptor-Like Kinase1 Is a Key Membrane-Bound Regulator of Abscisic Acid Early Signaling in Arabidopsis. Plant Cell 2005, 17, 1105–1119. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  51. Shahollari, B.; Vadassery, J.; Varma, A.; Oelmüller, R. A Leucine-Rich Repeat Protein Is Required for Growth Promotion and Enhanced Seed Production Mediated by the Endophytic Fungus Piriformospora indica in Arabidopsis thaliana: Growth Regulation by P. Indica. Plant J. 2007, 50, 1–13. [Google Scholar] [CrossRef]
  52. Ellur, V.; Wei, W.; Vandemark, G.J.; Chen, W. Characterization of Cell Wall Proteins in Chickpea That Inhibit Fungal Hydrolytic Enzymes. In Proceedings of the Plant Health 2020, Online, 10–14 August 2020. [Google Scholar]
  53. Xu, J.; Wang, X.; Guo, W. The Cytochrome P450 Superfamily: Key Players in Plant Development and Defense. J. Integr. Agric. 2015, 14, 1673–1686. [Google Scholar] [CrossRef] [Green Version]
  54. Hansen, C.C.; Nelson, D.R.; Møller, B.L.; Werck-Reichhart, D. Plant Cytochrome P450 Plasticity and Evolution. Mol. Plant 2021, 14, 1244–1265. [Google Scholar] [CrossRef]
  55. Manzano, D.; Andrade, P.; Caudepón, D.; Altabella, T.; Arró, M.; Ferrer, A. Suppressing Farnesyl Diphosphate Synthase Alters Chloroplast Development and Triggers Sterol-Dependent Induction of Jasmonate- and Fe-Related Responses. Plant Physiol. 2016, 172, 93–117. [Google Scholar] [CrossRef] [Green Version]
  56. Saffer, A.M.; Carpita, N.C.; Irish, V.F. Rhamnose-Containing Cell Wall Polymers Suppress Helical Plant Growth Independently of Microtubule Orientation. Curr. Biol. 2017, 27, 2248–2259.e4. [Google Scholar] [CrossRef] [PubMed]
  57. Neumann, U. Protein Transport in Plant Cells: In and Out of the Golgi. Ann. Bot. 2003, 92, 167–180. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Li, X.; Sun, M.; Liu, S.; Teng, Q.; Li, S.; Jiang, Y. Functions of PPR Proteins in Plant Growth and Development. Int. J. Mol. Sci. 2021, 22, 11274. [Google Scholar] [CrossRef] [PubMed]
  59. Gachomo, E.W.; Jimenez-Lopez, J.C.; Baptiste, L.J.; Kotchoni, S.O. GIGANTUS1 (GTS1), a Member of Transducin/WD40 Protein Superfamily, Controls Seed Germination, Growth and Biomass Accumulation through Ribosome-Biogenesis Protein Interactions in Arabidopsis Thaliana. BMC Plant Biol. 2014, 14, 37. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  60. Yang, X.; Wang, J.; Xia, X.; Zhang, Z.; He, J.; Nong, B.; Luo, T.; Feng, R.; Wu, Y.; Pan, Y.; et al. OsTTG1, a WD40 Repeat Gene, Regulates Anthocyanin Biosynthesis in Rice. Plant J. 2021, 107, 198–214. [Google Scholar] [CrossRef]
  61. Tuteja, N. Plant DNA Helicases: The Long Unwinding Road. J. Exp. Bot. 2003, 54, 2201–2214. [Google Scholar] [CrossRef] [Green Version]
  62. Antonets, K.S.; Nizhnikov, A.A. Amyloids and Prions in Plants: Facts and Perspectives. Prion 2017, 11, 300–312. [Google Scholar] [CrossRef] [Green Version]
  63. Perneger, T.V. What’s Wrong with Bonferroni Adjustments. BMJ 1998, 316, 1236–1238. [Google Scholar] [CrossRef]
  64. Tanabata, T.; Shibaya, T.; Hori, K.; Ebana, K.; Yano, M. SmartGrain: High-Throughput Phenotyping Software for Measuring Seed Shape through Image Analysis. Plant Physiol. 2012, 160, 1871–1880. [Google Scholar] [CrossRef] [Green Version]
  65. Rueden, C.T.; Schindelin, J.; Hiner, M.C.; DeZonia, B.E.; Walter, A.E.; Arena, E.T.; Eliceiri, K.W. ImageJ2: ImageJ for the next Generation of Scientific Image Data. BMC Bioinform. 2017, 18, 529. [Google Scholar] [CrossRef] [Green Version]
  66. Elshire, R.J.; Glaubitz, J.C.; Sun, Q.; Poland, J.A.; Kawamoto, K.; Buckler, E.S.; Mitchell, S.E. A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species. PLoS ONE 2011, 6, e19379. [Google Scholar] [CrossRef] [Green Version]
  67. Upadhyaya, H.D.; Wang, Y.-H.; Gowda, C.L.L.; Sharma, S. Association Mapping of Maturity and Plant Height Using SNP Markers with the Sorghum Mini Core Collection. Theor. Appl. Genet. 2013, 126, 2003–2015. [Google Scholar] [CrossRef] [PubMed]
  68. Wang, Y.-H.; Upadhyaya, H.D.; Burrell, A.M.; Sahraeian, S.M.E.; Klein, R.R.; Klein, P.E. Genetic Structure and Linkage Disequilibrium in a Diverse, Representative Collection of the C4 Model Plant, Sorghum bicolor. G3 Genes|Genomes|Genet. 2013, 3, 783–793. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  69. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for Association Mapping of Complex Traits in Diverse Samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  70. Adeyanju, A.; Little, C.; Yu, J.; Tesso, T. Genome-Wide Association Study on Resistance to Stalk Rot Diseases in Grain Sorghum. G3 Genes|Genomes|Genet. 2015, 5, 1165–1175. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. McCormick, R.F.; Truong, S.K.; Sreedasyam, A.; Jenkins, J.; Shu, S.; Sims, D.; Kennedy, M.; Amirebrahimi, M.; Weers, B.D.; McKinley, B.; et al. The Sorghum bicolor Reference Genome: Improved Assembly, Gene Annotations, a Transcriptome Atlas, and Signatures of Genome Organization. Plant J. 2018, 93, 338–354. [Google Scholar] [CrossRef] [Green Version]
  72. Chang, C.C.; Chow, C.C.; Tellier, L.C.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-Generation PLINK: Rising to the Challenge of Larger and Richer Datasets. Gigascience 2015, 4, 7. [Google Scholar] [CrossRef]
  73. Husson, F.; Josse, J.; Lê, S. FactoMineR: An R Package for Multivariate Analysis. J. Stat. Softw. 2008, 25, 1–18. [Google Scholar]
  74. Kassambara, A.; Mundt, F. Package ‘Factoextra’. Extr. Vis. Results Multivar. Data Anal. 2017, 76, 1–84. [Google Scholar]
  75. Alexander, D.H.; Novembre, J.; Lange, K. Fast Model-Based Estimation of Ancestry in Unrelated Individuals. Genome Res. 2009, 19, 1655–1664. [Google Scholar] [CrossRef] [Green Version]
  76. Zheng, X.; Levine, D.; Shen, J.; Gogarten, S.M.; Laurie, C.; Weir, B.S. A High-Performance Computing Toolset for Relatedness and Principal Component Analysis of SNP Data. Bioinformatics 2012, 28, 3326–3328. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  77. Shin, J.-H.; Blay, S.; McNeney, B.; Graham, J. LDheatmap: An R Function for Graphical Display of Pairwise Linkage Disequilibria Between Single Nucleotide Polymorphisms. J. Stat. Softw. 2006, 16, 1–9. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Histograms for distribution in Senegalese cultivars regarding seed morphologies. (a) Area size (mm2), (b) perimeter (mm), (c) length (mm), (d) width (mm), (e) length-to-width ratio, (f) circularity (0–1 range), (g) distance between IS and CG (mm) and (h) brightness (0–255 range). Box plots above each histogram indicated mean value (diamond), percentiles, upper and lower whisker, and outliers. Median ranges are shown in red.
Figure 1. Histograms for distribution in Senegalese cultivars regarding seed morphologies. (a) Area size (mm2), (b) perimeter (mm), (c) length (mm), (d) width (mm), (e) length-to-width ratio, (f) circularity (0–1 range), (g) distance between IS and CG (mm) and (h) brightness (0–255 range). Box plots above each histogram indicated mean value (diamond), percentiles, upper and lower whisker, and outliers. Median ranges are shown in red.
Plants 12 02344 g001
Figure 2. A comparison of the area sizes for PI514293 vs. PI514404. (a) PI514293 has one of the smallest seeds in area size. (b) PI514404 showed one of the largest seeds in the population. The scale bar indicates 1 cm applied to both (a,b).
Figure 2. A comparison of the area sizes for PI514293 vs. PI514404. (a) PI514293 has one of the smallest seeds in area size. (b) PI514404 showed one of the largest seeds in the population. The scale bar indicates 1 cm applied to both (a,b).
Plants 12 02344 g002
Figure 3. A comparison of the seed colors for PI514471 vs. PI514419. (a) PI514471 has one of the darkest seeds among the screened cultivars. (b) PI514471 showed one of the brightest seeds in the population. The scale bar indicates 1 cm applied to both (a,b).
Figure 3. A comparison of the seed colors for PI514471 vs. PI514419. (a) PI514471 has one of the darkest seeds among the screened cultivars. (b) PI514471 showed one of the brightest seeds in the population. The scale bar indicates 1 cm applied to both (a,b).
Plants 12 02344 g003
Figure 4. Scatter plots visualizing correlations between evaluated traits based on Pearson’s r. Each dot indicates a sorghum accession and red lines indicate correlations between two traits.
Figure 4. Scatter plots visualizing correlations between evaluated traits based on Pearson’s r. Each dot indicates a sorghum accession and red lines indicate correlations between two traits.
Plants 12 02344 g004
Figure 5. The principal component analysis of eight seed morphological parameters was from the Senegalese sorghum germplasms (162 Senegalese and 3 control cultivars). PC1 vs. PC2 are shown. Red arrows indicate directions for the traits on the PCA plot.
Figure 5. The principal component analysis of eight seed morphological parameters was from the Senegalese sorghum germplasms (162 Senegalese and 3 control cultivars). PC1 vs. PC2 are shown. Red arrows indicate directions for the traits on the PCA plot.
Plants 12 02344 g005
Figure 6. The plot of the partial contribution of variables for eight traits. Principal component analysis from Senegalese sorghum cultivars of eight seed morphological parameters. Contributions of each trait toward PC1, PC2, and PC3 are shown.
Figure 6. The plot of the partial contribution of variables for eight traits. Principal component analysis from Senegalese sorghum cultivars of eight seed morphological parameters. Contributions of each trait toward PC1, PC2, and PC3 are shown.
Plants 12 02344 g006
Figure 7. The plots of correlations between head smut spot appearance rate (%) and seed morphology-related traits. (a) Head smut spot appearance rate (%) and circularity showed a moderate negative correlation (Pearson’s correlation = −0.47, p < 0.0001). (b) Head smut spot appearance rate (%) and distance between IS and CG showed a moderate positive correlation (Pearson’s correlation = 0.31, p < 0.0001). Each dot indicates a sorghum accession; blue lines indicate correlations between the two traits.
Figure 7. The plots of correlations between head smut spot appearance rate (%) and seed morphology-related traits. (a) Head smut spot appearance rate (%) and circularity showed a moderate negative correlation (Pearson’s correlation = −0.47, p < 0.0001). (b) Head smut spot appearance rate (%) and distance between IS and CG showed a moderate positive correlation (Pearson’s correlation = 0.31, p < 0.0001). Each dot indicates a sorghum accession; blue lines indicate correlations between the two traits.
Plants 12 02344 g007
Figure 8. The population structure of the Senegalese sorghum panel. (a) k-means clustering by average silhouette method for genotype data of accessions analyzed in this study. The optimal number of k-means clusters was equal to 2, indicative of 2 major genetic groups. (b) An admixture plot displays 2 groups with most accessions falling in just one group. Two different colors indicate two different populations across the accessions.
Figure 8. The population structure of the Senegalese sorghum panel. (a) k-means clustering by average silhouette method for genotype data of accessions analyzed in this study. The optimal number of k-means clusters was equal to 2, indicative of 2 major genetic groups. (b) An admixture plot displays 2 groups with most accessions falling in just one group. Two different colors indicate two different populations across the accessions.
Plants 12 02344 g008
Figure 9. A dendrogram generated from genotype data of the accessions analyzed in this study. There are 2 major genetic groups, indicated by the two main branches, with that majority of the accessions falling in the top branch, which could potentially contain numerous subpopulations.
Figure 9. A dendrogram generated from genotype data of the accessions analyzed in this study. There are 2 major genetic groups, indicated by the two main branches, with that majority of the accessions falling in the top branch, which could potentially contain numerous subpopulations.
Plants 12 02344 g009
Figure 10. The genome-wide association for Senegalese sorghum seed morphology-related traits. Manhattan plots based on univariate and multivariate GWAS display the top candidate SNPs associated with each trait (a) Manhattan plot for area size. (b) Manhattan plot for perimeter (c) Manhattan plot for width. (d) Manhattan plot of PCs based on seven seed size-related traits. (e) Manhattan plot of seed brightness. The line is a cut-off for the Bonferroni threshold ≈ 0.00000017.
Figure 10. The genome-wide association for Senegalese sorghum seed morphology-related traits. Manhattan plots based on univariate and multivariate GWAS display the top candidate SNPs associated with each trait (a) Manhattan plot for area size. (b) Manhattan plot for perimeter (c) Manhattan plot for width. (d) Manhattan plot of PCs based on seven seed size-related traits. (e) Manhattan plot of seed brightness. The line is a cut-off for the Bonferroni threshold ≈ 0.00000017.
Plants 12 02344 g010
Figure 11. LD heatmaps visualize LD of regions around statistically significant SNP loci. (a) Variant S6_12058855 was associated with seed area. (b) Variant S6_39973848 was associated with seed perimeter. The blue stars indicate locations of the SNP loci.
Figure 11. LD heatmaps visualize LD of regions around statistically significant SNP loci. (a) Variant S6_12058855 was associated with seed area. (b) Variant S6_39973848 was associated with seed perimeter. The blue stars indicate locations of the SNP loci.
Plants 12 02344 g011
Table 1. The top five cultivars for each seed morphology are shown.
Table 1. The top five cultivars for each seed morphology are shown.
Largest Area Size (mm2)Smallest Area Size (mm2)
AccessionMean ± S.D.AccessionMean ± S.D.
1. PI51440419.26 ± 3.411. PI5142936.67 ± 0.86
2. PI51428717.03 ± 2.262. PI5143086.94 ± 1.1
3. PI51436716.10 ± 2.033. PI5143107.11 ± 0.87
4. PI51446415.88 ± 2.714. PI5143887.56 ± 1.05
5. PI51445815.84 ± 1.765. PI5144747.71 ± 0.82
Longest perimeter (mm)Shortest perimeter (mm)
1. PI51440417.09 ± 1.631. PI51438810.78 ± 0.76
2. PI51428716.23 ± 1.342. PI51429310.79 ± 1.01
3. PI51436715.73 ± 1.133. PI51447410.92 ± 0.61
4. PI51428815.7 ± 1.244. PI51430810.99 ± 1.21
5. PI51428315.65 ± 1.535. PI51444911.07 ± 0.68
Longest length (mm)Shortest length (mm)
1. PI5144045.29 ± 0.461. PI5143083.67 ± 0.39
2. PI5144295.11 ± 0.32. PI5143103.72 ± 0.26
3. PI5144645.09 ± 0.573. PI5144343.74 ± 0.22
4. PI5144585.09 ± 0.344. PI5142933.76 ± 0.28
5. PI5144785.08 ± 0.315. PI5143883.78 ± 0.27
Longest width (mm)Shortest width (mm)
1. PI5144044.9 ± 0.481. PI5142932.49 ± 0.19
2. PI5142874.67 ± 0.352. PI5143082.62 ± 0.3
3. PI5143674.51 ± 0.33. PI5143102.67 ± 0.19
4. PI5142884.49 ± 0.374. PI5143882.72 ± 0.2
5. PI5144584.27 ± 0.265. PI5143252.74 ± 0.23
Highest LWRLowest LWR
1. PI5142931.51 ± 0.11. PI5142881.07 ± 0.04
2. PI5143251.48 ± 0.082. PI5142831.08 ± 0.04
3. PI5144711.47 ± 0.083. PI5142851.08 ± 0.05
4. PI5144621.45 ± 0.094. PI5144041.08 ± 0.04
5. PI5144461.43 ± 0.065. PI5143671.09 ± 0.06
Highest circularity (0–1 scale)Lowest circularity (0–1 scale)
1. PI5144250.83 ± 0.011. PI5142820.67 ± 0.08
2. PI5144400.83 ± 0.022. PI5142920.67 ± 0.07
3. PI5144200.83 ± 0.023. PI5142940.68 ± 0.06
4. PI5144340.83 ± 0.024. PI5142970.69 ± 0.07
5. PI5144090.83 ± 0.025. PI5142960.69 ± 0.07
The longest distance between IS and CG (mm)Shortest distance between IS and CG (mm)
1. PI5142880.33 ± 0.19PI5143880.19 ± 0.11
2. PI5142940.32 ± 0.17PI5143980.19 ± 0.13
3. PI5142920.32 ± 0.21PI5144460.2 ± 0.11
4. PI5142830.32 ± 0.17PI5144380.2 ± 0.12
5. PI5143410.32 ± 0.17PI5144490.2 ± 0.12
Brightest (0–255 scale)Darkest (0–255 scale)
1. PI514419231.38 ± 19.431. PI514471107.55 ± 10.76
2. PI514334228.78 ± 15.282. PI514323119.34 ± 20.39
3. PI514446228.05 ± 18.193. PI514462120.22 ± 12.47
4. PI514361223.18 ± 15.754. PI514293123.81 ± 20.84
5. PI514458222.77 ± 20.395. PI514466126.28 ± 15.52
Table 2. Detailed correlations among eight seed morphology-related traits. *** = p < 0.0001, ** = p < 0.001 and * = p < 0.01.
Table 2. Detailed correlations among eight seed morphology-related traits. *** = p < 0.0001, ** = p < 0.001 and * = p < 0.01.
Area SizePerimeterLengthWidthLWRCircularityIS and CGBrightness
Area size (mm2)1 ***0.95 ***0.93 ***0.97 ***−0.67 ***0.33 ***0.36 ***0.09
Perimeter (mm)0.95 ***1 ***0.91 ***0.95 ***−0.66 ***0.030.53 ***0.08
Length (mm)0.93 ***0.91 ***1 ***0.83 ***−0.39 ***0.27 **0.36 ***0.10
Width (mm)0.97 ***0.95 ***0.83 ***1 ***−0.83 ***0.23 *0.41 ***0.07
LWR−0.67 ***−0.66 ***−0.39 ***−0.83 ***1 ***−0.13−0.30 ***−0.07
Circularity0.33 ***0.030.27 **0.23 *−0.131 ***−0.45 ***0.06
IS and CG0.36 ***0.53 ***0.36 ***0.41 ***−0.30 ***−0.45 ***1 ***−0.01
Brightness0.090.080.100.07−0.070.06−0.011 ***
Table 3. Annotated genes nearest to the most significant SNPs associated with GWAS regarding seed morphologies. SNPs that passed the Bonferroni threshold (0.00000017) were retested with a t-test for two prevalent bases, filtering out possible false positive SNPs. Multivariate GWAS based on PCs generated more than 50 SNPs passing Bonferroni correction (before filtering out with t-test), and the table shows only the top ten SNPs. Average score/SNP was not calculated for SNPs generated by PCs-based GWAS.
Table 3. Annotated genes nearest to the most significant SNPs associated with GWAS regarding seed morphologies. SNPs that passed the Bonferroni threshold (0.00000017) were retested with a t-test for two prevalent bases, filtering out possible false positive SNPs. Multivariate GWAS based on PCs generated more than 50 SNPs passing Bonferroni correction (before filtering out with t-test), and the table shows only the top ten SNPs. Average score/SNP was not calculated for SNPs generated by PCs-based GWAS.
TraitChrLocationCandidate Gene and FunctionDistance (bp)SNP Base %TASSEL
p-Value
Mean Value/SNP
Area612,058,855 and one more SNP within 3 bpSobic.006G036400
Similar to OSJNBb0043H09.3 protein
Ring zinc finger
136,397A: 84%
G: 16%
0.000000028A: 12.65
G: 10.05
Perimeter639,973,848Sobic.006G056100
Protein of unknown function (DUF3,537)
0A: 85.2%
G: 14.8%
0.000000074A: 14.11
G: 12.44
Perimeter236,673,863Sobic.002G147466
Ribonucleotide reductase-related
0C: 92.9%
G: 7.1%
0.000000013C: 13.76
G: 14.7
Perimeter172,472,059Sobic.001G447400
Similar to homeobox-leucine zipper protein HAT22
2530A: 16.3%
C: 83.7%
0.000000033A: 12.06
C: 14.14
Perimeter246,916,695Sobic.002G154750
Tetratricopeptide repeat-containing protein (TPR)
Associated PlantFAMs-
Ankyrin-1
6908G: 16%
T: 84%
0.00000004G: 12.59
T: 14.09
Perimeter651,902,835 and one more within 26 bpSobic.006G161000
Calcium/calmodulin-dependent protein kinase kinase (CAMKK)
2571A: 15.2%
G: 84.8%
0.000000043A: 12.05
G: 14.13
Perimeter246,926,845Sobic.002G154800
Similar to glycosyl transferase family 1 protein-like
Mannosylfructose-phosphate synthase/MFPS
0C: 16.6%
G: 83.4%
0.000000063C: 12.38
G: 14.09
Perimeter716,244,875Sobic.007G093100
No annotation
Associated PlantFAMs-zinc-finger of the FCS-type
37,632C: 83.2%
T: 16.8%
0.000000077C: 14.13
T: 12.06
Perimeter467,200,818Sobic.004G340100
PB1 domain
2091A: 15.9%
G: 84.1%
0.00000011A: 12.03
G: 14.13
Perimeter236,482,136Sobic.002G147433
Zinc finger
137,836A: 6.4%
G: 93.6%
0.00000012A: 14.71
G: 13.74
Perimeter236,533,975Sobic.002G147466
Ribonucleotide reductase-related
139,787C: 93.5%
T: 6.5%
0.00000015C: 13.74
T: 14.71
Width612,058,855 and one more SNP within 3 bpSobic.006G036400
Similar to OSJNBb0043H09.3 protein
Ring zinc finger
136,397A: 84%
G: 16%
0.000000011A: 3.67
G: 3.2
Width637,776,989Sobic.006G050400
Protein of unknown function (DUF493)
0A: 11.3%
G: 88.7%
0.00000002A: 2.99
G: 3.64
Width236,673,863Sobic.002G147466
Ribonucleotide reductase-related
0C: 92.9%
G: 7.1%
0.000000068C: 3.56
G: 3.94
Width716,244,901 and one more SNP within 26 bpSobic.007G093100
No annotation
Associated PlantFAMs- zinc-finger of the FCS-type
37,606A: 16.8%
T: 83.2%
0.00000011A: 3.05
T: 3.68
Width152,374,442Sobic.001G271500
Leucine-rich repeat-containing protein
22,206G: 90.6%
T: 9.4%
0.00000014G: 3.63
T: 2.99
PCs35,165,375Sobic.003G058600
Cytochrome P450 CYP2 subfamily
0A: 7.5%
G: 92.5%
0.00000000044
PCs658,756,099Sobic.006G248000
Farnesyl-pyrophosphate synthetase
0C: 87.9%
T: 12.1%
0.0000000017
PCs462,452,658 and one more SNP within 11 bpSobic.004G282700
Similar to DTDP-D-glucose 4,6-dehydratase-like
2043G: 92.5
T: 7.5%
0.0000000044
PCs68,316,056Sobic.006G033200
Similar to Probable protein transport Sec1a
Syntaxin-binding protein 1 (STXBP1, MUNC18-1)
266,373G: 25.3%
T: 74.7%
0.0000000054
PCs654,692,844Sobic.006G193600
Zinc-finger of the FCS-type, C2-C2 (zf-FLZ)
214C: 94.2%
G: 5.8%
0.0000000071
PCs654,708,681 and one more SNP within 11 bpSobic.006G193700
Zinc-finger of the FCS-type, C2-C2 (zf-FLZ)
0G: 94.2%
T: 5.8%
0.0000000071
PCs35,151,221Sobic.003G058500
Golgi to er traffic protein 4 homolog
0C: 92.5%
T: 7.5%
0.0000000075
PCs462,432,641Sobic.004G282400
Rho guanosine triphosphatases (GTPase) activating protein 2
0C: 93.1
T: 6.9%
0.0000000095
PCs462,438,345Sobic.004G282500
Pentatricopeptide (PPR) repeat-containing protein-like
11C: 93.1
T: 6.9%
0.0000000095
PCs351,995,650Sobic.003G195900
Stomatal cytokinesis defective
Associated PlantFAMs-
WD40 repeat-containing protein
0A: 5.7%
G: 94.3
0.000000012
Brightness1015,656,073Sobic.010G126000
WD-40 repeat protein
17,962C: 92.5%
T: 7.5%
0.0000000093C: 198.62
T: 156.23
Brightness153,382,065Sobic.001G275300
ATP-dependent DNA helicase
11,647G: 7.7%
T: 92.3%
0.000000019G: 186.61
T: 198.24
Brightness1053,260,179Sobic.010G191301
No annotation
Highly expressed in panicle-upper anthesis
7571C: 7%
G: 93%
0.000000035C: 177.47
G: 197.87
Brightness12,657,706Sobic.001G035200
Amyloid beta precursor protein-binding protein 1 (APPBP1)
0A: 92.4%
G: 7.6%
0.000000058A: 198.38
G: 156.23
Brightness166,050,539Sobic.001G372400
No annotation
Associated PlantFAMs-
Ribonuclease III
806C: 94.3%
T: 5.7%
0.00000015C: 198.3
T: 155.85
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ahn, E.; Botkin, J.; Ellur, V.; Lee, Y.; Poudel, K.; Prom, L.K.; Magill, C. Genome-Wide Association Study of Seed Morphology Traits in Senegalese Sorghum Cultivars. Plants 2023, 12, 2344. https://doi.org/10.3390/plants12122344

AMA Style

Ahn E, Botkin J, Ellur V, Lee Y, Poudel K, Prom LK, Magill C. Genome-Wide Association Study of Seed Morphology Traits in Senegalese Sorghum Cultivars. Plants. 2023; 12(12):2344. https://doi.org/10.3390/plants12122344

Chicago/Turabian Style

Ahn, Ezekiel, Jacob Botkin, Vishnutej Ellur, Yoonjung Lee, Kabita Poudel, Louis K. Prom, and Clint Magill. 2023. "Genome-Wide Association Study of Seed Morphology Traits in Senegalese Sorghum Cultivars" Plants 12, no. 12: 2344. https://doi.org/10.3390/plants12122344

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop