Genome-Wide Association Study and Genomic Prediction on Plant Architecture Traits in Sweet Corn and Waxy Corn

Dang, Dongdong; Guan, Yuan; Zheng, Hongjian; Zhang, Xuecai; Zhang, Ao; Wang, Hui; Ruan, Yanye; Qin, Li

doi:10.3390/plants12020303

Open AccessArticle

Genome-Wide Association Study and Genomic Prediction on Plant Architecture Traits in Sweet Corn and Waxy Corn

by

Dongdong Dang

^1,2,3

,

Yuan Guan

²,

Hongjian Zheng

²,

Xuecai Zhang

³,

Ao Zhang

¹

,

Hui Wang

^2,*,

Yanye Ruan

^1,* and

Li Qin

^1,*

¹

Shenyang City Key Laboratory of Maize Genomic Selection Breeding, College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang 110866, China

²

CIMMYT-China Specialty Maize Research Center, Crop Breeding and Cultivation Research Institute, Shanghai Academy of Agricultural Sciences, Shanghai 201403, China

³

International Maize and Wheat Improvement Center (CIMMYT), El Batan, Texcoco 56237, Mexico

^*

Authors to whom correspondence should be addressed.

Plants 2023, 12(2), 303; https://doi.org/10.3390/plants12020303

Submission received: 5 December 2022 / Revised: 1 January 2023 / Accepted: 3 January 2023 / Published: 9 January 2023

(This article belongs to the Topic Advanced Breeding Technology for Plants)

Download

Browse Figures

Versions Notes

Abstract

:

Sweet corn and waxy corn has a better taste and higher accumulated nutritional value than regular maize, and is widely planted and popularly consumed throughout the world. Plant height (PH), ear height (EH), and tassel branch number (TBN) are key plant architecture traits, which play an important role in improving grain yield in maize. In this study, a genome-wide association study (GWAS) and genomic prediction analysis were conducted on plant architecture traits of PH, EH, and TBN in a fresh edible maize population consisting of 190 sweet corn inbred lines and 287 waxy corn inbred lines. Phenotypic data from two locations showed high heritability for all three traits, with significant differences observed between sweet corn and waxy corn for both PH and EH. The differences between the three subgroups of sweet corn were not obvious for all three traits. Population structure and PCA analysis results divided the whole population into three subgroups, i.e., sweet corn, waxy corn, and the subgroup mixed with sweet and waxy corn. Analysis of GWAS was conducted with 278,592 SNPs obtained from resequencing data; 184, 45, and 68 significantly associated SNPs were detected for PH, EH, and TBN, respectively. The phenotypic variance explained (PVE) values of these significant SNPs ranged from 3.50% to 7.0%. The results of this study lay the foundation for further understanding the genetic basis of plant architecture traits in sweet corn and waxy corn. Genomic selection (GS) is a new approach for improving quantitative traits in large plant breeding populations that uses whole-genome molecular markers. The marker number and marker quality are essential for the application of GS in maize breeding. GWAS can choose the most related markers with the traits, so it can be used to improve the predictive accuracy of GS.

Keywords:

genome-wide association study; genomic prediction; plant height; ear height; tassel branch number; sweet corn; waxy corn

1. Introduction

Maize (Zea mays L.) is the most important food, feed, and economic energy crop in the world. Its production safety plays an extremely important role in ensuring national grain production, promoting the development of animal husbandry, and improving people’s quality of life [1,2]. Sweet corn and waxy corn, a new type of fresh edible maize, has been widely planted. It can be used as the replacement of vegetables or fruits, because it tastes sweet and juicy as well as having high nutritional value. The content of vitamins, proteins, lysine, sugar and fat is much higher than that of regular maize [3]. Sweet corn, derived from the mutation in the relative gene regulating the conversion of sugar to starch inside the endosperm of the corn kernel, have a favorable flavor and is planted worldwide [4]. Waxy corn, a variety of maize expressing only amylopectin, has been extensively planted in China and many other countries [5]. Using molecular markers can help to understand the genetic diversity of existing sweet corn and waxy corn germplasm resources and using gene mapping to study plant architecture traits and analyze its genetic basis will help improve the breeding efficiency of sweet corn and waxy corn and further promote the research and development of fresh eating corn varieties.

Ideal plant architecture is critical for increasing plant density. The key components of ideal plant architecture in maize include plant and ear height, leaf angle, ear architecture, root architecture, and tassel architecture. If PH and EH are too high, planting density, lodging resistance, and harvest index will be reduced [6]; If it is too low, it will affect the field permeability, improve the infection rate of diseases and pests and reduce the biological yield [7]. Tassel traits are an important factor affecting yield formation. Overly developed or stunted tassel traits will affect maize yield due to excessive energy consumption, shading, or insufficient pollen supply [8,9]. Considering the continuous population growth, environmental deterioration, and decrease in arable land, moderately increasing planting density is the most effective and simple way to achieve high grain yields. However, higher planting density will promote mutual shading among neighboring plants and limit the efficiency of interception and utilization of light energy of individual plants. The improvement in plant architecture traits during new variety breeding can be used for increasing grain yield with the help of biotechnology. Therefore, the genetic basis for breeding high-yield hybrids needs to be clarified [10]. The PH, EH, and TBN of sweet corn and waxy corn were different from those of regular maize. With the development of molecular marker technology and gene mapping methods, the study of these traits using the genetic mapping method can enhance the role on the genetic basis of these traits, develop molecular markers, and improve the efficiency of breeding.

Genome-wide association studies (GWAS) are powerful tools for gene mapping in plants and animals and have been widely used for genetic analysis of complex quantitative traits in many important crop. In recent years, many scholars have used genome-wide association study (GWAS) to study the loci that control various traits such as PH, EH [11], yield [12], disease resistance [13], and grain dehydration [14] in maize. Yin et al. [15], using the nested association mapping (NAM) population, yielded 264,694 SNPs by genotyping sequencing. A total of 105 SNPs and 22 QTLs were identified by GWAS, which was significantly associated with PH and EH. On chromosome 1, GWAS identified a QTL with high confidence QTL-chr1-ep and performed linkage analysis in two recombinant inbred line (RIL) populations. Wu et al. [16] used genome-wide association analysis and linkage analysis to co-locate the inflorescence size trait, which was measured by panicle main branch number (TBN) and panicle length (TL). A total of 125 QTLs were identified by linkage analysis (63 for TBN and 62 for TL). In addition, 965 quantitative trait nucleotides (QTNs) were identified by GWAS. These QTL/QTNs contain 24 known genes cloned from mutants. In the genetic research of maize traits, scholars generally believe that PH is jointly controlled by major genes and minor polygenes, and the genetic basis is relatively complex, which is a typical quantitative trait inheritance [17]. Therefore, studying maize plant architecture-related traits can not only effectively improve the spatial distribution of maize plants and promote maize growth, but also support for breeding ideal plant architecture and molecular marker-assisted selection (MAS). However, virtually no research has been considered on plant architecture of sweet corn and waxy corn.

Genomic prediction is a method of using markers to predict the genetic value of complex traits in offspring for selection and breeding [18]. When genomic prediction is used for selection, it is called genomic selection (GS). GS is a modified form of marker-assisted selection (MAS) in which the markers from the whole genome are used to estimate the genomic-estimated breeding value (GEBV). A few studies have been conducted to dissect the genetic architecture of plant architecture in maize. In maize, GS has been investigated to improve several major plant architectures, e.g., maize root seedling traits [19], stalk strength [20], root [21], plant height [22], and husk traits [23]. There is no report on the study of plant architecture traits of sweet corn and waxy corn by whole genome selection.

In this study, the association mapping panel comprised sweet corn and waxy corn inbred lines; a total of 477 accessions was used to perform GWAS analysis to dissect the genetic basis of the plant architecture traits of PH, EH, and TBN. The main objectives of the present study are (1) To analyze the genetic diversity of Chinese sweet maize and waxy maize elite inbred lines; (2) Using GWAS to locate and analyze the genetic basis of plant architecture traits, locate the significant SNPs controlling the three traits, identify candidate genes according to GWAS results, and annotate the function of candidate genes; (3) Estimate the prediction accuracy of genome-wide selection. Genetically analyze the maize PH, EH, and TBN by a genome-wide association study, find the quantitative trait loci regulating agronomic traits of maize, and determine a series of candidate genes related to plant growth. The candidate genes and mutation sites that control PH, EH, and TBN were mined, and the genetic evolution rules of key loci were analyzed. It provides theoretical guidance for further developing new germplasm resources and improving varieties more effectively.

2. Results

2.1. Phenotypic Data Analysis Results

The phenotypic data analysis results of all the target traits of PH, EH and TBN are shown in Table 1. Broad variations were observed for all the three traits in sweet corn and waxy corn. The coefficients of variation (CV) in PH, EH and TBN were 0.17 to 0.23, from 0.33 to 0.36 and from 0.37 to 0.45, respectively. The PH ranged from 63 to 254 cm, the EH ranged from 10–134, and the TBN ranged from 1–26; the absolute values of skewness and kurtosis of PH, EH and TBN were less than 1, indicating a small degree of bias. The frequency distribution of the phenotypes for PH, EH and TBN exhibited approximately near-normal distributions (Figure 1). The heritability for all traits were high and greater than 0.96 in single environment condition. The heritability for PH, EH and TBN in multiple environments analyses were 0.75, 0.79, and 0.72, respectively. Both the genotype and genotype × environment interaction variances were extremely significant (p ≤ 0.001) (Table 1).

Between sweet corn and waxy corn, significant difference was observed for PH, as well as for EH. Waxy corn had higher means of PH and EH than that sweet corn (Figure 2A). In the three subgroups of sweet corn, the three plant architecture traits did not show a significant difference (Figure 2B).

The results of the correlation analysis between different environments for the same trait and the correlation analysis results between PH and EH were shown in Figure 3A,B. The correlation coefficients between the two environments for PH, EH, and TBN was 0.59, 0.64, and 0.57, respectively. The correlation coefficients of the BLUE values for the same trait between a single environment and multiple environments were high, i.e., greater than 0.80. The correlation coefficients of the BLUE values estimated from multiple environments between PH and EH was 0.75, which were 0.65 and 0.82 in the single environments analysis in 2019 and 2020. The correlations between TBN and other two traits were not estimated.

2.2. Results of SNP Characterization, LD Decay Distance, and Population Structure

The heat map representing the marker density in ten maize chromosomes was showed in Figure 4A. There were 38,013, 32,224, 30,423, 35,688, 28,335, 22,698, 25,987, 24,306, 20,202 and 20,716 SNPS on chromosome 1 to chromosome 10, respectively. The number of markers on chromosome 1 was the most, and the number of markers on chromosome 9 was the least. There were 123.24, 132.24, 127.82, 142.56, 125.18, 125.16, 139.86, 133.255, 123.94 and 135.90 SNPS in 1 per Mb on each chromosome, respectively. The markers were evenly distributed. In the filtered SNP dataset, the average missing rate across the SNPs was 0.12, and the average MAF was 0.16, which was suitable for a subsequent genome-wide association study (Figure 4B,C). We used 278,592 SNPs to evaluate the degree of linkage disequilibrium (LD) attenuation of this association population, which corresponds to 50 kb at r² = 0.2 (Figure 4D). LD attenuation was slow, indicating that the higher the degree of domestication, the greater the selection intensity, resulting in a decrease in genetic diversity.

Results of the population structure analysis were shown in Figure 5. In general, results of population structure, PCA, and genetic distance or kinship were consistent, and this core collection of waxy and sweet inbred lines could be divided into two or three major groups, according to their pedigrees or genetic backgrounds. When K = 3, the curve slows down, indicating that it was feasible to divide the population into three subgroups (Figure 5A,B). The number of lines in subgroups 1, 2, and 3 was 247, 164, and 66, respectively. The principal component analysis also revealed three subgroups, the first two principal components explained most variances (Figure 5C) corresponding to the three subgroups identified by structure analysis (Figure 5D): sweet corn subgroup, waxy corn subgroup, and sweet–waxy corn mixed subgroup.

2.3. Results of GWAS for Plant Architecture Traits

The GWAS was performed by combining the individual location BLUE values of PH, EH, and TBN estimated across environments, the 278,592 high quality SNPs, the first three PCAs, and kinship matrix. A linear mixed model based GWAS was used to control for population structure: both kinship (K) and population structure were taken into account to avoid spurious associations. Q–Q plots showed that the population structure has been well controlled. A mixed linear model (MLM) can reduce the false positive significant markers, but also lead to some false negative significant markers not being identified.

In total, 184 SNPs significantly (p = 1 × 10⁻⁴) associated with the PH were identified, which were spread across 10 chromosomes (Figure 6). The phenotypic variance explained (PVE) of significant SNPs ranged from 3.5% to 6.4%, with an average value of 4.7%. Out of the total significant SNPs, the maximum number of SNPs were identified on chromosome 7 (85 SNPs) and the minimum number of SNPs were in chromosome 8 (6 SNPs) across locations. The p-value of the significantly associated SNPs ranged from 8.8 × 10⁻⁷ to 9.77 × 10⁻⁵. The most significant SNPs with the lowest p-value were located on chromosome 7, i.e., S7_121735865.

In total, 45 SNPs significantly (p = 1 × 10^–4) associated with EH were identified, which were located on chromosomes 1, 2, 3, 4, 5, 6, 7, 9, and 10, respectively (Figure 7). The PVE of these significantly associated SNPs ranged from 3.5% to 5.8%, with an average value of 4.4%. Out of these total significant SNPs, the maximum number of SNPs were identified on chromosome 5 (eight SNPs) and the minimum number of SNPs were in chromosome 10, containing only one SNP. The p-value of these significantly associated SNPs ranged from 2.94 × 10⁻⁶ to 9.11 × 10⁻⁵. The most significantly associated SNP was located on chromosome 6, i.e., S6_34755019. Among the 45 SNPs significantly associated with EH, two were also significantly associated with PH, indicating their pleiotropic effects both on PH and EH. The co-mapping of different traits to the same loci suggested that the genes controlling maize PH and EH have multiple effects.

In total, 68 SNPs significantly (p = 1 × 10^–4) associated with the TBN were detected, and they were located on chromosomes 1, 2, 3, 4, 5, 6, 7, 9, and 10, respectively (Figure 8). The PVE of these significant SNPs ranged from 3.7% to 7.0%, with an average of 5.0%. Out of all the significantly associated SNP, the maximum number of SNPs were identified on chromosome 1 (25 SNPs) and the minimum number of SNPs were in chromosome 5 (one SNP). The p-values of the significantly associated SNPs ranged from 4.11 × 10⁻⁷ to 9.99 × 10⁻⁵. The most significantly associated SNP of S4_184008951 was located on chromosome 4. There were no SNPs whose PVE exceeded 10%, indicating that PH, EH, and TBN were traits jointly controlled by a minor gene.

2.4. Candidate Genes Revealed by GWAS

Using B73 RefGen_v4 as the reference genome, 483 candidate genes were identified within 50 kb regions either upstream or downstream of the significant SNPs associated with all three plant architecture traits. Table 2 lists the candidate genes with functional annotation on the NCBI website and related to maize growth and development. Based on the expression levels of the candidate genes in plant growth and development, and the functional annotations on the NCBI website, the most promising candidate genes were determined to predict the PH, EH, and TBN in this experiment. Candidate genes were grouped into the following functions: photosynthesis, metabolism, plant hormones, cellular transport, transcriptional regulation, structural proteins, and cell division. These genes can directly or indirectly regulate the growth and development of maize plants. The details of all candidate genes associated with potential SNPs and the functional annotations were presented in Table S1.

2.5. Estimation of Genomic Prediction Accuracies

For all three traits of PH, EH and TBN, the prediction accuracies increased rapidly when the number of markers increased from 0 to 500; subsequently, the prediction accuracy increased slightly when the number of markers kept increasing. The differences in prediction accuracies obtained from 3000, 5000, and 10,000 markers were not obvious. It was effective to improve prediction accuracy by adding markers significantly associated with each target trait (Figure 9A).

As the training population size increases, the prediction accuracy gradually improved. When the training population size was 10% of the total markers, the prediction accuracy of PH was 0.51. As the proportion of the training population gradually increased, the prediction accuracy also increased. When the training population size was 80% of the total markers, the prediction accuracy of plant height was evenly distributed around 0.61. When the training population size was 10% of the total markers, the prediction accuracy of EH was evenly distributed at 0.62. With the increasing proportion of training groups, the prediction accuracy also increases. When the training group size was 10% of the total markers, the prediction accuracy of TBN was evenly distributed around 0.16. With the gradual increase of the proportion of training groups, prediction accuracy also increases. When the training group size was 90% of the total markers, the prediction accuracy of TBN was distributed around 0.48. By comparing and analyzing the influence of training population size on the prediction accuracy of the whole genome, the results show that when the training population size increases from 10% to 30% of the total markers, the prediction accuracy increases with the increase of the training population size, and the growth trend was significant. However, when the size of the training group increases from 40% to 80%, the changing trend of prediction accuracy was nearly horizontal. The prediction accuracy of plant height decreased at 90% (Figure 9B).

3. Discussion

In the present study, inbred lines representing the core collection of sweet and waxy corn germplasm in China, were used to conduct GWAS and GP analysis on three plant architecture traits, i.e., PH, EH, and TBN. In this study PH, EH, and TBN detected in the association mapping panel also exhibited extensive phenotypic variation and followed a normal distribution. Heritability was at a moderately high level; ANOVA for PH, EH, and TBN showed that the effects of G and G × E interactions were significant, indicating that these three traits were mainly influenced by genetic effects (Table 1). According to the results of GWAS, it was found that the PH, EH, and TBN of maize were typically controlled by multiple genes.

In the analysis of the population structure, although the value at K = 9 was the lowest, when K = 3, the value was obviously slowed down. Coupled with the kinship heat map and PCA analysis in this study, the associated population should be divided into three subgroups, including sweet corn, waxy corn, and sweet–waxy corn (Figure 5). Different populations with the same population type also have great differences in LD decay rate due to their different genetic backgrounds. Domestication selection can lead to a decrease in population genetic diversity and the strengthening of linkage between loci. Therefore, generally, the higher the degree of domestication and the greater the selection intensity of the population, the slower the LD decay rate. Similarly, the decline of population genetic diversity caused by natural selection and genetic drift will also slow down the rate of LD decay [24]. In comparison between LD analysis results and other studies, the value of distance was larger than that in other studies. In tropical maize, the average LD decay distance across all 10 chromosomes was 8.14 kb [25]. In subtropical maize, the average decay distance of the LD across all chromosomes was about 5–10 kb at r² = 0.2 [26]. The smaller the value, the greater the genetic diversity and the greater the genetic relationship between the populations. LD decay rate in this study was similar to that in other sweet corn studies, with the mean length of LD decay decreasing rapidly to 76 kb at a cut-off of r² = 0.2 [27].

In the correlation analysis of phenotypic traits, we found a significant correlation between PH and EH. Many previous studies have also confirmed that PH and EH were related [28]. In addition, GWAS analysis of the three traits found that EH and PH had two overlapping SNPs, which were S3_219824021 and S5_37693709. Therefore, further study on the relevant candidate genes of these loci was helpful to analyze the genetic mechanism of PH and EH in fresh eating maize. Previous research has used QTL mapping and GWAS methods to study the genetic structure of PH and EH traits, but due to the differences in population type and size, marker type and density, and statistical methods used by each research group, the identification of QTL were quite different, and it was difficult for a single study to reveal the genetic structure of maize PH and EH. The previous genome-wide association study of PH and EH was mainly carried out on common maize. This study uses the association group composed of fresh edible maize to overlap the identified significant SNPs and the segments located in the previous study. The SNPs of EH located in this study, S5_101186696, S5_101191399, S5_101191576, S5_101416556, S5_101420833, S5_110982180, S6_117338012 were located in 5.04/05; The SNPs of PH located in this study, S6_109254482, S6_113842238 were located in 6.04/05. These two regions were consistent with the “stable QTL” jointly located by Li using F_2:3 population and RIL population for PH and EH traits [29]. The SNPs of TBN located in this study, S6_157380718, S6_157381716, and S6_157391371, were located in the QTL and SNPs region of Bins 6.06–6.08 previously identified, indicating that there may be an important region for regulating maize TBN in this region [30,31,32,33]. The results of this study deepen the understanding of the genetic basis of sweet corn and waxy maize plant type traits and contribute to improving the breeding efficiency and breeding new varieties.

Previous studies have cloned some genes that related to TBN, such as mutations in ramosa1 [34], ramosa2, and ramosa3 [35] with increased TBN numbers. Double mutants of repetitive SBP-box transcription factor genes unbranched2 and unbranched3 exhibit a reduced number of tassel branches and an increased number of spike rows [36]. The ramosa1 gene encodes a putative transcription factor that controls branching architecture in the maize tassel and ear. The candidate gene Zm00001d020430 mapped by TBN in this study encodes ra1 [37]. The cytochrome P450 (CYP) family plays a key role in plant evolution and metabolic diversification [38]. The genes Zm00001d017528, Zm00001d007924, and Zm00001d044120 were cytochrome P450 superfamily proteins, which may regulate the process of plant growth and development and affect the phenotype of plants through the regulation of metabolites in plants. Zinc-finger protein (ZFP) was one of the most important transcription factors in eukaryotes [39,40]. It plays an important role in plant gene expression and regulation, growth, and senescence [41,42]. The candidate genes Zm00001d022427, Zm00001d010380, Zm00001d047539, Zm00001d034639, Zm00001d034642, Zm00001d007121, Zm00001d038926, Zm00001d027312, Zm00001d040302, and Zm00001d01801 in this experiment encode RING zinc finger domain superfamily proteins and zinc finger CCHC domain proteins, which may regulate the growth and development of plants. Gene Zm00001d022437, Zm00001d044162, Zm00001d023332, Zm00001d023336, and Zm00001d038451 encode a WRKY gene family protein. WRKY were widely involved in regulating rice growth and development by regulating growth regulator-mediated signaling pathways. The plant basic leucine zipper (bZIP) transcription factor protein is encoded by the gene Zm00001d022442, Zm00001d03169 [43]. Glycosyl-phosphatidyl inositol (GPI)-anchored proteins were associated with a variety of growth and developmental mechanisms [44]. The gene Zm00001d038682 encodes a GPI-anchored protein [45]. These candidate genes may play important roles in plant growth and inflorescence development, but their biological functions require further study. With the development of high-throughput sequencing technology and various gene editing technologies, direct selection of genotypes for crop phenotype improvement has become a reality. This study revealed candidate genes and possible molecular mechanisms regulating PH, EH, and TBN, providing important insights and genetic resources for efficient breeding of maize using genetically improved PH, EH, and TBN.

Genomic selection, especially early selection, was more accurate. Genotyping uses high-density molecular markers to estimate all QTL effects and explain genetic variation for most traits. However, MAS uses fewer markers for trait selection and genomic selection was more accurate than MAS. A previous study shows that GWAS-derived markers improved the prediction accuracy of GS [46]. Consistent with the results of this study, the prediction accuracy gradually increased with the number of significance markers added, and then the increasing trend gradually decreased.

Genomic prediction and GS have been successfully applied to a variety of crops to accelerate genetic gain and improve complex traits in breeding programs [47,48]. The prediction accuracy increases with the increase of the panel TPS, when the TPS increases from 10% to 30%, the prediction accuracy increases rapidly, and when the TPS was further increased, the prediction accuracy increases slightly. If 80% of the total genotypes were used as the training set, the prediction accuracy was higher, and the standard error was smaller. Noman et al. results showed that when the training population was smaller, the prediction accuracy increases as the modeled population increases [49]. However, beyond a certain point, the growth rate of prediction accuracy becomes very low, and breeders can decide on an acceptable prediction accuracy based on the actual situation.

4. Materials and Methods

4.1. Plant Material

This study utilized an association mapping panel composed of 477 fresh edible maize inbred lines, in which 190 sweet corn inbred lines and 287 waxy corn inbred lines were collected or developed by Shanghai Academy of Agricultural Sciences, China. This panel represents a core collection of sweet corn and waxy corn germplasm in China, and includes most of the parents of the recently released waxy corn and sweet corn varieties. The 190 sweet corn inbred lines could be divided into three subgroups, i.e., enhanced sweet corn, super sweet corn and ordinary sweet corn, according to the sweetness regulatory genes of Sugar-1 (su1), shriken-2 (sh2) and Sugar Extender (se).

4.2. Phenotyping and Experimental Design

We evaluated 477 sweet corn and waxy corn inbred lines; and three plant architecture traits of PH, EH, and TBN were measured. The association panel of fresh edible maize was planted at Zhuanghang Experimental Station (N 30°53′, E 121°23′) of the Crop Research Institute of Shanghai Academy of Agricultural Sciences in Shanghai, and at the winter season breeding station (N 18°51′, E 110°03′) in Lingshui County, Hainan Province in 2019 and 2020. The phenotypic data of PH and EH were collected in the summer of 2019 and 2020 from the trials planted in Shanghai, and the phenotypic data of TBN were collected from the trials planted in Hainan in the summer of 2020, and in Shanghai in the winter of 2020. A single row plot was planted with 2.5 m in length and 0.6 m between plots, with 0.25m between plants, and at a planting density of 52,500 plants ha⁻¹, A randomized complete block design with two replications per trial was applied. Other field measures were implemented following conventional management practices.

At the maturity stage, after the plant height of the maize inbred line in the natural population was stable, five plants from each row were randomly selected and measured with a tower ruler. The mean value of each trait was used for association analysis. The length from the root to the top of the tassel was the PH of the maize inbred line. EH is measured as the length from the root of the maize to the knot of the uppermost ear of the maize.

4.3. Phenotypic Data Analysis

The phenotypic data were analyzed using Microsoft Excel 2007 software to generate descriptive statistics, including the mean, minimum, maximum, standard deviation (SD), coefficient of variation (CV), skewness and kurtosis. The coefficient of variation was calculated as CV (coefficient of variation) = SD (standard deviation)/mean. The frequency distribution of phenotypic data was also checked using Microsoft Excel 2007 software. The kurtosis and skewness were used to estimate the frequency distribution normality. Corrplot in R was used to generate plots using Pearson correlation analysis.

Best Linear Unbiased Estimator (BLUE) and generalized heritability were estimated in META-R [50].

The formula for calculating the BLUE value is:

Y_ijk =μ + Rep_i+ Block_j(Rep_i) + Gen_k + ε_ijk

where Y_ijk is the plant architecture trait, μ is the overall mean effect, Rep_i is the effect of the ith replicate, Block_j (Rep_i) is the effect of the jth incomplete block within the ith replicate, Gen_k is the effect of the kth genotype, and ε_ijk is the effect of the error associated with three factors.

The formula for calculating the generalized heritability is:

H^{2} = \frac{{σ_{g}}^{2}}{{σ_{g}}^{2} + {σ_{ge}}^{2} / n Env + {σ_{ε}}^{2} / (n Env \times n Rep)}

where σ_g² and σ_ε² are the genotype and error variance components, respectively, σ_ge² is the variance of the G × E cross-variance component, nEnv is the number of environments, and nRep is the number of repetitions. To calculate BLUE and generalized heritability, all effects were declared as random.

4.4. Genotyping and Genotypic Data Analysis

For genotyping, fresh young leaves of all accessions were collected, and genomic DNA was extracted using a DNA extraction kit. All samples were sent for genotype detection at Novogene Company using the single nucleotide polymorphism (SNP) Illumina platform. The panel of 477 inbred lines was genotyped on the Illumina platform, and the reference genome was B73 RefGen_v4 for SNP calling. The raw reads were filtered via a standard quality control (QC) process, and the clean reads were obtained for SNP calling. A total of 108,457,756 SNPs were obtained. SNP calling using VCFtools software, the SNPs with missing rate (<20%) and minor allele frequency (MAF > 0.05) were retained, resulting in a final set of 278,592 high-quality SNPs.

4.5. Analyses of Linkage Disequilibrium (LD), Population Structure, GWAS, and LD Block Analysis

Population structure analysis: a model-based clustering algorithm in ADMIXTURE Software Version 1.3 [51] was applied. Preliminary analysis was performed in multiple runs by entering consecutive K values from 1 to 12. A five-fold cross-validation procedure was performed for each value of K. The most likely K value was determined using the cross-validation value of ADMIXTURE. Inbred lines with a membership probability greater than 0.5 were assigned to the corresponding clusters and plotted using TBtools software v1.098727 [52]. Principal component analyses (PCA) and clustering analyses were performed in R.

The PopLDdecay 3.40 software (https://github.com/BGI-shenzhen/PopLDdecay (accessed on 13 April 2022)) [53] and perl scripts were used to evaluate linkage disequilibrium (LD) to determine the number of markers required for GWAS, and to determine the detection efficiency and accuracy of GWAS.

The GWAS analysis was conducted in TASSEL 5.0 software [54] by incorporating PCA + K in a mixed linear model. The population structure (PCA) and kinship calculated among individuals were used to adjust the population structure. For the PCA method, the first three PCs (PC1, PC2, and PC3) that were determined from a scree plot constructed from PCs were included in the model as fixed-effect covariates to adjust population stratification. Considering the rigor of the mixed linear model, we conservatively chose −log10 (p-value) of 4.0 as the threshold to determine the SNPs significantly associated with the target traits of PH, EH, and TBN, respectively. The Manhattan plot and quantile–quantile (Q–Q) plot were produced using the “CMplot” package in R. The proportion of the explained phenotypic variation by each marker was estimated by the phenotype variance explained. Linkage disequilibrium heat maps were constructed using “LDBlockShow” [55].

4.6. Candidate Gene Identification and Annotation

All the putative candidate genes within 50 kb of the detected loci were identified. The expression data and gene annotation information were collected from the maizeGDB database (http://www.maizegdb.org (accessed on 19 May 2022)). The physical locations of the genes and SNPs were based on the maize B73 RefGen_V4 genome. The annotation functions and related information of the candidate genes are obtained from the Maize Genetics and Genomics Database and the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/ (accessed on 1 June 2022)).

4.7. Genomic Prediction Analysis

Genomic prediction analysis was conducted with the Ridge Regression Best Linear Unbiased Prediction (RRBLUP) model in R [56]. To estimate the effect of marker density on GP accuracy, the different number of significance markers identified by GWAS—100, 500, 1000, 3000, 5000, and 10,000—were selected to estimate prediction accuracy for all the target traits. At each marker density, SNPs were randomly selected 500 times, and a five-fold cross-validation scheme with 500 repetitions was applied. To explore the effect of training population size on the estimation of the prediction accuracy, training population sizes increasing from 10% to 90% of the total markers, with 10% of the total markers interval, were set to estimate the prediction accuracy for all the target traits.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants12020303/s1, Table S1: SNPs, chromosomal positions and candidate genes identified by GWAS.

Author Contributions

H.W., Y.R. and L.Q. contributed to the study conception and design; Data collection was performed by D.D., Y.G. and A.Z. Data analysis was performed by D.D., H.Z. and X.Z.; The first draft of the manuscript was written by D.D. and H.W.; The first draft of the manuscript was revised by Y.R. and L.Q.; And all authors commented on previous versions of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Shanghai Agriculture Applied Technology Development Program (No. 1-1Shanghai Agricultural Science and Technology Innovation Word (2019)). Shanghai Science and Technology Support Project (20392000400). National Corn Industry Technology System (CARS-02-73). Shanghai Modern Agricultural Industry Technology System (No. 10 of Shanghai Agricultural Industry Word (2017)). Shanghai Engineering Research Center of Specialty Maize (20DZ2255300).

Data Availability Statement

Data available in a publicly accessible repository.

Acknowledgments

The author is very grateful to CIMMYT-China Special Maize Research Center (CCSMRC) for the germplasm resources and support.

Conflicts of Interest

The authors declare no conflict of interest.

References

Su, C.; Wang, W.; Gong, S.; Zuo, J.; Li, S.; Xu, S. High Density Linkage Map Construction and Mapping of Yield Trait QTLs in Maize (Zea mays) Using the Genotyping-by-Sequencing (GBS) Technology. Front. Plant Sci. 2017, 8, 706. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Rouf Shah, T.; Prasad, K.; Kumar, P. Maize A potential source of human nutrition and health: A review. Cogent Food Agric. 2016, 2, 1166995. [Google Scholar] [CrossRef]
Baveja, A.; Muthusamy, V.; Panda, K.K.; Zunjare, R.U.; Das, A.K.; Chhabra, R.; Mishra, S.J.; Mehta, B.K.; Saha, S.; Hossain, F. Development of multinutrient-rich biofortified sweet corn hybrids through genomics-assisted selection of shrunken2, opaque2, lcyE and crtRB1 genes. J. Appl. Genet. 2021, 62, 419–429. [Google Scholar] [CrossRef] [PubMed]
Feng, X.; Pan, L.; Wang, Q.; Liao, Z.; Wang, X.; Zhang, X.; Guo, W.; Hu, E.; Li, J.; Xu, J.; et al. Nutritional and physicochemical characteristics of purple sweet corn juice before and after boiling. PLoS One 2020, 15, e233094. [Google Scholar] [CrossRef] [PubMed]
Li, Z.; Hong, T.; Shen, G.; Gu, Y.; Guo, Y.; Han, J. Amino Acid Profiles and Nutritional Evaluation of Fresh Sweet–Waxy Corn from Three Different Regions of China. Nutrients 2022, 14, 3887. [Google Scholar] [CrossRef] [PubMed]
Pan, Q.; Xu, Y.; Li, K.; Peng, Y.; Zhan, W.; Li, W.; Li, L.; Yan, J. The Genetic Basis of Plant Architecture in 10 Maize Recombinant Inbred Line Populations. Plant Physiol. 2017, 175, 858–873. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Lu, S.; Li, M.; Zhang, M.; Lu, M.; Wang, X.; Wang, P.; Liu, W. Genome-Wide Association Study of Plant and Ear Height in Maize. Trop. Plant Biol. 2020, 13, 262–273. [Google Scholar] [CrossRef]
Gage, J.L.; Miller, N.D.; Spalding, E.P.; Kaeppler, S.M.; de Leon, N. TIPS: A system for automated image-based phenotyping of maize tassels. Plant Methods 2017, 13, 1–12. [Google Scholar] [CrossRef] [Green Version]
Wartha, C.A.; Cargnelutti Filho, A.; Lúcio, A.D.; Follmann, D.N.; Kleinpaul, J.A.; Simões, F.M. Sample sizes to estimate mean values for tassel traits in maize genotypes. Genet. Mol. Res. 2016, 15, 1–13. [Google Scholar] [CrossRef]
Cao, Y.; Zhong, Z.; Wang, H.; Shen, R. Leaf angle: A target of genetic improvement in cereal crops tailored for high-density planting. Plant Biotechnol. J. 2022, 20, 426–436. [Google Scholar] [CrossRef]
Riedelsheimer, C.; Lisec, J.; Czedik-Eysenberg, A.; Sulpice, R.; Flis, A.; Grieder, C.; Altmann, T.; Stitt, M.; Willmitzer, L.; Melchinger, A.E. Genome-wide association mapping of leaf metabolic profiles for dissecting complex traits in maize. Proc. Natl. Acad. Sci. USA 2012, 109, 8872–8877. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, B.; Lin, Z.; Li, X.; Zhao, Y.; Zhao, B.; Wu, G.; Ma, X.; Wang, H.; Xie, Y.; Li, Q.; et al. Genome-wide selection and genetic improvement during modern maize breeding. Nat. Genet. 2020, 52, 565–571. [Google Scholar] [CrossRef] [PubMed]
Li, N.; Lin, B.; Wang, H.; Li, X.; Yang, F.; Ding, X.; Yan, J.; Chu, Z. Natural variation in ZmFBL41 confers banded leaf and sheath blight resistance in maize. Nat. Genet. 2019, 51, 1540–1548. [Google Scholar] [CrossRef] [PubMed]
Li, S.; Zhang, C.; Yang, D.; Lu, M.; Qian, Y.; Jin, F.; Liu, X.; Wang, Y.; Liu, W.; Li, X. Detection of QTNs for kernel moisture concentration and kernel dehydration rate before physiological maturity in maize using multi-locus GWAS. Sci. Rep. 2021, 11, 1764. [Google Scholar] [CrossRef] [PubMed]
Yin, X.; Bi, Y.; Jiang, F.; Guo, R.; Zhang, Y.; Fan, J.; Kang, M.S.; Fan, X. Fine mapping of candidate quantitative trait loci for plant and ear height in a maize nested-association mapping population. Front. Plant Sci. 2022, 13, 963985. [Google Scholar] [CrossRef] [PubMed]
Wu, X.; Li, Y.; Shi, Y.; Song, Y.; Zhang, D.; Li, C.; Buckler, E.S.; Li, Y.; Zhang, Z.; Wang, T. Joint-linkage mapping and GWAS reveal extensive genetic loci that regulate male inflorescence size in maize. Plant Biotechnol. J. 2016, 14, 1551–1562. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zhao, S.; Li, X.; Song, J.; Li, H.; Zhao, X.; Zhang, P.; Li, Z.; Tian, Z.; Lv, M.; Deng, C.; et al. Genetic dissection of maize plant architecture using a novel nested association mapping population. Plant Genome 2022, 15, e20179. [Google Scholar] [CrossRef]
Meuwissen, T.H.; Hayes, B.J.; Goddard, M.E. Prediction of total genetic value using genome-wide dense marker maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef]
Pace, J.; Yu, X.; Lübberstedt, T. Genomic prediction of seedling root length in maize (Zea mays L.). Plant J. 2015, 83, 903–912. [Google Scholar] [CrossRef]
Liu, X.; Hu, X.; Li, K.; Liu, Z.; Wu, Y.; Wang, H.; Huang, C. Genetic mapping and genomic selection for maize stalk strength. Bmc Plant Biol. 2020, 20, 1–16. [Google Scholar] [CrossRef]
Sharma, S.; Pinson, S.R.M.; Gealy, D.R.; Edwards, J.D. Genomic prediction and QTL mapping of root system architecture and above-ground agronomic traits in rice (Oryza sativa L.) with a multitrait index and Bayesian networks. G3 Genes Genomes Genet. 2021, 11, 10. [Google Scholar] [CrossRef] [PubMed]
Kadam, D.C.; Potts, S.M.; Bohn, M.O.; Lipka, A.E.; Lorenz, A.J. Genomic Prediction of Single Crosses in the Early Stages of a Maize Hybrid Breeding Pipeline. G3 (Bethesda) 2016, 6, 3443–3453. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Cui, Z.; Dong, H.; Zhang, A.; Ruan, Y.; He, Y.; Zhang, Z. Assessment of the Potential for Genomic Selection To Improve Husk Traits in Maize. G3 (Bethesda) 2020, 10, 3741–3749. [Google Scholar] [CrossRef] [PubMed]
Morrell, P.L.; Toleno, D.M.; Lundy, K.E.; Clegg, M.T. Low levels of linkage disequilibrium in wild barley (Hordeum vulgare ssp. spontaneum) despite high rates of self-fertilization. Proc. Natl. Acad. Sci. USA 2005, 102, 2442–2447. [Google Scholar] [CrossRef] [Green Version]
Ren, J.; Li, Z.; Wu, P.; Zhang, A.; Liu, Y.; Hu, G.; Cao, S.; Qu, J.; Dhliwayo, T.; Zheng, H.; et al. Genetic Dissection of Quantitative Resistance to Common Rust (Puccinia sorghi) in Tropical Maize (Zea mays L.) by Combined Genome-Wide Association Study, Linkage Mapping, and Genomic Prediction. Front. Plant Sci. 2021, 12, 1338. [Google Scholar] [CrossRef]
Thirunavukkarasu, N.; Hossain, F.; Shiriga, K.; Mittal, S.; Arora, K.; Rathore, A.; Mohan, S.; Shah, T.; Sharma, R.; Namratha, P.M.; et al. Unraveling the genetic architecture of subtropical maize (Zea mays L.) lines to assess their utility in breeding programs. Bmc Genom. 2013, 14, 877. [Google Scholar] [CrossRef] [Green Version]
Ruanjaichon, V.; Khammona, K.; Thunnom, B.; Suriharn, K.; Kerdsri, C.; Aesomnuk, W.; Yongsuwan, A.; Chaomueang, N.; Thammapichai, P.; Arikit, S.; et al. Identification of Gene Associated with Sweetness in Corn (Zea mays L.) by Genome-Wide Association Study (GWAS) and Development of a Functional SNP Marker for Predicting Sweet Corn. Plants 2021, 10, 1239. [Google Scholar] [CrossRef]
Fei, J.; Lu, J.; Jiang, Q.; Liu, Z.; Yao, D.; Qu, J.; Liu, S.; Guan, S.; Ma, Y. Maize plant architecture trait QTL mapping and candidate gene identification based on multiple environments and double populations. Bmc Plant Biol. 2022, 22, 1–15. [Google Scholar] [CrossRef]
Li, X.; Zhou, Z.; Ding, J.; Wu, Y.; Zhou, B.; Wang, R.; Ma, J.; Wang, S.; Zhang, X.; Xia, Z.; et al. Combined Linkage and Association Mapping Reveals QTL and Candidate Genes for Plant and Ear Height in Maize. Front. Plant Sci. 2016, 7, 833. [Google Scholar] [CrossRef] [Green Version]
Wang, Y.; Chen, J.; Guan, Z.; Zhang, X.; Zhang, Y.; Ma, L.; Yao, Y.; Peng, H.; Zhang, Q.; Zhang, B.; et al. Combination of multi-locus genome-wide association study and QTL mapping reveals genetic basis of tassel architecture in maize. Mol. Genet. Genomics 2019, 294, 1421–1440. [Google Scholar] [CrossRef]
Brown, P.J.; Upadyayula, N.; Mahone, G.S.; Tian, F.; Bradbury, P.J.; Myles, S.; Holland, J.B.; Flint-Garcia, S.; Mcmullen, M.D.; Buckler, E.S. Distinct Genetic Architectures for Male and Female Inflorescence Traits of Maize. PLoS Genet. 2011, 7, e1002383. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Yang, W.; Zheng, L.; He, Y.; Zhu, L.; Tao, Y. Fine mapping and candidate gene prediction of a major quantitative trait locus for tassel branch number in maize. Gene 2020, 757, 144928. [Google Scholar] [CrossRef] [PubMed]
Xu, G.; Wang, X.; Huang, C.; Xu, D.; Li, D.; Tian, J.; Chen, Q.; Wang, C.; Liang, Y.; Wu, Y.; et al. Complex genetic architecture underlies maize tassel domestication. New Phytol. 2017, 214, 852–864. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Vollbrecht, E.; Springer, P.S.; Goh, L.; Buckler Iv, E.S.; Martienssen, R. Architecture of floral branch systems in maize and related grasses. Nature 2005, 436, 1119–1126. [Google Scholar] [CrossRef]
Satoh-Nagasawa, N.; Nagasawa, N.; Malcomber, S.; Sakai, H.; Jackson, D. A trehalose metabolic enzyme controls inflorescence architecture in maize. Nature 2006, 441, 227–230. [Google Scholar] [CrossRef]
Chuck, G.S.; Brown, P.J.; Meeley, R.; Hake, S. Maize SBP-box transcription factors unbranched2 and unbranched3 affect yield traits by regulating the rate of lateral primordia initiation. Proc. Natl. Acad. Sci. USA 2014, 111, 18775–18780. [Google Scholar] [CrossRef] [Green Version]
Liu, X.; Galli, M.; Camehl, I.; Gallavotti, A. RAMOSA1 ENHANCER LOCUS2-Mediated Transcriptional Repression Regulates Vegetative and Reproductive Architecture. Plant Physiol. 2019, 179, 348–363. [Google Scholar] [CrossRef] [Green Version]
Wang, B.; Shahzad, M.F.; Zhang, Z.; Sun, H.; Han, P.; Li, F.; Han, Z. Genome-wide analysis reveals the expansion of Cytochrome P450 genes associated with xenobiotic metabolism in rice striped stem borer, Chilo suppressalis. Biochem. Bioph. Res. Co. 2014, 443, 756–760. [Google Scholar] [CrossRef]
Chen, Y.; Wang, G.; Pan, J.; Wen, H.; Du, H.; Sun, J.; Zhang, K.; Lv, D.; He, H.; Cai, R.; et al. Comprehensive Genomic Analysis and Expression Profiling of the C₂H₂ Zinc Finger Protein Family Under Abiotic Stresses in Cucumber (Cucumis sativus L.). Genes (Basel) 2020, 11, 171. [Google Scholar] [CrossRef] [Green Version]
Li, Y.; Sun, A.; Wu, Q.; Zou, X.; Chen, F.; Cai, R.; Xie, H.; Zhang, M.; Guo, X. Comprehensive genomic survey, structural classification and expression analysis of C₂H₂-type zinc finger factor in wheat (Triticum aestivum L.). Bmc Plant Biol. 2021, 21, 1–18. [Google Scholar] [CrossRef]
Arrey-Salas, O.; Caris-Maldonado, J.C.; Hernández-Rojas, B.; Gonzalez, E. Comprehensive Genome-Wide Exploration of C₂H₂ Zinc Finger Family in Grapevine (Vitis vinifera L.): Insights into the Roles in the Pollen Development Regulation. Genes 2021, 12, 302. [Google Scholar] [CrossRef] [PubMed]
Zhang, S.; Liu, J.; Zhong, G.; Wang, B. Genome-Wide Identification and Expression Patterns of the C₂H₂-Zinc Finger Gene Family Related to Stress Responses and Catechins Accumulation in Camellia sinensis [L.] O. Kuntze. Int. J. Mol. Sci. 2021, 22, 4197. [Google Scholar] [CrossRef] [PubMed]
Sun, W.; Chen, D.; Xue, Y.; Zhai, L.; Zhang, D.; Cao, Z.; Liu, L.; Cheng, C.; Zhang, Y.; Zhang, Z. Genome-wide identification of AGO18b-bound miRNAs and phasiRNAs in maize by cRIP-seq. Bmc Genomics 2019, 20, 1–11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Samalova, M.; Carr, P.; Bromley, M.; Blatzer, M.; Moya-Nilges, M.; Latge, J.P.; Mouyna, I. GPI Anchored Proteins in Aspergillus fumigatus and Cell Wall Morphogenesis. Curr. Top Microbiol Immunol. 2020, 425, 167–186. [Google Scholar] [CrossRef]
Wang, M.; Qu, H.; Zhang, H.; Liu, S.; Li, Y.; Zhang, C. Hormone and RNA-seq analyses reveal the mechanisms underlying differences in seed vigour at different maize ear positions. Plant Mol. Biol. 2019, 99, 461–476. [Google Scholar] [CrossRef]
Ali, M.; Zhang, Y.; Rasheed, A.; Wang, J.; Zhang, L. Genomic Prediction for Grain Yield and Yield-Related Traits in Chinese Winter Wheat. Int. J. Mol. Sci. 2020, 21, 1342. [Google Scholar] [CrossRef] [Green Version]
Wang, N.; Wang, H.; Zhang, A.; Liu, Y.; Yu, D.; Hao, Z.; Ilut, D.; Glaubitz, J.C.; Gao, Y.; Jones, E.; et al. Genomic prediction across years in a maize doubled haploid breeding program to accelerate early-stage testcross testing. Theor. Appl. Genet. 2020, 133, 2869–2879. [Google Scholar] [CrossRef]
Zhang, A.; Wang, H.; Beyene, Y.; Semagn, K.; Liu, Y.; Cao, S.; Cui, Z.; Ruan, Y.; Burgueño, J.; San Vicente, F.; et al. Effect of Trait Heritability, Training Population Size and Marker Density on Genomic Prediction Accuracy Estimation in 22 bi-parental Tropical Maize Populations. Front. Plant Sci. 2017, 8, 1916. [Google Scholar] [CrossRef] [Green Version]
Norman, A.; Taylor, J.; Edwards, J.; Kuchel, H. Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy. G3 (Bethesda) 2018, 8, 2889–2899. [Google Scholar] [CrossRef] [Green Version]
Alvarado, G.; Rodríguez, F.M.; Pacheco, A.; Burgueño, J.; Crossa, J.; Vargas, M.; Pérez-Rodríguez, P.; Lopez-Cruz, M.A. META-R: A software to analyze data from multi-environment plant breeding trials. Crop J. 2020, 8, 745–756. [Google Scholar] [CrossRef]
Mussmann, S.M.; Douglas, M.R.; Chafin, T.K.; Douglas, M.E. AdmixPipe: Population analyses in Admixture for non-model organisms. Bmc Bioinform. 2020, 21, 1–9. [Google Scholar] [CrossRef] [PubMed]
Chen, C.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.; Xia, R. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef] [PubMed]
Zhang, C.; Dong, S.; Xu, J.; He, W.; Yang, T. PopLDdecay: A fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 2019, 35, 1786–1788. [Google Scholar] [CrossRef] [PubMed]
Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef] [PubMed]
Dong, S.; He, W.; Ji, J.; Zhang, C.; Guo, Y.; Yang, T. LDBlockShow: A fast and convenient tool for visualizing linkage disequilibrium and haplotype blocks based on variant call format files. Brief. Bioinform. 2021, 22, 4. [Google Scholar] [CrossRef]
Endelman, J.B. Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP. Plant Genome 2011, 4, 250–255. [Google Scholar] [CrossRef]

Figure 1. Distribution of phenotypes for PH, EH, and TBN in maize. (A) PH. (B) EH. (C) TBN.

Figure 2. Comparison of PH, EH, and TBN traits among different maize types. (A) Comparison between sweet corn and waxy corn. (B) Comparison between different genotypes of sweet corn. Asterisk above the box indicates significant differences by Student’s t test, *** p < 0.001; ns p > 0.05.

Figure 3. Distributions of and correlations between three relative phenotypic traits. (A) Correlation between BLUE and different environments of plant height and ear height. (B) Correlation between BLUE and different environments of the number of tassel branches. The frequency distribution histograms of three traits are located on the diagonal line, the area below the diagonal line is the scatter plot of the traits, and the area above is the correlation coefficient between each pair of traits. *** indicate significance at p < 0.001.

Figure 4. Genotypic diversity and LD decay in the mapping panel. (A) Chromosome-specific SNPs density in 1-Mb genomic intervals. The number of SNPs is represented in a green to red scale. (B) Frequency distribution of genotypic deletions. (C) Distribution of the minimum allele frequency of genotype. (D) Whole-genome LD in the entire panel based on 477 maize inbred lines.

Figure 5. Analysis of genetic diversity. (A) ΔK-value of 477 inbred lines. (B) The Bayes cluster plot of 477 maize inbred lines when K = 3. (C) Principal component analysis. (D) Distribution of pairwise relative kinship for 477 maize inbred lines calculated.

Figure 6. Manhattan plots of GWAS results showing the significant SNPs associated with PH traits. (A) Manhattan plot. Each dot represents a SNP. The black solid line represents the threshold of 1× 10^–4. (B) Quantile–Quantile (Q–Q) plots. The red line is the trend line to which the ideal Q–Q plot in each case should correspond.

Figure 7. Manhattan plots of GWAS results showing the significant SNPs associated with EH traits. (A) Manhattan plot. Each dot represents a SNP. The black solid line represents the threshold of 1× 10^–4. (B) Quantile–Quantile (Q–Q) plots. The red line is the trend line to which the ideal Q–Q plot in each case should correspond.

Figure 8. Manhattan plots of GWAS results showing the significant SNPs associated with TBN traits. (A) Manhattan plot. Each dot represents a SNP. The black solid line represents the threshold of 1× 10^–4. (B) Quantile–Quantile (Q–Q) plots. The red line is the trend line to which the ideal Q–Q plot in each case should correspond.

Figure 9. Genomic prediction accuracy of PH, EH, and TBN in the population, (A) when the number of SNPs varied from 0 to 10,000. (B) when the training population size (TPS) ranged from 10 to 90% of the total population size.

Table 1. Descriptive statistics, variance components, and broad-sense heritability (H²) response to PH, EH, TBN in the population.

Trait	Environment	Range	Mean	Skewness	Kurtosis	CV (%)	Variations			H²
Trait	Environment	Range	Mean	Skewness	Kurtosis	CV (%)	G	E	G × E	H²
PH	20SH	89.00–254.00	160.61 ± 26.92	0.37	0.50	0.17	702.62 **	62.58 **		0.97
	19SH	63.00–264.00	139.79 ± 31.94	−0.85	0.31	0.23	686.65 **	165.79 **		0.93
	BLUE	63.00–244.67	148.53 ± 26.31	0.10	0.50	0.18	447.64 **	118.75 **	254.99 **	0.75
EH	20SH	10.00–134.00	58.14 ± 20.82	0.18	0.10	0.36	414.84 **	48.63 **		0.96
	19SH	14.67–128.67	59.13 ± 20.62	−0.05	0.16	0.35	400.47 **	68.89 **		0.95
	BLUE	14.33–128.67	58.45 ± 19.46	0.12	−0.04	0.33	284.23 **	59.60 **	129.65 **	0.79
TBN	20HN	1.00–21.33	7.32 ± 3.33	0.53	0.77	0.45	9.28 **	2.17 **		0.93
	20SH	1.00–26.00	11.54 ± 4.05	0.34	−0.01	0.35	15.62 **	2.5 **		0.95
	BLUE	1.00–20.83	9.27 ± 3.47	0.32	0.31	0.37	7.31 **	2.34 **	4.97 **	0.72

** correlation is significant at p < 0.01.

Table 2. Candidate genes for each significant SNP associated with traits and their encoding products.

Trait	Chr	SNP Physical Position	Gene ID	Encoding	Functions
EH	1	298683020	Zm00001d034639	Zinc finger protein ZAT12	transcriptional regulation
			Zm00001d034641	ZFP16-2	other
			Zm00001d034642	Zinc finger protein ZAT11	transcriptional regulation
	2	222818258	Zm00001d007123	FPF1	other
			Zm00001d007121	CW-type Zinc Finger	transcriptional regulation
	3	219824021	Zm00001d044117	MYBR41	transcriptional regulation
			Zm00001d044120	cytochrome P450 CYP51H12	metabolism
			Zm00001d044121	auxin-like 1 protein	plant hormones
	4	240547324	Zm00001d053756	SBP-domain protein2	other
			Zm00001d053753	calmodulin binding protein	metabolism
	6	158598786	Zm00001d038496	Cyclin-T1-5	cell division
	6	167169385	Zm00001d038930	Transcription factor MYB36	transcriptional regulation
	7	177703100	Zm00001d022437	probable WRKY transcription factor 70	transcriptional regulation
			Zm00001d022440	ABI32 ABI3VP1 type transcription factor	plant hormones
			Zm00001d022442	bZIP transcription factor	transcriptional regulation
PH	1	2773087	Zm00001d027317	rolled leaf 2	other
	1	244295536	Zm00001d032945	myosin-7B	structural proteins
	1	255274670	Zm00001d033231	Expansin-B4	other
	2	11369186	Zm00001d002374	SAUR20—auxin-responsive SAUR family member	plant hormones
	2	241314625	Zm00001d007869	UDP-glycosyltransferase 71B1	cellular transport
	2	242857556	Zm00001d007924	cytochrome P450 93G2	metabolism
	3	36650639	Zm00001d040302	Zinc finger CCCH type domain-containing protein ZFN-like 1	transcriptional regulation
	3	220846626	Zm00001d044162	WRKY-TF64	transcriptional regulation
	5	198868106	Zm00001d017528	cytochrome P450 86A2	metabolism
	5	212239163	Zm00001d018016	putative RING zinc finger domain superfamily protein	transcriptional regulation
	8	121528270	Zm00001d010601	ZmCHX5	other
	9	28153529	Zm00001d045600	NAC46	transcriptional regulation
	10	3595365	Zm00001d023332	putative WRKY DNA-binding domain superfamily protein	transcriptional regulation
	10	36382301	Zm00001d024008	auxin-responsive protein IAA5	plant hormones
TBN	1	29829368	Zm00001d028304	Homeobox-leucine zipper protein HOX19
	1	37949776	Zm00001d028515	NCBP	other
	1	64922019	Zm00001d029289	putative cytochrome P450 superfamily protein metabolism
	1	204436402	Zm00001d031861	ethylene-responsive transcription factor ABI4	plant hormones
	1	232642160	Zm00001d032637	myosin 1	structural proteins
	2	2113968	Zm00001d001864	uracil-DNA glycosylase
			Zm00001d001865	ZmCip1, cytokinin-inducible protein	plant hormones
	2	3567341	Zm00001d001961	SAUR23—auxin-responsive SAUR family member
	2	4131167	Zm00001d001995	ribulose bisphosphate carboxylase/oxygenase activase 2, chloroplastic	photosynthesis
			Zm00001d002000	linoleate 9S-lipoxygenase6
	4	184008951	Zm00001d052229	ERF056	Other
	4	246401386	Zm00001d054093	senescence-associated protein DIN1	transcriptional regulation
	6	157380718	Zm00001d038444	Transcription factor TCP11	transcriptional regulation
	7	113522699	Zm00001d020430	ra1-ramosa1	Other

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dang, D.; Guan, Y.; Zheng, H.; Zhang, X.; Zhang, A.; Wang, H.; Ruan, Y.; Qin, L. Genome-Wide Association Study and Genomic Prediction on Plant Architecture Traits in Sweet Corn and Waxy Corn. Plants 2023, 12, 303. https://doi.org/10.3390/plants12020303

AMA Style

Dang D, Guan Y, Zheng H, Zhang X, Zhang A, Wang H, Ruan Y, Qin L. Genome-Wide Association Study and Genomic Prediction on Plant Architecture Traits in Sweet Corn and Waxy Corn. Plants. 2023; 12(2):303. https://doi.org/10.3390/plants12020303

Chicago/Turabian Style

Dang, Dongdong, Yuan Guan, Hongjian Zheng, Xuecai Zhang, Ao Zhang, Hui Wang, Yanye Ruan, and Li Qin. 2023. "Genome-Wide Association Study and Genomic Prediction on Plant Architecture Traits in Sweet Corn and Waxy Corn" Plants 12, no. 2: 303. https://doi.org/10.3390/plants12020303

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Genome-Wide Association Study and Genomic Prediction on Plant Architecture Traits in Sweet Corn and Waxy Corn

Abstract

1. Introduction

2. Results

2.1. Phenotypic Data Analysis Results

2.2. Results of SNP Characterization, LD Decay Distance, and Population Structure

2.3. Results of GWAS for Plant Architecture Traits

2.4. Candidate Genes Revealed by GWAS

2.5. Estimation of Genomic Prediction Accuracies

3. Discussion

4. Materials and Methods

4.1. Plant Material

4.2. Phenotyping and Experimental Design

4.3. Phenotypic Data Analysis

4.4. Genotyping and Genotypic Data Analysis

4.5. Analyses of Linkage Disequilibrium (LD), Population Structure, GWAS, and LD Block Analysis

4.6. Candidate Gene Identification and Annotation

4.7. Genomic Prediction Analysis

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI