Next Article in Journal
Bioinformatics Insights on Viral Gene Expression Transactivation: From HIV-1 to SARS-CoV-2
Next Article in Special Issue
FvMYB108, a MYB Gene from Fragaria vesca, Positively Regulates Cold and Salt Tolerance of Arabidopsis
Previous Article in Journal
Raman Spectroscopy as a Research and Diagnostic Tool in Clinical Hematology and Hematooncology
Previous Article in Special Issue
The Potential Role of Genic-SSRs in Driving Ecological Adaptation Diversity in Caragana Plants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Association Studies on the Kernel Row Number in a Multi-Parent Maize Population

1
College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming 650201, China
2
Institute of Food Crops, Yunnan Academy of Agricultural Sciences, Kunming 650205, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(6), 3377; https://doi.org/10.3390/ijms25063377
Submission received: 23 February 2024 / Revised: 11 March 2024 / Accepted: 12 March 2024 / Published: 16 March 2024
(This article belongs to the Special Issue Advance in Plant Abiotic Stress)

Abstract

:
Kernel row number (KRN) is a crucial trait in maize that directly influences yield; hence, understanding the mechanisms underlying KRN is vital for the development of high-yielding inbred lines and hybrids. We crossed four excellent panicle inbred lines (CML312, CML444, YML46, and YML32) with Ye107, and after eight generations of selfing, a multi-parent population was developed comprising four subpopulations, each consisting of 200 lines. KRN was accessed in five environments in Yunnan province over three years (2019, 2021, and 2022). The objectives of this study were to (1) identify quantitative trait loci and single nucleotide polymorphisms associated with KRN through linkage and genome-wide association analyses using high-quality genotypic data, (2) identify candidate genes regulating KRN by identifying co-localized QTLs and SNPs, and (3) explore the pathways involved in KRN formation and identify key candidate genes through Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. Our study successfully identified 277 significant Quantitative trait locus (QTLs) and 53 significant Single Nucleotide Polymorphism (SNPs) related to KRN. Based on gene expression, GO, and KEGG analyses, SNP-177304649, SNP-150393177, SNP-135283055, SNP-138554600, and SNP-120370778, which were highly likely to be associated with KRN, were identified. Seven novel candidate genes at this locus (Zm00001d022420, Zm00001d022421, Zm00001d016202, Zm00001d050984, Zm00001d050985, Zm00001d016000, and Zm00014a012929) are associated with KRN. Among these, Zm00014a012929 was identified using the reference genome Mo17. The remaining six genes were identified using the reference genome B73. To our knowledge, this is the first report on the association of these genes with KRN in maize. These findings provide a theoretical foundation and valuable insights into the genetic mechanisms underlying maize KRN and the development of high-yielding hybrids through heterosis.

1. Introduction

Kernel row number (KRN) is a critical factor in maize yield and is an important trait for improving maize inbreds and developing high-yielding hybrids. The KRN of maize is a quantitative trait with high heritability that serves as a key indicator for evaluating the quality of inbred lines. Therefore, gathering genetic information about KRN is of great significance for maize breeding. Previous studies have shown that KRN is a quantitative trait controlled by multiple genes, and over 100 KRN-related QTLs have been identified in maize [1,2].
Since Thornsberry et al. [3] successfully introduced the genome-wide association study (GWAS) method into plant genetics research, GWAS has become a widely adopted method in various crops to identify loci associated with traits of interest. Maize, a cross-pollinated crop with rapid linkage disequilibrium decay, is well suited for association analysis. The development of molecular markers has significantly enhanced the effectiveness of QTL mapping for KRN [4]. Numerous studies have employed QTL mapping for KRN, revealing that the loci influencing KRN are distributed across all ten maize chromosomes. Using different mapping populations, more than 200 QTLs associated with KRN have been identified [5,6,7,8,9,10,11,12,13,14]. For instance, Lu et al. [13] conducted a study using a population of 397 F2:3 individuals across seven different environments and detected a significant KRN-related QTL, QTLqKRN7. Tian et al. [15] detected a major KRN-associated QTL, qKRN10, using three BC5F2:3 populations. Cai et al. [16] used an F2:3 population to identify three major QTLs associated with KRN in three different environments. In addition to identifying QTLs associated with KRN, researchers have conducted fine mapping to narrow the QTL interval, leading to the discovery of candidate genes. Calderon et al. [17] used a BC2S3 population to locate a KRN-related QTL and identified seven candidate genes located within a 203 kb region in the QTL. Bai et al. [18] detected a KRN-related QTL, qKRN5.04, and 10 candidate genes within a 300 kb region of the QTL. Although several candidate genes associated with KRN have been identified, only a limited number of functional genes associated with this trait have been successfully cloned. Liu et al. [19] and Wang et al. [20] achieved significant milestones by fine mapping and cloning KRN4 and KRN1, respectively, in maize. Furthermore, Chen et al. [21] made a notable contribution by identifying KRN2, a gene that displays variation between domesticated maize and its wild ancestor, teosinte. KRN2 serves as a pivotal quantitative trait locus for kernel row number in maize, with selection pressures in noncoding upstream regions resulting in decreased KRN2 expression. This reduction correlates with an augmented grain number, which is facilitated by an increase in the KRN.
Previous studies have revealed that relying solely on a single approach, such as GWAS or QTL mapping, has significant limitations for identifying candidate genes that regulate KRN. In recent years, researchers have increasingly combined two or more mapping methods to mine candidate genes associated with target traits. Xiao et al. [22] mapped the QTLs affecting ear traits in ten different recombinant inbred line (RIL) populations using three methods. They then validated and fine-mapped four new QTLs using candidate gene association analysis, expression QTL analysis, and heterozygous inbred line validation. They successfully fine-mapped the QTLs associated with KRN. Similarly, Liu et al. [23] employed a combination of linkage and association mapping to unravel the genetic architecture of maize KRN and evaluated phenotypic predictability using the detected loci. A GWAS study revealed 31 associated SNPs representing 17 genomic loci with effects in at least one of the five individual environments. In another study, Fei et al. [24] utilized a nested association mapping (NAM) population consisting of 1617 RILs and identified five QTLs related to KRN via joint linkage mapping (JLM). These QTLs were further validated through linkage mapping (SLM) and GWAS. They identified three cloned KRN genes through a QTL assay, and two new KRN-related QTLs, qKRN4.2 and qKRN9.1, were successfully identified. Understanding the genetic basis of maize KRN and identifying candidate genes that control KRN are crucial for breeding high-yielding inbred lines and hybrids. Although previous studies have identified numerous KRN-associated genes [25,26,27,28,29,30,31,32,33,34,35], few have applied GO and KEGG analyses in GWAS and QTL studies for candidate gene selection. Additionally, studies on the association between KRN and tropical inbred lines are limited. In this study, four excellent inbred lines with high genetic variation in KRN (CML312, CML444, YML32, and YML46) were selected from the Suwan and non-Reid heterotic groups of tropical regions as donor parents. These inbred lines were hybridized with the elite maize inbred line, Ye107, from the Reid heterotic group, to develop a multi-parent population. The KRN phenotypic data were collected in five different environments using this multi-parent population. Subsequently, GWAS analysis and QTL mapping were conducted on the 655 RILs, followed by GO and KEGG enrichment analyses. The objectives of this study were to (1) utilize high-quality genotypic data to perform linkage and GWAS analyses of KRN in a multi-parent maize population to identify QTLs and SNPs associated with maize KRN, (2) identify candidate genes associated with KRN based on co-localized SNP and QTL information, and (3) combine the results of GO and KEGG enrichment analyses to identify the pathways involved in KRN formation and identify key candidate genes.

2. Results

2.1. Phenotypic Analysis of KRN in the Multi-Parent Population

KRN data were collected from a multi-parent population consisting of 655 RILs across five different environments. The environments were as follows: two in 2019 at Dehong (19DH) and Baoshan (19BS), two in 2021 at Jinghong (21JI) and Yanshan (21YS), and one in 2022 at Yanshan (22YS). As shown in Table 1, the range of KRN variation in 19DH was from 6.5 to 16.67, with an average KRN of 12.16. For 21YS, the range was from 6 to 16, with an average of 11.9; for 22YS, the range was from 9 to 19.5, with an average of 12.76. At 21JH, the range spanned from 8 to 20, with an average of 13.24. In 19BS, the range was 5.33 to 16.67 with an average of 11.82. Phenotypic data across the environments revealed high broad-sense heritability for KRN, reaching 61.94% (Table 1, Figure 1).

2.2. Structure Analysis of the Multi-Parent Population

To mitigate the impact of population stratification during GWAS analysis, we employed a mixed linear model (MLM) that incorporated both the population structure (Q) and kinship matrix (K) to reduce false positives, as suggested in a previous study [36]. We analyzed 590,816 high-quality SNPs using the Plink v1.9 software and performed a genetic similarity analysis using Tassel5. The resulting phylogenetic tree grouped the 655 RILs into four subpopulations (Figure 2). Additionally, we conducted principal component analysis (PCA) using R v 4.3.0 software, which initially divided the 655 RILs into four populations, labeled sub-pop1 to sub-pop4 (Figure 2). However, because the sub-pop4 and sub-pop1 groups overlapped, they were merged into one population. Thus, the 655 RILs were classified into three groups: sub-pop2, sub-pop3, and sub-pop1_sub-pop4 (Figure 3). This result is consistent with the “three heterotic group” theory in maize that has been used to improve breeding efficiency [37,38,39]. This also confirmed the accuracy of classifying the maize lines into different heterotic groups based on their combining ability and grain yield.

2.3. Genome-Wide Association Analysis for KRN

We obtained genotyping data for 590,816 high-quality SNPs for GWAS analysis to identify the loci associated with KRN. GWAS was performed using both genotyping and phenotypic data from the five environments, utilizing the MLM model implemented in GEMMA software (https://github.com/genetics-statistics/GEMMA/releases, accessed on 1 June 2023). Using the formula −log10 (1/SNP number), we established a significant threshold of five. By applying this threshold, we identified 53 SNPs associated with KRNs across five environments associated with 38 genes. (Figure 4a–d and Table S1). Based on our stringent significant threshold, we identified the SNPs across various environments. In the 19DH environment, six SNPs were detected that were located on chromosomes 5 and 7. These SNPs collectively accounted for 8.8% and 6.4% of the phenotypic variance explained (PVE), respectively. Additionally, one SNP located on chromosome 6 in the 19BS environment contributed to 3.2% of the PVE. Furthermore, two SNPs were identified in the 21YS environment, located on chromosomes 2 and 8. In the 21JH environment, 41 SNPs were identified that were distributed across chromosomes 1, 2, 3, 4, 5, 6, 8, and 10, explaining a phenotypic variance ranging from 4.7% to 21%. In the 22YS environment, three SNPs were identified on chromosome 10, explaining 14.9% of PVE. Notably, these SNPs were not co-localized across the environments.

2.4. Detection of Candidate Genes

In the GWAS analysis, 53 significant SNPs were identified, and these SNPs explained KRN variation ranging from 3.24% to 21.01% (Table S1). Among these SNPs, seven had a phenotypic variation explained (PVE) of over 10%, and 14 genes were located within the range of 20 kb upstream and downstream of these seven SNPs (Table 2). In maize, KRN is largely influenced by the following tissues: meristem_16–19_day, ear primordium (2–4 mm), ear primordium (6–8 mm), and female spikelets. Among the genes listed in Table 3, Zm00001d016202 encodes an E3 ubiquitin protein ligase found in ear primordium [40], meristem, and COP1 proteins. COP1 is crucial in plant biology and participates in light signal transduction and growth regulation as an E3 ubiquitin ligase. It interacts with various proteins to control the expression of light-responsive genes, thereby influencing photomorphogenesis, flowering time, and photoperiod-related development. Based on functional annotation and expression analysis, Zm00001d016202 is regarded as a potential candidate gene regulating kernel row number (KRN). However, the remaining 13 genes did not appear to play a specific role in KRN regulation.
Through a search within 20 kb upstream and downstream of SNP-177304649, two genes, Zm00001d022420 and Zm00001d022421, were found to be co-localized in the 19DH environment. Although Zm00001d022420 was not expressed in the meristem_16–19_day or ear primordium, it showed an expression level of 61.5 in female spikelets. In contrast, Zm00001d022421 was expressed in all four tissues, with the highest expression level among all genes. Expression levels were 122.7, 156, 124.5, and 464.8 FPKM in meristem_16–19, ear primordium (2–4 mm), ear primordium (6–8 mm), and female spikelets, respectively. Both Zm00001d022420 and Zm00001d022421 encode ricin B-like lectins, which are carbohydrate-binding agglutinins involved in binding to polysaccharides with Lewis X, Lewis Y, or lacto-N-aminoglycan structures through their agglutinin domains [53] Additionally, by comparing with previously published genes related to ear development, we identified a bd1 (branched silkless1) gene located approximately 1 Mb downstream of SNP-177304649. The bd1 gene leads to increased branching in the male ear, deformities, and lack of filaments in the female ear [54,55]. Therefore, it is speculated that Zm00001d022420 and Zm00001d022421 may contribute to the reduction in KRN in maize.
To confirm the two candidate genes, Zm00001d022420 and Zm00001d022421, we conducted haplotype analysis using the Haploview 4.2 software (Figure 5). Zm00001d022420 exhibited two haplotypes: HAP-1 (TAA) and HAP-2 (CGC). The results indicated that in 19DH environments, HAP-2 (CGC) was not significantly higher than HAP-1 (TAA), and Zm00001d022420 did not have a predominant haplotype. Zm00001d022421 contained two haplotypes: HAP-1 (AA) and HAP-2 (GG). In the 19DH environment, HAP-1 (AA) was significantly more prevalent than HAP-2 (GG) was. Therefore, HAP-1 (AA) was considered to be the dominant haplotype for Zm00001d022421.

2.5. Linkage Analysis

2.5.1. Parental Line Variation for KRN

To facilitate the mapping of QTLs controlling KRN, this study selected five inbred lines with substantial differences in KRN as parents for constructing the multi-parent population. The KRN values of the parental lines were as follows: Ye107 had 8 rows, CML312 had 16 rows, CML444 had 16 rows, YML32 had 14 rows, and YML46 had 14 rows.

2.5.2. Construction of Linkage Maps

We constructed genetic linkage maps for two subpopulations (sub-pop3 and sub-pop4) made from crosses between the common parent Ye107, which exhibited the lowest KRN, and CML312 and CML444, both of which had the highest KRNs of 16. RILs from the two subpopulations were filtered with a k-mer completeness of 0.8 and a deviation segregation of 0.001. The filter markers were then grouped into bins, each containing 15 markers without linkage. JoinMap software (v 4.0) was used to sort the bin markers for each subpopulation using the maximum likelihood method, and genetic distances were calculated using the Kosambi function. Ultimately, 1802 bin markers were assigned to the two maps (Table S2). The linkage map was visualized using the Perl SVG module, which depicts the distribution of bins on each chromosome (Figure 6). The linkage map of sub-pop3 had a total length of 1045.83 cM with an average marker distance of 1.07 cM. The longest chromosome was chromosome 1, spanning 169 cM, and the shortest was chromosome 10, measuring 33.23 cM. The linkage map of sub-pop4 had a total length of 828.27 cM with an average marker spacing of 1.01 cM. The longest chromosome was chromosome 4 with a length of 151.08 cM, and the shortest was chromosome 9 with a length of 41.88 cM.
Collinearity analysis was performed by comparing marker positions on the genetic map (Figure S2) with a physical map of the two RIL populations. The order of most markers in both linkage maps in Figure S1 is consistent with each chromosome, indicating strong collinearity and high mapping accuracy during linkage mapping in this study.

2.5.3. QTL Mapping

QTL mapping was performed for KRN in two subpopulations (sub-pop3 and sub-pop4) of the multi-parent population across five different environments (19DH, 19BS, 21YS, 21JH, and 22YS). The best linear unbiased prediction (BLUP) for KRN was predicted for all environments. Subsequently, QTL mapping was conducted for BLUP and all individual environments for the two subpopulations using WinQTLcart 2.5 software and the composite interval mapping method (Figure 7). The LOD threshold was determined based on 1000 permutations with a significance level of p ≤ 0.05 [56]. QTLs with LOD scores ≥ 2.5 were considered significantly associated with KRN. Using this threshold, 178 QTLs were detected in five environments and BLUP for sub-pop3. These QTLs were distributed across chromosomes 1, 3, 4, 5, 7, and 9 (Table S3). Marker 95 on chromosome 1 in 19DH had the highest LOD (6.299). In sub-pop3, chromosome 4 had the highest number of QTL exceeding the threshold. In contrast, no QTLs exceeding the threshold of 2.5 were detected in the BLUP and 19DH environment for sub-pop4. However, 99 QTLs exceeding this threshold were identified in the remaining five environments and were distributed across chromosomes 1, 2, 4, and 5, with the highest number of QTLs detected on chromosome 4 (Table S3). Among the five environments and BLUP, mk388 on chromosome 4 in 21JH exhibited the highest LOD (5.657). Notably, 52 stable QTLs were consistently mapped in the five environments and BLUP in the two subpopulations (Table S4). The sub-pop3/22YS environment and sub-pop3/BLUP shared seven QTLs. For the pop4 subpopulation, the sub-pop4/21JH and sub-pop4/21YS environments shared one QTL, whereas the sub-pop4/19BS environment shared another QTL.
Across the five environments and BLUPs in the two subpopulations, we identified 677 genes, some of which were repeatedly identified in different subpopulations and some in different environments in the same subpopulation (Table 4). For instance, on chromosome 4, 648 genes were detected in sub-pop3 in the 21JH environment and sub-pop4 in 22YS environment. In sub-pop3 in the 22YS environment, 29 co-located genes were identified on chromosome 4 using BLUP. In some cases, significant QTLs were identified, but no candidate genes were detected. For instance, sub-pop3 in the 19BS environment and sub-pop4 in the 21YS environment shared a common QTL within the interval of 120,350,778–120,390,778 bp on chromosome 5. However, no gene was detected at this position in the B73 reference genome. Nevertheless, when the Mo17 reference genome was used, we successfully detected the gene Zm00014a012929 within this QTL. This result suggests that when no candidate gene is detected in the B73 reference genome, an alternate reference genome can potentially be used to identify candidate genes associated with significant QTL.

2.6. Joint Analysis of GWAS and QTL

By comparing the genes identified via GWAS with those obtained from QTL mapping, we identified 18 genes consistently associated with KRN in both analyses (Table 5). Among these genes, 11 were located on chromosome 1, two on chromosome 4, and the remaining five on chromosome 5. These 18 genes were consistently identified using GWAS and QTL analyses in two or more environments. For instance, in GWAS analysis conducted in the 21JH environment and QTL analysis in the pop3/19DH environment, 11 overlapping genes (Zm00001d031666, Zm00001d031667, Zm00001d031668, Zm00001d031669, Zm00001d031709, Zm00001d031713, Zm00001d031715, Zm00001eb036870, Zm00001eb036890, Zm00001d031648, and Zm00001d031772) were identified. The phenotypic variation explained (PVE) by these genes ranged from 4.73% to 8.41%. Furthermore, GWAS analysis in the 21JH environment and QTL analysis in the pop3/21JH and pop4/22YS environments identified two genes (Zm00001d050984 and Zm00001d050985) that were consistently associated with KRN, which explained 6.7% of the PVE for KRN. Similarly, GWAS analysis in the 21JH environment and QTL analysis in the pop4/21YS environment revealed four genes (Zm00001d016000, Zm00001d016202, Zm00001d016203, and Zm00001d016204) that were consistently linked to KRN, explaining 8.4% to 10.4% of the phenotypic variation for KRN. Interestingly, one molecular marker showed a significant association with KRN across the three environments through GWAS analysis in the 19DH environment, and QTL analysis in the pop3/19BS and pop4/21YS environments. However, no candidate genes were identified when the B73 genome reference was used for analysis. Intriguingly, when the Mo17 reference genome was used, a gene associated with KRN, Zm00014a012929, was identified, which explained 8.83% of the KRN variation. This underscores the importance of considering different reference genomes in candidate gene annotation, as it can provide valuable insights into the genes that regulate KRN.
To further validate the 18 co-located candidate genes, we examined their functions and expression levels in the maizeGDB (Table 6). The data in Table 5 showed that Zm00001d050984 and Zm00001d050985 were consistently identified in all three environments through either GWAS or QTL analysis. Zm00001d050985 is expressed in all ear-related tissues and encodes the Ultraviolet-B (UV-B) receptor UVR8, which serves as a light sensor in plants, detecting UV-B wavelengths. UVR8 regulates plant responses to UV-B light and promotes plant growth and development [57,58]. Zm00001d050984 belongs to the Rab GDI displacement factor A 2 (CHML) gene family [59,60]. It functions as a component of the Rab geranylgeranyltransferase (RGGT) complex, which is responsible for post-translational modifications by adding geranylgeranyl groups to the C-termini of Rab proteins. Although Zm00001d050984 was not significantly expressed in formation-related tissues, it shared the same SNP as Zm00001d050985. Furthermore, approximately 1 Mb downstream of this SNP, there is an fea2 (fasciated ear 2) gene known to cause flattened cobs and irregular KRNs [61,62,63]. Therefore, we considered Zm00001d050984 and Zm00001d050985 as potential candidate genes that influence KRN. Zm00001d016000 belongs to the Myb family of transcription factors. The 1R (R1/2, R3-MYB) subgroup of Myb transcription factors is primarily involved in various physiological processes such as plant morphology, secondary metabolism, circadian rhythm control, and flower fruit development [64]. It also plays a crucial role in diverse physiological processes, including flower organ development and pollen germination [65,66,67,68]. However, based on comprehensive analyses of gene function and expression levels [69,70,71,72], no additional information was found for the other seven genes (Table 6).

2.7. Haplotype Analysis

Based on the co-localized candidate gene information from GWAS and QTL analyses and the duplicated pathways obtained from KEGG/GO enrichment, we selected three candidate genes for haplotype analysis. Zm00001d050984 and Zm00001d050985, which were co-located in the three environments, and Zm00001d016000, which may be directly related to KRN, were selected for haplotype analysis. Zm00001d016000 had three haplotypes (hap-1, GGAGG; hap-2, CGAGG; and hap-3, GACAA), and there were no significant differences among these three haplotypes. Zm00001d050984 had two haplotypes, hap-1 (CC) and hap-2 (TT); however, the difference between the two haplotypes was not significant. Zm00001d050985 shares the same haplotype pattern as Zm00001d050884, and the difference between the two haplotypes was also not significant. Different haplotypes of the same gene may affect gene function, thereby influencing maize KRN. When there are no significant differences among the various haplotypes, they may also co-regulate the gene and make the same contribution to gene expression (Figure 8).

2.8. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) Enrichment Analysis

To precisely identify KRN-associated genes among the large number of candidate genes, we conducted GO and KEGG enrichment analyses to further narrow down the candidate genes identified by GWAS and linkage analysis. GO enrichment analysis provided multiple GO terms, covering three hierarchical levels: cellular components (CC), molecular functions (MF), and biological processes (BP) [81]. The results obtained from GO and KEGG analyses were encouraging. In sub-pop3 for the BLUP and 22YS environments, despite the identification of 29 overlapping genes (Table 4), GO enrichment analysis did not yield significant bioprocesses associated with these genes. Therefore, these 29 genes may not be considered KRN-associated candidate genes. In the case of the sub-pop3 in the 21JH environment and sub-pop4 in the 22YS environment, 648 co-located genes (Table 4) participated in 271 biological processes, primarily related to transport and metabolic processes. These genes were associated with 77 cellular components, including the Golgi apparatus, complexes, and other organelles, and 168 molecular functions involving bioregulation and enzyme activity (Table S5 and Figure S2). In terms of biological processes (BP), the highly correlated category was transmembrane transport, which includes processes such as “nucleotide-sugar transmembrane transport (GO:0015780)” and “pyrimidine nucleotide-sugar transmembrane transport (GO:0090481)”. In cellular components (CC), the highly correlated category was complexes, including the “catalytic complex (GO:1902494)”, “ubiquitin ligase complex (GO:0000151)”, “dynein complex (GO:0030286)”, “respiratory chain complex (GO:0098803)”, and “SAGA-type complex (GO:0070461)”, among others. For molecular function (MF), the highly correlated category included “catalytic activity, specifically acting on a glycoprotein (GO:0140103)”, “nucleotide-sugar transmembrane transporter activity (GO:0005338)”, and “pyrimidine nucleotide-sugar transmembrane transporter activity (GO:0015165)” (Table 7).
After comparing the genes identified from GWAS and QTL analyses with GO annotations, we identified the gene Zm00001d050984, which was associated with multiple GO terms. This gene was linked to ten GO terms encompassing both BP and MF. In BP, it participates in “intracellular signal transduction (GO:0035556), signal transduction (GO:0007165), signaling (GO:0023052), small GTPase-mediated signal transduction (GO:0007264), and cell communication (GO:0007154)”, indicating its involvement in various signaling pathways. In MF, it is associated with “nucleoside-triphosphatase regulator activity (GO:0060589), GTPase regulator activity (GO:0030695), GDP-dissociation inhibitor activity (GO:0005092), enzyme regulator activity (GO:0030234), and molecular function regulator (GO:0098772)”, suggesting its role in regulating enzyme activity (Table 8). This Zm00001d050984 gene appears to be crucial for coordinating multiple signaling pathways and regulating enzyme activity, highlighting its importance in cellular function and response to external stimuli.
After identifying the key cellular components, molecular functions, and biological processes associated with these genes, we conducted KEGG enrichment analysis to reveal their intricate relationships and networks. The results showed that the sub-pop3 for the BLUP and 22YS environments shared genes enriched in eight metabolic pathways, and the most highly correlated categories include “fatty acid elongation (zma00062)” and “ascorbate and aldarate metabolism (zma00053)”. However, the category with the highest number of participating genes was “biosynthesis of cofactors (zma01240)”. Those in sub-pop3 for the 21JH environment and sub-pop4 for the 22YS environment were enriched in 72 metabolic pathways (Table S6 and Figure S2). Some of the highly correlated categories included “butanoate metabolism (zma00650)”, “monoterpenoid biosynthesis (zma00902)”, and “circadian rhythm—plant (zma04712)” (Table 9). The authors suggested that genes associated with these pathways could be considered key candidate genes for further validation.
Through GO and KEGG enrichment analyses, we compared the pathway information with co-located genes identified by GWAS–QTL analyses. In addition to Zm00001d050984, which has multiple GO terms, we selected two additional genes, Zm00001d016202 and Zm00001d050985. Zm00001d016202 encodes the E3 ubiquitin protein ligase COP1. This gene coincides with the “ubiquitin ligase complex (GO:0000151)” term in the GO analysis of sub-pop3 in the 21JH environment and sub-pop4 in the 22YS environment. Zm00001d050985 encodes a UV-B wavelength receptor, which might be associated with the “Circadian rhythm—plant (zma04712)” term in the KEGG analysis for the sub-pop3 in the 21JH environment and sub-pop3 in the 22YS environment. It is possible that UVR8 signaling, particularly in response to UV-B radiation, indirectly affects circadian clock regulation in plants.

3. Discussion

3.1. 18 Co-Located Genes Were Confirmed to Be Associated with KRN

Linkage analysis and GWAS have been widely used to identify candidate genes associated with quantitative traits [82,83,84]. In this study, we developed a multi-parent population comprising 655 RILs by crossing Ye107 from the Reid heterotic group with CML312 and CML444 from the non-Reid group and YML32 and YML46 from the Suwan heterotic group. By combining the results from GWAS and genetic linkage analyses, we identified 18 candidate genes that were co-located in multiple environments (Table 5). The functions of these genes were consistent with those of the GO- and KEGG-enriched pathways. Although previous researchers have identified KRN-associated QTLs on the same chromosomes as in this study [16,85], the physical locations of these KRN-associated QTLs were different. Therefore, we believe that the candidate genes identified in this study are distinct from those reported in previous studies and represent novel candidate genes associated with KRN. Furthermore, our genetic linkage analysis results indicated that some significant QTLS overlapped with the genes that had already been cloned (Table S7). This finding enhances the reliability of the results of this study. In Table S7, the QTL on chromosome 5, localized in sub-pop4/21YS, coincided with qKRN5.04 identified by An et al. [86] who found that the Zm00001d016075 gene negatively regulated KRN. Although our SNP–QTL co-localization did not pinpoint this gene, the gene we co-localized, Zm00001d016000, was approximately four million bases upstream of Zm00001d016075 and belonged to the same QTL region. Therefore, we hypothesized that they may share similar functions. Given that Zm00001d016000 is located approximately four million bases upstream of Zm00001d016075 and falls within the same QTL region, we speculate that they may possess analogous functions, particularly regarding the regulatory effect on KRN.

3.2. Key Candidate Genes Linked to KRN Were Identified through GO and KEGG Analysis

In this study, we conducted a comprehensive analysis to identify and validate candidate genes using Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and gene cloning data. These analyses are instrumental in the selection and validation of potential genes associated with the traits of interest (KRN) in maize.
Previous studies have demonstrated the effectiveness of GO and KEGG enrichment analyses in screening candidate genes. For example, Tang et al. [87] used RNA-seq to analyze the transcriptomes of near-isogenic lines and identified 1960 differentially expressed genes (DEGs). Subsequently, the authors employed GO classification and KEGG enrichment analysis to reveal the significantly enriched pathways, shedding light on the genetic basis of KRN in maize. Notably, the Zm00001d020460 gene within the “DNA binding” pathway emerged as a candidate gene. They further constructed an overexpression vector for Zm00001d020460 of the YE478 haplotype, transformed the maize inbred line B104, and observed a significant reduction in kernel width in the T1 generation of the positive transgenic lines compared to the wild type.
In our study, we initially narrowed down gene selection by identifying identical GO/KEGG pathways in multiple environments. We then compared the functions of these pathways with those of co-localized genes to screen for candidate genes associated with KRN. Notably, the “circadian rhythm—plant” term was significantly associated with the gene Zm00001d050985, identified through GWAS and genetic linkage analysis. This gene encodes the UV-B wavelength receptor. Similarly, the “ubiquitin ligase complex” pathway in GO enrichment analysis corresponded to the gene Zm00001d016202, also identified in this study, which encodes the E3 ubiquitin protein ligase COP1. Furthermore, the co-located gene Zm00001d016000 belongs to the Myb family of transcription factors. Although its exact function is yet to be determined, previous research has suggested that different light wavelengths can regulate maize mesocotyl tissue growth, potentially mediated by circadian rhythmic-related genes, thereby affecting tissue plasticity [88,89]. For example, white, blue, and UV-B light can strongly induce anthocyanin accumulation, and the expression of two key structural genes was high in maize mesocotyls [90]. Furthermore, the overexpression of cryptochrome (CRY) OsCRY1b in rice results in the shortening of the petiole and sheath, and the inhibition of elongation by blue light is related to the interaction between OsCRY1 and E3 ubiquitin protein ligase RFWD2 (COP1) [91]. Additionally, MYB transcription factors play an important role in the response of plants to UV-B light [92]. MYB73 and MYB77 interact with UVR8 to regulate lateral root growth, whereas MYB13 induces secondary flavonoid metabolism and cotyledon expansion under UV-B irradiation. Therefore, we believe that these three genes may interact to jointly regulate the number of rows per ear in maize. In summary, our study harnessed the power of GO and KEGG analyses to identify three candidate genes associated with KRN. This not only streamlined the candidate gene selection process but also offered insights into the molecular mechanisms underlying KRN variations. Collectively, these findings collectively contribute to our understanding of the genetic factors that influence KRN in maize.

3.3. Alternative Reference Genome May Help Identify Candidate Gene

In this study, one SNP located between 120,350,778–120,390,778 bp on chromosome 5 was consistently identified in three different environments, including GWAS in the 19DH environment and QTL mapping in sub-pop3 for the 19BS environment, as well as in sub-pop4 for the 21YS environment. Interestingly, when the B73 reference genome was used, no genes were detected in proximity to this SNP. We used another commonly used reference genome, Mo17, and identified the Zm00014a012929 gene in the same region. Therefore, we believe that genes related to KRN in these lines may not be present in the B73 genome. This phenomenon could be attributed to the introduction of unique gene fragments into the temperate germplasm owing to introgression from the tropical germplasm into the temperate germplasm. This finding underscores the potential of tropical germplasms in improving maize germplasm, as it may carry distinct beneficial genes lacking in temperate germplasms. The lines used in this study were from “Suwan, Reid, and non-Reid heterotic groups”, as proposed by our research group. The presence of novel genes that control KRN in maize from these “three heterotic groups” explains why a candidate gene was identified when Mo17 was used as the reference genome. This outcome further validates the accuracy of the “three heterotic group” model proposed by our research team.
Many GWAS and QTL mapping studies in maize have utilized populations that include Mo17 and B73 or their derived lines. Zhang et al. [93] employed a B73×Mo17 (IBM) Syn10 double haploid (DH) population to successfully identify 100 QTLs and 138 SNPs controlling yield-related traits in maize through a combination of GWAS and QTL mapping. In their study, 52 candidate genes were identified, with one candidate gene, SNPSYN806, which is associated with the number of rows per spike (ERN) and is closely linked to SBP transcription factor 7 (GRMZM2G098557) [93]. In another study focusing on the interaction between B73×Mo17 populations, Lin et al. constructed a QTL map for starch content and identified a major QTL, Qsta9.1, located on chromosome 9. Subsequent GWAS and co-expression network analysis highlighted GRMZM2G110929 and GRMZM5G852704 as potential candidate genes influencing the starch content in maize grains [94].
In contrast, some studies independently analyzed Mo17 and B73 and subsequently compared their disparities. For example, Song et al. investigated the responses of susceptible B73 and resistant Mo17 maize strains to aphids at both metabolite and transcriptome levels. Surprisingly, both strains exhibited time-specific responses to aphid infestation, and even in the absence of herbivory effects, significant differences in gene expression were observed between the two strains [95].
It is worth noting that our study adopted a different approach from the previous ones; we used different reference genomes to identify the candidate genes. The results highlighted that the candidate gene Zm00014a012929 was exclusively obtained using the Mo17 reference genome and was absent when the B73 reference genome was used. This finding suggests that utilizing different maize reference genomes, especially in cases involving diverse maize heterosis groups, could enhance the chances of identifying pertinent candidate genes. Consequently, this approach holds promise for future research endeavors in maize research.

3.4. Key Candidate Genes Associated with KRN

In this study, we identified 277 significant QTLs and 38 significant SNPs that play crucial roles in determining KRN in maize. We further annotated the genes located within a 20 kb range upstream and downstream of these QTLs and SNPs, revealing multiple genes related to KRN. After considering factors such as frequency of occurrence, expression pattern, and functional annotation, and supplemented by GO and KEGG analyses, we identified seven candidate genes associated with KRN. During GWAS analysis, a significant SNP, SNP-177304649, was consistently identified in the 19DH environment. This SNP was associated with two genes, Zm00001d022420 and Zm00001d022421, which were expressed in the KRN-related tissues. Additionally, a bd1 (branchedsilkless1) gene was found downstream of this SNP. A total of 18 genes were identified through co-localization using GWAS and genetic linkage analysis. Among these, four genes were identified as candidate genes related to KRN. First, Zm00001d016202, annotated as the E3 ubiquitin-protein ligase COP1 and associated with the “ubiquitin ligase complex” pathway identified through GO annotation, exhibited a phenotypic variation of 10.64% and was expressed in tissues related to KRN. Next, we identified Zm00001d050985, a UV-B receptor associated with the “circadian rhythm—plant” pathway using KEGG analysis. Zm00001d050984, which is linked to ten GO terms, appears to be crucial for coordinating multiple signaling pathways and regulating enzyme activity, which may be closely related to the formation of KRN. Downstream of Zm00001d050985 and Zm00001d050984, another gene, fea2 (fasciated ear 2), was found to be associated with KRN and was expressed in KRN-related tissues. Zm00001d016000, a transcription factor of the Myb family, is highly expressed in tissues related to KRN [96]. Finally, for Zm00014a012929, although the gene function, expression level, and metabolic pathway remain unclear, it was consistently co-located in most environments in this study and was exclusively identified in the Mo17 reference genome. This gene may not only regulate KRN but also support the validation of the “three heterotic group” model proposed by our research group.
In conclusion, these seven candidate genes are likely to be involved in the regulation of KRN formation in maize. Future research should focus on conducting differential gene expression analyses and validating gene functions to confirm their role in KRN regulation in maize. This study not only provides new genetic markers and genomic resources for exploring the genetic structure and molecular mechanism of maize KRN but also lays a critical foundation for validating and cloning functional genes related to maize kernel row number formation.

4. Materials and Methods

4.1. Experimental Materials

We used Ye107, a foundational inbred line in China, as the common male parent. Additionally, we selected four parental inbred lines, CML312, CML444, YML46, and YML32, each characterized by significant genetic diversity in terms of KRN, as female parents. The hybridization of these four parental inbred lines with the common male parent Ye107 resulted in the development of four F1 hybrids. Subsequently, we conducted eight successive generations of selfing of the four F1s using the single-seed descent method, leading to the establishment of four distinct recombinant inbred line (RIL) subpopulations. The F8 generation subpopulations resulting from crosses between CML312, CML444, YML46, and YML32 with Ye107 were denoted as sub-pop1, sub-pop2, sub-pop3, and sub-pop4, respectively. Each RIL subpopulation comprised approximately 200 recombinant inbred lines, resulting in 655 RIL lines constituting the multi-parent population. Ye107 has a significant position as a foundation maize parent in China. This line and its subsequent inbred lines were categorized under the Reid group, which is a prominent heterotic group in maize breeding in China (Table 10).

4.2. Field Experimental Design and Phenotyping

RILs of the multi-parent population were planted in Dehong (DH) and Baoshan (BS) in Yunnan Province in 2019. The experiments were set up in a randomized complete block design (RCBD) with two replications at each location. Each experimental plot consisted of 4 m long rows with 0.70 m inter-row spacing. Plant-to-plant spacing was 25 cm, with 14 plants per row. Spring trials in 2021 and 2022 were conducted in Yanshan, Yunnan Province, using a Latin square design. Each row had a length of 2.5 m, with a space of 25 cm between plants, and there were 14 plants per row, with two replications. During the winter of 2021, the trial was conducted in Jinghong, Yunnan Province using a random block design. Each row had a length of 2.5 m, with 14 plants per row spaced at 25 cm, with two replications. After the maize cobs matured and dried, five ears were randomly selected from each plot to investigate the KRN.

4.3. Phenotypic Data Analysis

Excel 2019 and IBM SPSS Statistics 20 were used to conduct the normal distribution, variance, correlation, and coefficient of variation analyses. The coefficient of variation (CV) was calculated using the following formula: CV = (standard deviation SD/mean) × 100%. The R package ‘lem4’ was used to calculate heritability. Heritability (h2) was estimated using the formula proposed by Knapp et al. [97]: h2 = σg2/(σg2 + σge2/e + σε2/re), where σg2 represents the genetic variance, σgl2 is the variance of the genotype–environment interaction, σe2 is the error variance, n is the number of sites, and y is the number of years.

4.4. Genotyping

The parental lines and RILs of the multi-parent population were genotyped via the genotyping-by-sequencing (GBS) method. DNA was extracted from 655 RILs of the four subpopulations, along with their respective parents, using the method described by Elshire et al. [98]. After the DNA library of 300 bp size was constructed and sequenced on the Illumina NovaSeq 6000 platform (Illumina Inc., San Diego, CA, USA) with a read length of 2 × 150 bp, the raw sequencing reads were processed to remove reads with adapters and low-quality reads, resulting in high-quality filtered reads. Single nucleotide polymorphisms (SNPs) and insertions and deletions (InDels) were detected using GATK software v4.1.4.0. Subsequent association analysis was performed using 590,816 high-quality SNPs.

4.5. Structure Analysis

Phylogenetic tree analysis was conducted using Tassel v5.0 software, utilizing 590,816 high-quality SNPs to assess the genetic relationships among the 655 RILs. Principal component analysis was performed using R 4.3.2, and the results were visualized using the scatterplot3d software package.

4.6. Genome-Wide Association Analysis

Genome-wide association analysis was conducted using the GEMMA software (open model: https://github.com/genetics-statistics/GEMMA/releases, accessed on 1 June 2023) [99], and 590,816 high-quality SNPs were used in the analysis. The analysis employed a mixed linear model (MLM) for GWAS for both the average and BLUP values for all environments. Significance was determined at the p < 0.0001 level using the formula −log10 (1/Total number of SNPs) to assess the association between SNP markers and panicle number. The distribution of markers is shown in the Manhattan plot, and the Q-Q plot was used to evaluate the accuracy of the association analysis results.

4.7. Haplotype Analysis

Haploview v4.2 software was used to analyze genes that were consistently detected in multiple environments whose functions were related to KRN.

4.8. Candidate Gene Screening

SNPs obtained from GWAS results were annotated using the ANNOVAR software (v2013-05-20). Based on the annotated gene numbers, the SNPs were searched and compared in maizeGDB and NCBI (http://www.ncbi.nlm.nih.gov/ (accessed on 15 September 2023)) databases, leading to the identification and functional annotation of candidate genes.

4.9. Construction of Genetic Linkage Map

The JoinMap4.0 software was used for linkage map construction. The LOD threshold was determined using 1000 random permutation tests, with a significance level of p ≤ 0.05 [100]. QTLs were considered significant if the LOD threshold was ≥2.5. Linkage groups were defined based on the criteria of an LOD critical value ≥ 2.5, and a genetic linkage map was constructed using eligible SNP markers. SNP markers within each linkage group were sorted using the maximum likelihood method and unlinked markers were excluded. The genetic distance (cM) between markers was computed using the Kosambi mapping function. Polymorphic SNPs with distinct alleles in both parents were used to construct genetic maps of sub-pop3 and sub-pop4. Markers showing segregation distortion were filtered out using the Chi-square test (p < 0.001), and markers with an integrity <95% were also excluded.

4.10. QTL Mapping Analysis

QTL mapping was conducted using the composite interval mapping (CIM) method in Windows QTL Cartographer 2.0. Phenotypic data for KRN were integrated to the high-density genetic linkage map, and a step distance of 0.21 cM was used to identify parental polymorphic QTLs. The threshold was established using 1000 random permutations at a 95% confidence interval (CI). The LOD thresholds associated with flanking markers were set to 2.5 to identify QTLs controlling KRN.

4.11. GO and KEGG Enrichment Analysis

Based on the results of linkage disequilibrium decay distance analysis, genes located within 20kb upstream and downstream of the significant SNPs were functionally annotated using GO and KEGG analyses. The annotation process helped further validate and identify candidate genes associated with the target traits obtained through GWAS and QTL analyses. The p-values for GO and KEGG analyses were calculated using the following formula:
P = 1 i = 0 m 1 M i N M n i N n

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms25063377/s1.

Author Contributions

Data curation, X.Y. and F.J.; formal analysis, Y.B.; investigation, F.R.; supervision, X.F.; writing—original draft, Y.W.; writing—review and editing, R.K.S. and X.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China Project (U2202204), the Joint Laboratory Project of Yunnan Province Seed Industry (202205AR070001), and the Special Support for Seed Industry of Food Security in Yunnan Province (530000210000000013809).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We appreciate the editors and reviewers for their valuable comments and time. We also thank Tianhui Kuang for his guidance in the GWAS analysis.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zhou, Q.; Wang, P.X.; Cheng, B.J.; Zhu, S.; Xie, C. Meta-analysis of QTL for ear row number in maize. J. Maize Sci. 2014, 22, 35–40. [Google Scholar]
  2. Li, M.; Zhong, W.; Yang, F.; Zhang, Z. Genetic and molecular mechanisms of quantitative trait loci controlling maize inflorescence architecture. Plant Cell Physiol. 2018, 59, 448–457. [Google Scholar] [CrossRef] [PubMed]
  3. Thornsberry, J.M.; Goodman, M.M.; Doebley, J.; Kresovich, S.; Nielsen, D.; Buckler, E.S. Dwarf8 polymorphisms associate with variation in flowering time. Nat. Genet. 2001, 28, 286–289. [Google Scholar] [CrossRef] [PubMed]
  4. Lima, M.L.A.; Souza, C.L.; Bento, D.A.V.; de Souza, A.P.; Carlini-Garcia, L.A. Mapping QTL for grain yield and plant traits in a tropical maize population. Mol. Breed. 2006, 17, 227–239. [Google Scholar] [CrossRef]
  5. Lan, J.H.; Li, X.H.; Gao, S.R. QTL Analysis of Maize Yield Traits under Different Ecological Environments. Acta Agron. Sin. 2005, 31, 1253–1259. [Google Scholar]
  6. Liu, Z.H.; Tang, J.H.; Wei, X.Y.; Wang, C.L.; Tian, G.W.; Hu, Y.M.; Chen, W.C. QTL Analysis of Maize Ear Traits under Nitrogen Stress and Normal Conditions. Sci. Agric. Sin. 2007, 40, 2409–2417. [Google Scholar]
  7. Tan, W.W.; Li, Y.X.; Wang, Y.; Liu, C.; Liu, Z.; Peng, B.; Wang, D.; Zhang, Y.; Sun, B.; Shi, Y.; et al. QTL Analysis of Maize Ear Traits under Drought and Normal Moisture Conditions. Acta Agron. Sin. 2011, 37, 235–248. [Google Scholar] [CrossRef]
  8. Tan, W.W.; Wang, Y.; Li, Y.X.; Liu, C.; Liu, Z.; Peng, B.; Wang, D.; Zhang, Y.; Sun, B.; Shi, Y. QTL Analysis of Multiple Maize Ear Traits under Different Environments. Sci. Agric. Sin. 2011, 44, 233–244. [Google Scholar]
  9. Tang, J.H.; Yan, J.B.; Ma, X.Q.; Meng, Y.J.; Dai, J.R.; Li, J.S. Genetic Mechanisms of Maize Yield and Related Traits Analyzed Using Permanent F2 Populations. Acta Agron. Sin. 2007, 33, 1299–1303. [Google Scholar]
  10. Austin, D.F.; Lee, M. Comparative mapping in F 2:3 and F 6:7 generation of quantitative trait loci for grain yield and yield components in maize. Theor. Appl. Genet. 1996, 92, 817–826. [Google Scholar] [CrossRef] [PubMed]
  11. Guo, J.J.; Chen, Z.L.; Liu, Z.P.; Wang, B.; Song, W.; Li, W.; Chen, J.; Dai, J. Identification of genetic factors affecting plant density response through QTL mapping of yield component traits in maize (Zea mays L.). Euphytica 2011, 182, 409–422. [Google Scholar] [CrossRef]
  12. Li, J.Z.; Zhang, Z.W.; Li, Y.L.; Wang, Q.L.; Zhou, Y.G. QTL consistency and meta-analysis for grain yield components in three generations in maize. Theor. Appl. Genet. 2011, 122, 771–782. [Google Scholar] [CrossRef]
  13. Lu, M.; Xie, C.X.; Li, X.H.; Hao, Z.F.; Li, M.S.; Weng, J.F.; Zhang, D.G.; Bai, L. Mapping of quantitative trait loci for kernel row number in maize across seven environments. Mol. Breed. 2011, 28, 143–152. [Google Scholar] [CrossRef]
  14. Ma, X.Q.; Tang, J.H.; Teng, W.T.; Yan, J.B.; Meng, Y.J.; Li, J.S. Epistatic interaction is an important genetic basis of grain yield and its components in maize. Mol. Breed. 2007, 20, 41–51. [Google Scholar] [CrossRef]
  15. Tian, B.; Wang, J.; Wang, G. Confirmation of a major QTL on chromosome 10 for maize kernel row number in different environments. Plant Breed. 2014, 133, 184–188. [Google Scholar] [CrossRef]
  16. Cald, L.C.; Li, K.; Yang, X.H.; Li, J.S. Identification of large-effect QTL for kernel row number has the potential for maize yield improvement. Mol. Breed. 2014, 34, 1087–1906. [Google Scholar]
  17. Calderón, C.I.; Yandell, B.S.; Doebley, J.F. Fine Mapping of a QTL Associated with Kernel Row Number on Chromosome 1 of Maize. PLoS ONE 2016, 11, e0150276. [Google Scholar] [CrossRef]
  18. Bai, N.; Li, Y.X.; Jiao, F.C.; Chen, L.; Li, C.H.; Zhang, D.F.; Song, Y.C.; Wang, T.Y.; Li, Y.; Shi, Y.S. Fine Mapping and Genetic Effects Analysis of the Major Locus qKRN5.04 for Maize Ear Row Number. Acta Agron. Sin. 2017, 1, 63–71. [Google Scholar] [CrossRef]
  19. Liu, L.; Du, Y.; Shen, X.; Li, M.; Sun, W.; Huang, J.; Liu, Z.; Tao, Y.; Zheng, Y.; Yan, J.; et al. KRN4 controls quantitative variation in maize kernel row number. PLoS Genet. 2015, 11, e1005670. [Google Scholar] [CrossRef]
  20. Wang, J.; Lin, Z.; Zhang, X.; Liu, H.; Zhou, L.; Zhong, S.; Li, Y.; Zhu, C.; Lin, Z. krn1, a major quantitative trait locus for kernel row number in maize. New Phytol. 2019, 223, 1634–1646. [Google Scholar] [CrossRef]
  21. Chen, F. Analysis Methods of Endophytic Microorganisms in Plants Study on Differences in Endophytic Microbial Communities in Grains of Different Maize Germplasms. Ph.D. Thesis, Anhui Agricultural University, Hefei, China, 2022. [Google Scholar]
  22. Xiao, Y.; Tong, H.; Yang, X.; Xu, S.; Pan, Q.; Qiao, F.; Raihan, M.S.; Luo, Y.; Liu, H.; Zhang, X.; et al. Genome-wide dissection of the maize ear genetic architecture using multiple populations. New Phytol. 2016, 210, 1095–1106. [Google Scholar] [CrossRef] [PubMed]
  23. Liu, L.; Du, Y.; Huo, D.; Wang, M.; Shen, X.; Yue, B.; Qiu, F.; Zheng, Y.; Yan, J.; Zhang, Z. Genetic architecture of maize kernel row number and whole-genome prediction. Theor. Appl. Genet. 2015, 128, 2243–2254. [Google Scholar] [CrossRef] [PubMed]
  24. Fei, X.; Wang, Y.; Zheng, Y.; Shen, X.; Lizhu, E.; Ding, J.; Lai, J.; Song, W.; Zhao, H. Identification of two new QTLs of maize (Zea mays L.) underlying kernel row number using the HNAU-NAM1 population. BMC Genom. 2022, 23, 593. [Google Scholar] [CrossRef] [PubMed]
  25. An, Y.X. Genetic Analysis of Maize Ear Row Number and Candidate Gene Identification with Functional Analysis of Major QTLs. Ph.D. Thesis, Chinese Academy of Agricultural Sciences, Beijing, China, 2021. [Google Scholar]
  26. Chen, W.K.; Chen, L.; Zhang, X.; Yang, N.; Guo, J.; Wang, M.; Ji, S.; Zhao, X.; Yin, P.; Cai, L.; et al. Convergent selection of a WD40 protein that enhances grain yield in maize and rice. Science 2022, 375, eabg7985. [Google Scholar] [CrossRef]
  27. Dai, L.Q.; Wu, L.; Dong, Q.S.; Yan, G.; Qu, J.; Wang, P. Genome-Wide Association Analysis of Corn Kernel Length. J. Northwest AF Univ. (Nat. Sci. Ed.) 2018, 46, 20–28. [Google Scholar]
  28. Fei, J.; Jiang, Q.; Guo, M.; Lu, J.; Wang, P.; Liu, S.; Qu, J.; Ma, Y.; Guan, S. Fine Mapping and Functional Research of Key Genes for Photoperiod Sensitivity in Maize. Front. Plant Sci. 2022, 13, 890780. [Google Scholar] [CrossRef] [PubMed]
  29. Tian, F.; Bradbury, P.J.; Brown, P.J.; Hung, H.; Sun, Q.; Flint-Garcia, S.; Rocheford, T.R.; McMullen, M.D.; Holland, J.B.; Buckler, E.S. Genome-wide association study of leaf architecture in the maize nested association mapping population. Nat. Genet. 2011, 43, 159–162. [Google Scholar] [CrossRef] [PubMed]
  30. Yang, J.P.; Rong, T.Z.; Xiang, D.Q.; Tang, H.T.; Huang, L.J.; Dai, J.R. Genetic Mapping of Quantitative Traits in Maize. Acta Agron. Sin. 2005, 31, 188–196. [Google Scholar]
  31. Yan, J.B.; Tang, H.; Huang, Y.Q.; Zheng, Y.-L.; Li, J.-S. Quantitative trait loci mapping and epistatic analysis for grain yield and yield components using molecular markers with an elite maize hybrid. Euphytica 2006, 149, 121–131. [Google Scholar] [CrossRef]
  32. Yang, G.H.; Li, Y.L.; Wang, Q.L.; Zhou, Y.; Zhou, Q.; Shen, B.; Zhang, F.; Liang, X. Detection and integration of quantitative trait loci for grain yield components and oil content in two connected recombinant inbred line populations of high-oil maize. Mol. Breed. 2012, 29, 313–333. [Google Scholar] [CrossRef]
  33. Yu, Y.; Song, Y.; Li, Y.; Shi, Y.; Wang, T. QTL Analysis of Resistance to Asian Corn Borer in Maize Germplasm H21 and Mo17. J. Plant Genet. Resour. 2003, 4, 94–98. [Google Scholar]
  34. Zhou, H.; Zhang, X.; Liu, T.Y.; Yu, F.X. Data Analysis and Gene Discovery in High-Throughput Transcriptome Sequencing. Jiangxi Sci. 2012, 30, 607–611. [Google Scholar]
  35. Zhou, L.M.; Fang, W. Biological Functions and Subcellular Localization Analysis of Constitutively Active CIPK9 in Pollen Tube Growth. Guangxi Plant 2019, 39, 788–793. [Google Scholar]
  36. Remington, D.L.; Thornsberry, J.M.; Matsuoka, Y.; Wilson, L.M.; Whitt, S.R.; Doebley, J.; Kresovich, S.; Goodman, M.M.; Buckler, E.S., IV. Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc. Natl. Acad. Sci. USA 2001, 98, 11479–11484. [Google Scholar] [CrossRef]
  37. Fan, X.; Tan, J.; Yang, J. Combining ability and heterotic grouping of ten temperate, Subtropical and tropical quality protein maizeinbreds. Maydica 2004, 49, 267–272. [Google Scholar]
  38. Fan, X.; Chen, H.; Tan, J.; Xu, C.; Zhang, Y.; Huang, Y.; Kang, M.S. A new maize heterotic pattern between temperate and tropical germplasms. Agron. J. 2008, 100, 917–923. [Google Scholar] [CrossRef]
  39. Fan, X.; Zhang, Y.; Yao, W.; Bi, Y.; Liu, L.; Chen, H.; Kang, M.S. Reciprocal Diallel Crosses Impact Combining Ability, Variance Estimation, and Heterotic Group Classification. Crop Sci. 2014, 54, 89–97. [Google Scholar] [CrossRef]
  40. Ling, J.J.; Li, J.; Zhu, D.; Deng, X.W. Noncanonical role of Arabidopsis COP1/SPA complex in repressing Bin2-mediated PIF3 phosphorylation and degradation in darkness. Proc. Natl. Acad. Sci. USA 2017, 114, 3539–3544. [Google Scholar] [CrossRef]
  41. Henrissat, B.; Callebaut, I.; Fabrega, S.; Lehn, P.; Mornon, J.P.; Davies, G. Conserved catalytic machinery and the prediction of a common fold for several families of glycosyl hydrolases. Proc. Natl. Acad. Sci. USA 1995, 92, 7090–7094. [Google Scholar] [CrossRef] [PubMed]
  42. Kodaira, K.S.; Qin, F.; Tran, L.S.; Maruyama, K.; Kidokoro, S.; Fujita, Y.; Shinozaki, K.; Yamaguchi-Shinozaki, K. Arabidopsis Cys2/His2 zinc-finger proteins AZF1 and AZF2 negatively regulate abscisic acid-repressive and auxin-inducible genes under abiotic stress conditions. Plant Physiol. 2011, 157, 742–756. [Google Scholar] [CrossRef] [PubMed]
  43. Meierhoff, K.; Felder, S.; Nakamura, T.; Bechtold, N.; Schuster, G. HCF152, an Arabidopsis RNA Binding pentatricopeptide repeat protein involved in the processing of chloroplast psbB-psbT-psbH-petB-petD RNAs. Plant Cell 2003, 15, 1480–1495. [Google Scholar] [CrossRef]
  44. Mili, S.; Pinol-Roma, S. LRP130, a pentatricopeptide motif protein with a noncanonical RNA-Binding domain, is bound in vivo to mitochondrial and nuclear RNAs. Mol. Cell. Biol. 2003, 23, 4972–4982. [Google Scholar] [CrossRef]
  45. Nakamura, T.; Yagi, Y.; Kobayashi, K. Mechanistic insight into tetratricopeptide repeat proteins as sequence-specific RNA-Binding proteins for organellar RNAs in plants. Plant Cell Physiol. 2012, 53, 1171–1179. [Google Scholar] [CrossRef]
  46. Narawongsanont, R.; KaBinpong, S.; Auiyawong, B.; Tantitadapitak, C. Cloning and Characterization of AKR4C14, a Rice Aldo–Keto Reductase, from Thai Jasmine Rice. Protein J. 2012, 31, 35–42. [Google Scholar] [CrossRef]
  47. Rojas, M.; Ruwe, H.; Miranda, R.G.; Zoschke, R.; Hase, N.; Schmitz-Linneweber, C.; Barkan, A. Unexpected functional versatility of the pentatricopeptide repeat proteins PGR3, PPR5, and PPR10. Nucleic Acids Res. 2018, 46, 10448–10459. [Google Scholar] [CrossRef] [PubMed]
  48. Sakamoto, H.; Maruyama, K.; Sakuma, Y.; Meshi, T.; Iwabuchi, M.; Shinozaki, K.; Yamaguchi-Shinozaki, K. Arabidopsis Cys2/His2-type zinc-finger proteins function as transcription repressors under drought, cold, and high-salinity stress conditions. Plant Physiol. 2004, 136, 2734–2746. [Google Scholar] [CrossRef] [PubMed]
  49. Schrank, B.R.; Aparicio, T.; Li, Y.; Chang, W.; Chait, B.T.; Gundersen, G.G.; Gottesman, M.E.; Gautier, J. Nuclear ARP2/3 drives DNA break clustering for homology-directed repair. Nature 2018, 559, 61–66. [Google Scholar] [CrossRef]
  50. Welch, M.D.; DePace, A.H.; Verma, S.; Iwamatsu, A.; Mitchison, T.J. The human Arp2/3 complex is composed of evolutionarily conserved Subunits is localized to cellular regions of dynamic actin filament assembly. J. Cell Biol. 1997, 138, 375–384. [Google Scholar] [CrossRef]
  51. Small, I.D.; Peeters, N. The PPR motif—A TPR-related motif prevalent in plant organellar proteins. Trends Biochem. Sci. 2000, 25, 46–47. [Google Scholar] [CrossRef] [PubMed]
  52. Armes, N.; Gilley, J.; Fried, M. The comparative genomic structure and sequence of the surfeit gene homologs in the puffer fish Fugu rubripes and their association with CpG-rich islands. Genome Res. 2021, 7, 1138–1152. [Google Scholar] [CrossRef]
  53. Van Hove, J.; Fouquaert, E.; Smith, D.F.; Proost, P.; Van Damme, E.J. Lectin activity of the nucleocytoplasmic EUL protein from Arabidopsis thaliana. Biochem. Biophys. Res. Commun. 2011, 414, 101–105. [Google Scholar] [CrossRef]
  54. Chuck, G.; Muszynski, M.; Kellogg, E.; Hake, S.; Schmidt, R.J. The control of spikelet meristem identity by the branched silkless1 gene in maize. Science 2002, 298, 1238–1241. [Google Scholar] [CrossRef]
  55. Van Hove, J.; De Jaeger, G.; De Winne, N.; Guisez, Y.; Van Damme, E.J. The Arabidopsis lectin EULS3 is involved in stomatal closure. Plant Sci. 2015, 238, 312–322. [Google Scholar] [CrossRef]
  56. Churchill, G.; Doerge, R. Empirical threshold values for quantitative trait mapping. Genetics 1994, 138, 963–971. [Google Scholar] [CrossRef]
  57. Yang, Y.; Liang, T.; Zhang, L.; Shao, K.; Gu, X.; Shang, R.; Shi, N.; Li, X.; Zhang, P.; Liu, H. UVR8 interacts with WRKY36 to regulate HY5 transcription and hypocotyl elongation in Arabidopsis. Nat. Plants 2018, 4, 98–107. [Google Scholar] [CrossRef] [PubMed]
  58. Liang, T.; Mei, S.; Shi, C.; Yang, Y.; Peng, Y.; Ma, L.; Wang, F.; Li, X.; Huang, X.; Yin, Y.; et al. UVR8 Interacts with BES1 and BIM1 to Regulate Transcription and Photomorphogenesis in Arabidopsis. Dev. Cell 2018, 44, 512–523.e5. [Google Scholar] [CrossRef] [PubMed]
  59. Chen, T.W.; Yin, F.F.; Yuan, Y.M.; Guan, D.-X.; Zhang, E.; Zhang, F.-K.; Jiang, H.; Ma, N.; Wang, J.-J.; Ni, Q.-Z.; et al. CHML promotes liver cancer metastasis by facilitating Rab14 recycle. Nat. Commun. 2019, 10, 2510. [Google Scholar] [CrossRef] [PubMed]
  60. Fan, J.D.; Wang, W.; Liang, A.H. Structure, Function, and Evolution of Rab Proteins. Chem. Life 2004, 24, 31–33. [Google Scholar] [CrossRef]
  61. Bommert, P.; Nagasawa, N.S.; Jackson, D. Quantitative variation in maize kernel row number is controlled by the FASCIATED EAR2 locus. Nat. Genet. 2013, 45, 334–337. [Google Scholar] [CrossRef] [PubMed]
  62. Taguchi-Shiobara, F.; Yuan, Z.; Hake, S.; Jackson, D. The fasciated ear2 gene encodes a leucine-rich repeat receptor-like protein that regulates shoot meristem proliferation in maize. Genes Dev. 2001, 15, 2755–2766. [Google Scholar] [CrossRef]
  63. Huang, J.; Dai, W.; Xiao, D.; Xiong, Q.; Liu, C.; Hu, J.; Ge, F.; Yu, X.; Li, S. Acetylation-dependent SAGA complex dimerization promotes nucleosome acetylation and gene transcription. Nat. Struct. Mol. Biol. 2022, 29, 261–273. [Google Scholar] [CrossRef]
  64. Cao, Y.; Li, K.; Li, Y.; Zhao, X.; Wang, L. MYB Transcription Factors as Regulators of Secondary Metabolism in Plants. Biology 2020, 9, 61. [Google Scholar] [CrossRef]
  65. Hu, Y.; Wang, Z.; Cao, X. Progress of functional studies on plant MYB transcription factors. Anhui Agron. Bull. 2023, 29, 60–62, Mol. Plant Breed.2011, 9, 1835–1842. [Google Scholar] [CrossRef]
  66. Brown, P.J.; Upadyayula, N.; Mahone, G.S.; Tian, F.; Bradbury, P.J.; Myles, S.; Holland, J.B.; Flint-Garcia, S.; McMullen, M.D.; Buckler, E.S.; et al. Distinct genetic architectures for male and female inflorescence traits of maize. PLoS Genet. 2011, 7, e1002383. [Google Scholar] [CrossRef]
  67. Buckler, E.S.; Holland, J.B.; Bradbury, P.J.; Acharya, C.B.; Brown, P.J.; Browne, C.; Ersoz, E.; Flint-Garcia, S.; Garcia, A.; Glaubitz, J.C.; et al. The genetic architecture of maize flowering time. Science 2009, 325, 714–718. [Google Scholar] [CrossRef]
  68. Chen, Q.; Tang, H.R.; Dong, X.L.; Hou, Y.X.; Luo, Y.; Jiang, Y.; Huang, Q.Y. Research Progress on Plant Myb Transcription Factors. Genom. Appl. Biol. 2009, 28, 365–372. [Google Scholar]
  69. Yang, F.; Li, T.; Peng, Z.; Liu, Y.; Guo, Y. The amphipathic helices of Arfrp1 and Arl14 are sufficient to determine Subcellular localization. J. Biol. Chem. 2020, 295, 16643–16654. [Google Scholar] [CrossRef]
  70. Li, Y.; Du, Y.; Huai, J.; Jing, Y.; Lin, R. The RNA helicase UAP56 and the E3 ubiquitin ligase COP1 coordinately regulate alternative splicing to repress photomorphogenesis in Arabidopsis. Plant Cell. 2022, 34, 4191–4212. [Google Scholar] [CrossRef] [PubMed]
  71. Ohno, Y.; Kashio, A.; Ogata, R.; Ishitomi, A.; Yamazaki, Y.; Kihara, A. Analysis of Substrate specificity of human DHHC protein acyltransferases using a yeast expression system. Mol. Biol. Cell 2012, 23, 4543–4551. [Google Scholar] [CrossRef] [PubMed]
  72. Gonzalez Montoro, A.; Chumpen Ramirez, S.; Valdez Taubas, J. The canonical DHHC motif is not absolutely required for the activity of the yeast S-acyltransferases Swf1 and Pfa4. J. Biol. Chem. 2015, 290, 22448–22459. [Google Scholar] [CrossRef] [PubMed]
  73. Hemsley, P.A.; Kemp, A.C.; Grierson, C.S. The TIP GROWTH DEFECTIVE1 S-acyl transferase regulates plant cell growth in Arabidopsis. Plant Cell 2005, 17, 2554–2563. [Google Scholar] [CrossRef]
  74. Maiden, M.C.; Davis, E.O.; Baldwin, S.A.; Moore, D.C.; Henderson, P.J. Mammalian and bacterial sugar transport proteins are homologous. Nature 1987, 325, 641–643. [Google Scholar] [CrossRef]
  75. Mueckler, M.; Caruso, C.; Baldwin, S.A.; Panico, M.; Blench, I.; Morris, H.R.; Allard, W.J.; Lienhard, G.E.; Lodish, H.F. Sequence and structure of a human glucose transporter. Science 1985, 229, 941–945. [Google Scholar] [CrossRef]
  76. Bao, H.Y. Structural Basis for the Selective Recognition of RhoA by Human Protein ARAP3 and the Specific Recognition of 18S Ribosomal RNA by Arabidopsis Protein APUM23. Ph.D. Thesis, University of Science and Technology of China, Anhui, China, 2017. [Google Scholar]
  77. Wang, Y.; Shao, L.; Shi, S.; Harris, R.J.; Spellman, M.W.; Stanley, P.; Haltiwanger, R.S. Modification of epidermal growth factor-like repeats with O-fucose Molecular cloning and expression of a novel GDP-fucose protein O-fucosyltransferase. J. Biol. Chem. 2001, 276, 40338–40345. [Google Scholar] [CrossRef]
  78. Engh, R.A.; Bossemeyer, D. Structural aspects of protein kinase control-role of conformational flexibility. Pharmacol. Ther. 2002, 93, 99–111. [Google Scholar] [CrossRef]
  79. Shen, Q.; Guo, W.; Jiang, L. Protein Phosphatase B Subunit CBL and Its Regulatory Kinase Protein CIPK. Plant Res. 2012, 1, 9–12. [Google Scholar]
  80. Han, X.; Huang, X.; Deng, X.W. The Photomorphogenic Central Repressor COP1: Conservation and Functional Diversification during Evolution. Plant Commun. 2020, 1, 100044. [Google Scholar] [CrossRef] [PubMed]
  81. Liu, X.; Xu, X.; Li, B.; Wang, X.; Wang, G.; Li, M. RNA-Seq transcriptome analysis of maize inbred carrying nicosulfuron-tolerant and nicosulfuron-susceptible alleles. Int. J. Mol. Sci. 2015, 16, 5975–5989. [Google Scholar] [CrossRef] [PubMed]
  82. He, W.; Zhu, Y.; Leng, Y.; Yang, L.; Zhang, B.; Yang, J.; Zhang, X.; Lan, H.; Tang, H.; Chen, J.; et al. Transcriptomic Analysis Reveals Candidate Genes Responding Maize Gray Leaf Spot Caused by Cercospora Zeina. Plants 2021, 10, 2257. [Google Scholar] [CrossRef] [PubMed]
  83. Liu, F.X.; Yang, W.G.; Sun, Q.H. Research on Transcriptome Sequencing Data Analysis and High-Throughput Gene Ontology Annotation Theory. Anhui Agric. Sci. 2018, 46, 88–91. [Google Scholar]
  84. Liu, F.P. Meta-QTL Analysis of Lodging-Related Traits and Candidate Gene Identification in Maize. Ph.D. Thesis, Yanbian University, Jilin, China, 2019. [Google Scholar]
  85. Li, K.; Tassinari, A.; Giuliani, S.; Rosignoli, S.; Urbany, C.; Tuberosa, R.; Salvi, S. QTL mapping identifies novel major loci for kernel row number-associated ear fasciation, ear prolificacy and tillering in maize (Zea mays L.). Front. Plant Sci. 2023, 13, 1017983. [Google Scholar] [CrossRef] [PubMed]
  86. An, Y.X.; Chen, L.; Li, Y.X.; Li, C.; Shi, Y.; Zhang, D.; Li, Y.; Wang, T. Fine mapping qKRN5.04 provides a functional gene negatively regulating maize kernel row number. Theor. Appl. Genet. 2022, 135, 1997–2007. [Google Scholar] [CrossRef]
  87. Tang, B. Fine Mapping and Candidate Gene Identification for Maize Kernel Length qKL1.07 and Kernel Width qKW7b. Ph.D. Thesis, Chinese Academy of Agricultural Sciences, Beijing, China, 2020. [Google Scholar]
  88. Dubois, P.G.; Olsefski, G.T.; Flint-Garcia, S.; Setter, T.L.; Hoekenga, O.A.; Brutnell, T.P. Physiological and genetic characterization of end-of-day far-red light response in maize seedlings. Plant Physiol. 2010, 154, 173. [Google Scholar] [CrossRef]
  89. Shi, J.; Zhao, X.Q.; Niu, Y.N.; Chen, X.J.; Ren, X.W. Brassinosteroid Affects the Elongation of Mesocotyl and Coleoptile in Zea mays L. by Regulating the Network of Circadian Rhythm under Severe Stress. Russ. J. Plant Physiol. 2022, 69, 90. [Google Scholar] [CrossRef]
  90. Guo, K.; Hou, L.D.; Zhang, Y.Y.; Zhou, K.M.; Zhang, B.L.; Zhou, W.H. Research Progress on the Plant MYB Gene Family. J. Yangtze Univ. (Nat. Sci. Ed.) 2020, 17, 93–98. [Google Scholar] [CrossRef]
  91. Liu, M.; Sun, W.; Ma, Z.; Guo, C.; Chen, J.; Wu, Q.; Wang, X.; Chen, H. Integrated network analyses identify MYB4R1 neofunctionalization in the UV-B adaptation of Tartary buckwheat. Plant Commun. 2022, 3, 100414. [Google Scholar] [CrossRef]
  92. Xu, Y.; Zhu, Z. UV-B Response: When UVR8 Meets MYBs. Trends Plant Sci. 2020, 25, 515–517. [Google Scholar] [CrossRef] [PubMed]
  93. Zhang, X.; Guan, Z.; Li, Z.; Liu, P.; Ma, L.; Zhang, Y.; Pan, L.; He, S.; Zhang, Y.; Li, P.; et al. A combination of linkage mapping and GWAS brings new elements on the genetic basis of yield-related traits in maize across multiple environments. Theor. Appl. Genet. 2020, 133, 2881–2895. [Google Scholar] [CrossRef]
  94. Lin, F.; Zhou, L.; He, B.; Zhang, X.; Dai, H.; Qian, Y.; Ruan, L.; Zhao, H. QTL mapping for maize starch content and candidate gene prediction comBined with co-expression network analysis. Theor. Appl. Genet. 2019, 132, 1931–1941. [Google Scholar] [CrossRef] [PubMed]
  95. Song, J.; Liu, H.; Zhuang, H.; Zhao, C.; Xu, Y.; Wu, S.; Qi, J.; Li, J.; Hettenhausen, C.; Wu, J. Transcriptomics and Alternative Splicing Analyses Reveal Large Differences between Maize Lines B73 and Mo17 in Response to Aphid Rhopalosiphum padi Infestation. Front. Plant Sci. 2017, 8, 1738. [Google Scholar] [CrossRef]
  96. Piazza, P.; Procissi, A.; Jenkins, G.; Tonelli, C. Members of the c1/pl1 regulatory gene family mediate the response of maize aleurone and mesocotyl to different qualities and cytokinins. Plant Physiol. 2002, 128, 1077. [Google Scholar] [CrossRef]
  97. Knapp, S.J. Confidence intervals for heritability for two-factor mating design single environment linear models. Theor. Appl. Genet. 1986, 72, 587–591. [Google Scholar] [CrossRef] [PubMed]
  98. Elshire, R.J.; Glaubitz, J.C.; Sun, Q.; Poland, J.A.; Kawamoto, K.; Buckler, E.S.; Mitchell, S.E. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 2011, 6, e19379. [Google Scholar] [CrossRef] [PubMed]
  99. Zhou, X.; Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet. 2012, 44, 821–824. [Google Scholar] [CrossRef]
  100. Delannoy, E.; Stanley, W.A.; Bond, C.S.; Small, I.D. Pentatricopeptide repeat (PPR) proteins as sequence-specificity factors in post-transcriptional processes in organelles. Biochem. Soc. Trans. 2007, 35, 1643–1647. [Google Scholar] [CrossRef]
Figure 1. KRN variations in the multi-parent population in the five environments. 19BS for Baoshan in 2019; 19DH for Dehong in 2019; 21JH for Jinghong in 2021; 21YS for Yanshan in 2021; 22YS for Yanshan in 2022. In Figure 1, in the boxplot, the horizontal line in the middle represents the median, the upper and lower lines of the box refer to 75th and 25th quartiles, respectively, and the whiskers for each box indicate the minimum and maximum values within the box plot range (whiskers). Significance levels are denoted as follows: “***” ≤ 0.0001, “*” ≤ 0.01, “NS” ≥ 0.05.
Figure 1. KRN variations in the multi-parent population in the five environments. 19BS for Baoshan in 2019; 19DH for Dehong in 2019; 21JH for Jinghong in 2021; 21YS for Yanshan in 2021; 22YS for Yanshan in 2022. In Figure 1, in the boxplot, the horizontal line in the middle represents the median, the upper and lower lines of the box refer to 75th and 25th quartiles, respectively, and the whiskers for each box indicate the minimum and maximum values within the box plot range (whiskers). Significance levels are denoted as follows: “***” ≤ 0.0001, “*” ≤ 0.01, “NS” ≥ 0.05.
Ijms 25 03377 g001
Figure 2. UPGMA analysis of 655 RILs. The color indicates different sub-populations: green: YML32 × Ye107 (sub-pop1); yellow: YML46 × Ye107 (sub-pop2); blue: CML312 × Ye107 (sub-pop3); red: CML444 × Ye107 (sub-pop4).
Figure 2. UPGMA analysis of 655 RILs. The color indicates different sub-populations: green: YML32 × Ye107 (sub-pop1); yellow: YML46 × Ye107 (sub-pop2); blue: CML312 × Ye107 (sub-pop3); red: CML444 × Ye107 (sub-pop4).
Ijms 25 03377 g002
Figure 3. PCA of 655 RILs. The sub-populations are identified as follows: sub-pop1: YML32 × Ye107; sub-pop2: YML46 × Ye107; sub-pop3: CML312 × Ye107; sub-pop4: CML444 × Ye107.
Figure 3. PCA of 655 RILs. The sub-populations are identified as follows: sub-pop1: YML32 × Ye107; sub-pop2: YML46 × Ye107; sub-pop3: CML312 × Ye107; sub-pop4: CML444 × Ye107.
Ijms 25 03377 g003
Figure 4. Manhattan plots: (a) Dehong 2019 (19DH), (b) Baoshan 2019 (19BS), (c) Jinghong 2021 (21JH), (d) Yanshan 2021 (21YS), and (e) Yanshan 2022 (22YS).
Figure 4. Manhattan plots: (a) Dehong 2019 (19DH), (b) Baoshan 2019 (19BS), (c) Jinghong 2021 (21JH), (d) Yanshan 2021 (21YS), and (e) Yanshan 2022 (22YS).
Ijms 25 03377 g004aIjms 25 03377 g004b
Figure 5. Haplotype analysis of Zm00001d022420 and Zm00001d022421. (a) The position of the SNPs associated with Zm00001d022420 identified by GWAS. (b) The differences in KRN between the two haplotypes (GG, AA) in the 19DH environments. (c) Haplotype analysis of Zm00001d022421 and the SNP position on chromosome 7. (d) Differences in KRN between the two haplotypes (CGC, TAA) in the 19 DH environment. “***”: p ≤ 0.001. (e) The locations of candidate genes associated with KRN.
Figure 5. Haplotype analysis of Zm00001d022420 and Zm00001d022421. (a) The position of the SNPs associated with Zm00001d022420 identified by GWAS. (b) The differences in KRN between the two haplotypes (GG, AA) in the 19DH environments. (c) Haplotype analysis of Zm00001d022421 and the SNP position on chromosome 7. (d) Differences in KRN between the two haplotypes (CGC, TAA) in the 19 DH environment. “***”: p ≤ 0.001. (e) The locations of candidate genes associated with KRN.
Ijms 25 03377 g005aIjms 25 03377 g005b
Figure 6. Genetic linkage maps of sub-pop3 (a) and sub-pop4 (b). Chromosomes 1 to 10 are labeled as LG1 to LG10, respectively. The blue lines represent the bins, indicating marker segments spaced every 15 markers without linkage, while the green spaces between bins represent the distances between adjacent bins.
Figure 6. Genetic linkage maps of sub-pop3 (a) and sub-pop4 (b). Chromosomes 1 to 10 are labeled as LG1 to LG10, respectively. The blue lines represent the bins, indicating marker segments spaced every 15 markers without linkage, while the green spaces between bins represent the distances between adjacent bins.
Ijms 25 03377 g006
Figure 7. QTL analysis of two subpopulations in five different environments and BLUP. (a1–a6) QTL mapping in sub-pop3 in five different environments and BLUP. (b1–b6) QTL mapping in sub-pop4 in five different environments and BLUP. The red line represents the LOD threshold in subfigures (a,b) The best linear unbiased prediction over all environments (BLUP), merged value of Yanshan in 2022 (22YS), Yanshan in 2021 (21YS), Dehong in 2019 (19DH), Baoshan in 2019 (19BS), and Jinghong in 2021 (21JH).
Figure 7. QTL analysis of two subpopulations in five different environments and BLUP. (a1–a6) QTL mapping in sub-pop3 in five different environments and BLUP. (b1–b6) QTL mapping in sub-pop4 in five different environments and BLUP. The red line represents the LOD threshold in subfigures (a,b) The best linear unbiased prediction over all environments (BLUP), merged value of Yanshan in 2022 (22YS), Yanshan in 2021 (21YS), Dehong in 2019 (19DH), Baoshan in 2019 (19BS), and Jinghong in 2021 (21JH).
Ijms 25 03377 g007aIjms 25 03377 g007bIjms 25 03377 g007c
Figure 8. Haplotype analysis of Zm00001d050984, Zm00001d050985, and Zm00001d016000. (a) Positions of the SNPs in the candidate gene associated with Zm00001d016000 on chromosome 5; (b) Differential effects of three haplotypes (GGAGG, CGAGG, and GACAA) of Zm00001d016000 on KRN; (c) Positions of the SNPs in the candidate genes associated with Zm00001d050984 on chromosome 4; (d) Differential effects of CC/TT haplotypes of Zm00001d050984 on KRN; (e) Positions of the SNPs in the candidate genes associated with Zm00001d050985 on chromosome 4; (f) Differential effects of CC/TT haplotypes of Zm00001d050985 on KRN; (g) Candidate gene positions associated with KRN.
Figure 8. Haplotype analysis of Zm00001d050984, Zm00001d050985, and Zm00001d016000. (a) Positions of the SNPs in the candidate gene associated with Zm00001d016000 on chromosome 5; (b) Differential effects of three haplotypes (GGAGG, CGAGG, and GACAA) of Zm00001d016000 on KRN; (c) Positions of the SNPs in the candidate genes associated with Zm00001d050984 on chromosome 4; (d) Differential effects of CC/TT haplotypes of Zm00001d050984 on KRN; (e) Positions of the SNPs in the candidate genes associated with Zm00001d050985 on chromosome 4; (f) Differential effects of CC/TT haplotypes of Zm00001d050985 on KRN; (g) Candidate gene positions associated with KRN.
Ijms 25 03377 g008
Table 1. Phenotypic variation in KRN across five environments in the multi-parent population .
Table 1. Phenotypic variation in KRN across five environments in the multi-parent population .
EnvironmentMinMaxAVESDCV(%)H2
(%)
19DH6.516.6712.16471.4691212.0769110676.8
19BS5.3316.6711.82471.5452613.06806938
21YS61611.93131.8393915.41650951
21JH82013.23781.5910412.01891552
22YS919.512.76301.7071013.37538196
, CV: coefficient of variation; H2: broad-sense heritability; SD: standard deviation.
Table 2. Genes associated with SNPs explaining >10% KRN variation and their expression in tissues relevant to KRN [40,41,42,43,44,45,46,47,48,49,50,51].
Table 2. Genes associated with SNPs explaining >10% KRN variation and their expression in tissues relevant to KRN [40,41,42,43,44,45,46,47,48,49,50,51].
Gene IDEnvSNPChrPVE
(%)
FunctionExpression (FPKM)
Meristem_16–19Ear_Primordium (2–4 mm)Ear_Primordium (6–8 mm)Female_Spikelet
Zm00001d00792421YS242858336217.74 Cytochrome P450 41 NANANANA
Zm00001d007925Cyclic nucleotide-gated ion channel 18NANANANA
Zm00001d02569422YS1265289981014.88 Surfeit locus protein 2 [52]58.573.96335.7
Zm00001d025695Tetratricopeptide-like helical domain superfamily 1.01.21.42.2
Zm00001d025696Cellulase (Glycosyl hydrolase family 5) Protein4.15.05.16.0
Zm00001d02970221JH83163645121.01 glutathione S-transferaseNANANANA
Zm00001d00673321JH215468689214.31 Transcription factor bHLH85NANANANA
Zm00001d006734Zinc finger protein AZF2Not reported
Zm00001d006735Zinc finger protein AZF1
Zm00001d01443521JH46725146510.57 D-xylose-proton symporter-like 358.549.84629.3
Zm00001d01620221JH150393177510.64 E3 ubiquitin-protein ligase COP10.10.20.2NA
Zm00001d016203Actin-related protein 2/3 complex subunit 320.311.911.611.1
Zm00001d016204Putative ubiquitin-like-specific protease 1BNot reported
Zm00001d02439021JH684470651011.15 Sec14p-like phosphatidylinositol transfer family protein3.68.89.1NA
, NA indicates that the gene is not expressed in the corresponding tissue. Env, environment; SNP, single nucleotide polymorphisms; Chr, chromosome; PVE, percentage of phenotypic variation explained.
Table 3. Genes corresponding to SNP-177304649 and their expression levels in KRN-related tissues .
Table 3. Genes corresponding to SNP-177304649 and their expression levels in KRN-related tissues .
GeneLocSNPChrPVE
(%)
FunctionExpression (FPKM)
Meristem_16–19_DayEar_Primordium (2–4 mm)Ear_Primordium (6–8 mm)Female_Spikelet
Zm00001d02242019DH17730464976.79 ricin B-like lectinsNANANA61.5
Zm00001d022421122.7156124.5464.8
, NA indicates that the gene is not expressed in the corresponding tissue. Meristem_16–19_Day: Meristem during days 16–19.
Table 4. Number of genes repeatedly identified across different environments .
Table 4. Number of genes repeatedly identified across different environments .
LocChrStartEndNumber of Repeated Genes
sub-pop3/21JH475,627,569151,339,536648
sub-pop4/22YS446,403,895172,368,609
sub-pop3/22YS48,043,73918,953,65929
sub-pop3/BLUP48,043,73910,683,376
sub-pop4/21JH431,671,02131,671,0200
sub-Pop4/21YS431,409,14031,409,140
Total 677
, Loc: location. Chr: chromosomes. Start and end: Starting and ending position of the QTL.
Table 5. Co-localized genes identified through GWAS and QTL mapping .
Table 5. Co-localized genes identified through GWAS and QTL mapping .
Gene IDPVE
(%)
GWAS-LocStartEndSNPChrQTL-Loc
Zm00001d0316667.0221JH198,109,347198,149,34731312/19DH
Zm00001d031667
Zm00001d031668
Zm00001d031669
Zm00001d0317096.1421JH199,086,836199,126,83641312/19DH
Zm00001d0317138.4121JH199,202,030199,242,03051312/19DH
Zm00001d031715
Zm00001eb0368706.4821JH199,294,176199,334,17661312/19DH
Zm00001eb0368904.7321JH199,315,283199,355,28371312/19DH
Zm00001d031648
Zm00001d0317725.5721JH201,335,723201,375,82481312/19DH
Zm00001d0509846.7021JH135,263,055135,303,05514312/21JH444/22YS
Zm00001d050985
Zm00001d0160008.4021JH138,534,600138,574,63025444/21YS
Zm00001d01620210.6421JH150,373,177150,413,17715444/21YS
Zm00001d016203
Zm00001d016204
Zm00014a0129298.8319DH120,35,0778120,390,77815312/19BS444/21YS
, PVE: phenotypic variance explained. GWAS-Loc: environment in GWAS. Start and end: Starting and ending position of the QTL. SNP: Number of Single Nucleotide Polymorphism in the interval from start to end. Chr: chromosome. QTL-Loc: Environment in QTL.
Table 6. Functional annotation and expression levels of co-located genes [35,40,42,67,68,71,72,73,74,75,76,77,78,79,80].
Table 6. Functional annotation and expression levels of co-located genes [35,40,42,67,68,71,72,73,74,75,76,77,78,79,80].
GeneChrFunctionExpression (FPKM)
Meristem_16–19Ear_Primordium (2–4 mm)Ear_Primordium (6–8 mm)Female_Spikelet
Zm00001d0316661Probable aldo-keto reductase 22.14.34.56.2
Zm00001d0316671Discolored-paralog213.312.315.27.8
Zm00001d0316681Ubiquitin thioesterase otubain-like8.95.26.47.3
Zm00001d0316691O-fucosyltransferase family protein12.38.98.78.8
Zm00001d0317091Replication protein A 70 kDa DNA-binding subunit BNANANANA
Zm00001d0317131CBL-interacting protein kinase 19NANANANA
Zm00001d0317151CBL-interacting serine/threonine-protein kinase 9NA1.31.8NA
Zm00001eb0368701UnknownUnreported
Zm00001eb0368901Protein S-acyltransferase4
Zm00001d0316481Protein S-acyltransferase447.526.626.264.8
Zm00001d0317721Protein NtpR NANANANA
Zm00001d0509844Rab-proteins geranylgeranyltransferase component A NA0.20.2NA
Zm00001d0509854Ultraviolet-B receptor UVR88.718.818.115.7
Zm00001d0160005Myb-related protein 3R-152.355.559.32.6
Zm00001d0162025E3 ubiquitin-protein ligase COP1NANANANA
Zm00001d0162035Actin-related protein 2/3 complex subunit 320.311.911.611.1
Zm00001d0162045Putative ubiquitin-like-specific protease 1B NANANANA
Zm00014a012929 (mo17)5UnknownUnreported
“NA” in the candidate genes column indicates currently unidentified genes.
Table 7. The highly correlated Gene Ontology (GO) terms .
Table 7. The highly correlated Gene Ontology (GO) terms .
LOCCategoryGO IDDescriptionGene Ratiop Value
Sub-pop3/21JH and sub-pop4/22YSBPGO:0015780Nucleotide-sugar transmembrane transport2/1700.011210655
BPGO:0090481Pyrimidine nucleotide-sugar transmembrane transport2/1700.011210655
BPGO:0006914Autophagy2/1700.013554188
BPGO:0061919Process utilizing autophagic mechanism2/1700.013554188
BPGO:0006325Chromatin organization4/1700.020186478
BPGO:1901264Carbohydrate derivative transport2/1700.021710384
BPGO:0015698Inorganic anion transport3/1700.024541102
BPGO:0006821Chloride transport2/1700.034964547
BPGO:0006497Protein lipidation2/1700.054807613
BPGO:0006505GPI anchor metabolic process2/1700.054807613
CCGO:0000139Golgi membrane2/590.02194791
CCGO:0098791Golgi subcompartment2/590.025312478
CCGO:0031984Organelle subcompartment3/590.0600554
CCGO:1902494Catalytic complex7/590.103515469
CCGO:0005794Golgi apparatus2/590.155306753
CCGO:0044431Golgi apparatus part2/590.155306753
CCGO:0000151Ubiquitin ligase complex1/590.166579053
CCGO:0030286Dynein complex1/590.166579053
CCGO:0098803Respiratory chain complex1/590.166579053
CCGO:0070461SAGA-type complex1/590.181653191
MFGO:0140103Catalytic activity, acting on a glycoprotein3/2390.001729669
MFGO:0005338Nucleotide-sugar transmembrane transporter activity2/2390.009752157
MFGO:0015165Pyrimidine nucleotide-sugar transmembrane transporter activity2/2390.009752157
MFGO:0033926Glycopeptide alpha-N-acetylgalactosaminidase activity2/2390.011799431
MFGO:0015103Inorganic anion transmembrane transporter activity3/2390.016322483
MFGO:0008509Anion transmembrane transporter activity3/2390.023303348
MFGO:0005244Voltage-gated ion channel activity2/2390.030592768
MFGO:0005247Voltage-gated chloride channel activity2/2390.030592768
MFGO:0005253Anion channel activity2/2390.030592768
MFGO:0005254Chloride channel activity2/2390.030592768
, Category: categories of GO annotations, including molecular function (MF), cellular component (CC), and biological process (BP). Gene Ratio: a fraction where the numerator is the number of genes enriched in this GO term, and the denominator is the total number of genes used for enrichment analysis. LOC: location. Sub-pop3/21JH: pop3 subpopulation for the 21JH environment. Sub-pop4/22YS: pop4 subpopulation for the 22YS environment.
Table 8. The Gene Ontology (GO) terms associated with Zm00001d050984 .
Table 8. The Gene Ontology (GO) terms associated with Zm00001d050984 .
GeneGWAS-LocQTL LocDescriptionGO IDCategory
Zm00001d05098421JHSub-pop3/21JH and sub-pop4/22YSintracellular signal transductionGO:0035556BP
signal transductionGO:0007165BP
signalingGO:0023052BP
small GTPase mediated signal transductionGO:0007264BP
cell communicationGO:0007154BP
nucleoside-triphosphatase regulator activityGO:0060589MF
GTPase regulator activityGO:0030695MF
GDP-dissociation inhibitor activityGO:0005092MF
enzyme regulator activityGO:0030234MF
molecular function regulatorGO:0098772MF
, Category: categories of GO Annotations, including molecular function (MF) and biological process (BP). 21JH: Jinghong in 2021. Sub-pop3/21JH: the pop3 subpopulation for the 21JH environment. Sub-pop4/22YS: the pop4 subpopulation for the 22YS environment.
Table 9. The Kyoto Encyclopedia of Genes and Genomes (KEGG) associated with Zm00001d050984 .
Table 9. The Kyoto Encyclopedia of Genes and Genomes (KEGG) associated with Zm00001d050984 .
LocKEGGIDDescriptionBg Ratiop Value
Sub-Pop3/21JH and sub-pop4/22YSzma00650Butanoate metabolism30/54930.010066583
zma00902Monoterpenoid biosynthesis12/54930.013495559
zma00770Pantothenate and CoA biosynthesis40/54930.021995592
zma00020Citrate cycle (TCA cycle)72/54930.022923456
zma00290Valine, leucine and isoleucine biosynthesis22/54930.042859117
zma00563Glycosylphosphatidylinositol (GPI)-anchor biosynthesis24/54930.050229291
zma04712Circadian rhythm—plant58/54930.056694817
zma012102-Oxocarboxylic acid metabolism61/54930.064061405
zma01200Carbon metabolism290/54930.070207933
zma01040Biosynthesis of unsaturated fatty acids29/54930.070421369
Sub-pop3/22YS and sub-pop4/BLUPzma00062Fatty acid elongation40/54930.028819143
zma00053Ascorbate and aldarate metabolism60/54930.042992849
zma00040Pentose and glucuronate interconversions77/54930.054918027
zma01250Biosynthesis of nucleotide sugars105/54930.074316114
zma00480Glutathione metabolism111/54930.07843369
zma00520Amino sugar and nucleotide sugar metabolism160/54930.111548551
zma04626Plant-pathogen interaction228/54930.156015338
zma01240Biosynthesis of cofactors265/54930.179497585
, Sub-pop3/21JH: the pop3 subpopulation for the 21JH environment; sub-pop4/22YS: the pop4 subpopulation for the 22YS environment; sub-pop3/22YS and sub-pop3/BLUP: the pop3 subpopulation for the BLUP and 22YS. BgRatio: The denominator is the total number of genes in the genes encoding proteins of the species wherever they carry GO annotations (estimated to be the total GO gene pool), and the numerator is the number of genes within the total number that carry GO functional annotations on the current entry.
Table 10. Parental lines used to develop multi-parent population.
Table 10. Parental lines used to develop multi-parent population.
Parental LinesPedigreeHeterotic GroupEcotype
Ye107From US hybrid DeKalb XL8ReidTemperate
CML312S89500-F2-2-2-1-1-BNon-ReidTropical
CML444P43-C9-1-1-1-1-1Non-ReidTropical
YML46Selected from Suwan1SuwanTropical
YML32Selected from Suwan1SuwanTropical
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, Y.; Ran, F.; Yin, X.; Jiang, F.; Bi, Y.; Shaw, R.K.; Fan, X. Genome-Wide Association Studies on the Kernel Row Number in a Multi-Parent Maize Population. Int. J. Mol. Sci. 2024, 25, 3377. https://doi.org/10.3390/ijms25063377

AMA Style

Wang Y, Ran F, Yin X, Jiang F, Bi Y, Shaw RK, Fan X. Genome-Wide Association Studies on the Kernel Row Number in a Multi-Parent Maize Population. International Journal of Molecular Sciences. 2024; 25(6):3377. https://doi.org/10.3390/ijms25063377

Chicago/Turabian Style

Wang, Yizhu, Fengyun Ran, Xingfu Yin, Fuyan Jiang, Yaqi Bi, Ranjan K. Shaw, and Xingming Fan. 2024. "Genome-Wide Association Studies on the Kernel Row Number in a Multi-Parent Maize Population" International Journal of Molecular Sciences 25, no. 6: 3377. https://doi.org/10.3390/ijms25063377

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop