Next Article in Journal
Effects of Different Mulching Practices on Soil Environment and Fruit Quality in Peach Orchards
Previous Article in Journal
Genome-Wide Comparative Analysis of Five Amaranthaceae Species Reveals a Large Amount of Repeat Content
Previous Article in Special Issue
Rice (Oryza sativa L.) Grain Size, Shape, and Weight-Related QTLs Identified Using GWAS with Multiple GAPIT Models and High-Density SNP Chip DNA Markers
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genetic Diversity and Population Structure of Maize (Zea mays L.) Inbred Lines in Association with Phenotypic and Grain Qualitative Traits Using SSR Genotyping

1
Department of Agricultural Biotechnology, Anand Agricultural University, Anand 388110, India
2
Department of Genetics and Plant Breeding, B. A. College of Agriculture, Anand Agricultural University, Anand 388110, India
3
Main Maize Research Station, Anand Agricultural University, Godhra 389001, India
4
ICAR—Indian Institute of Maize Research, PAU Campus, Ludhiana 141004, India
5
Department of Genetics and Plant Breeding, N. M. College of Agriculture, Navsari Agricultural University, Navsari 396450, India
6
Department of Plant Pathology and Weed Research, Institute of Plant Protection, Agricultural Research Organization (ARO)—Volcani Center, Rishon Lezion 7505101, Israel
*
Authors to whom correspondence should be addressed.
Plants 2024, 13(6), 823; https://doi.org/10.3390/plants13060823
Submission received: 27 January 2024 / Revised: 26 February 2024 / Accepted: 11 March 2024 / Published: 13 March 2024
(This article belongs to the Special Issue Advances in Genetics and Breeding of Grain Crops)

Abstract

:
Maize (Zea mays L.) is an important cereal and is affected by climate change. Therefore, the production of climate-smart maize is urgently needed by preserving diverse genetic backgrounds through the exploration of their genetic diversity. To achieve this, 96 maize inbred lines were used to screen for phenotypic yield-associated traits and grain quality parameters. These traits were studied across two different environments (Anand and Godhra) and polymorphic simple sequence repeat (SSR) markers were employed to investigate the genetic diversity, population structure, and trait-linked association. Genotype–environment interaction (GEI) reveals that most of the phenotypic traits were governed by the genotype itself across the environments, except for plant and ear height, which largely interact with the environment. The genotypic correlation was found to be positive and significant among protein, lysine and tryptophan content. Similarly, yield-attributing traits like ear girth, kernel rows ear−1, kernels row−1 and number of kernels ear−1 were strongly correlated to each other. Pair-wise genetic distance ranged from 0.0983 (1820194/T1 and 1820192/4-20) to 0.7377 (IGI-1101 and 1820168/T1). The SSRs can discriminate the maize population into three distinct groups and shortlisted two genotypes (IGI-1101 and 1820168/T1) as highly diverse lines. Out of the studied 136 SSRs, 61 were polymorphic to amplify a total of 131 alleles (2–3 per loci) with 0.46 average gene diversity. The Polymorphism Information Content (PIC) ranged from 0.24 (umc1578) to 0.58 (umc2252). Similarly, population structure analysis revealed three distinct groups with 19.79% admixture among the genotypes. Genome-wide scanning through a mixed linear model identifies the stable association of the markers umc2038, umc2050 and umc2296 with protein, umc2296 and umc2252 with tryptophan, and umc1535 and umc1303 with total soluble sugar. The obtained maize lines and SSRs can be utilized in future maize breeding programs in relation to other trait characterizations, developments, and subsequent molecular breeding performances for trait introgression into elite genotypes.

1. Introduction

Maize (Zea mays L., 2n = 20) is the most widely cultivated cereal in the world and is thus generally recognized as the ‘Queen of Cereals’ due to its high production potential. It is also renowned as a “model crop”, due to its higher genetic diversity and incorporation in basic and advanced breeding programs for genetic improvement [1]. Being the primary staple food for millions, maize is a key component of animal feed and a significant source of bioenergy. It is a high biological-yield-producing crop with remarkable photosynthetic ability due to the foliar ‘Kranz anatomy’ [2]. Maize is used in the production of biofuel, fabric starch, high-quality corn oil and alcoholic quenchers, and raw material in the pharmaceutical and cosmetic industries [2]. However, the increased human population and climate change in this present era combined with limited production resulted in the scarcity of this crop as a raw material. However, climate change associated obstacles such as yield stagnation, grain quality improvement hurdles, growing disease and pest pressures, and harsh weather circumstances have created significant quantitative and qualitative losses of maize production [3]. As a result, immediate action is essential to support genetic research to improve maize breeding efficiency. It is believed that maize can act as a climate-resilient crop in rainfed areas of tropical and subtropical climatic regions by developing specific hybrids and varieties against changing climate [4]. Maize reproduces sexually through allogamy by protandry condition, which gives the chance to create an enormous amount of genetic variability in diverse morphotype like plant height, flowering (silking and tasseling) time, maturation period, ear and kernel features, as well as diversity in its response to climate change. For many years, growers and breeders have used this variation to create cultivars with specific characteristics [5,6]. The new varieties replaced the old traditional landraces and became the cause of genetic erosion [1]. To prevent the genetic loss, continued sourcing, systematic recording, and in situ and/or ex situ storages are necessary, which also aids plant-breeding programs by providing varied maize genetic stocks. Such maize stocks can be used to create new varieties and deploy a new beneficial diversity for long-term advances in agricultural productivity and genetic advantages [7]. These efforts can be greatly aided by modern genetic improvement tools [8,9].
According to Van Inghelandt et al. [10], a basic knowledge about population structure and the genetic diversity of maize germplasm is a prerequisite for the success of a breeding program. This knowledge enables the tracking of the pattern of inheritance and the amount of polymorphism and heterozygosity [10]. Marker-assisted approaches like phenological, morphological, biochemical and molecular (DNA-based) markers are used to examine maize genetic diversity and population structure among germplasms [10]. Morphological, yield-associated components and grain quality traits of maize are frequently employed as the major breeding parameters for determining population genetic diversity. However, such traits are heavily influenced by environmental interaction. To overcome such limitations, DNA-based markers are introduced to identify cultivars and accelerate breeding programs through early selection of maize lines or segregants with the traits of interest [11,12]. A polymerase chain reaction (PCR) leveraging efficient molecular markers, particularly the simple sequence repeats (SSRs) marker, has long been pivotal in maize genetic research and breeding endeavors [13,14,15]. SSR markers are more informative than biallelic Single Nucleotide Polymorphism (SNP) since they detect multiple alleles per genomic loci [16,17]. Van Inghelandt et al. [10] reported that SSRs are 7 to 11 times more accurate than SNPs. SSRs are broadly scattered throughout the genome in both coding and non-coding regions as a small simple repeat motif [18], which makes them more effective in assessing the genetic diversity and population structure of maize landraces and improved lines [19].
The relationship between morphological data and molecular marker patterns offered important information on population structure and genetic diversity. Identification of these relations is popularly known as association mapping. Association mapping takes advantage of historical and evolutionary crossing-overs. It is based on SSR genotyping revealed linkage disequilibrium (LD) and has a greater mapping resolution than the linkage mapping approach. The statistical relation between SSRs and traits is often utilized to identify a quantitative trait loci (QTL) [3]. Instead of creating a mapping population using linkage mapping, association mapping may be carried out on natural maize populations and inbred lines. This mapping technique allows for the simultaneous evaluation of several alleles as well as the assessment of numerous recombination events, which took place across a number of generations [20].
Genetic diversity and variation analysis are essential for maize inbred lines to facilitate the development of improved varieties with desirable traits. Maize breeders rely on genetic diversity within inbred lines to select and combine alleles that confer traits such as high yield, disease resistance, abiotic stress tolerance, and nutritional quality [21]. Understanding the genetic diversity among such lines is crucial for exploiting heterosis, where hybrids exhibit superior performance compared to their parental lines. Moreover, genetic variation analysis allows breeders to identify and introgress resistance genes against diseases, pests, and environmental stresses, thereby enhancing the durability and resilience of maize cultivars. Furthermore, genetic diversity analysis aids in the improvement of nutritional quality and adaptation to changing climatic conditions, ensuring the sustainability and productivity of maize production systems [22]. In essence, genetic diversity and variation analysis serve as fundamental tools for maize breeders to develop cultivars that meet the diverse needs of farmers and consumers while addressing the challenges of modern agriculture [23].
In light of the foregoing explanation, the aim of this present study is (i) to evaluate the variability of yield-associated morphological traits and grain quality parameters, (ii) to reveal the genetic diversity and population structure of the maize inbred population using genome wide distributed SSR markers, and (iii) to validate the association between SSR markers and traits. The findings of the potential maize lines with marker-trait association will be the outcome to broadening the genetic base, creating new gene combinations and identifying markers for achieving higher genetic gain in future maize breeding programs.

2. Materials and Methods

2.1. Plant Materials and Experimental Locations

In this study, a set of 96 genetically diverse maize (Zea mays L.) inbred lines were used for the assessment of genetic diversity, population structure and marker-trait association using some potent yield-associated morphological traits and grain quality parameters. Among those, 36 lines belong to the sweet corn group and were obtained from the Indian Institute of Maize Research (IIMR, Ludhiana, India). The remaining 60 maize lines belong to the flour corn group and were obtained from the Main Maize Research Station (MMRS, Godhra, India). The details of studied germplasm are presented in Table S1. As we aimed to analyze the diversity and genetic variation in the maize lines, no check varieties were not included for the purpose of trait comparison. The inbred lines were field evaluated at (i) ‘Anand’, the Experimental Farm (22.54° N, 72.98° E, 45.10 m above mean sea level) of Anand Agriculture University, Gujarat, India, where its soil was sandy loam and had a poor water holding capacity with poor organic carbon content and (ii) ‘Godhra’, MMRS (22.45° N, 40° E, 119.40 m above mean sea level), Gujarat, India, where soil was sandy with medium water holding capacity and contained good organic carbon. Detailed meteorological parameters were shown in Table S2. Randomized Complete Block Design (RCBD) with two replications in each site was employed in this experiment. Each maize line was sown in a 6 m row (20 plants per row) with 30 cm intra-row and 60 cm inter-row spacing during the winter season of 2020.

2.2. Phenotypic Characterization

The phenological data were recorded from the sowing date and expressed in days. Silking was recorded as the date when 50% of the plants in each plot had visible styles; tasseling happened immediately preceding this growth stage, but essentially, these phases must coincide for successful pollination to occur [24]. The morphological traits like plant height (PH in cm), ear height (EH in cm), ears per plant (EPP), ear length (EL in cm), ear girth (EG in cm), number of kernel rows per ear (KRP), number of kernels per row (KPR) and number of kernels per ear (NKE) were phenotyped at maturity from five randomly selected plants of each maize line for each replication per location as per previous established by the way of Patel et al. [25]. The PH was measured the aboveground distance between ground and tassel. Ear traits (EH, EPP, EL, EG) were evaluated manually after harvesting the matured cob. After the separation of ears from the cob, yield parameters (KRP, KPR and NKE) were recorded.

2.3. Grain Qualititative Parameters

For the biochemical analysis of the maize grain, three randomly selected plants were harvested together, and grains were separated from cob. Then, the grains were cleaned manually, followed by sun-drying for 7 days and subsequent crushing to make corn flour. This flour was stored in a plastic bag for a further estimation of protein content (Pro%) as per Bressani et al. [26], β-carotene (Car in ppm) according to Sathya et al. [27], and lysin and tryptophan content (as µg per mL) following the protocol of Lane [28]. Total soluble sugar (TSS%) was estimated from the cobs at the dough stage following the methods of Dubious [29].

2.4. DNA Extraction, PCR Amplification and Gel Electrophoresis

The total genomic DNA of each field-evaluated maize genotype was isolated from the 15-days-old leaves as per the protocol described by Doyle [30]. The DNA quality was checked through nanodrop (ND-1000, Thermo Fisher Scientific, Waltham, MA, USA) and 1% (w/v) agarose gel electrophoresis. A total of 136 SSR markers distributed across the maize genome were randomly selected from the MaizeGBD database (https://maizegdb.org/data_center/ssrid (accessed on 1 June 2023)) and employed to evaluate genetic variations and association analysis (Table S3). PCR amplification was carried out using Emerald 2× PCR master mix (Takara, Tokyo, Japan). The PCR reaction consisted of 5 μL of PCR master mix, 2 μL of 50 ng DNA, 1 μL of 10 pmole forward and reverse primers in a total volume reaction of 10 μL. The volume was adjusted with nuclease-free water. The thermal cycling conditions were as follows: an initial denaturation at 94 °C for 5 min, followed by 35 cycles of denaturation at 94 °C for 45 s, annealing at ΔT °C (primer specific) for 45 s and extension at 72 °C for 45 s; one cycle of final extension was performed at 72 °C for 7 min. The PCR reactions were conducted in Applied Biosystems® Veriti® 96-Well Thermal Cycler (Thermo Fisher Scientific, Waltham, MA, USA). A 3% agarose gel was used to resolve PCR products with a 100 bp DNA ladder. Allele scoring was performed using the software AlphaEaseFC 4.0 and allele size was determined for downstream data processing and statistical genetic analysis.

2.5. Statistical Analysis

The phenotypic field data and biochemical estimations were examined through combined analysis of variance (ANOVA) by assuming genotypic effects as fixed and environment effects as random. To determine the association between traits, the analysis of correlation was performed using experimental means for each variety from each location. The descriptive statistical analysis (mean values, correlation, regression, t-test) was carried out using JMP software v16.0 [31]. The genotypic correlation coefficient was calculated for both the environments using R v4.2.3. package “variability” [32].

2.6. Analysis of Genetic Diversity, Population Structure and Marker-Trait Association

The PowerMarker 3.25 program was used to calculate the allele number, allelic frequency, gene diversity (GD), and polymorphic information content (PIC) for each used SSR as suggested by Lui [33]. The genetic dissimilarities were estimated for each pair of accessions using the dissimilarity index. The inter-genotype genetic distances were generated using DARwin 6.0 with the Neighbor-Joining (NJ) method and the dendrogram was created. To infer the population structure, the Bayesian clustering approach was used through STRUCTURE 2.3.4 software as advised by Pritchard et al. [34]. For population structure, 5 runs for each ‘K’ value from 2 to 10 were performed with 500,000 burn-in and 500,000 MCMC (Markov Chain Monte Carlo). The structure harvester online program (http://taylor0.biology.ucla.edu (accessed on 23 September 2023)) was used to obtain the final ‘K’ value based on the lowest negative number of Ln (the log-likelihood of the data) and the lowest standard deviation was found during analysis, according to the protocol of Earl and VonHoldt [35]. The association of the trait of interest with each SSR allelic locus was tested through a Mixed Linear Model (MLM) using TASSEL 3.0 software [36]. The expected value under the null hypothesis of no relationship with the trait was displayed against the negative logarithms of the p-values from the models used in the GWAS in this graph. The quantile–quantile (QQ) plot is an important tool for evaluating the efficiency of the model employed in GWAS for structuring the population and recognizing familial relatedness. The QQ plot was derived from the p-value of MLM through the R package v4.2.3. “CMplot” [37]. The Manhattan scatter plot was developed through the R package “qqman” to summarize GWAS results according to the procedure explained by Turner [38]. The Y-axis is the negative logarithm of the p-value derived from the GWAS model (particularly, from the F-test for testing H0: No connection between the SSR and trait) and the X-axis is the genomic location of each SSR. Large peaks in the Manhattan plot indicated a substantial relationship between the characteristic and the adjacent genetic region.

3. Results

3.1. Phenotypic Evaluation and Genotypic Mean Performance in Two Environments

The combined analysis of variance indicated that all the characters were found to be significant for the genotype, implying the presence of significant variability for all characters of the panel under study (Table 1). This is the prerequisite for a successful breeding program. It also showed a significant effect of the environment which indicated a significant influence of environment on the expression of the particular traits. According to Table 1, plant height (37.92%) and lowest ear placement height (48.57%) had the highest influence on the environment indicating that differences in a microenvironment like soil fertility play a significant role in the expression of these characteristics.
Similarly, ears per plant showed the highest influence on the interaction of genotype, with the environment or total variance towards overall variability (54.83%) followed by kernel rows per ear (52.70%) and ear girth (44.91%). In the rest of the characteristics, genotype shared maximum variation followed by GEI effect across all locations (Table 1; for the mean differences see Table S4). Figure 1 depicted the mean over the environment and mean of the site, respectively. The overall mean of the studied characters was 69.38 (TA), 75.73 (SI), 144.12 (PH), 65.45 (EH), 1.46 (EPP), 14.38 (EL), 12.08 (EG), 13.99 (KRP), 27.22 (KPR), 381.86 (NKE), 12.33 (Pro), 1.24 (TSS), 6.46 (Car), 1.94 (Lys) and 0.46 (Trp). According to the boxplot, most of the characters showed a similar performance across the environment except for the plant height, ear height, ear length and ear girth (Figure 1). This result was also confirmed using the combined ANOVA.

3.2. Genotypic Correlation between the Studied Traits

The genotypic correlation among 14 characters of 96 genotypes evaluated at the Anand (lower side of the diagonal value) and Godhra (upper side of the diagonal value) locations are shown in Table 2. Days to 50% tasseling and silking were negatively associated with the plant height and ear height at both locations. This indicates the selection of the early flowering genotype can be carried out through selecting a dwarf line, and this could lead to early maturity. Similarly, yield-attributing traits like ear length were highly correlated with the number of kernels per row, ear girth and number of kernels per ear at both locations (Table 2). Likewise, in this observation, all yield-attributing characters were also associated positively with each other at both sites.

3.3. Genetic Diversity among Maize Genotypes

The genetic diversity of the 96 genotypes of maize was evaluated using SSR markers. In the current study, 136 SSR markers were screened for polymorphism but among them 61 markers showed effective polymorphism (Figure S1). A total number of 131 alleles were perceived ranging from 2 alleles per loci to 3 alleles per loci with an average of 2.14 alleles per loci. The molecular weight of the markers ranged from 100 bp (umc2298 and umc2226) to 930 bp (umc1480) (Table 3). The major allele frequency ranged from 0.41 (umc2252) to 0.83 (umc2282) with an average of 0.63. Also, the average gene diversity was recorded 0.46 with the range of 0.28 (umc1578) to 0.65 (umc2252). Heterozygosity varied widely from 0.00 to 0.82 (umc2252) with an average of 0.04. The informativeness in form of the strength as a marker for the polymorphism-based genotypic discrimination ability is measured using the Polymorphism Information Content (PIC) value. The PIC ranged from 0.24 (umc1578) to 0.58 (umc2252) with an average of 0.36 (Table 3). In the present study, umc2252 (0.58), umc1060 (0.51) and umc1552 (0.54) were highly informative, while the rest of the markers were moderately informative (Table 3).

3.4. Inter-Genotypic Genetic Relationship and Clustering

A total of 96 maize genotypes were clustered into three foremost clusters A, B and C with 47, 40 and 9 genotypic entries, respectively, based on the pair-wise comparison values of Nei’s genetic distance (Figure 2). The maximum number of maize lines grouped in cluster A showing those lines are genetically similar. The main cluster A was further separated into two sub clusters A1 (27 lines) and A2 (20 lines). All 27 lines grouped in sub-cluster A1 comprised only sweet corn type. Sub-cluster A2 had 20 lines, which comprised flour corn as well as sweet corn type. Cluster B was further separated into two sub clusters B1 (31 lines) and B2 (9 lines). The maximum number of accessions were clustered in sub-cluster B1 belonging to flour corn type of inbreds, showing high genetic resemblance among genotypes of this group. Pair-wise genetic distance values among 96 maize lines ranged from 0.0983 (1820194/T1 and 1820192/4-20) to 0.7377 (IGI-1101 and 1820168/T1) (Table S5).

3.5. Population Structure Analysis

STRUCTURE 2.3.4 was used to minimize type I errors during the genome-wide association study. In this study, a total of 61 polymorphic SSR markers were accessed to determine the population structure. Admixture model-based simulations were performed by varying the value of K from 2 to 10 for each K with five repetitions. A sharp pick with the maximum value of delta K was obtained at K = 3 with the lowest standard deviation (Figure 3 and Figure 4, Table S6), confirming the three sub-populations in the present mapping panel using the Evanno method (Figure 3).
Three (K = 3) optimum numbers of sub-populations were estimated which potentially explains the population structure of the accessions (Figure 4). Using this approach, 96 genotypes of maize were assigned to three sub-populations representing the 17.70% with 17 genotypes (sub-population I), 37.50% with 36 genotypes (sub-population II) and 25.00% with 24 genotypes (sub-population III) (Table 4). But based on the membership probability threshold of 0.80, 19 inbred lines were assigned to an admixed group with the representation of 19.79% of the whole panel (Figure 4, Table S7). Furthermore, the net nucleotide distance among the sub-populations was computed using point estimates of P (Table 4). This distance showed that sub-population I and III showed the maximum net nucleotide distance while sub-population I and II showed the lowest distance. Similarly, sub-population II and III showed a moderate distance. There was no observed difference found in the expected heterozygosity in the populations. Sub-population III was less heterozygous with an expected heterozygosity value of 0.3111, followed by subpopulation I with a value of 0.3729 (Table 4). The highest expected heterozygosity was shown by sub-population II. However, sub-population II reported less diversity with the lowest Fst value (0.1368), while subpopulation III reported a maximum Fst value (0.3591) with great diversity among sub-populations. But in the present study, sub-population II implies the genotypes from cluster B and C of Nei’s genetic distance-based dendrogram which is genetically more diverse (Figure 2).

3.6. Trait-Marker Association

A combined model is used for mixed model frameworks (Q + K) for marker-trait associations. The data obtained from population structure analysis (Q-matrix) and kinship (K-matrix) are used for the mixed model framework (Q + K) for marker-trait associations. In this study, the marker-trait association was performed for all recorded traits at both the locations and all polymorphic markers using the Q + K MLM approach through TASSEL. Significant marker-trait association was identified by using the Bonferroni threshold [39]. By performing genome scanning through the Q + K MLM approach, marker-trait associations were found for both locations. QQ plots were used to visualize the result of the Q + MLM approach (Figure 5, Figure 6 and Figures S2 and S3). This observed the significant deviation from the diagonal line due to population structure and familial relatedness for the traits. According to the QQ plot for the Anand environment, the number of kernels per row, number of kernels per ear, protein content, total soluble sugar content and tryptophan content showed significant deviation (Figure 5 and Figure S2).
Similarly, all these characters also surpassed the threshold in the Manhattan plot (Figure 7 and Figure 8). According to the QQ plot and Manhattan plot, 10 markers showed association with the four yields attributing and the 4-grain biochemical traits with an average of 25.46% phenotypic variance in the Anand location (Figure 7). Table 5 indicates that umc2038 is strongly associated with the protein content, with 13.09% phenotypic variance, followed by umc2050 with 12.74% phenotypic variance. umc2296 and umc2136 showed a strong association with tryptophan content. Yield-attributing traits like the number of kernels per row and number of kernels per ear were strongly associated with the umc1303 with 9.57% and 9.85% phenotypic variance, respectively (Figure 7 and Figure 8).
Similarly, the QQ plot (Figure 6 and Figure S3) and Manhattan plot (Figure 8) for the Godhra location indicated the significance of markers for the days up to 50% silking, ear height, ears per plant, protein content, total soluble sugar content, lysine and tryptophan content. In this location, yield-attributing traits like the number of kernels per ear, ear length and ears per plant were associated with umc1941, umc2384 and umc2380 (Table 5). Additionally, protein content was strongly associated with the umc2038 with 17.28% phenotypic variance. This was followed by the umc2050 and umc2296 with 15.09% and 13.83% phenotypic variance, respectively (Figure 7 and Figure 8, Table 5). An umc2296 was associated with lysine and tryptophan content with 9.18 and 12.85% phenotypic variance, respectively. In this study, umc2038, umc2050 and umc2296 were found to be strongly associated with protein content in both environments, indicating a stable association. Similarly, umc2296 and umc2252 showed a significant and strong association with tryptophan content at both locations. The total soluble sugars also established a significant association with umc1535 and umc1303 at both locations (Table 5).

4. Discussion

The present study reveals the significant diversity of maize inbred lines for yield-associated morphological and grain biochemical traits supported using the combined ANOVA (Table 1). Within the genotype, the very least significant variation has been observed. As the studied population are inbred lines, it was quite expected due to their homozygous genetic constituents. Those lines were maintained through continuous selfing in the field condition by restricting the open pollination through bagging the tassel and cobs. During the season, no such morphological visible variation was observed. It is very prominent from Table 1 that the highest variation comes from GEI, which is in fact the consequence of genotype–environment interaction (G × E). Similar trait diversity was previously reported in other maize lines [25,40]. Patel et al. [41] showed the environment shared the least variation but its significance indicated the variability in the test sites. In the present study, grain quality parameters shared maximum variability through genotypic effects, proving its stability across the two environments in Anand and Godhra.
In our study, genotypic correlation analysis expresses the relatedness among the traits from their genetic counterparts from the total variance. The significant and negative correlation among days up to 50% tasseling and silking with plant height helps to select early maturing genotypes [25]. Patel et al. [25] recorded a significant and positive correlation between the ear length to number of kernels per row (0.64) and the number of kernels per ear (0.54). Patel et al. [42] also recorded the same trend with ear length to number of kernels per row (0.67) and number of kernels per ear (0.47). Protein content is significantly associated with the lysine and tryptophan content in a positive direction at both locations. This significant correlation is observed because protein is an important structural composition of maize grain while lysine and tryptophan are contributing amino acids in the development of quality protein maize. Diversity based on phenotypic performance gives the idea about the potential of genotype, but it is highly vulnerable to environmental conditions. Changing the environment leads to the biased estimation of population diversity because of the G × E interaction.
To overcome the environment and stage specificity effect, molecular markers are introduced in the plant breeding research. SSR-based genetic diversity shows the markers’ efficiency of polymorphism is less than 50%. However, the amplified alleles are in a moderate range (2–3 alleles per locus). This is why total gene diversity and PIC values are not much higher than moderate. Based on these parameters, three SSRs (umc2252, umc1060 and umc1552) are found to be more informative to discriminate the maize lines. All these three SSRs amplify three alleles per locus and are positioned in chr. 2 and 5. Similarly, Sa et al. [43] reported 3 to 15 alleles per loci with an average of 7.6 during the genetic diversity analysis of Canadian maize inbred lines. Similarly, Adu et al. [44] reported 4 to 17 alleles per loci with an average of 9.60 in the maize populations. The difference in allele numbers in the present study and the previous study is because of the difference in the methodology of polymorphism detection and plant genetic material used in the study [44]. SSR markers showed 0.46 average gene diversity with a range of 0.28 to 0.65. This result is in accordance with the Sa et al. [43] in maize. However, Park et al. [45] noticed that gene diversity values varied from 0.20 to 0.92 with an average of 0.65, which was higher than the present study. Many of the markers show a unique inbreeding coefficient which indicates less or no heterozygosity. Because maize is a highly cross-pollinated crop, in the present study, inbreds were used and were maintained through continuous inbreeding. This process prevents the gene flow between two populations that may prevent blending of two gene pools thereby decreasing the genetic variation and increasing the homozygosity. However, the true value of markers can be predicted using their PIC value. As per the Botstein et al. [13], the PIC value of a marker is classified into three main categories viz. high, moderate and less informative. The highly informative SSRs have a PIC value greater than 0.5, the moderately informative have a PIC within the range of 0.25–0.5 and the slightly or less informative marker have a PIC below 0.25. In the present study, umc2252, umc1060 and umc1552 were found to be highly informative and the rest of the markers were moderately informative. A high to moderate PIC value indicated the multi-allelic behavior of the SSR markers. Adu et al. [44] concluded that the high magnitude of the polymorphism of SSR markers supports its application in various genetic studies like genetic diversity, QTL mapping and Genome Wide Association Studies (GWAS). Wasala et al. [46] revealed heterozygosity values ranging from 0.13 to 0.75 with an average of 0.63, and polymorphic information content values ranging from 0.20 to 0.90 with a mean of 0.60, which were comparable to the current findings. The SSR-based genotyping enabled the grouping of the studied maize lines into three foremost clusters, which can easily classify the maize lines into sweet corn and flour corn types. The pair-wise comparison based on Nei’s genetic distance ranged to high, from 0.0983 (1820194/T1 and 1820192/4-20) to 0.7377 (IGI-1101 and 1820168/T1), which is clearly indicated in the population structure (Figure 4). The inter-genotype genetic relationship indicated that the genotype IGI-1101 and 1820168/T1 were extremely diverged at the genomic level and can be utilized to create biparental mapping populations as well as in a maize improvement program to broaden the genetic make-up of various maize genotypes. This is in much accordance with the findings of Ganguniya et al. 2023 on another cereal relative pearl millet [47]. Similar results were also inferred by Saiyad et al. 2018 [48] in fodder maize with a maximum Nei’s genetic distance of 0.16 between ‘IC-130791’ and ‘IC-130759’, with the minimum distance of 0.75 found between ‘Origin Mexico-6386’ and ‘Mexico genotype-3970’.
Marker information and cluster analysis help to select a genotype to broaden the genetic base of a crop, but it cannot offer an idea about how genetic variation in population is distributed among and within the population. Population structure analysis is used to infer information regarding genetic variations in and among populations. It is a prerequisite for GWAS. Without the population structure analysis, population stratification underwrites to false positive results in the GWAS [20]. Sometimes pseudo-associations are also found because the population was structured according to the geographical origin of the genotype. Therefore, to avoid the high rate of type I error, a suitable and effective method is implemented to control the effect of population structure during association analysis using STRUCTURE [20]. Population structure analysis divided 96 genotypes into three subpopulations with 17.70% (subpopulation I), 37.50% (subpopulation II) and 25.00% (subpopulation III) representation. Among them, 19.79% of genotypes were assigned to admixed groups based on the membership probability threshold value of 0.80. Admixtures in the populations can be ascribed mostly owing to the allogamous nature of crop and gene flow from the adjacent genotype. The knowledge gained from population structure and ancestry would help with germplasm management and conservation strategies, as well as the choice of suitable parents in maize breeding programs. Sub-population I and II showed minimum net nucleotide distance indicating minimal changes in allele frequencies between these sub-populations and vice versa [44]. The study revealed that sub-population II was more genetically diverse and more heterozygous compared to other sub-populations as per Holsinger et al. [49] based on expected heterozygosity. The fixation index (Fst) is a measure of population differentiation due to genetic structure which ranges from 0 to 1. According to Holsinger et al. [49], a zero value implies that the population can complete random matting. Fst equals one, implying that all genetic variation is explained through the population structure and that the two populations do not differ in their genetic composition. In this study, the fixation index found 0.35 for sub-population III, which was the highest, while 0.13 was found for sub-population II, which was the lowest. A similar conclusion was drawn by Adu et al. [44] during the genetic characterization and population structure of maize populations using SSR markers. They reported two subpopulations with a moderate Fst value of 0.18 and 0.013. They also reported less expected heterozygosity of 0.44 between populations.
Genetic diversity is the prerequisite for the beginning of any breeding program. Higher diversity in any breeding population leads to the discovery of new effective loci and improves the understanding of the genetic architecture and behavior of different loci in the population. The result of the population structure analysis supported the performance of GWAS. It helps to discover numerous genetic variations which are connected to complex characteristics. Compared to linkage mapping, association analysis gives more options to plant breeders for choosing a genotype for crop improvement [43]. In this study, none of the marker-trait associations were supported by the previous study because of the different markers and traits used in the study.

5. Conclusions

This study reported trait diversity, genetic structure and marker-trait association among 96 maize genotypes, through which we are able to identify potential lines for the improved agronomic trait with the best grain biochemical parameters. The multiple allelism of the SSR marker in this study proved its efficiency over the SNP marker and identified two highly diverse genotypes IGI-1101 and 1820168/T1 at the genomic level. This can be further utilized to broaden the genetic base of the maize populations. Population structure analysis divides these lines into three distinct groups, with additional admixed groups with moderate Fst values indicating the free flow of genes among groups. This revealed the absence of population structure, which opens the doors for GWAS analysis. Marker trait association through the MLM approach identified that umc2038, umc2050 and umc2296 were strongly associated with the protein content; markers umc2296 and umc2252 were associated with tryptophan content and markers umc1535 and umc1303 were strongly associated with total soluble sugar at both the locations. Surprisingly, protein, lysine and tryptophan contents are highly correlated at the genotypic level across the environment, helping to improve multiple characters simultaneously through marker-assisted breeding because as per the analysis of variance, these traits also had a good buffering capacity across the wide range of environments as these biochemical traits were potentially expressed through the genotype.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants13060823/s1. Figure S1. Agarose gel picture under UV trans-illuminant showing a model fingerprinting profile using SSRs (A) umc2101 and (B) umc2252. Figure S2. QQ plots for MLM of the different traits evaluated at Anand environment. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content. Figure S3. QQ plots for MLM of the different traits evaluated at Godhra environment. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content. Table S1. Name and sources of studied maize inbreed lines. Table S2. Meteorological data for the experimental field (E1: Anand, E2: Godhra) across the cropping season from November 2020 to March 2021. Table S3. List of polymorphic SSR markers and their primer details for PCR amplification. Table S4. Mean values of the studied phenotypic and grain quality traits of 96 maize lines in two environments-Anand and Godhra. Table S5. Nei genetic distance matrix between studied maize lines. Table S6. The Evanno table output from STRUCTURE HARVESTER. Table S7. Q table derived from STRUCTURE analysis.

Author Contributions

Conceptualization and methodology, R.P., J.M., D.A.P., A.D., R.R. and S.K.; field trial and data collection, R.P., J.M., S.P. and U.P.; germplasm procurement, M.B.P. and C.G.K.; maintaining field experiments, M.B.P. and A.D.; biochemical analysis, A.A.S., S.K., R.P., J.M. and S.P.; molecular data analysis, R.P., J.M., S.P. and A.D.; data curation and analysis, software, visualization, writing and editing of manuscript, R.P., S.K., D.A.P., J.M. and R.R.; funding acquisition and supervision, S.K. and R.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding. APC was funded by Plants-MDPI to R.R.

Data Availability Statement

Data will be made available upon request to the first author (R.P.).

Acknowledgments

R.P. would like to express gratitude to the Anand Agricultural University (AAU) at Anand, Gujarat, India, for granting them permission for doctoral research and allowing them utilize the necessary tools and infrastructure for the study. ICAR-IIMR, Ludhiana (Punjab, India) is also acknowledged for providing the seed material used in this present investigation. R.R. is thankful to the Agricultural Research Organization (ARO-Volcani Center), Israel for Overseas Visiting Scientist fellowship. The authors are also thankful to the editors and anonymous reviewers for their critical comments to improvise the manuscript to its current form.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Prasanna, B.M. Diversity in Global Maize Germplasm: Characterization and Utilization. J. Biosci. 2012, 37, 843–855. [Google Scholar] [CrossRef]
  2. Slafer, G.A.; Otegui, M.E. Physiological Bases for Maize Improvement; CRC Press: Boca Raton, FL, USA, 2000; ISBN 156022889X. [Google Scholar]
  3. Roychowdhury, R.; Taoutaou, A.; Hakeem, K.R.; Gawwad, M.R.A.; Tah, J. Molecular Marker-Assisted Technologies for Crop Improvement. In Crop Improvement in the Era of Climate Change; IK International Publishing House: New Delhi, India, 2014; pp. 241–258. [Google Scholar]
  4. Prasanna, B.M.; Cairns, J.E.; Zaidi, P.H.; Beyene, Y.; Makumbi, D.; Gowda, M.; Magorokosho, C.; Zaman-Allah, M.; Olsen, M.; Das, A. Beat the Stress: Breeding for Climate Resilience in Maize for the Tropical Rainfed Environments. Theor. Appl. Genet. 2021, 134, 1729–1752. [Google Scholar] [CrossRef]
  5. Roychowdhury, R.; Ballen-Taborda, C.; Chaturvedi, P. Characterizing and Improving Traits for Resilient Crop Development. Front. Plant Sci. 2023, 14, 1307327. [Google Scholar] [CrossRef] [PubMed]
  6. Yan, J.; Warburton, M.; Crouch, J. Association Mapping for Enhancing Maize (Zea mays L.) Genetic Improvement. Crop Sci. 2011, 51, 433–449. [Google Scholar] [CrossRef]
  7. Smith, S.; Bubeck, D.; Nelson, B.; Stanek, J.; Gerke, J. Genetic Diversity and Modern Plant Breeding. In Genetic Diversity and Erosion in Plants: Indicators and Prevention; Springer: Berlin/Heidelberg, Germany, 2015; pp. 55–88. [Google Scholar]
  8. Anumalla, M.; Roychowdhury, R.; Geda, C.K.; Mazid, M.; Rathoure, A.K. Utilization of Plant Genetic Resources and Diversity Analysis Tools for Sustainable Crop Improvement with Special Emphasis on Rice. Int. J. Adv. Res. 2015, 3, 1155–1175. [Google Scholar]
  9. Mollier, M.; Roychowdhury, R.; Tzudir, L.; Sharma, R.; Barua, U.; Rahman, N.; Pal, S.; Gogoi, B.; Kalita, P.; Jain, D. Evaluation of Morpho-Physiological and Yield-Associated Traits of Rice (Oryza sativa L.) Landraces Combined with Marker-Assisted Selection under High-Temperature Stress and Elevated Atmospheric CO2 Levels. Plants 2023, 12, 3655. [Google Scholar] [CrossRef] [PubMed]
  10. Van Inghelandt, D.; Melchinger, A.E.; Lebreton, C.; Stich, B. Population Structure and Genetic Diversity in a Commercial Maize Breeding Program Assessed with SSR and SNP Markers. Theor. Appl. Genet. 2010, 120, 1289–1299. [Google Scholar] [CrossRef] [PubMed]
  11. Govindaraj, M.; Vetriventhan, M.; Srinivasan, M. Importance of Genetic Diversity Assessment in Crop Plants and Its Recent Advances: An Overview of Its Analytical Perspectives. Genet. Res. Int. 2015, 2015, 431487. [Google Scholar] [CrossRef] [PubMed]
  12. Roychowdhury, R.; Das, S.P.; Gupta, A.; Parihar, P.; Chandrasekhar, K.; Sarker, U.; Kumar, A.; Ramrao, D.P.; Sudhakar, C. Multi-Omics Pipeline and Omics-Integration Approach to Decipher Plant’s Abiotic Stress Tolerance Responses. Genes 2023, 14, 1281. [Google Scholar] [CrossRef]
  13. Botstein, D.; White, R.L.; Skolnick, M.; Davis, R.W. Construction of a Genetic Linkage Map in Man Using Restriction Fragment Length Polymorphisms. Am. J. Hum. Genet. 1980, 32, 314. [Google Scholar]
  14. Roychowdhury, R.; Karmakar, J.; Karmakar, J.; Adak, M.K.; Dey, N. Physio-Biochemical and Microsatellite Based Profiling of Lowland Rice (Oryza sativa L.) Landraces for Osmotic Stress Tolerance. Am. J. Plant Sci. 2013, 4, 41040. [Google Scholar] [CrossRef]
  15. Williams, J.G.K.; Kubelik, A.R.; Livak, K.J.; Rafalski, J.A.; Tingey, S. V DNA Polymorphisms Amplified by Arbitrary Primers Are Useful as Genetic Markers. Nucleic Acids Res. 1990, 18, 6531–6535. [Google Scholar] [CrossRef]
  16. Karmakar, J.; Roychowdhury, R.; Kar, R.K.; Deb, D.; Dey, N. Profiling of Selected Indigenous Rice (Oryza sativa L.) Landraces of Rarh Bengal in Relation to Osmotic Stress Tolerance. Physiol. Mol. Biol. Plants 2012, 18, 125–132. [Google Scholar] [CrossRef] [PubMed]
  17. Ganie, S.A.; Karmakar, J.; Roychowdhury, R.; Mondal, T.K.; Dey, N. Assessment of Genetic Diversity in Salt-Tolerant Rice and Its Wild Relatives for Ten SSR Loci and One Allele Mining Primer of SalT Gene Located on 1st Chromosome. Plant Syst. Evol. 2014, 300, 1741–1747. [Google Scholar] [CrossRef]
  18. Hasanuzzaman, M.; Roychowdhury, R.; Karmakar, J.; Dey, N.; Nahar, K.; Fujita, M. Recent Advances in Biotechnology and Genomic Approaches for Abiotic Stress Tolerance in Crop Plants. In Genomics and Proteomics: Concepts, Technologies and Applications; Apple Academic Press: Burlington, ON, Canada, 2015; pp. 333–366. [Google Scholar]
  19. Santos, L.F.C.; Andueza-Noh, R.H.; Ruíz, E.S.; Moreno, L.L.; Garruña, R.; Mijangos-Cortes, J.O.; Martínez-Castillo, J. Characterization of the Genetic Structure and Diversity of Maize (Zea mays L) Landrace Populations from Mexico. Maydica 2018, 62, 7. [Google Scholar]
  20. Yu, J.; Buckler, E.S. Genetic Association Mapping and Genome Organization of Maize. Curr. Opin. Biotechnol. 2006, 17, 155–160. [Google Scholar] [CrossRef] [PubMed]
  21. Shikha, K.; Shahi, J.P.; Vinayan, M.T.; Zaidi, P.H.; Singh, A.K.; Sinha, B. Genome-Wide Association Mapping in Maize: Status and Prospects. 3 Biotech 2021, 11, 244. [Google Scholar] [CrossRef] [PubMed]
  22. Shiferaw, B.; Prasanna, B.M.; Hellin, J.; Bänziger, M. Crops That Feed the World 6. Past Successes and Future Challenges to the Role Played by Maize in Global Food Security. Food Secur. 2011, 3, 307–327. [Google Scholar] [CrossRef]
  23. Singh, V.K.; Pundir, S.; Chaturvedi, D.; Kaur, A.; Pandey, A.; Mandal, S.; Kumar, R.; Singh, R.K.; Bhati, H.P.; Dhanda, P.S. Enhancing Maize (Zea mays L.) Crop through Advanced Techniques: A Comprehensive Approach. In New Prospects of Maize; IntechOpen: London, UK, 2023; ISBN 1837686327. [Google Scholar]
  24. Borrás, L.; Westgate, M.E.; Astini, J.P.; Echarte, L. Coupling Time to Silking with Plant Growth Rate in Maize. Field Crops Res. 2007, 102, 73–85. [Google Scholar] [CrossRef]
  25. Patel, R.; Parmar, D.J.; Kumar, S.; Patel, D.A.; Memon, J.; Patel, M.B.; Patel, J.K. Dissection of Genotype× Environment Interaction for Green Cob Yield Using AMMI and GGE Biplot with MTSI for Selection of Elite Genotype of Sweet Corn (Zea mays Conva. Saccharata Var. Rugosa). Indian J. Genet. Plant Breed. 2023, 83, 59–68. [Google Scholar]
  26. Bressani, R.; Benavides, V.; Acevedo, E.; Ortiz, M.A. Changes in Selected Nutrient Contents and in Protein Quality of Common and Quality Protein Maize during Rural Tortilla Preparation. Cereal Chem 1990, 67, 515–518. [Google Scholar]
  27. Sathya, M.; Sumathi, P.; Joel, A.J. A Simple and Rapid Screening Technique for Grain β Carotene Content in Pearl Millet through Spectrophotometric Method. Afr. J. Agric. Res. 2014, 9, 572–576. [Google Scholar]
  28. Lane, C.S. Rapid LC-MS/MS Analysis of Free Amino Acids in Extracellular Matrix; SCIEX: Framingham, MA, USA, 2016. [Google Scholar]
  29. Dubious, M. Colorimetric Method for Determination of Sugars and Related Substances. Anal. Chem. 1956, 28, 350–366. [Google Scholar] [CrossRef]
  30. Doyle, J. DNA Protocols for Plants. In Molecular Techniques in Taxonomy; Springer: Berlin/Heidelberg, Germany, 1991; pp. 283–293. [Google Scholar]
  31. Roychowdhury, R.; Zilberman, O.; Chandrasekhar, K.; Curzon, A.Y.; Nashef, K.; Abbo, S.; Slafer, G.A.; Bonfil, D.J.; Ben-David, R. Pre-Anthesis Spike Growth Dynamics and Its Association to Yield Components among Elite Bread Wheat Cultivars (Triticum aestivum L. Spp.) under Mediterranean Climate. Field Crops Res. 2023, 298, 108948. [Google Scholar] [CrossRef]
  32. Popat, R.; Patel, R.; Parmar, D. Variability: Genetic Variability Analysis for Plant Breeding Research, R Package Version 0.1.0; R Project for Statistical Computing: Vienna, Austria, 2020.
  33. Lui, K. PowerMarker: Integrated Analysis Environment for Genetic Marker Data. Bioinformatics 2005, 21, 2128–2129. [Google Scholar]
  34. Pritchard, J.K.; Stephens, M.; Donnelly, P. Inference of Population Structure Using Multilocus Genotype Data. Genetics 2000, 155, 945–959. [Google Scholar] [CrossRef]
  35. Earl, D.A.; VonHoldt, B.M. STRUCTURE HARVESTER: A Website and Program for Visualizing STRUCTURE Output and Implementing the Evanno Method. Conserv. Genet. Resour. 2012, 4, 359–361. [Google Scholar] [CrossRef]
  36. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for Association Mapping of Complex Traits in Diverse Samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef]
  37. Xiong, X.; Li, J.; Su, P.; Duan, H.; Sun, L.; Xu, S.; Sun, Y.; Zhao, H.; Chen, X.; Ding, D. Genetic Dissection of Maize (Zea mays L.) Chlorophyll Content Using Multi-Locus Genome-Wide Association Studies. BMC Genomics 2023, 24, 384. [Google Scholar] [CrossRef]
  38. Turner, S.D. Qqman: An R Package for Visualizing GWAS Results Using QQ and Manhattan Plots. BioRxiv 2014. [Google Scholar] [CrossRef]
  39. Zhang, P.; Liu, X.; Tong, H.; Lu, Y.; Li, J. Association mapping for important agronomic traits in core collection of rice (Oryza sativa L.) with SSR markers. PloS ONE 2014, 9, e111508. [Google Scholar]
  40. Mebratu, A.; Wegary, D.; Mohammed, W.; Teklewold, A.; Tarekegne, A. Genotype× Environment Interaction of Quality Protein Maize Hybrids under Contrasting Management Conditions in Eastern and Southern Africa. Crop Sci. 2019, 59, 1576–1589. [Google Scholar] [CrossRef]
  41. Patel, R.; Patel, D.A.; Das, A.; Borkhatariya, T. Estimation of Association among Yield and Yield Attributing Traits of Maize (Zea mays L.). Int. J. Curr. Microbiol. Appl. Sci. 2022, 11, 361–367. [Google Scholar] [CrossRef]
  42. Patel, R.; Patel, D.A.; Parmar, D.J.; Sondarava, P. Genotypic and Phenotypic Correlation Coefficient Study to Identify Best Yield and Yield Attributing Trait for Selection in Maize (Zea mays L.). Int. J. Curr. Microbiol. Appl. Sci. 2022, 11, 368–374. [Google Scholar] [CrossRef]
  43. Sa, K.J.; Hong, T.K.; Lee, J.K. Genetic Diversity and Association Analyses of Canadian Maize Inbred Lines with Agronomic Traits and Simple Sequence Repeat Markers. Plant Breed. Biotechnol. 2018, 6, 159–169. [Google Scholar] [CrossRef]
  44. Adu, G.B.; Awuku, F.J.; Amegbor, I.K.; Haruna, A.; Manigben, K.A.; Aboyadana, P.A. Genetic Characterization and Population Structure of Maize Populations Using SSR Markers. Ann. Agric. Sci. 2019, 64, 47–54. [Google Scholar] [CrossRef]
  45. Park, J.Y.; Ramekar, R.V.; Sa, K.J.; Lee, J.K. Genetic Diversity, Population Structure, and Association Mapping of Biomass Traits in Maize with Simple Sequence Repeat Markers. Genes Genom. 2015, 37, 725–735. [Google Scholar] [CrossRef]
  46. Wasala, S.K.; Prasanna, B.M. Microsatellite Marker-Based Diversity and Population Genetic Analysis of Selected Lowland and Mid-Altitude Maize Landrace Accessions of India. J. Plant Biochem. Biotechnol. 2013, 22, 392–400. [Google Scholar] [CrossRef]
  47. Gunguniya, D.F.; Kumar, S.; Patel, M.P.; Sakure, A.A.; Patel, R.; Kumar, D.; Khandelwal, V. Morpho-Biochemical Characterization and Molecular Marker Based Genetic Diversity of Pearl Millet (Pennisetum glaucum (L.) R. Br.). PeerJ 2023, 11, e15403. [Google Scholar] [CrossRef]
  48. Saiyad, M.M.; Kumar, S. Evaluation of Maize Genotypes for Fodder Quality Traits and SSR Diversity. J. Plant Biochem. Biotechnol. 2018, 27, 78–89. [Google Scholar] [CrossRef]
  49. Holsinger, K.E.; Weir, B.S. Genetics in Geographically Structured Populations: Defining, Estimating and Interpreting F ST. Nat. Rev. Genet. 2009, 10, 639–650. [Google Scholar] [CrossRef]
Figure 1. Box plots showing mean performance of the studied traits across two test sites during 2020. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content.
Figure 1. Box plots showing mean performance of the studied traits across two test sites during 2020. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content.
Plants 13 00823 g001
Figure 2. Dendrogram showing the relationship among 96 maize genotypes using molecular marker data (name of genotypes corresponds to the numbers are mentioned in Table S1).
Figure 2. Dendrogram showing the relationship among 96 maize genotypes using molecular marker data (name of genotypes corresponds to the numbers are mentioned in Table S1).
Plants 13 00823 g002
Figure 3. Estimation of hypothetical sub-populations using ΔK-values.
Figure 3. Estimation of hypothetical sub-populations using ΔK-values.
Plants 13 00823 g003
Figure 4. Model-based population structure plot for each genotype with K = 3 in STRUCTURE software using 61 polymorphic SSRs. Color codes are as follows: sub-population I red, sub-population II green and sub-population III blue. The single vertical line represents an individual genotype and different segments of each vertical line show extent of admixture in an individual genotype.
Figure 4. Model-based population structure plot for each genotype with K = 3 in STRUCTURE software using 61 polymorphic SSRs. Color codes are as follows: sub-population I red, sub-population II green and sub-population III blue. The single vertical line represents an individual genotype and different segments of each vertical line show extent of admixture in an individual genotype.
Plants 13 00823 g004
Figure 5. QQ plot for MLM of the studied traits in Anand environment.
Figure 5. QQ plot for MLM of the studied traits in Anand environment.
Plants 13 00823 g005
Figure 6. QQ plot for MLM of the studied traits in Godhra environment.
Figure 6. QQ plot for MLM of the studied traits in Godhra environment.
Plants 13 00823 g006
Figure 7. Manhattan plot for MLM of the studied traits under Anand environment. The horizontal red line indicates the threshold value. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content.
Figure 7. Manhattan plot for MLM of the studied traits under Anand environment. The horizontal red line indicates the threshold value. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content.
Plants 13 00823 g007
Figure 8. Manhattan plot for MLM of the studied traits under Godhra environment. The horizontal red line indicates the threshold value. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content.
Figure 8. Manhattan plot for MLM of the studied traits under Godhra environment. The horizontal red line indicates the threshold value. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content.
Plants 13 00823 g008
Table 1. Combined analysis of variance (ANOVA) for the studied phenotypic and grain quality traits under Anand and Godhra environments.
Table 1. Combined analysis of variance (ANOVA) for the studied phenotypic and grain quality traits under Anand and Godhra environments.
Source of VariationEnvironment
(df = 1)
Genotype
(df = 95)
GEI
(df = 95)
Residual (df = 190)CV (%)Mean over Environment
TraitMean Squares(G + E + GEI) %Mean Squares(G + E + GEI)%Mean Squares(G + E + GEI)%Mean
Squares
TA301.04 **3.1992.24 **92.883.903.934.733.1469.38
SI229.71 **2.5886.62 **92.544.564.875.072.9775.73
PH219,940.00 **37.922038.00 **33.381752.00 **28.70141.008.24144.12
EH134,578.50 **48.57777.90 **26.67721.90 **24.7550.8010.8965.45
EPP0.35 **0.760.22 **44.400.27 **54.830.0717.551.46
EL213.68 **7.6217.03 **57.6910.24 **34.691.257.7714.38
EG55.88 **4.876.06 **50.215.42 **44.911.199.0112.08
KRP24.82 **1.507.96 **45.809.16 **52.701.207.8213.99
KPR226.37 **1.7183.67 **60.0153.38 **38.283.947.3027.22
NKE108,631.00 **2.6224,231.00 **55.5118,280.00 **41.871793.0011.09381.86
Pro11.31 **0.4823.78 **95.910.90 **3.610.646.4912.33
TSS0.01 **0.003.34 **99.290.02 **0.710.016.501.24
Car7.55 **0.06132.19 **97.822.86 **2.120.8414.176.46
Lys0.08 **0.061.42 **98.810.02 **1.130.015.501.94
Trp0.01 **0.050.12 **98.170.002 **1.780.0015.960.46
** Significant at p < 0.01; df: degrees of freedom, TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content, G: genotype sum of square, E: environment sum of square, GEI: genotype–environment interaction sum of square.
Table 2. Correlation coefficient of the studied phenotypic and grain quality traits for the evaluated maize lines across environments: Anand (lower side of the diagonal values in Italics) and Godhra (upper side of the diagonal value).
Table 2. Correlation coefficient of the studied phenotypic and grain quality traits for the evaluated maize lines across environments: Anand (lower side of the diagonal values in Italics) and Godhra (upper side of the diagonal value).
TASIPHEHEPPELKPREGKRPNKEProTSSCarLysTrp
TA1.000.99 **−0.090.04−0.24 *−0.01−0.08−0.21 *−0.05−0.08−0.070.060.20 *−0.11−0.12
SI0.98 **1.00−0.090.04−0.26 **−0.02−0.08−0.20 *−0.04−0.08−0.050.060.19−0.10−0.11
PH−0.18−0.22 *1.000.87 **0.26 **0.63 **0.41 **0.55 **0.040.34 **−0.16−0.130.02−0.13−0.15
EH−0.12−0.130.76 **1.000.21 *0.56 **0.37 **0.48 **0.010.29 **−0.20 *−0.150.01−0.21 *−0.22 *
EPP−0.09−0.090.00−0.121.000.07−0.01−0.02−0.05−0.05−0.31 **−0.39 **−0.16−0.06−0.10
EL−0.10−0.090.34 **0.29 **−0.101.000.73 **0.62 **0.000.57 **0.02−0.09−0.010.060.11
KPR0.020.040.38 **0.29 **−0.110.66 **1.000.50 **0.090.82 **0.040.040.160.040.03
EG0.29 **0.31 **0.070.190.070.24 *0.35 **1.000.35 **0.58 **0.040.01−0.08−0.040.04
KRP0.29 **0.30 **0.070.030.05−0.100.28 **0.61 **1.000.63 **0.110.02−0.02−0.020.04
NKE0.110.120.32 **0.24 *−0.090.52 **0.93 **0.51 **0.58 **1.000.100.020.100.010.04
Pro0.020.01−0.15−0.04−0.150.110.29 **0.140.080.27 **1.000.60 **0.170.35 **0.51 **
TSS0.190.21 *−0.23 *−0.24 *0.00−0.020.170.35 **0.33 **0.26 **0.70 **1.000.33 **0.39 **0.39 **
Car0.23 *0.26 **−0.18−0.33 **0.18−0.040.180.150.180.21 *0.120.24 *1.000.02−0.03
Lys−0.01−0.01−0.02−0.05−0.170.080.21 *0.01−0.110.120.33 **0.37 **0.011.000.91 **
Trp−0.04−0.03−0.06−0.05−0.130.060.21 *−0.06−0.140.120.51 **0.44 **−0.090.84 **1.00
*, ** Significant at p < 0.05 and p < 0.01, respectively. TA: days to 50% tasseling, SI: days to 50% silking, PH: plant height, EH: ear height, EPP: ears per plant, EL: ear length, EG: ear girth, KRP: kernel rows per ear, KPR: kernels per row, NKE: number of kernels per ear, Pro: protein content, TSS: total soluble sugar, Car: carotene content, Lys: lysine content, Trp: tryptophan content.
Table 3. Genotyping data using SSR markers across the studied maize lines.
Table 3. Genotyping data using SSR markers across the studied maize lines.
SSR MarkersChromosome NumberAmplicon Size (bp)Number of AllelesMajor Allele FrequencyGene DiversityHeterozygosityPolymorphism Information ContentInbreeding Coefficient
umc22231166–18020.640.460.000.351.00
umc11221150–17020.790.330.000.281.00
umc21511125–14020.550.490.000.371.00
umc20831122–14320.550.490.130.370.74
umc14461156–16820.640.460.000.361.00
umc15581123–15120.700.420.020.330.94
umc13531232–26020.600.480.000.371.00
umc22041149–16020.640.460.000.351.00
umc22261100–11520.510.500.070.370.87
umc21891163–17520.580.490.000.371.00
umc14521118–12420.640.460.000.361.00
umc22401160–16520.530.500.000.371.00
umc17562164–17030.780.350.020.290.94
umc15522130–17030.540.600.040.540.93
umc12622157–17020.650.460.000.351.00
umc22522112–14530.410.650.820.58−0.25
umc15352155–17020.530.500.000.371.00
umc23802121–13230.630.530.020.480.96
umc21292100–16630.530.530.080.430.84
umc12562173–20020.510.500.000.371.00
umc22202114–12720.670.440.000.351.00
umc17463105–11220.730.400.070.320.81
umc16413111–13420.630.470.090.360.80
umc10103116–13020.680.440.040.340.90
umc18133123–13420.690.430.010.340.98
umc11363145–16820.600.480.040.360.91
umc14893128–14620.540.500.070.370.85
umc16903110–11620.520.500.130.370.74
umc22583151–16020.650.460.010.350.98
umc21013157–18020.640.460.100.360.78
umc21033156–17520.590.480.020.370.96
umc15783168–18520.830.280.000.241.00
umc20503131–14020.700.420.060.330.85
umc21043126–13020.680.430.000.341.00
phi1932253145–15320.590.480.020.370.96
umc17574147–16520.700.420.040.330.90
umc12944150–17520.770.350.010.290.97
umc22824335–42020.830.290.070.250.75
umc20384150–16020.510.500.060.370.89
dupssr344177–23320.700.420.160.330.63
umc23844100–11730.560.540.050.450.90
umc12284141–15620.600.480.000.361.00
umc21374138–15620.580.490.000.371.00
umc13034113–12020.650.460.000.351.00
umc23885257–40020.780.350.140.290.60
umc22965133–15520.670.440.130.340.71
umc10605167–24630.450.600.000.511.00
umc22015190–21020.740.390.010.310.97
umc19415106–13130.560.550.000.461.00
phi0875116–12920.760.370.020.300.94
umc21365149–18120.660.450.040.350.91
umc22955129–13620.560.490.000.371.00
umc22985100–11020.800.320.060.270.80
umc14295235–26020.600.480.000.361.00
umc11535117–12820.630.460.010.360.98
umc16295120–12520.700.420.000.331.00
umc21435156–16620.520.500.000.371.00
umc17966127–14120.640.460.000.351.00
umc18317184–20620.600.480.000.371.00
umc14807100–93030.660.490.020.430.96
umc23927250–27020.570.490.000.371.00
Minimum20.410.280.000.24−0.25
Maximum30.830.650.820.581.00
Average2.140.630.460.040.360.90
Table 4. Significant divergence (Fst) between groups (sub-populations) and average distances (expected heterozygosity) among maize populations and net nucleotide distance among sub-populations.
Table 4. Significant divergence (Fst) between groups (sub-populations) and average distances (expected heterozygosity) among maize populations and net nucleotide distance among sub-populations.
Sub-PopulationFst ValuePercent Mean Value of
Fst within Population
Expected HeterozygosityNumber of GenotypesNet Nucleotide Distance
Sub-Population ISub-Population II
I0.269317.700.3729170.000-
II0.136837.500.4497360.0980.000
III0.359125.000.3111240.1250.113
Admixture-19.79-19--
Table 5. Marker-trait association among the studied maize lines using mixed linear model (MLM) approach for the studied traits under Anand and Godhra environments.
Table 5. Marker-trait association among the studied maize lines using mixed linear model (MLM) approach for the studied traits under Anand and Godhra environments.
Sl. No.MarkersTraitsChromosomep-ValueR2
Environmental location: Anand
1umc1429Days to 50% tasseling50.012830.06847
2umc1429Days to 50% silking50.01530.06507
3umc1303Number of kernels per row40.003290.09578
4umc1303Number of kernels per ear40.002890.09853
5umc2296 *Protein content50.003470.12996
6umc2050 *30.003830.12748
7umc2038 *40.003870.13098
8umc130340.004310.08984
9umc1535 *Total soluble sugars20.004820.08726
10umc1303 *40.006510.08107
11umc2296Carotene content50.012020.1014
12umc153520.014420.06541
13umc2296 *Tryptophan content50.005510.11606
14umc213650.007320.10908
15umc2252 *20.008880.10466
Average0.2546
Environmental location: Godhra
1phi193225Days to 50% silking30.00710.1136
2umc2384Ear height40.00590.1435
3umc2380Ears per plants20.00870.1583
4umc194150.01000.1017
5umc2384Ear length40.01550.1192
6umc1941Number of kernels per ear50.01080.1000
7umc2038 *Protein content49.20 × 10−40.1728
8umc2050 *30.00160.1509
9umc2296 *50.00230.1383
10umc238020.01140.1463
11umc130340.01370.0662
12umc1535 *Total soluble sugars20.00580.0831
13umc130340.00920.0739
14umc2296Lysine content50.01550.0918
15umc2296 *Tryptophan content50.00320.1285
16umc2252 *20.00960.1027
Average0.1182
* Common marker-trait association over both locations.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Patel, R.; Memon, J.; Kumar, S.; Patel, D.A.; Sakure, A.A.; Patel, M.B.; Das, A.; Karjagi, C.G.; Patel, S.; Patel, U.; et al. Genetic Diversity and Population Structure of Maize (Zea mays L.) Inbred Lines in Association with Phenotypic and Grain Qualitative Traits Using SSR Genotyping. Plants 2024, 13, 823. https://doi.org/10.3390/plants13060823

AMA Style

Patel R, Memon J, Kumar S, Patel DA, Sakure AA, Patel MB, Das A, Karjagi CG, Patel S, Patel U, et al. Genetic Diversity and Population Structure of Maize (Zea mays L.) Inbred Lines in Association with Phenotypic and Grain Qualitative Traits Using SSR Genotyping. Plants. 2024; 13(6):823. https://doi.org/10.3390/plants13060823

Chicago/Turabian Style

Patel, Rumit, Juned Memon, Sushil Kumar, Dipak A. Patel, Amar A. Sakure, Manish B. Patel, Arna Das, Chikkappa G. Karjagi, Swati Patel, Ujjaval Patel, and et al. 2024. "Genetic Diversity and Population Structure of Maize (Zea mays L.) Inbred Lines in Association with Phenotypic and Grain Qualitative Traits Using SSR Genotyping" Plants 13, no. 6: 823. https://doi.org/10.3390/plants13060823

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop