Next Article in Journal
Synthesis of Shape-Memory Polyurethanes: Combined Experimental and Simulation Studies
Next Article in Special Issue
Isolation of a Novel QTL, qSCM4, Associated with Strong Culm Affects Lodging Resistance and Panicle Branch Number in Rice
Previous Article in Journal
Involvement of the PI3K/AKT Intracellular Signaling Pathway in the AntiCancer Activity of Hydroxytyrosol, a Polyphenol from Olea europaea, in Hematological Cells and Implication of HSP60 Levels in Its Anti-Inflammatory Activity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome Wide Identification and Annotation of NGATHA Transcription Factor Family in Crop Plants

by
Hymavathi Salava
1,
Sravankumar Thula
2,3,
Adrià Sans Sánchez
2,3,
Tomasz Nodzyński
2 and
Fatemeh Maghuly
1,*
1
Plant Functional Genomics, Institute of Molecular Biotechnology, Department of Biotechnology, BOKU-VIBT, University of Natural Resources and Life Sciences, 1190 Vienna, Austria
2
Mendel Centre for Plant Genomics and Proteomics, Central European Institute of Technology (CEITEC), Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic
3
National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(13), 7063; https://doi.org/10.3390/ijms23137063
Submission received: 6 May 2022 / Revised: 18 June 2022 / Accepted: 23 June 2022 / Published: 25 June 2022
(This article belongs to the Special Issue Functional Genomics for Plant Breeding 3.0)

Abstract

:
The NGATHA (NGA) transcription factor (TF) belongs to the ABI3/VP1 (RAV) transcriptional subfamily, a subgroup of the B3 superfamily, which is relatively well-studied in Arabidopsis. However, limited data are available on the contributions of NGA TF in other plant species. In this study, 207 NGA gene family members were identified from a genome-wide search against Arabidopsis thaliana in the genome data of 18 dicots and seven monocots. The phylogenetic and sequence alignment analyses divided NGA genes into different clusters and revealed that the numbers of genes varied depending on the species. The phylogeny was followed by the characterization of the Solanaceae (tomato, potato, capsicum, tobacco) and Poaceae (Brachypodium distachyon, Oryza sativa L. japonica, and Sorghum bicolor) family members in comparison with A. thaliana. The gene and protein structures revealed a similar pattern for NGA and NGA-like sequences, suggesting that both are conserved during evolution. Promoter cis-element analysis showed that phytohormones such as abscisic acid, auxin, and gibberellins play a crucial role in regulating the NGA gene family. Gene ontology analysis revealed that the NGA gene family participates in diverse biological processes such as flower development, leaf morphogenesis, and the regulation of transcription. The gene duplication analysis indicates that most of the genes are evolved due to segmental duplications and have undergone purifying selection pressure. Finally, the gene expression analysis implicated that the NGA genes are abundantly expressed in lateral organs and flowers. This analysis has presented a detailed and comprehensive study of the NGA gene family, providing basic knowledge of the gene, protein structure, function, and evolution. These results will lay the foundation for further understanding of the role of the NGA gene family in various plant developmental processes.

1. Introduction

Plant growth and development require numerous rigorous regulatory processes, and therefore, transcriptional regulation plays an important role in every stage of plant growth and development. TFs bind to their target genes or adjacent regions and control gene expression by turning them on and off as needed [1,2], and therefore, they play crucial roles in regulating various plant processes and stress responses. To date, many TF families and their binding sites have been reported [3]. One such family of TFs is the B3 superfamily, which regulates the expression of various genes. B3 proteins are expressed in various plant tissues, suggesting a role for B3 proteins in different plant processes [4]. NGA belongs to the RELATED TO ABI3/VP1 (RAV) transcriptional subfamily, forming a subgroup of the B3 superfamily. The RAV subfamily is further divided into two categories: Class-I (proteins with a B3 domain and an AP2 domain) and Class-II (proteins have a B3 domain but no AP2 domain) [5]. Exceptionally, Merchantia Polymorpha has two B3 domains in NGA3 [4]. Although the NGA family has been studied in A. thaliana and Brassica napus, it has been poorly explored in other plant species.
In Arabidopsis, four NGA genes (AtNGA1, AtNGA2, AtNGA3, AtNGA4) and three NGA-LIKE genes (AtNGA-LIKE1, AtNGA-LIKE2, AtNGA-LIKE3) have been reported [6,7,8,9,10,11,12]. NGA TFs are mainly involved in developing pistils; they are also involved in regulating the shape and size of lateral organs such as leaves and petals and the regulation of seed size [6,8,9,13,14,15,16,17,18]. Alvarez et al. [19] used synthetic miRNAs against A. thaliana NGA genes, showing that single NGA mutants exhibited mild phenotypic changes in lateral organs, while quadruple mutants exhibited defective pistils as well as small, broad leaves with the broad perianth [19]. Later, in 2009, the same group published a detailed report on the NGA genes, AtNGA1 and AtNGA4, expressed in the lateral organs, especially the distal parts [6,20]. The gene STYLISH1 (AtSTY1), known to be involved in carpel development, activates the NGA gene that indirectly regulates gynoecium development. STY1 and NGA co-regulate the YUCCA2 (AtYUC2) and AtYUC4 genes that promote auxin biosynthesis, thereby regulating the auxin gradient in pistils. In addition, STY1 is known to directly activate AtYUC4, suggesting a direct link to auxin biosynthesis [6,20]. This was further confirmed by Trigueros et al. [12]. The same group identified a tower of pisa-1 (top1) mutant by activation markers in Arabidopsis and observed that this mutant was impaired in silique development, with enlarged patterns and reduced fruit size. The top1 mutant was found to overexpress AtNGA3 (due to random T-DNA insertions containing the 4x 35S promoter). As a result, the TOP1/NGA3 mutant showed an elongated style [12]. VIGS mediated silencing of EcNGA in Eschscholzia californica resulted in pistils with impaired style and stigma, but the other parts of the flower such as the petals, sepals, and stamens were unaffected, indicating that NGA is redundant in pistil development. Similarly, NbNGAa and NbNGAb were downregulated in Nicotiana benthamiana using VIGS and it was observed that the length of the style was significantly reduced with improperly fused stigma. It has been documented that the YUCCA (YUC) genes are involved in auxin biosynthesis and its accumulation in the gynoecium. In Arabidopsis, reduced expression of YUC genes was reported in nga mutants, resulting in reduced auxin accumulation in the pistils of nga mutants [12]. Furthermore, the expression of NbYUC2 and NbYUC6 in the apical portion of the gynoecium was decreased in the silenced NbNGAa and NbNGAb lines, suggesting that the role of NGA genes in activating YUC genes is involved in promoting auxin gradient across the pistil [7].
NGA TFs also play critical roles in lateral organ growth and development. Lee et al. [8] measured the cell proliferation activity in the lateral organs of overexpression lines and loss-of-function mutants of the NGA family in Arabidopsis, and they observed small, narrow leaves in the overexpression lines and large, wide lateral organs in the quadruple mutants. These results suggest that the NGA family negatively controls cell proliferation in lateral organs [8]. Similar results were observed when BrNGA1 from Brassica was overexpressed in Arabidopsis, suggesting that BrNGA1 regulates cell numbers in the lateral organs and roots [21]. This was further supported by the study of Lee et al. [9]. They expressed AtNGA1 in the presence of domains such as CLAVATA3 (CLV3), Meristem layer1 (ML1), WUSCHEL (WUS), SHOOTMERISTEMLESS (STM), and AINTEGUMENTA (ANT) in Arabidopsis. Their results showed that NGA expression in meristems incapacitated pluripotent cells, rendering them incapable of cell differentiation, suggesting an important role for NGA TFs as general differentiation and pistil group identity factors [9].
Despite the limited literature available, NGA has also been involved in various stress responses [10,22]. Sato et al. [10] overexpressed AtNGA1-GFP under the influence of the 35S promoter, examined the two-week old seedlings subjected to water stress under the confocal microscope and observed increased protein accumulation of AtNGA1-GFP under drought stress compared to the control conditions. They further showed that NGA1 binds to the G-box of the AtNCED3 promoter under water-deficit conditions to induce ABA biosynthesis. The study also confirmed that the binding of AtNGA1 to the promoter of AtNCED3 increased under drought stress. A similar pattern was seen in ABA-deficient mutants of Arabidopsis, where NGA induced AtNCED3 to synthesize ABA in response to stress. In addition, a study by Guo et al. [22] showed that the overexpression of MtNGA1 from Medicago truncatula in A. thaliana exhibited increased tolerance to high salt stress. They also exhibited a reduction in the number of branches in the overexpressed lines along with delayed flowering, indicating the importance of NGA as key players in crucial aspects of plant development as well as stress responses. They also examined the reduced shoot branching by analyzing the transcript levels of SMXL genes in the MtNGA1 overexpression lines to observe that the transcript levels of AtSMXL6, AtSMXL7, and AtSMXL8 were downregulated while the expression of AtMAX1/2, AtBRC1, and AtBRC2 were up-regulated. The repressed shoot branching in the transgenic lines provides important evidence that NGA not only influences ABA, but also regulates strigolactones [22].
To date, phylogenetic analyses of the NGA family of a few plant species such as A. thaliana, B. napus, G. max, B. distachyon, O. sativa, P. patens, and M. truncatula have been reported in the literature [10,21]. Furthermore, Pfannebecker et al. [23] combined the phylogeny of members of the NGA family of cruciferous, nightshade, and grass families. Their study concluded that each gene family evolved independently through several rounds of gene duplication events.
In this study, we performed a detailed analysis of the NGA family in higher plant species, focusing on Solanaceae and Poaceae. Phylogenetic reconstruction of the gene family was followed by the characterization of the Solanaceae NGA gene family compared to the monocot members of Poaceae. The characterization included gene and protein structure, protein motifs, promoter analysis, Gene Ontology, and quantitative RT-PCR analysis of the NGA genes. Our obtained data provide a comprehensive understanding of the NGA gene family in higher plants and facilitate further research related to crop plant development and new control methods.

2. Results

2.1. Identification and Characterization of NGA Genes

We used four NGA and three NGA-Like sequences from A. thaliana as the query to identify the NGA sequences in different plant species. An initial search was started with the BLASTP search in phytozome and Ensemble Plants. Databases such as the Sol Genomics Network and Rice Genome Annotation Project were also used to search for NGA family members. Altogether, 460 sequences were retrieved, which were subjected to reciprocal BLASTP against the NGA sequences of A. thaliana in the NCBI. The obtained sequences were checked for the presence of the B3 domain (PF02362.21) using the HMM profile of the Pfam and SMART databases. The validated sequences were further assessed using CD-HIT with a threshold of a ≥90% cut-off to eliminate redundant sequences. After the filtering process, 207 sequences of monocots and dicots were obtained to characterize the gene family (Table S1). The genes were named according to the homology to the Arabidopsis NGA family and the previous literature [6,7,8,9,10,11,12,14,16,18,23].

2.2. Phylogenetic Analysis of NGA Family

The evolutionary history of the NGA TF family was investigated by assessing the phylogenetic relationship to classify the NGA proteins. The phylogenetic tree was constructed based on the obtained protein sequences of 18 dicots and 7 monocots (Figure 1; Table S1). The phylogenetic analysis demonstrated that both the NGA and NGA-Like sequences of dicots and monocots diversified into different clades for (Figures S1 and S2). Furthermore, the NGA and NGA-Like sequences of dicots were grouped based on families such as Brassicaceae (A. thaliana, B. rapa, Camelina sativa), Solanaceae (S. lycopersicum, S. tuberosum, C. annum, N. tabacum), and other species such as Populus trichocarpa, Phaseolus vulgaris, Medicago truncatula, etc. (Figure S1).
Among the dicots, the number of NGA and NGA-Like protein sequences varied within species, while among the selected monocots, the number of NGA and NGA-Like protein sequences was almost the same; five and two, respectively. However, there were some exceptions: one interesting feature observed here was that T. aestivum possessed the highest number of proteins (26), followed by C. sativa and Musa accuminata (banana) with 20 and 17 sequences, respectively. Furthermore, NGA sequences of banana were grouped separately from other monocots, indicating that banana evolution is independent of other monocots (as showed in Figure 1 and Figure S2). Altogether, our results indicate that the evolution of NGA and NGA-Like sequences have followed divergent lineages. The similarity of protein sequences of A. thaliana, S. lycopersicum, and O. sativa L. japonica was examined to observe that the proteins were 47.63% similar on average (Table S2).

2.3. Physical and Chemical Properties of NGA Family

The physical and chemical properties of the NGA family of Solanaceae (tomato, potato, capsicum, and tobacco) along with Arabidopsis, and Poaceae (rice, sorghum, and Brachypodium) are outlined in Table 1 and Table 2. The table shows the details of the NGA family such as gene ID, chromosome locations, length of the gene, complete coding sequence (CDS), protein molecular weight (MW), and isoelectric point (PI) as well as the predicted location of the signal peptide of the respective proteins. The length of the proteins was between 176 amino acids (CaNGA-Like1-1) to 477 amino acids (CaNGA3), with an average of 324 amino acids. The MW of the proteins ranged from 20.31 kDa (CaNGA-Like1-1) to 52.49 kDa (CaNGA3), with an average of 35.84 kDa (SbNGA1). The predicted IP varied from 4.66 (SbNGA4) to 10.45 (BdNGA5). The IP of twenty proteins was below pH 7, while the IP of twenty-seven proteins was above pH 7. The predicted signal peptides showed that the majority of the NGA proteins were located in the nucleus. In contrast, OsNGA2 and OsNGA4 were located in chloroplast and cytoplasm, respectively (Table 1 and Table 2).

2.4. Gene Structure and Protein Motifs Analyses

The gene structure analysis of Solanaceae members followed a similar pattern as that of A. thaliana (Figure 2a–c). All the NGA genes possessed single introns with some genes possessing untranslated regions (UTRs) and some without UTRs. Among all of the NGAs, StNGA3 included the longest 3′ UTR of 4154 bp. Similarly, the NGA-LIKE genes contained three exons and two introns with an exception for AtNGA-LIKE3 and NtNGA-LIKE with two exons and one intron as well as CaNGA-LIKE1-1 with only a single exon and no UTRs. Poaceae members such as rice, sorghum, and Brachypodium also followed a similar fashion as above-mentioned (Figure 2d–f). The NGA-LIKE gene in Poaceae with two exons and a single intron was observed in OsNGA-LIKE1-2. The number of exons (triple exons and double exons) in a gene is not consistent; however, the results suggest that these genes are conserved in gene structure.
All of the NGA proteins included in the study possessed only one B3 domain (PF02362.21/CL0405). The NGA proteins contained a repressor motif (R/KLFGV) that is responsible for regulating heat stress-related genes (Figure S3) [16,23,24,25,26]. Most of the NGA and NGA-Like proteins possessed the repressor motif except for OsNGA3, cCaNGA-Like1-1, and StNGA-Like 1-1 (Figure S3c,d). These results indicate that NGAs and NGA-Like proteins play an essential role in combating heat stress. We also looked for other protein motifs in the NGAs and NGA-Like sequences using MEME 2.0, represented in Figure 2c,f.
Furthermore, we considered four species of the Solanaceae family (tomato, potato, capsicum, and tobacco), named as “Solanaceae members” in the current study. Three common motifs in the NGA and NGA-Like proteins of the Arabidopsis and Solanaceae members are 1, 2, and 4, representing the conserved B3 domain in these species (Figure 2c and Figure S4a). Motifs 7 and 8 are found distributed among the NGA proteins of tomato, potato, capsicum, tobacco, and A. thaliana, representing conserved domains specific to NGA proteins, named as the NGA-I and NGA-II domains [7,12]. The repressor motif RLFGV is represented in motif 3, and is involved in various stress responses such as heat, salt, and drought [10,22,24,27]. Other motifs such as motif 16 is unique to NGA3 of N. tabacum, while motif 19 is unique to the NGA3 proteins of C. annum and N. tabacum, indicating the importance of these motifs in plant development processes. Similarly, the unique motifs in CaNGA3, NtNGA3-1, and NtNGA3-2 might take part in various stress responses.
Similarly, motif analysis was also performed with NGA protein sequences in members of Poaceae such as O. sativa L. japonica, B. distachyon, and S. bicolor (Figure 2f and Figure S4b), which are named members of Poaceae in this paper. It was observed that three motifs (i.e., motif 1, 2, and 4) are common in both the Solanaceae and Poaceae family members, Arabidopsis and monocots, indicating the conserved B3 domain. Similarly, in Poaceae members, motifs 9 and 10 were conserved only in the NGA sequences, which are named as NGA-II and NGA-I motifs, as described in [7,12]. The repressor motif RLFGV in these monocot species is represented as motif 3 in Figure 2f and Figure S4b.
The conserved motifs in the Solanaceae and Poaceae species indicate the perpetuated function of NGA proteins in the plant kingdom. Some conserved motifs such as 11, 12, 14, 16, and 18 are only present in the NGA2 and NGA3 proteins of three monocots, suggesting their conserved role in plant growth and development. Similarly, motifs 6, 7, 8, 13, and 19 are found in the NGA-Like sequences, indicating a conserved function of NGA-Like proteins, possibly in carpel development, and other possible roles. The proteins were also checked for the presence of transmembrane domains using TMHMM-2.0 (https://services.healthtech.dtu.dk/service.php?TMHMM-2.0; accessed on 13 October 2022) and it was predicted that the NGA proteins are not embedded in the transmembranes (Figure S5) [28].
Our results reveal that both Solanaceae and Poaceae members share similar motifs as in Arabidopsis, suggesting a common role of NGAs and NGA-Like sequences. Further experimental analysis would provide more knowledge on the possible roles of these proteins, especially in stress responses.

2.5. Promoter cis-Element Analysis of NGA Genes

The promoter cis-element analysis was performed as followed by Wei et al. [29]. We considered the 1500 bp upstream region of the NGA genes and looked for the presence of various cis-elements that include light, hormones (abscisic acid [ABA], gibberellic acid [GA], auxin, methyl jasmonic acid [MeJA], and salicylic acid [SA]), stress-responsive elements (drought inducibility, defense and stress response), and other cis-elements related to anaerobic induction, circadian control, meristem development, flavonoid biosynthesis, low-temperature responsive, zein metabolism, seed-specific regulation, At-rich DNA binding protein, endosperm expression, anoxic specific inducibility, and cell cycle regulation. The details of the promoter cis-elements of NGA and NGA-LIKE are represented in Figure 3.
Since the STYLISH1 (STY1) transcription factor is known to bind and induce the NGA gene expression via the DNA binding site ACTCTA(C/A) [31,32,33], we observed the presence of ACTCTAC in the upstream region of the genes, namely, AtNGA2, SlNGA1, SbNGA2, OsNGA-LIKE1-2, SbNGA-LIKE1-1, SbNGA-LIKE1-2, and BdNGA-LIKE1-2. At the same time, we identified ACTCTAA in the promoters of AtNGA-LIKE3, SlNGA-LIKE1-3, NtNGA-LIKE1, and OsNGA1. We also checked for other regulatory elements such as CpG islands and tandem repeats using PlantPan (http://plantpan.itps.ncku.edu.tw/; accessed on 22 January 2022) [34]; however, we could not identify either tandem repeats or CpG islands.

2.6. Three-Dimensional Structure of NGA Proteins

We analyzed the 3D structure of NGA proteins in Arabidopsis, tomato, and rice (Figure 4 and Figure S6). In Arabidopsis and tomato, we observed that the protein structure had a conserved B3 domain and no AP2 domain, which shows that the NGA structure and function might be conserved in these two species (Figure S3). Conservation of the B3 domain in NGA proteins is a notable feature in the RAV subclade, except for the presence of the AP2 domain, specifically in RAV proteins. Multi-alignment of NGA proteins also confirmed the conserved nature of the B3 domain in Arabidopsis, tomato, and rice, suggesting the functional conservation of these proteins during evolution (Figure S3). Furthermore, another five amino acid motifs (RLFGV, green box in Figure S3) seemed to be present in all of the above three species (i.e., RLFGV), indicating an essential role of this motif in plant development.

2.7. Synteny or Gene Duplication Analysis

Even though the NGA family is relatively well-studied in A. thaliana compared to other species such as B. rapa, the evolution history of the NGA family is not yet understood. In this study, we investigated the evolution and origin of the NGA genes of S. lycopersicum in comparison with A. thaliana (Figure 5).
We identified five pairs of the syntenic relationship between A. thaliana and S. lycopersicum, where AtNGA-LIKE1 and AtNGA-LIKE2 are both linked to SlNGA-LIKE1-1 as well as SlNGA-LIKE1-3. AtNGA-LIKE3 is linked to SlNGA-LIKE1-1, forming the fifth syntenic pair. Although five genes are paired between A. thaliana and S. lycopersicum, the number of synteny events suggests the distant evolutionary relationship between these two species (Figure 5a). However, the gene duplication event was also evaluated between O. sativa L. japonica and A. thaliana, where only one gene pair was observed between OsNGA-LIKE1-2 and AtNGA-LIKE2 (Figure 5c). The syntenic relationship between O. sativa L. japonica and A. thaliana further suggests a distant relationship between these two species. Moreover, despite the distant syntenic relation among the genomes of Arabidopsis, tomato and Arabidopsis, rice, the genes belonging to the same subfamily were linked in each syntenic block, suggesting that these species have evolved from the same ancestor (Figure 5). The syntenic relationship was also assessed in other species such as B. rapa, where 32 syntenic pairs were observed, indicating that B. rapa is the closest relative of A. thaliana (Figure 5b). In addition, gene duplication analysis was also investigated in other dicots such as P. trichocarpa, Vitis vinifera, S. tuberosum and monocots such as B. distachyon with 14, 11, 9, 6, and 1 syntenic pairs, respectively (Figure S7). These results show that P. trichocarpa is closely related to A. thaliana while B. distachyon seems to be a distant relative to A. thaliana with only one syntenic pair.
We further assessed the association (Ka) and dissociation (Ks) constant of NGA genes to understand the evolutionary rates (Table 3 and Table 4). The Ka/Ks ratio of most of the genes (is less than 1) indicated that the majority of them have evolved slowly under purifying selection pressure. These results indicate that genes have evolved under stringent conditions, thus maintaining the conserved nature of the NGA family during evolution. However, gene pairs of the sorghum NGA gene family, namely, SbNGA-LIKE1-2 paired with other members of the sorghum NGA family, have resulted from positive Darwinian selection, where the Ka/Ks ratio is greater than 1 (Table 4).

2.8. Functional Annotation of the NGA Gene Family

According to the GO analysis, NGA family is predicted to be involved in BPs including leaf shaping (GO:0010358), flower development (GO:0009908), response to karrikin (GO:0080167), meristem maintenance (GO:0010073), seed growth regulation (GO:0080113), glucosinolate metabolic processes (GO:0019760), regulation of leaf morphogenesis (GO:1901371), and the regulation of transcription (GO:0006355) (Figure 6; Table 5). However, MFs include DNA binding (GO:0003677) and protein binding (GO:0005515) while CCs include the nucleus, suggesting that the NGA family transcription factors reside in the nucleus (GO:0005634). The obtained GO data represent that NGA genes play an essential role in regulating lateral organs and the development of gynoecium, and participate in various gene regulations involved in plant development and stress responses.

2.9. Expression Analysis of NGA Genes by qPCR

We undertook qPCR to understand the gene expression pattern of the NGA family in Arabidopsis and tomato (Figure 7). AtNGA1 was highly expressed in cotyledons followed by flowers, where the expression was reduced to half of that in the cotyledons in A. thaliana. AtNGA2 was highly expressed in the rosetta leaf, and the expression was reduced to half in cotyledon, while very minimal expression was observed in the mature leaf. AtNGA3 expression was almost similar in the cotyledon and flower, and there was a gradual decrease in the AtNGA3 expression in mature leaf followed by the rosetta leaf. AtNGA4 shows the highest expression in mature leaf while the expressions were significantly lower in the cotyledon and flower. Similar to AtNGA4, AtNGA-LIKE1, and AtNGA-LIKE3 expression was significantly higher in the mature leaf while drastically reduced in the cotyledons, flower, and rosetta leaf (Figure 7).
The SlNGA and SlNGA-LIKE expression in young tomato leaf was significantly high compared to the cotyledons, flower, and mature leaf (Figure 7). In the cotyledons and mature leaf, SlNGA and SlNGA-LIKE expression was consistently reduced except for SlNGA-LIKE1-1 in cotyledon and SlNGA-LIKE1-3 in mature leaf, where the expression levels were significantly higher. SlNGA2 showed the highest expression in tomato flower, followed by SlNGA1 and SlNGA-LIKE1-3 (Figure 7).

3. Discussion

The NGA family belonging to the RAV subfamily of the B3 superfamily is relatively well-characterized in A. thaliana compared to other plant species [6,8,9,12,21]. In Arabidopsis, the NGA family is known to be involved in the development of gynoecium and the regulation of lateral organs. However, functional annotation of the NGA family is still an area of limited knowledge. In this study, we performed phylogenetic reconstruction of the NGA family using several dicots (Solanaceae) and monocots (Poaceae) (Figure 1).
The NGA phylogenetic tree has a peculiar feature (i.e., the NGA and NGA-LIKE sequences are very well distinguished, suggesting that these genes have evolved separately with well-demarcated evolution in dicots and monocots (Figure 1)). Furthermore, NGA and NGA-LIKE sequences are defined based on the plant families where members of the Brassicaceae, Solanaceae, and Poaceae are phylogenetically well separated, suggesting that these sequences have resulted from multiple duplication events from the most recent common ancestor. Based on the phylogeny analysis, the NGA sequences from different subfamilies and the number of genes in each species vary. For example, in B. rapa, ten NGAs and seven NGA-LIKE genes were present, while in B. vulgaris, only one NGA and one NGA-LIKE gene were identified. The highest number of genes were identified in C. sativa with 14 NGAs and seven NGA-LIKEs, followed by T. aestivum with 18 NGAs and eight NGA-LIKEs (Figure 1; Figure S2). These results indicate that the NGA genes have evolved due to multiple rounds of duplications leading to the expansion of the gene family. Furthermore, among the monocots, banana forms a distinct clade with respect to both the NGA and NGA-LIKE genes, revealing that the genes within this species might have resulted from repeated segmental duplications (Figure 1 and Figure S2).
Furthermore, the gene structure analysis gives a framework of gene duplications and the functional relationship among the gene families. The exon–intron structures of the NGA family in our analysis revealed that the numbers of exons and introns were conserved among subfamilies, indicating the conserved function of the genes within subfamilies (Figure 2). The same trend has been observed among the protein structures where the NGA and NGA-Like proteins share some common motifs; however, few unique motifs are only present within the subfamilies or unique to species. For example, motifs 9, 11, 13, 14, 15, 17, 18, and 19 were acquired during evolution in the NGA and NGA-Like proteins of the Solanaceae species such as S. lycopersicum, S. tuberosum, C. annuum, and N. tabacum, indicating novel functions of the proteins. Similarly, monocots such as O. sativa L. japonica, B. distachyon, and S. bicolor possess common motifs that are also present in Solanaceae members, suggesting a conserved function of the NGA proteins. Consistent with these results, the three-dimensional structure of the proteins was conserved in these species; however, minor alterations in the amino acid sequences contribute to the functional variations among the NGA proteins (Figure 4). The presence of protein motif (RLFGV) in the NGA proteins of A. thaliana, S. lycopersicum, and O. sativa implicates that this motif plays an essential role in plant development (Figure S4). Consistent with this, it has been observed that AtNGA1 possessing the RLFGV motif directly binds to the promoter of AtNCED3, thereby inducing ABA biosynthesis in Arabidopsis in response to drought stress (Figure S3) [10]. The presence of the repressor motif is also reported in N. benthamiana, Amborella trichopoda, and Aquilegia caerulea in their respective NGA protein sequences [7]. In addition, this repressor motif is reported to be involved in regulating heat stress in the Heat shock factor B family [7,24,25,27]. These findings indicate the significance of NGA proteins in many aspects of plant development, which is yet to be explored.
The analysis of cis-elements in the promoter region of the genes would provide clues into the transcriptional regulation of the respective genes. NGA genes are also looked for in the upstream cis-regulatory elements. It has been observed that light-responsive elements are present in the promoters of the genes, suggesting that light plays an important role in regulating these genes (Figure 3). Almost half of the genes were observed to be involved in stress-related responses such as drought inducibility and defense, suggesting that these genes play a role in stress response. The NGA genes also possess hormone response elements such as ABA, GA, MeJA, SA, and auxin. ABA and SA are known to participate in plant stress, and the cis-elements analysis indicates that NGA genes might be involved in defense response [36,37,38]. The presence of auxin-responsive elements in the promoters of the NGA genes is an interesting feature. As discussed above, the NGA family regulates the AtYUC2 and AtYUC4 genes involved in auxin biosynthesis, especially in carpel development [6,12,16]. However, the direct link of auxin responsive elements with NGA regulation is yet to be discovered. Furthermore, some of these genes also implicate their role in gibberellin signaling and methyl jasmonate pathways. In addition, phytohormone ABA seems to play a major part in carpel development [39,40], and the roles of other hormones such as GA, SA, and MeJA in NGA regulation are still not understood. Among the other cis-elements, anaerobic induction and meristem development seem to be majorly involved in the regulating of NGA genes.
Gene duplications are the main source of evolution of gene families, predominantly tandem and segmental duplication events [41]. The synteny analysis of NGA genes of Arabidopsis with tomato, potato, and other species such as P. trichocapra, M. truncatula, and O. Sativa L. japonica showed that most of them have evolved through segmental duplications. However, these duplications are followed by the diversification of gene functions during evolution. In addition, tandem duplications are also not uncommon, as can be seen in the phylogeny with genes or proteins co-existing, resembling their similarities in terms of sequence and functions [42]. The nucleotide variations are the key to evolution within gene families. The Ka/Ks ratio tells us about the synonymous and non-synonymous changes in the gene sequences acquired during evolution and measures the evolutionary pressure of the nucleotide variations within the sequence of the genes [43,44]. The Ka/Ks ratio is assessed in NGA genes. Most of the genes have evolved under negative selection pressure, thereby screening random deleterious mutations, whereas, in S. bicolor, each gene pair with SbNGA-LIKE1-2 showed positive Darwinian selection. Our study revealed that the NGA family has evolved under stringent selection pressure, resulting in the conservation of the gene family.
GO analysis revealed the possible roles of NGA genes in Arabidopsis and tomato. Being derived from the B3 superfamily, NGA is primarily involved in gene regulation by sequence-specific DNA binding activity (including cis-elements) and is predicted to be localized in the nucleus (Figure 6; Table 5). As evident from the previous literature on the NGA family in A. thaliana and S. lycopersicum, the AtNGAs are involved in regulating leaf morphogenesis and flower development [6,9,12,21,45]. In addition, AtNGA-LIKE1 is thought to be responsive to karrikins, indicating that this gene has a role in Strigolactone signaling. Other functions of NGA-LIKE genes include negative regulation of transcription, seed growth regulation, leaf shaping, and meristem maintenance. The possible roles of the NGA gene family based on the Gene Ontology results implicate the potential role of the genes in plant growth, development, and defense. Consistent with the Gene Ontology results, the gene expression of the NGA family in Arabidopsis and tomato reflected the importance of the genes in regulating leaf morphogenesis and flower development (Figure 7). However, the localization of NGA proteins would provide better evidence for protein expression in different cell types rather than gene expression studies. These results correlate with the expression of NGA genes in Arabidopsis and B. rapa, affecting the development of lateral organs and floral development [8,9,11,21,46].

4. Materials and Methods

4.1. Identification of NGA Genes

The amino acid sequences of NGA proteins of A. thaliana were obtained from the TAIR website (https://www.arabidopsis.org/download_files/Proteins/TAIR10_protein_lists/TAIR10_pep_20101214; accessed on 19 July 2021). These sequences were used to derive the sequences of tomato (S. lycopersicum), potato (Solanum tuberosum), capsicum (Capsicum annum), and tobacco (Nicotiana tabacum) from the Sol Genomics Network (https://solgenomics.net/tools/blast/; accessed on 22 July 2021) using BLASTP (https://blast.ncbi.nlm.nih.gov/Blast.cgi; accessed on 19 July 2021). The sequences in rice (O. sativa L. japonica) were retrieved from the Rice Genome Annotation Project (http://rice.uga.edu/analyses_search_blast.shtml; accessed on 24 July 2021). The sequences for other dicots (Brassica rapa, Beta vulgaris, Glycine max, Helianthus annus, Phaseolus vulagaris, etc.), monocots (S. bicolor, Hordeum vulgare, B. distachyon, Triticum aestivum, etc.) and other crop plants were extracted from phytozome (https://phytozome-next.jgi.doe.gov/; accessed on 25 July 2021) and Ensembl Plants (https://plants.ensembl.org/index.html; accessed on 28 July 2021). These sequences were checked for the presence of the B3 domain using the Hidden Markov Model (HMM) profile of the Pfam database (http://pfam.xfam.org/; accessed on 30 July 2021) [47]. The sequences were then subjected to CD-HIT (http://weizhong-lab.ucsd.edu/cd-hit/; accessed on 7 August 2021) and highly similar sequences were filtered from the other low similar sequences. The redundant sequences were filtered from the rest of the sequences. The molecular weights and the isoelectric points were calculated using ProtParam tool-Expasy (https://web.expasy.org/protparam/; accessed on 18 September 2021).

4.2. Multiple Sequence Alignment and Phylogenetic Tree Construction

The 207 sequences were aligned by MAFFT using default parameters (https://mafft.cbrc.jp/alignment/server/; accessed on 19 August 2021) [48]. The aligned sequences were used for phylogeny using the neighbor-joining (NJ) method and the bootstrap method with 1000 replicates in MEGA 11 [26].

4.3. Exon–Intron Structure and Protein Motif and Structure Analysis

The Gene Structure Display Server 2.0 tool was used to illustrate the location and length of the exons and intron within the respective genes (http://gsds.gao-lab.org/; accessed on 13 September 2021) [49]. The protein motifs were predicted using MEME suite version 5.4.1 (https://meme-suite.org/meme/tools/meme; accessed on 16 September 2021) [50]. The protein length was restricted from 6 to 20 amino acids long, and a maximum of 20 motifs was set.

4.4. Three-Dimensional (3D) Structure of NGA Proteins

The 3D structure of the full-length proteins was generated using I-TASSER (https://zhanggroup.org/I-TASSER/; accessed on 18 January 2022). The best threading template close to the target protein was chosen based on the C-score and TM-score [35].

4.5. Subcellular Localization of Proteins

The signal peptides of the protein sequences were analyzed by using WoLF PSORT (https://wolfpsort.hgc.jp/; accessed on 20 January 2022) [51] and TargetP-2.0 (https://services.healthtech.dtu.dk/service.php?TargetP-2.0; accessed on 25 January 2022) to estimate the subcellular localization of proteins [52].

4.6. Promoter Analysis

The 1500 bp sequence upstream was downloaded for the respective genes. Using PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/; accessed on 30 January 2022) [30], these sequences were examined for the presence of cis-elements related to phytohormones, stress, light, and other biological activities. The sequences were also investigated for the presence of CpG islands and tandem repeats using PlantPAN 3.0 (http://plantpan.itps.ncku.edu.tw/; accessed on 22 January 2022) [34].

4.7. Chromosomal Location and Gene Duplication and Ontology Analysis

The complete genome, gene, and protein sequences were downloaded from the respective databases for the synteny analysis. The Multiple Collinearity Scan Toolkit (MCScanX) was used to scan the genome to identify the gene duplicated gene pairs [53]. Finally, the orthologous gene pairs were identified using a Dual synteny plotter in TBtools (https://github.com/CJ-Chen/TBtools; accessed on 18 January 2022) [54]. The association and dissociation constants (Ka and Ks) were assessed using the Ka/Ks calculation tool (http://services.cbu.uib.no/tools/kaks; accessed on 22 Fabruary 2022) [55].

4.8. Gene Ontology (GO) Analysis

GO enrichment analysis was performed using the Blast2GO tool (https://www.biobam.com/blast2go-basic/; accessed on 25 February 2022) [56] and were categorized into three parts: Molecular Function (MF), Biological Processes (BP), and Cellular Components (CC).

4.9. Plant Material

The seeds were surface-sterilized using chlorine gas for four hours and plated on half-strength Murashige and Skoog medium (MS) with 1% sucrose. After 3-day stratification, seedlings were transferred to normal growth condition (150 µmol/m2/s, 16/8 h photoperiod, 21 °C and 60% relative humidity). In Arabidopsis, cotyledons, rosetta leaf, mature leaf, and flowers were collected from 7, 21, 28, and 32 day old plants, respectively. In tomatoes, cotyledons, developing young leaves (meristem)m and developed mature leaves (second node from the ground) were collected from 12, 28, and 35 day old plants, respectively. The flowers were obtained from the first set of flowers in both Arabidopsis and tomato.

4.10. RNA Extraction and Quantitative (q)PCR Analysis

The RNA from the respective samples was extracted using the Trizol method. First, the contaminating DNA was removed from the extracted RNA using DNase as per the manufacturer’s protocol. The RNA integrity was assessed using the RNA 6000 Pico Kit and Agilent 2100 Bioanalyzer. Finally, one µg of RNA was reverse transcribed to cDNA using the iScript™ cDNA Synthesis Kit (BIO-RAD, CZ). This cDNA was used for qPCR analysis. Quantitative PCR was performed using the SYBR Green PCR Master Mix (Agilent Technologies, Santa Clara, CA, USA) on a 7300 Fast Real-Time PCR system (Applied Biosystems, CA, USA). The primer sequences used for Real-Time PCR were designed using Primer3 software (Table S3). Ubiquitin and RNaseH were used as the internal controls for Arabidopsis while actin and ubiquitin were used for tomato. The relative expression was calculated using the formula 2(−ΔΔCT), where ΔCt = (Ct value of target gene) − (Ct value of actin) and ΔΔCT = ΔCt of accession −ΔCt of reference.

5. Conclusions

The comprehensive analysis of the NGA gene family identified 207 sequences that were classified into different gene families according to species. The identified genes from the selected dicot species (Arabidopsis, tomato, potato, capsicum, and tobacco) and monocot species (rice, sorghum and brachypodium) were characterized for gene structure, protein motif, the 3D structure of proteins, gene duplications, Gene Ontology, and expression studies. The gene structure and protein 3D structure revealed the conserved nature of the gene families across different species. Furthermore, Gene Ontology studies implicated the possible roles of the gene families in various aspects of plant development and stress or defense responses. This is in concordance with the gene expression of the NGA genes, suggesting that the NGA genes are mainly involved in the regulation of lateral organs such as the development of the leaves and flowers. Therefore, the detailed characterization of NGA genes in different species is required for further understanding the gene family in various plant developmental processes.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms23137063/s1.

Author Contributions

H.S., S.T. and F.M. conceived and designed the manuscript. H.S. and S.T. performed the experiments and participated in the bioinformatics and data analysis. A.S.S. and T.N. contributed to performing the qPCR experiments. H.S. wrote the manuscript. F.M. and S.T. revised and proofread the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

The Czech Science Foundation [GA 20-20860Y] support the A.S.S.

Acknowledgments

We especially thank Marta Zweiwka. The Plant Sciences Core Facility of CEITEC Masaryk University is acknowledged for the technical support. The authors also wish to thank Gorji Marzban.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

ABAAbscisic acid
BPBiological processes
CCCellular components
CDSCoding DNA sequence
GAGibberellic acid
GFPGreen fluorescent protein
GOGene Ontology
HMMHidden Markov Model
kDaKilo Dalton
MeJAMethyl jasmonic acid
MFMolecular function
MWMolecular weight
NCED9-cis-epoxycarotenoid dioxygenase
NGANGATHA
NGA-LIKENGATHA-LIKE
pIIsoelectric point
qPCRQuantitative PCR
RAVRELATED TO ABI3/VP1
RLFGVArg-Lue-Phe-Gly-Val
SASalicylic acid
STYSTYLISH
TAIRThe Arabidopsis Information Resource
TFTranscription factor
top1tower of pisa-1
UTRUntranslated region
VIGSVirus induced gene silencing
YUCYUCCA

References

  1. Omidbakhshfard, M.A.; Proost, S.; Fujikura, U.; Mueller-Roeber, B. Growth-Regulating Factors (GRFs): A Small Transcription Factor Family with Important Functions in Plant Biology. Mol. Plant 2015, 8, 998–1010. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Schwechheimer, C.; Bevan, M. The regulation of transcription factor activity in plants. Trends Plant Sci. 1998, 3, 378–383. [Google Scholar] [CrossRef]
  3. Riaño-Pachón, D.M.; Ruzicic, S.; Dreyer, I.; Mueller-Roeber, B. PlnTFDB: An integrative plant transcription factor database. BMC Bioinform. 2007, 8, 42. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Swaminathan, K.; Peterson, K.; Jack, T. The plant B3 superfamily. Trends Plant Sci. 2008, 13, 647–655. [Google Scholar] [CrossRef]
  5. Romanel, E.A.C.; Schrago, C.G.; Couñago, R.M.; Russo, C.A.M.; Alves-Ferreira, M. Evolution of the B3 DNA Binding Superfamily: New Insights into REM Family Gene Diversification. PLoS ONE 2009, 4, e5791. [Google Scholar] [CrossRef] [Green Version]
  6. Alvarez, J.P.; Goldshmidt, A.; Efroni, I.; Bowman, J.L.; Eshed, Y. The NGATHA distal organ development genes are essential for style specification in Arabidopsis. Plant Cell 2009, 21, 1373–1393. [Google Scholar] [CrossRef] [Green Version]
  7. Fourquin, C.; Ferrándiz, C. The essential role of NGATHA genes in style and stigma specification is widely conserved across eudicots. New Phytol. 2014, 202, 1001–1013. [Google Scholar] [CrossRef]
  8. Lee, B.H.; Kwon, S.H.; Lee, S.-J.; Park, S.K.; Song, J.T.; Lee, S.; Lee, M.M.; Hwang, Y.-S.; Kim, J.H. The Arabidopsis thaliana NGATHA transcription factors negatively regulate cell proliferation of lateral organs. Plant Mol. Biol. 2015, 89, 529–538. [Google Scholar] [CrossRef]
  9. Lee, B.H.; Mai, T.T.; Song, J.T.; Kim, J.H. The Arabidopsis thaliana NGATHA1 transcription factor acts as a promoter of a general differentiation program and a carpel identity factor. J. Plant Biol. 2017, 60, 352–357. [Google Scholar] [CrossRef]
  10. Sato, H.; Takasaki, H.; Takahashi, F.; Suzuki, T.; Iuchi, S.; Mitsuda, N.; Ohme-Takagi, M.; Ikeda, M.; Seo, M.; Yamaguchi-Shinozaki, K. Arabidopsis thaliana NGATHA1 transcription factor induces ABA biosynthesis by activating NCED3 gene during dehydration stress. Proc. Natl. Acad. Sci. USA 2018, 115, E11178–E11187. [Google Scholar] [CrossRef] [Green Version]
  11. Shao, J.; Meng, J.; Wang, F.; Shou, B.; Chen, Y.; Xue, H.; Zhao, J.; Qi, Y.; An, L.; Yu, F.; et al. NGATHA-LIKEs Control Leaf Margin Development by Repressing CUP-SHAPED COTYLEDON2 Transcription. Plant Physiol. 2020, 184, 345–358. [Google Scholar] [CrossRef]
  12. Trigueros, M.; Navarrete-Gómez, M.; Sato, S.; Christensen, S.K.; Pelaz, S.; Weigel, D.; Yanofsky, M.F.; Ferrándiz, C. The NGATHA genes direct style development in the Arabidopsis gynoecium. Plant Cell 2009, 21, 1394–1409. [Google Scholar] [CrossRef] [Green Version]
  13. Alvarez, J.P.; Furumizu, C.; Efroni, I.; Eshed, Y.; Bowman, J.L. Active suppression of a leaf meristem orchestrates determinate leaf growth. eLife 2016, 5, e15023. [Google Scholar] [CrossRef]
  14. Ballester, P.; Martínez-Godoy, M.A.; Ezquerro, M.; Navarrete-Gómez, M.; Trigueros, M.; Rodríguez-Concepción, M.; Ferrándiz, C. A transcriptional complex of NGATHA and bHLH transcription factors directs stigma development in Arabidopsis. Plant Cell 2021, 33, 3645–3657. [Google Scholar] [CrossRef]
  15. Li, N.; Li, Y. Signaling pathways of seed size control in plants. Curr. Opin. Plant Biol. 2016, 33, 23–32. [Google Scholar] [CrossRef]
  16. Martínez-Fernández, I.; Sanchís, S.; Marini, N.; Balanzá, V.; Ballester, P.; Navarrete-GóMez, M.; Oliveira, A.C.; Colombo, L.; Ferrándiz, C. The effect of NGATHA altered activity on auxin signaling pathways within the Arabidopsis gynoecium. Front. Plant Sci. 2014, 5, 210. [Google Scholar] [CrossRef] [Green Version]
  17. Nicolas, A.; Maugarny-Calès, A.; Adroher, B.; Chelysheva, L.; Li, Y.; Burguet, J.; Bågman, A.-M.; Smit, M.E.; Brady, S.M.; Li, Y. De novo stem cell establishment in meristems requires repression of organ boundary cell fate. bioRxiv 2022, 1, 1. [Google Scholar]
  18. Zhang, Y.; Du, L.; Xu, R.; Cui, R.; Hao, J.; Sun, C.; Li, Y. Transcription factors SOD7/NGAL2 and DPA4/NGAL3 act redundantly to regulate seed size by directly repressing KLU expression in Arabidopsis thaliana. Plant Cell 2015, 27, 620–632. [Google Scholar] [CrossRef] [Green Version]
  19. Alvarez, J.P.; Pekker, I.; Goldshmidt, A.; Blum, E.; Amsellem, Z.; Eshed, Y. Endogenous and Synthetic MicroRNAs Stimulate Simultaneous, Efficient, and Localized Regulation of Multiple Targets in Diverse Species. Plant Cell 2006, 18, 1134–1151. [Google Scholar] [CrossRef] [Green Version]
  20. Sohlberg, J.J.; Myrenås, M.; Kuusk, S.; Lagercrantz, U.; Kowalczyk, M.; Sandberg, G.; Sundberg, E. STY1 regulates auxin homeostasis and affects apical-basal patterning of the Arabidopsis gynoecium. Plant J. 2006, 47, 112–123. [Google Scholar] [CrossRef]
  21. Kwon, S.H.; Lee, B.H.; Kim, E.Y.; Seo, Y.S.; Lee, S.; Kim, W.T.; Song, J.T.; Kim, J.H. Overexpression of a Brassica rapa NGATHA Gene in Arabidopsis thaliana Negatively Affects Cell Proliferation During Lateral Organ and Root Growth. Plant Cell Physiol. 2009, 50, 2162–2173. [Google Scholar] [CrossRef] [Green Version]
  22. Guo, T.; Wang, S.; Li, Y.; Yuan, J.; Xu, L.; Zhang, T.; Chao, Y.; Han, L. Expression of a NGATHA1 Gene from Medicago truncatula Delays Flowering Time and Enhances Stress Tolerance. Int. J. Mol. Sci. 2020, 21, 2384. [Google Scholar] [CrossRef] [Green Version]
  23. Pfannebecker, K.C.; Lange, M.; Rupp, O.; Becker, A. Seed plant specific gene lineages involved in carpel development. Mol. Biol. Evol. 2017, 34, 925–942. [Google Scholar] [CrossRef] [Green Version]
  24. Mistry, J.; Chuguransky, S.; Williams, L.; Qureshi, M.; Salazar, G.A.; Sonnhammer, E.L.L.; Tosatto, S.C.E.; Paladin, L.; Raj, S.; Richardson, L.J.; et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021, 49, D412–D419. [Google Scholar] [CrossRef]
  25. Katoh, K.; Rozewicki, J.; Yamada, K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2017, 20, 1160–1166. [Google Scholar] [CrossRef] [Green Version]
  26. Tamura, K.; Stecher, G.; Kumar, S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol. Biol. Evol. 2021, 38, 3022–3027. [Google Scholar] [CrossRef]
  27. Hu, B.; Jin, J.; Guo, A.-Y.; Zhang, H.; Luo, J.; Gao, G. GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 2015, 31, 1296–1297. [Google Scholar] [CrossRef] [Green Version]
  28. Bailey, T.L.; Johnson, J.; Grant, C.E.; Noble, W.S. The MEME Suite. Nucleic Acids Res. 2015, 43, W39–W49. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Yang, J.; Yan, R.; Roy, A.; Xu, D.; Poisson, J.; Zhang, Y. The I-TASSER Suite: Protein structure and function prediction. Nat. Methods 2015, 12, 7–8. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Horton, P.; Park, K.-J.; Obayashi, T.; Fujita, N.; Harada, H.; Adams-Collier, C.J.; Nakai, K. WoLF PSORT: Protein localization predictor. Nucleic Acids Res. 2007, 35, W585–W587. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  31. Bannai, H.; Tamada, Y.; Maruyama, O.; Nakai, K.; Miyano, S. Extensive feature detection of N-terminal protein sorting signals. Bioinformatics 2002, 18, 298–305. [Google Scholar] [CrossRef] [Green Version]
  32. Lescot, M.; Déhais, P.; Thijs, G.; Marchal, K.; Moreau, Y.; Van de Peer, Y.; Rouzé, P.; Rombauts, S. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002, 30, 325–327. [Google Scholar] [CrossRef]
  33. Chow, C.-N.; Lee, T.-Y.; Hung, Y.-C.; Li, G.-Z.; Tseng, K.-C.; Liu, Y.-H.; Kuo, P.-L.; Zheng, H.-Q.; Chang, W.-C. PlantPAN3. 0: A new and updated resource for reconstructing transcriptional regulatory networks from ChIP-seq experiments in plants. Nucleic Acids Res. 2019, 47, D1155–D1163. [Google Scholar] [CrossRef] [Green Version]
  34. Wang, Y.; Tang, H.; DeBarry, J.D.; Tan, X.; Li, J.; Wang, X.; Lee, T.-H.; Jin, H.; Marler, B.; Guo, H.; et al. MCScanX: A toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012, 40, e49. [Google Scholar] [CrossRef] [Green Version]
  35. Chen, C.; Chen, H.; Zhang, Y.; Thomas, H.R.; Frank, M.H.; He, Y.; Xia, R. TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Mol. Plant 2020, 13, 1194–1202. [Google Scholar] [CrossRef]
  36. Wang, D.; Zhang, Y.; Zhang, Z.; Zhu, J.; Yu, J. KaKs_Calculator 2.0: A Toolkit Incorporating Gamma-Series Methods and Sliding Window Strategies. Genom. Proteom. Bioinform. 2010, 8, 77–80. [Google Scholar] [CrossRef] [Green Version]
  37. Conesa, A.; Götz, S. Blast2GO: A Comprehensive Suite for Functional Analysis in Plant Genomics. Int. J. Plant Genom. 2008, 2008, 619832. [Google Scholar] [CrossRef]
  38. Ikeda, M.; Mitsuda, N.; Ohme-Takagi, M. Arabidopsis HsfB1 and HsfB2b Act as Repressors of the Expression of Heat-Inducible Hsfs but Positively Regulate the Acquired Thermotolerance. Plant Physiol. 2011, 157, 1243–1254. [Google Scholar] [CrossRef] [Green Version]
  39. Ikeda, M.; Ohme-Takagi, M. A Novel Group of Transcriptional Repressors in Arabidopsis. Plant Cell Physiol. 2009, 50, 970–975. [Google Scholar] [CrossRef] [Green Version]
  40. Bharti, K.; von Koskull-Döring, P.; Bharti, S.; Kumar, P.; Tintschl-Korbitzer, A.; Treuter, E.; Nover, L. Tomato Heat Stress Transcription Factor HsfB1 Represents a Novel Type of General Transcription Coactivator with a Histone-Like Motif Interacting with the Plant CREB Binding Protein Ortholog HAC1. Plant Cell 2004, 16, 1521–1535. [Google Scholar] [CrossRef] [Green Version]
  41. Krogh, A.; Larsson, B.; von Heijne, G.; Sonnhammer, E.L. Predicting transmembrane protein topology with a hidden markov model: Application to complete genomes. J. Mol. Biol. 2001, 305, 567–580. [Google Scholar] [CrossRef] [Green Version]
  42. Wei, Y.; Wan, H.; Wu, Z.; Wang, R.; Ruan, M.; Ye, Q.; Li, Z.; Zhou, G.; Yao, Z.; Yang, Y. A Comprehensive Analysis of Carotenoid Cleavage Dioxygenases Genes in Solanum Lycopersicum. Plant Mol. Biol. Rep. 2015, 34, 512–523. [Google Scholar] [CrossRef]
  43. Eklund, D.M.; Cierlik, I.; Ståldal, V.; Claes, A.R.; Vestman, D.; Chandler, J.; Sundberg, E. Expression of Arabidopsis SHORT INTERNODES/STYLISH family genes in auxin biosynthesis zones of aerial organs is dependent on a GCC box-like regulatory element. Plant Physiol. 2011, 157, 2069–2080. [Google Scholar] [CrossRef] [Green Version]
  44. Eklund, D.M.; Ståldal, V.; Valsecchi, I.; Cierlik, I.; Eriksson, C.; Hiratsu, K.; Ohme-Takagi, M.; Sundström, J.F.; Thelander, M.; Ezcurra, I. The Arabidopsis thaliana STYLISH1 protein acts as a transcriptional activator regulating auxin biosynthesis. Plant Cell 2010, 22, 349–363. [Google Scholar] [CrossRef] [Green Version]
  45. Ståldal, V.; Cierlik, I.; Chen, S.; Landberg, K.; Baylis, T.; Myrenås, M.; Sundström, J.F.; Eklund, D.M.; Ljung, K.; Sundberg, E. The Arabidopsis thaliana transcriptional activator STYLISH1 regulates genes affecting stamen development, cell expansion and timing of flowering. Plant Mol. Biol. 2012, 78, 545–559. [Google Scholar] [CrossRef]
  46. Ding, P.; Ding, Y. Stories of Salicylic Acid: A Plant Defense Hormone. Trends Plant Sci. 2020, 25, 549–565. [Google Scholar] [CrossRef]
  47. Shah, J. The salicylic acid loop in plant defense. Curr. Opin. Plant Biol. 2003, 6, 365–371. [Google Scholar] [CrossRef]
  48. Chen, K.; Li, G.J.; Bressan, R.A.; Song, C.P.; Zhu, J.K.; Zhao, Y. Abscisic acid dynamics, signaling, and functions in plants. J. Integr. Plant Biol. 2020, 62, 25–54. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Fitzpatrick, A.H.; Shrestha, N.; Bhandari, J.; Crowell, D.N. Roles for farnesol and ABA in Arabidopsis flower development. Plant Signal. Behav. 2011, 6, 1189–1191. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  50. Kai, W.; Fu, Y.; Wang, J.; Liang, B.; Li, Q.; Leng, P. Functional analysis of SlNCED1 in pistil development and fruit set in tomato (Solanum lycopersicum L.). Sci. Rep. 2019, 9, 16943. [Google Scholar] [CrossRef] [PubMed]
  51. Cannon, S.B.; Mitra, A.; Baumgarten, A.; Young, N.D.; May, G. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 2004, 4, 10. [Google Scholar] [CrossRef] [Green Version]
  52. Verma, S.; Bhatia, S. A comprehensive analysis of the B3 superfamily identifies tissue-specific and stress-responsive genes in chickpea (Cicer arietinum L.). 3 Biotech 2019, 9, 346. [Google Scholar] [CrossRef]
  53. Sironi, M.; Cagliani, R.; Forni, D.; Clerici, M. Evolutionary insights into host–pathogen interactions from mammalian sequence data. Nat. Rev. Genet. 2015, 16, 224–236. [Google Scholar] [CrossRef]
  54. Zhou, T.; Gu, W.; Wilke, C.O. Detecting Positive and Purifying Selection at Synonymous Sites in Yeast and Worm. Mol. Biol. Evol. 2010, 27, 1912–1922. [Google Scholar] [CrossRef] [Green Version]
  55. Osnato, M. Making the connection: Gene regulatory networks reveal a new regulator for graft formation. Plant Cell 2021, 33, 3604–3605. [Google Scholar] [CrossRef]
  56. Shao, J.; Liu, X.; Wang, R.; Zhang, G.; Yu, F. The Over-Expression of an Arabidopsis B3 Transcription Factor, ABS2/NGAL1, Leads to the Loss of Flower Petals. PLoS ONE 2012, 7, e49861. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Phylogenetic analysis of the NGATHA family in crop plants. The phylogenetic tree was constructed using the NJ method with 1000 bootstrap replications in MEGA11. The NGATHA proteins were divided into distinct subfamilies with five NGA proteins and three NGA-Like proteins. The symbols are represented to each of the NGATHA proteins as follows. Ijms 23 07063 i001 NGA1, Ijms 23 07063 i002 NGA2, Ijms 23 07063 i003 NGA3, Ijms 23 07063 i004 NGA4, Ijms 23 07063 i005 NGA5, Ijms 23 07063 i006 NGA-Like1, Ijms 23 07063 i007 NGA-Like2, Ijms 23 07063 i008 NGA-Like3.
Figure 1. Phylogenetic analysis of the NGATHA family in crop plants. The phylogenetic tree was constructed using the NJ method with 1000 bootstrap replications in MEGA11. The NGATHA proteins were divided into distinct subfamilies with five NGA proteins and three NGA-Like proteins. The symbols are represented to each of the NGATHA proteins as follows. Ijms 23 07063 i001 NGA1, Ijms 23 07063 i002 NGA2, Ijms 23 07063 i003 NGA3, Ijms 23 07063 i004 NGA4, Ijms 23 07063 i005 NGA5, Ijms 23 07063 i006 NGA-Like1, Ijms 23 07063 i007 NGA-Like2, Ijms 23 07063 i008 NGA-Like3.
Ijms 23 07063 g001
Figure 2. The structural analysis of NGATHA genes and their conserved protein motifs. (a,d) The phylogenetic tree and classification of NGATHA genes, (b,e) Exon–intron structures (where exons, introns and untranslated regions are represented by yellow bars, blue bars and black lines, respectively) and (c,f) protein motifs. (ac) represent the gene structures and protein motifs of A. thaliana, S. lycopersicum, S. tuberosum, C. annuum, and N. tabacum, (df) represent the gene structures and protein motifs of A. thaliana, O. sativa, S. bicolor, and B. distachyon.
Figure 2. The structural analysis of NGATHA genes and their conserved protein motifs. (a,d) The phylogenetic tree and classification of NGATHA genes, (b,e) Exon–intron structures (where exons, introns and untranslated regions are represented by yellow bars, blue bars and black lines, respectively) and (c,f) protein motifs. (ac) represent the gene structures and protein motifs of A. thaliana, S. lycopersicum, S. tuberosum, C. annuum, and N. tabacum, (df) represent the gene structures and protein motifs of A. thaliana, O. sativa, S. bicolor, and B. distachyon.
Ijms 23 07063 g002
Figure 3. The analysis of cis-acting elements in the promoters of NGATHA genes. The X-axis represents the NGATHA genes, and the Y-axis represents the number of cis-elements in the promoter of each gene. The cis-elements were predicted in the 1500 bp upstream regions using PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/; accessed on 30 January 2022) [30]. “Others” include promoter elements related to anaerobic induction, circadian control, meristem development, flavonoid biosynthesis, zein metabolism, seed specific regulation, endosperm expression and cell cycle regulation.
Figure 3. The analysis of cis-acting elements in the promoters of NGATHA genes. The X-axis represents the NGATHA genes, and the Y-axis represents the number of cis-elements in the promoter of each gene. The cis-elements were predicted in the 1500 bp upstream regions using PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/; accessed on 30 January 2022) [30]. “Others” include promoter elements related to anaerobic induction, circadian control, meristem development, flavonoid biosynthesis, zein metabolism, seed specific regulation, endosperm expression and cell cycle regulation.
Ijms 23 07063 g003
Figure 4. The 3D-structure of the NGATHA proteins in (a) A. thaliana, (b) S. lycopersicum, and (c) O. sativa L. Japonica. The three-dimensional structures were acquired using I-TASSER (https://zhanggroup.org/I-TASSER/; accessed on 18 January 2022) [35].
Figure 4. The 3D-structure of the NGATHA proteins in (a) A. thaliana, (b) S. lycopersicum, and (c) O. sativa L. Japonica. The three-dimensional structures were acquired using I-TASSER (https://zhanggroup.org/I-TASSER/; accessed on 18 January 2022) [35].
Ijms 23 07063 g004
Figure 5. The gene duplication or synteny analysis of the NGATHA genes of A. thaliana with (a) S. lycopersicum; (b) B. rapa, and (c) O. sativa L. Japonica. The grey lines (in the background) represent collinear blocks between the respective genomes. The red lines indicate the syntenic gene pairs of S. lycopersicum, B. rapa, and O. sativa L. Japonica with A. thaliana.
Figure 5. The gene duplication or synteny analysis of the NGATHA genes of A. thaliana with (a) S. lycopersicum; (b) B. rapa, and (c) O. sativa L. Japonica. The grey lines (in the background) represent collinear blocks between the respective genomes. The red lines indicate the syntenic gene pairs of S. lycopersicum, B. rapa, and O. sativa L. Japonica with A. thaliana.
Ijms 23 07063 g005
Figure 6. The GO annotation and classification of NGATHA proteins using Blast2GO. Results are summarized for the three main GO categories (BP, MF, and CC). The X-axis includes the most abundant GO terms. The Y-axis represents the number of NGATHA proteins.
Figure 6. The GO annotation and classification of NGATHA proteins using Blast2GO. Results are summarized for the three main GO categories (BP, MF, and CC). The X-axis includes the most abundant GO terms. The Y-axis represents the number of NGATHA proteins.
Ijms 23 07063 g006
Figure 7. The expression patterns of the NGATHA genes in Arabidopsis (af) and tomato (gl) in different plant organs were verified by qPCR. The X-axis represents different plant organs: CL—cotyledon, RL—rosetta leaf, FL—flower, YL—young leaf, ML—mature leaf. The Y-axis represents the relative transcript abundance of the respective genes.
Figure 7. The expression patterns of the NGATHA genes in Arabidopsis (af) and tomato (gl) in different plant organs were verified by qPCR. The X-axis represents different plant organs: CL—cotyledon, RL—rosetta leaf, FL—flower, YL—young leaf, ML—mature leaf. The Y-axis represents the relative transcript abundance of the respective genes.
Ijms 23 07063 g007
Table 1. The list of NGATHA genes in Arabidopsis and Solanaceae (tomato, potato, capsicum, and tobacco) and the features of each gene and protein.
Table 1. The list of NGATHA genes in Arabidopsis and Solanaceae (tomato, potato, capsicum, and tobacco) and the features of each gene and protein.
Transcript IDGene NameChromosomeChromosome LocusStrandGene Length (bp)CDS Length (bp)Protein Length (aa)Protein Weight (kDa)pISignal Peptide
AT2G46870.1AtNGA12Chr2:19260906..19262533F162893331034.886.75Nucleus
AT3G61970.1AtNGA23Chr3:22951323..22953265F194390029934.278.94Nucleus
AT1G01030.1AtNGA31Chr1:11649..13714R2066107735840.286.24Nucleus
AT4G01500.1AtNGA44Chr4:639247..640976F1730100233338.518.52Nucleus
AT2G36080.1AtNGA-LIKE12Chr2:15148259..15151634R337673524428.436.80Nucleus
AT3G11580.1AtNGA-LIKE23Chr3:3648902..3651726R282580426730.217.33Nucleus
AT5G06250.1AtNGA-LIKE35Chr5:1891880..1894353R216284928231.646.97Nucleus
Solyc08g013700.1.1SlNGA18SL4.0ch08:3090386..3091270F88588529433.708.36Nucleus
Solyc08g013690.1.1SlNGA28SL4.0ch08:3081975..3082949F97597532436.677.72Nucleus
Solyc05g004000.1.1SlNGA35SL4.0ch05:12418..14381F1964116437842.496.50Nucleus
Solyc10g083210.2.1SlNGA-LIKE1-110SL4.0ch10:62238789..62243955F516792731636.156.59Nucleus
Solyc09g010230.2.1SlNGA-LIKE1-39SL4.0ch09:3659577..3663391F3815103834539.117.02Nucleus
PGSC0003DMP400010383StNGA18ST4.03ch08:5461993..5463392R140090330034.279.01Nucleus
PGSC0003DMP400010384StNGA28ST4.03ch08:5469187..5470896R171094231335.747.16Nucleus
PGSC0003DMP400021619StNGA38ST4.03ch08:56844094..56849780F5687122140645.646.27Nucleus
PGSC0003DMP400048918StNGA-LIKE1-110ST4.03ch10:55899618..55904610R499386718921.286.97Nucleus
PGSC0003DMP400048917StNGA-LIKE1-210ST4.03ch10:55899618..55904610R499386728832.646.39Nucleus
PGSC0003DMP400015721StNGA-LIKE1-39ST4.03ch09:2309350..2314043R2774100833537.226.81Nucleus
PGSC0003DMP400015722StNGA-LIKE1-49ST4.03ch09:2309350..2314043R2774100832536.296.52Nucleus
CA01g34800CaNGA11Pepper1.55ch01:272613133..272614266F1134113437742.487.79Nucleus
CA01g00060CaNGA31Pepper1.55ch01:132987..134420R1434143447752.496.16Nucleus
CA10g17390CaNGA-LIKE1-110Pepper1.55ch10:223101144..223101674R53153117620.316.12Nucleus
CA09g00050CaNGA-LIKE1-29Pepper1.55ch09:519882..521660F177993331035.275.91Nucleus
Nitab4.5_0000799g0050.1NtNGA1-1Nt15Nitab4.5_0000799:325787..327070F1284128442747.398.21Nucleus
Nitab4.5_0005519g0040.1NtNGA1-2--Nitab4.5_0005519:96109..97242R1134113437742.007.32Nucleus
Nitab4.5_0015619g0010.1NtNGA3-1--Nitab4.5_0015619:684..1886R1203120340044.916.36Nucleus
Nitab4.5_0012411g0010.1NtNGA3-2--Nitab4.5_0012411:19665..20864R1200120039944.716.33Nucleus
Nitab4.5_0000573g0030.1NtNGA-LIKENt10Nitab4.5_0000573:193812..195562R1751102934238.895.96Nucleus
Table 2. The list of NGATHA genes in Poaceae (O. sativa L. Japonica, B. distachyon, and S. bicolor) and the features of each gene and protein.
Table 2. The list of NGATHA genes in Poaceae (O. sativa L. Japonica, B. distachyon, and S. bicolor) and the features of each gene and protein.
Transcript IDGene NameChromosomeStartEndStrandGene Length (bp)CDS Length (bp)Protein Length (aa)Protein Weight (kD)pISignal Peptide
LOC_Os03g02900.1OsNGA1311529181154876F195993631233.7710.28Nucleus
LOC_Os02g45850.1OsNGA222793468927932831R1859123941343.948.13Chloroplast
LOC_Os04g49230.1OsNGA342936851529366456R206095131734.508.72Nucleus
LOC_Os08g06120.1OsNGA4833774693378332F86486428830.714.80Cytoplasm
LOC_Os10g39190.1OsNGA5102091709920916161R93993931333.1210.03Nucleus
LOC_Os11g05740.1OsNGA-LIKE1-11126335632631707R185784028030.967.19Nucleus
LOC_Os12g06080.1OsNGA-LIKE1-21228369952836021R185783427831.077.78Nucleus
SORBI_3001G528200SbNGA117921653379218457R476799333035.849.83Nucleus
SORBI_3004G280500SbNGA246229175762293326F1570130543445.779.08Nucleus
SORBI_3006G190400SbNGA365445842154463486R5066126342044.766.31Nucleus
SORBI_3007G047500SbNGA4747615224763508F287476225327.374.66Nucleus
SORBI_3001G313800SbNGA516009659360097417F82582527429.6210.44Nucleus
SORBI_3005G041400SbNGA-LIKE1-1538286893830488R227782527430.787.20Nucleus
SORBI_3008G041100SbNGA-LIKE1-2839742053975994R179096332135.338.64Nucleus
BRADI_1g77150v3BdNGA117375539473760932R620793631233.849.87Nucleus
BRADI_3g51840v3BdNGA235260653252612120R6309128142645.399.91Nucleus
BRADI_5g19260v3BdNGA352242452022430521R6393124241344.036.50Nucleus
BRADI_3g16500v3BdNGA431467772014679110R176065121623.825.16Nucleus
BRADI_3g32140v3BdNGA533414865534150224R193780426729.1710.45Nucleus
BRADI_4g25170v3BdNGA-LIKE1-143032751430329491F197883427731.137.34Nucleus
BRADI_4g42167v3BdNGA-LIKE1-244619423246196263R2031181227330.477.21Nucleus
Table 3. Synonymous and non-synonymous substitutions in Arabidopsis and Solanaceae members (tomato, potato, and capsicum).
Table 3. Synonymous and non-synonymous substitutions in Arabidopsis and Solanaceae members (tomato, potato, and capsicum).
Seq_1Seq_2KaKsKa/Ks
AtNGA2AtNGA10.2394631.0697870.223842
AtNGA3AtNGA20.4315932.0612620.209383
AtNGA3AtNGA10.3801091.7358170.21898
AtNGA4AtNGA20.4334252.5960130.166958
AtNGA4AtNGA10.5814861.8061790.321943
AtNGA4AtNGA30.3255080.9554950.340669
AtNGA-LIKE1AtNGA10.5254092.7170220.193377
AtNGA-LIKE1AtNGA30.5835342.7055680.215679
AtNGA-LIKE2AtNGA-LIKE10.3128771.7013280.183901
AtNGA-LIKE2AtNGA40.5826292.7283110.213549
AtNGA-LIKE2AtNGA30.5991512.2701560.263925
AtNGA-LIKE2AtNGA10.6169172.137340.288638
AtNGA-LIKE3AtNGA-LIKE10.3175191.95840.162132
AtNGA-LIKE3AtNGA-LIKE20.2788150.9975630.279496
AtNGA-LIKE3AtNGA30.6457782.2833920.282815
AtNGA-LIKE3AtNGA40.7822032.7407710.285395
AtNGA-LIKE3AtNGA10.6059382.0438460.29647
AtNGA-LIKE3AtNGA20.6488931.9393740.334589
SlNGA1SlNGA3-20.3779511.7284170.218669
SlNGA2SlNGA10.1065660.1868940.570193
SlNGA2SlNGA3-20.383811.8798750.204168
SlNGA3-1SlNGA10.4078721.6560930.246286
SlNGA3-1SlNGA20.4012011.7905130.224071
StNGA2StNGA10.0981840.2167780.452926
StNGA3StNGA20.337041.2774730.263833
StNGA3StNGA10.3214761.5660630.205276
CaNGA1CaNGA-LIKE1-10.434432.624650.165519
CaNGA3CaNGA10.2895841.438280.201341
CaNGA-LIKE1-2CaNGA-LIKE1-10.2887393.1389350.091986
Table 4. Synonymous and non-synonymous substitutions in Poaceae members (rice, sorghum, and Brachypodium).
Table 4. Synonymous and non-synonymous substitutions in Poaceae members (rice, sorghum, and Brachypodium).
Seq_1Seq_2KaKsKa/Ks
OsNGA1OsNGA20.3054940.6288930.485764
OsNGA1OsNGA-LIKE1-20.547030.8390350.651975
OsNGA1OsNGA40.5043840.6322140.797806
OsNGA2OsNGA-LIKE1-10.5649571.0406720.542877
OsNGA2OsNGA40.4537420.7168750.632945
OsNGA3OsNGA20.265930.8094530.32853
OsNGA3OsNGA50.4206660.8879460.473751
OsNGA3OsNGA10.3324060.6663170.498871
OsNGA3OsNGA-LIKE1-10.5826860.8117970.717773
OsNGA3OsNGA-LIKE1-20.6818180.7365760.925658
OsNGA4OsNGA30.5513080.8292740.664809
OsNGA4OsNGA-LIKE1-20.6094990.8033160.758729
OsNGA5OsNGA10.2577660.5114550.503987
OsNGA5OsNGA40.4751880.8574410.554194
OsNGA5OsNGA20.3895990.6785080.574199
OsNGA5OsNGA-LIKE1-20.4939710.6717410.73536
OsNGA-LIKE1-1OsNGA-LIKE1-20.1245960.4093620.304365
OsNGA-LIKE1-1OsNGA40.5763451.0954550.526124
OsNGA-LIKE1-1OsNGA10.530160.9937640.533487
OsNGA-LIKE1-1OsNGA50.6056990.7628450.794
OsNGA-LIKE1-2OsNGA20.5163860.7300130.707365
SbNGA1SbNGA50.2731970.7997840.341589
SbNGA1SbNGA-LIKE1-10.5613221.4132830.397176
SbNGA1SbNGA40.4862560.9002320.540145
SbNGA2SbNGA30.2117130.6485960.326418
SbNGA2SbNGA-LIKE1-10.69451.6822760.412833
SbNGA2SbNGA10.3573620.7335810.487147
SbNGA2SbNGA40.5335681.0838990.492267
SbNGA2SbNGA-LIKE1-22.1512210.9831692.188048
SbNGA3SbNGA10.3461820.6221910.556391
SbNGA3SbNGA-LIKE1-22.2971871.006792.281695
SbNGA4SbNGA-LIKE1-10.5786961.4947520.387152
SbNGA4SbNGA50.5045081.2600050.400402
SbNGA4SbNGA30.5249430.9341370.561955
SbNGA5SbNGA20.3547551.0710840.331211
SbNGA5SbNGA30.3502020.8383950.417705
SbNGA-LIKE1-1SbNGA30.5977741.5474760.38629
SbNGA-LIKE1-1SbNGA50.5763391.475950.390487
SbNGA-LIKE1-2SbNGA-LIKE1-12.4710221.7849321.384379
SbNGA-LIKE1-2SbNGA42.4767921.3591581.8223
SbNGA-LIKE1-2SbNGA52.5653041.1293592.271469
SbNGA-LIKE1-2SbNGA12.7990220.9993042.800971
BdNGA1BdNGA30.3461190.9557670.362138
BdNGA1BdNGA40.5144321.0058480.511441
BdNGA1BdNGA50.3775610.6298520.599444
BdNGA1BdNGA-LIKE1-10.5778210.9282360.622493
BdNGA2BdNGA10.3579560.7058820.507104
BdNGA2BdNGA-LIKE1-10.6018211.1577220.519832
BdNGA3BdNGA20.2530510.6057260.417765
BdNGA4BdNGA-LIKE1-20.5707691.4559960.392013
BdNGA4BdNGA20.4376181.0533230.415464
BdNGA4BdNGA30.4614560.9780350.47182
BdNGA4BdNGA-LIKE1-10.5186731.0801270.480196
BdNGA4BdNGA50.4774010.8657850.551408
BdNGA5BdNGA30.4111250.9379380.438328
BdNGA5BdNGA-LIKE1-20.6507761.2438350.523201
BdNGA5BdNGA20.4178420.6462290.646585
BdNGA-LIKE1-1BdNGA-LIKE1-20.1119050.6564720.170464
BdNGA-LIKE1-1BdNGA30.6102941.1098970.549865
BdNGA-LIKE1-1BdNGA50.6416990.8864950.72386
BdNGA-LIKE1-2BdNGA30.5447771.6341450.333371
BdNGA-LIKE1-2BdNGA10.6115411.299770.470499
BdNGA-LIKE1-2BdNGA20.5935261.2370820.479779
Table 5. The GO classification of the annotated NGATHA genes in Arabidopsis and tomato.
Table 5. The GO classification of the annotated NGATHA genes in Arabidopsis and tomato.
GO TermAnnotationInvolved Genes
BPGO:0006355Regulation of transcription, DNA-templatedAtNGA1, AtNGA2, AtNGA3, AtNGA4, AtNGA-LIKE2
GO:0009908Flower developmentAtNGA1, AtNGA2, AtNGA3, AtNGA4
GO:1901371Regulation of leaf morphogenesisAtNGA1, AtNGA2, AtNGA3, AtNGA4
GO:0045892Negative regulation of transcription, DNA-templatedAtNGA-LIKE1, AtNGA-LIKE3
GO:0080167Response to karrikinAtNGA-LIKE1
GO:0019760Glucosinolate metabolic processAtNGA-LIKE2
GO:0080113Regulation of seed growthAtNGA-LIKE2
GO:0010073Meristem maintenanceAtNGA-LIKE3
GO:0010358Leaf shapingAtNGA-LIKE3
MFGO:0003700DNA-binding transcription factor activityAtNGA1, AtNGA2, AtNGA3, AtNGA4, AtNGA-LIKE1, AtNGA-LIKE2, AtNGA-LIKE3
GO:0005515Protein bindingAtNGA1, AtNGA3, AtNGA4
GO:0043565Sequence-specific DNA bindingAtNGA1
GO:0003677DNA bindingAtNGA2, AtNGA3, AtNGA4, AtNGA-LIKE3
GO:0000976Transcription cis-regulatory region bindingAtNGA-LIKE1, AtNGA-LIKE2
CCGO:0005634NucleusAtNGA1, AtNGA2, AtNGA3, AtNGA4, AtNGA-LIKE1, AtNGA-LIKE2, AtNGA-LIKE3
GO:0016021Integral component of membraneAtNGA-LIKE2
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Salava, H.; Thula, S.; Sánchez, A.S.; Nodzyński, T.; Maghuly, F. Genome Wide Identification and Annotation of NGATHA Transcription Factor Family in Crop Plants. Int. J. Mol. Sci. 2022, 23, 7063. https://doi.org/10.3390/ijms23137063

AMA Style

Salava H, Thula S, Sánchez AS, Nodzyński T, Maghuly F. Genome Wide Identification and Annotation of NGATHA Transcription Factor Family in Crop Plants. International Journal of Molecular Sciences. 2022; 23(13):7063. https://doi.org/10.3390/ijms23137063

Chicago/Turabian Style

Salava, Hymavathi, Sravankumar Thula, Adrià Sans Sánchez, Tomasz Nodzyński, and Fatemeh Maghuly. 2022. "Genome Wide Identification and Annotation of NGATHA Transcription Factor Family in Crop Plants" International Journal of Molecular Sciences 23, no. 13: 7063. https://doi.org/10.3390/ijms23137063

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop