Genome-wide identification of TGAs in wheat and the two progenitors
As described in the method, a total of 38, 12, and 26 TGA transcription factors were identified in wheat (TaTGA1-TaTGA38) and its two progenitors-wild emmer (T. dicoccoides) (TdTGA1-TdTGA26) and Aegilops tauschii(AeTGA1-AeTGA12), respectively. According to the published papers, these identified genes were named based on their chromosome locations. In terms of quantity, these identified TGAs unevenly distributed in each chromosome of their genomes varies from 1 to 3 differently (Fig. 1). In wheat, chromosome group 3 possessed the most abundant TaTGAs, with each containing 3 TaTGA transcription factors, and group 1, 2 and 4 with each containing 2 TaTGA transcription factors in the chromosome. While groups 6 and 7 only contained 1 TaTGA transcription factor in each chromosome. It's different from the other cases in that A, B, and D chromosomes had the same TGA transcription numbers. There are two transcription factors on both chromosomes 5A and 5D, but one transcription factor on chromosome 5B. Compared to the sequence and location of TGAs in the wild emmer (T. dicoccoides) genome, which offered the genome B of wheat, the TaTGAs genes in the B sub-genome may be lost during evolution. This lost TGA gene in 5B may have had a redundant function with the TGA genes on 5A or 5D, which were eventually lost during evolution due to low purifying selection.
[Figure 1]
In the perspective of A, B and D sub-genomes, 13, 12 and 13 TaTGA transcription factors were identified, suggesting no significant variation in the TaTGA transcription factor abundance. The sequence characteristics of these proteins were investigated and compared (Table 1). The amino acid length ranged from 157 aa(TaTGA22) to 580 aa(TaTGA15), averaging 438 aa. In comparison, the size of TdTGAs ranged from 203 to 526 aa with an average of 371aa, and AeTGAs ranged from 111 to 477 aa with an average of 346 aa. Accordingly, the molecular weight (Mw) ranged from 32.75(TaTGA19) to 62.18(TaTGA25) kDa, 17.20 to 58.03 kDa, and 12.78 to 52.04 kDa for wheat, T. dicoccoides and A. tauschii, respectively.
[Table 1]
Tandem and segmental duplications are the most common driving forces for gene family expansions during evolution. From these identified proteins above, TaTGA proteins have longer amino acids than its two progenitors. The segmental duplications are the main event responsible for the evolution of the TaTGA gene family. Therefore, we speculate that the TGA gene families in higher plants expanded primarily because of segmental duplications.
Phylogenetic and molecular evolution analysis of TGA genes
The TGA transcription factors were well studied in the model plant Arabidopsis. To investigate the evolutionary relationships of TGA genes, we built a phylogenetic tree of these identified TGAs between species, including two model plants and two wheat progenitors. We aligned the protein sequences of all these TGAs, and a phylogenetic tree was constructed using their conserved domains. As shown in Fig. 2a, The evolutionary relationships of these TGA transcription factors were determined. The phylogenetic tree was classified into four groups, with each clade consisting of 21–59 members. 8, 3, 6, and 21 TaTGAs are clustered in clades I, II, III, and IV, respectively.
We examined the synteny between wheat and its two progenitors to determine the phylogenetic mechanisms of these TGA genes. A total of 28 and 53 orthologous gene pairs between hexaploid wheat (T. aestivum) and its two progenitors (Ae. tauschii, T. dicoccoides) were identified (Figure 2b). The collinear pairs were identified, suggesting these orthologous pairs were relatively well conserved during the evolution. Except for a few TGA genes, almost all TGA genes in its two progenitors showed synteny to A, B and D sub-genomes in hexaploid wheat. However, some orthologous gene pairs were only identified between chromosome 5B of T. dicoccoides and 5A of hexaploid wheat, but not 5B of hexaploid wheat. They combined the chromosome location and sequence similarity, possibly due to gene loss or chromosomal recombination during evolution and polyploidization.
[Figure 2]
Gene structure and conserved motifs analysis
The genes in the same subfamily showed similar gene structures. The closely related homologous TaTGA genes in the A, B, or D sub-genomes also have similar structures and encoded motifs, suggesting TGA genes were highly conserved in wheat and its progenitor during evolution. Exon-intron structure is also an essential evolutionary feature of genes, and this kind of evolution can lead to the functional diversification of genes. A total of 9 conversed motifs were identified (motifs 1-9) (Fig. 3). Motif 1 is a conserved region in all TGA proteins located in the N-terminus. It covers the bZIP domain, which is responsible for DNA binding and contains nuclear localization signals (NLS). This motif is usually rich in glutamine (Q), traditionally associated with the dimerization of proteins. Motif-3 is the typical signature of the TGA members in the C terminal, Yx2RL[RQ]ALSS[LS]W. Motif-3 and Motif-4 were also rich in Q, and Motif-4 was the Q-richest motif. The TaTGAs possess Q-rich motifs, suggesting that these proteins can interact with a variety of other proteins, making these TaTGAs have the potential to integrate various biological pathways in wheat. Motif-6 and motif-9 were highly variable among the TaTGA proteins, suggesting a wide range of functional diversity for TaTGAs.
[Figure 3]
Cis-Element Analysis of TaTGAs
The 1500bp sequence upstream of the promoter was selected to analyze and search for the regulatory element upstream of TaTGA. A single element involves multiple processes of multiple genes. Therefore, we mainly focused on the most representative genes and functions here. The results (Fig. 4) showed that, in addition to some common regulatory elements (CAAT-box, TATA-box), The promoter part of TaTGAs mainly contain elements related to light response, hormone regulation, biological and abiotic stress, injury stress and other reaction. Among them, the kinds of light-responsive elements are the most.
We pay attention to hormone regulatory elements like abscisic acid (ABRE), methyl jasmonate (CGTCA-motif, TGACG-motif), gibberellin (GARE-motif, P-box), salicylic acid (as-1, TCA-box) and biotic and abiotic stress-related regulatory elements (DRE, STRE, MBS, CCAAT-box, W-box, F-box etc.). According to the reported results, TGA transcription factors can interact with WRKY proteins to regulate plant resistance in the MeJA pathway[26]. In addition, EMSA experiments have verified that TGAs can bind ROXY proteins to form a complex to the as-1-like element[27]. These results validated that the predicted cis-acting elements are valid, which helps to identify more upstream genes regulating TGA transcription factors.
[Figure 4]
Expression profiles of TaTGAs under biotic stress
TGA transcription factors play essential roles in stress response and floral development. To further evaluate the potential functions of TaTGAs in response to biotic stress, the TaTGAs expression patterns under biotic stress were analyzed from transcriptome data based on the fragment per kilobase of transcript per million reads mapped (FPKM). The heat map showed that most TaTGA genes were expressed differently after pathogen inoculation. Results showed that TaTGAs had differential expression under different treatments. The expression of TaTGA was induced significantly by the stripe rust pathogen CYR31 infection. TaTGAs 7/13/25 were suppressed, while the other, like TaTGA8, was up-regulated under the stripe rust infection(Figure 5). Homologous genes in A, B and D sub-genomes showed the same pattern of expression, which suggested that there may be functional redundancy between them. These results indicate that TaTGAs contribute to the stripe rust resistance in wheat.
[Figure 5]
Validation of the expression of wheat TGA genes using qRT-PCR
To validate the expression patterns of these selected TaTGAs in response to biotic stresses, we collected the samples treated with stripe rust at the specified time points and performed qRT-PCR(Figure 6). The TaTGAs genes, which were significantly induced by the stripe rust, were selected to validate. TaTGA7, TaTGA8, TaTGA13, and TaTGA16 significantly upgraded after stripe rust infection. They showed several types of trends of incompatible interaction, some reached a peak at 24h and were significantly inhibited at 72h post-inoculation (hpi), and then the expression level rose again. Some genes reached the peak at 48hpi or 120hpi. The expression of TaTGA24 significantly decreases at 12hpi. The other TaTGA genes showed no obvious response to stripe rust. These results matched the results of the RNA-seq results.
[Figure 6]
Interaction analysis of TaTGAs
To better understand the biological function and the regulatory network of TaTGAs, we submitted the protein sequences to the STRING database to predict the protein-protein interaction (PPI) in the TaTGA genes family. A limited number of proteins have been reported to interact with TGA transcription factors. However, according to the extensive regulation of TGA transcription factors and the genetic structure of TaTGAs, we still found some interaction proteins worthy of attention in biotic resistance(Figure 7). As expected, some proteins interacting with TaTGAs were SA signalling pathway components, such as NPR1 or NPR1-like proteins. Combined with the previous cis-acting element analysis and published research, TaTGAs can interact with themselves to form a dimer or polymer. Different TaTGAs can also interact with each other. These kinds of binding can influence the ability of DNA binding and the affinity of other transcription factors to the promoter elements. The TaTGAs may interact with stress-related transcription factors such as WRKY. In addition, some research has already shown that different types of ROXY bind to TGA to regulate the activity of TGA transcription factors. The hexaploid wheat has a larger and more complex genome and possesses unique genes compared to other plants. Identification of new interacting proteins of TaTGAs requires the combination of Chip-seq or DAP-seq experiments, and the bioinformatics predictions alone are limited.
[Figure 7]
Transient Silencing of TaTGA13 Enhances Wheat resistance to Pst
TaTGA13, which significantly responds to stripe rust inoculation, was chosen to implement the known down experiment. Firstly, we used BSMV-VIGS (virus-induced gene silencing) system to analyze the function of the TaTGA13. At ten days post-inoculation, mosaic symptoms of the virus were observed in virus-inoculated plants, but further leaf growth was not affected. In contrast, γ-PDS inoculated plants gradually displayed photobleaching symptoms (Figure 8), suggesting that the virus-induced gene silencing assay was successful. qRT-PCR results revealed that the transcript levels of endogenous TaTGA13 of the transformants were significantly reduced during compatible interactions. After inoculation with the compatible race V26, fungal growth was slightly decreased on γ-TaTGA13 plants compared to the control γ.
[Figure 8]