Identification of 56 TaSPL genes in wheat
Programs of HMMER, SMART and Interpro were adopted to identify TaSPL genes in wheat genome [56-58]. After removing the redundant sequences, total 56 presumed SBP-box genes with E-values less than 0.01 found in the wheat genome. Meanwhile, 18 AetSPL, 10 TuSPL, and 16 BdSPL genes in Aegilops tauschii Coss., Triticum urartu, and Brachypodium distachyon were respectively identified and named with reference to the OsSPL genes. The relevant information including gene name, gene accession number, amino acid sequence and amino acid length was provided in Additional file 1: Table S1.Program of ExPASy server was used to calculate the physical parameters of each TaSPL protein. As shown in Additional file 2: Table S2, Additional file 3: Table S3 and Additional file 4: Table S4, the cDNA length of TaSPL genes ranged from 1,062 (TaSPL-7-B) to 4,510 bp (TaSPL-1-B), the full-length coding sequences (CDS) ranged from 579 (TaSPL-13-A, -B, -D) to 3,390 bp (TaSPL-15-D), and the protein sequence included 192-1,129 amino acid resides. The isoelectric point ranged from 5.54 to 9.90, with the range of molecular weight, 20.11-123.69. Phylogenetic analysis of TaSPL genesTo examine the phylogenetic relationship among SPL genes of plant species, and 136 putative SPL proteins from six species were selected, including 56 from the wheat, 18 from Ae.tauschii, 10 from T. urartu, 16 from B.distachyon, 19 from rice, and remaining 17 from Arabidopsis. The full length protein sequences were used for phylogenetic analysis using the neighbour-joining (N-J) method by MEGA X [59].The 136 SPL proteins were aggregated into eight distinct groups (G1-G8) (Fig. 1). TaSPL proteins were distributed in all eight groups, containing 3, 9, 3, 17, 6, 3, 6 and 9 proteins, respectively. Groups 3-8 included short TaSPL proteins with no more than 473 (TaSPL-3-B, -3-D) amino acid residues, whereas the members of group 1 and 2 were longer and vary from 822(TaSPL-1-D)to 1,129 (TaSPL-15-B, -15-D) amino acid residues (Additional file 2: Table S2). According to phylogenetic tree analysis, some TaSPL proteins clustered closer to Ae. tauschii, T. urartu, B. distachyon and rice than Arabidopsis, indicating the majority of wheat TaSPL genes were more closely related to monocots. Thus, although the plant SBP-box genes may have originated from a common ancestor, it appears to be a unique pattern of differentiation among many family members after the separation of each lineage. Taken together, our results suggest that the TaSPL genes may have profound evolutionary origin and diversity of biological functions. Sequence alignments and SBP domain of TaSPLs in wheatThe full length protein sequences of 19 OsSPL and 56 TaSPL proteins were used for multiple sequence alignment, and the ClustalW2.0 software was selected to determine the domain structures of the TaSPL proteins in detail [60]. The results showed that there was only one very conservative SBP domain, containing approximately 78 amino acid residues, found to be shared by all TaSPL and OsSPL proteins (Additional file 5: Figure S1). Sequence alignments of the SBP domain showed that zinc-binding sites, zinc finger 1 and zinc finger 2, located in SBP domain of all TaSPL proteins. A conservative NLS appeared at the C-terminus of the SBP domain. The zinc finger 2 partially overlapped with the NLS (Additional file 5: Figure S1a), as previously reported [5]. For 56 TaSPL proteins, the type of zinc finger 1 in the SBP domain was Cys-Cys-Cys-His (CCCH), except for TaSPL-9-A, -9-B and -9-D with Cys-Cys-Cys-Cys (CCCC), and the zinc finger 2 of all the proteins is Cys-Cys-His-Cys (CCHC) (Additional file 5: Figure S1b).
Analysis of gene structure and conservative motif
For gene structure, number and distribution of introns and exons were performed in 56 TaSPL genes (Fig. 2a). The results suggested that introns were existed in 56 TaSPL genes and the number of exons varies from 2 (TaSPL-13-A, -B, -D) to 12 (TaSPL-6-D). Most TaSPL genes harboured a similar gene structure in the same group. Such as, genes for each group (1, 3, 4, 5, 6, 7 and 8) were made up of equal number of introns or exons but with different intron or exon lengths, whereas those within most of group 2 consisted of 11 exons, except that TaSPL-15 contained only 10 exons, and TaSPL-6-D had 12 exons.
The conservative motifs among the TaSPL proteins were examined using MEME motif program. Ten motifs were identified, and the distributions and sequences of these motifs in TaSPL proteins were showed in Fig. 2b-c. Motif 1 and motif 2, which together constituted the SBP-domain, were found in all TaSPL proteins. Protein sequence alignment (Additional file 5: Figure S1b) supported the result, which showed that the SBP domain was highly conserved in SPL proteins, consistent with previous reports [12]. In addition to the conservative SBP domain, other motifs could also play important role in the functions of TaSPL genes [12]. For example, motif 6 was encoded by a conservative sequence that was complementary to the mature sequence of miR156 (Fig. 2c). Some motifs (5, 6, 7, 8, 9 and 10) were present in most TaSPL proteins, while motifs (3 and 4) only were present in fewer TaSPL proteins, and these motifs in gene functions need further study.
Chromosome localization, segmentally duplication and syntenic analysis
Corresponding maps were constructed for the chromosomal localization of 56 TaSPL genes (Fig. 3). A total of 56 TaSPL genes were distributed on 19 wheat chromosomes except for 4B and 4D, which 19, 18 and 19 TaSPL genes were located in the A, B and D sub-genomes, respectively. The wheat chromosome 7 was the most abundant, containing 23 of the 56 TaSPL genes, followed by chromosome 6. By contrast, only one TaSPL gene was found on chromosome 4A, which contained the minimum number of TaSPL genes (Fig. 3a). The relative positions of the 56 TaSPL genes on the chromosomes were shown in Fig. 3b.Notably, some genes were tightly distributed across wheat chromosomes, for example, TaSPL22-A and 21-A, TaSPL20-A and 10-A genes. Within this type of genes, the gene structures and motifs were quite similar and conservative (Fig. 2). Therefore, we also examined gene segmental duplication events to explore the expansion patterns of the TaSPL genes in wheat. In this study, there are 63 gene pairs identified cross on 21 chromosomes (Additional file 6: Table S5 and Fig. 4). In addition, by calculating the ratio of nonsynonymous substitutions (Ka) to synonymous substitutions (Ks) of each gene pair, the selection constraints of duplicated TaSPL genes were investigated. In general, duplicated genes with a high Ka/Ks (>1) are indicated to be evolving under positive selection, Ka/Ks =1 manifested neutral selection, while Ka/Ks (<1) indicated negative or purifying selection [61]. The values of Ka/Ks for the 63 TaSPL gene pairs were shown in Additional file 6: Table S5. The results showed that all of the calculated genes pair values were Ka/Ks (<1), and the ratio of segmental duplications ranged from 0.05 (TaSPL-8-B/TaSPL-8-D) to 0.62 (TaSPL-22-A/TaSPL-22-D), which implied that TaSPL genes have mainly undergone purifying selection or negative selection after the segmental duplication events with limited functional divergence.Subsequently, to understand the original, evolutionary history, and supposed function of the TaSPL genes, a syntenic map were made between wheat and rice. The 55 gene pairs syntenic relationships between 19 OsSPL and 56 TaSPL genes were analysed (Additional file 7: Table S6 and Fig 5). The results indicated that some TaSPL genes and their OsSPL counterparts appeared to be derived from a common ancestor, suggesting that the function of some TaSPL genes could be inferred from their OsSPL homolog, which could facilitate research on the roles of the TaSPL genes in wheat.
The miR156 may regulate the expression of TaSPL genes
According to the available annotation information, the 11 OsSPL genes and 10 AtSPL genes are targets by miR156 respectively, and the complementary sites of miR156 are located in the coding regions or 3' UTRs of SPL genes in rice and Arabidopsis [12, 62]. To understand the post-transcriptional regulation of miR156-mediated TaSPL genes in wheat, miRBase database was used to predict putative members of the miR156 in wheat. Tae-miR156 (MI0016450) precursor in the public database and the psRNATarget server can be used to search for targets of miR156 according to cDNA region. The results showed that 28 TaSPL genes had targets for miR156, which divided into six groups (I-VI), according to the different complementary sequences existing on the target genes (Fig. 6). Group I contained 15 TaSPL genes, the mature sequence of tae-miR156 was completely complementary to target sites in the TaSPL genes, while other groups showed different degree of nucleotide mismatch. Interestingly, there were 11 target sites that were conserved across 28 TaSPL genes, showing that a functional studies of these genes could be performed by mutating miR156 target sites.
The miR156-target sites of 24 TaSPL genes were located in the coding region, and the other four TaSPL genes target sites were existed in the 3' UTR, near the termination codon (Additional file 8: Table S7 and Additional file 9: Figure S2), almost the same to miR156 targets of SPL genes in Arabidopsis and rice. The results indicated that miR156-mediated regulation of the SPL genes were conserved in diverse plant species. Further analysis of SPL genes in wheat showed the miR156-targeted SPL genes were distributed in six subgroups (G3, G4, G5, G6, G7 and G8) but except other two subgroups (G1 and G2).
The multiple alignments of the miR156 mature sequences from wheat, Ae. tauschii, B. distachyon, rice and Arabidopsis, revealed few polymorphism of nucleotides existed in the complementary region of the mature miR156 sequence, indicating the precursor sequences of miR156 family is relatively conserved in various plant species (Additional file 10: Table S8 and Fig. 7).
Tissue-specific expression analysis TaSPL genes
Although the TaSPL genes have different gene size, structure, location and segmental duplication, their functions in wheat remain unclear. The expression patterns of TaSPL genes were analysed by RNA-seq data (Fig. 8). The results showed that 56 TaSPL genes exhibited an extensive range of expression in different tissues, while homologous genes across the A, B, and D subgenomes revealed similar expression patterns. Remarkably, TaSPL13 and TaSPL15 were highly expressed in spikes, suggesting that they might be regulate spike development, consistent with our previous study. In addition to the expression patterns of the six genes previously analysed, we randomly selected another eight genes for tissue expression analysis by qRT-PCR method in this study [63].
The tissues-specific expression profiles of eight TaSPL genes were examined by qRT-PCR in different vegetative and reproductive tissues and organs. Our results showed that the transcript levels of eight TaSPL genes were varied greatly (Fig. 9). At the vegetative growth stage (Z00-Z49), the eight TaSPL genes were constitutively expressed in all detected tissues, while TaSPL-2 exhibited a high level of expression in root, which was associated with RNA-seq data showing that TaSPL-2 was involved in drought stress (Additional file 11: Figure S3a). The expression level of TaSPL-3 was highly detected in tiller base and axillary bud (Z20-Z29), implying their specific roles in tiller base and axillary bud development. TaSPL-4 was expressed at high levels in coleoptile and tiller base (Z00-Z09, Z20-Z29), indicating the may be involvement in coleoptile and tiller base development. TaSPL-6 was highly expressed in root and axillary bud (Z10-Z19, Z20-Z29), which might be involve in root and axillary bud development. TaSPL-8 exhibit relatively high expression levels in radicle, tiller base and ligule (Z00-Z09, Z20-Z29 and Z40-Z49). TaSPL-10 was highly expressed in coleoptile and axillary bud (Z00-Z09, Z20-Z29), TaSPL-14 at high levels in coleoptile and flag leaf (Z00-Z09, Z30-Z39) and TaSPL-18 at high levels at coleoptile and node (Z00-Z09, Z30-Z39). At the reproductive stage, TaSPL-2, -3, -4, -6, -10, -14 and -18 genes had high expression level during spike developmental stage (Z50-Z69), while the transcript of TaSPL-8 exhibit low expression level. Besides, TaSPL-2, -8 and -18 genes showed relatively higher expression levels in the grains (Z70-Z79), except for TaSPL-3, -4, -6, -10 and -14. These results indicated that some TaSPL genes might be involved in the development of axillary bud, spike and grain, and provide a theoretical basis for our future research on the functional mechanism of genes.
Expression of TaSPL genes of wheat under abiotic stresses and plant hormones
To better understand the responses of TaSPL genes to various stresses, RNA-seq data of three abiotic stresses (drought, heat, and cold) and two biotic stresses (powdery mildew pathogen and stripe rust pathogen infection) treatments were acquired and the results were performed by row clustering (Additional file 11: Figure S3). The expression of TaSPL-2-B, -2-D and -6-D genes were significantly up-regulated after 1 h drought treatment (Additional file 10: Figure S3a and Additional file 12: Table S9a). Meanwhile, TaSPL-2-A, -2-B and -2-D genes were also up-regulated after 6 h of combined drought and heat stresses (Additional file 11: Figure S3a and Additional file 12: Table S9c). There are seven TaSPL genes may be respond to cold stress (Additional file 11: Figure S3b and Additional file 12: Table S9d). Besides, TaSPL-6-A, -6-D and -15-B genes were distinctly up-regulated with prolonged injection of powdery mildew pathogen and stripe rust pathogen from 24 h to 72 h (Additional file 11: Figure S3c and Additional file 12: Table S9e).
Previous studies have shown that SPL genes could response to drought [39], heat [64], auxin (IAA) and brassinolide signal pathways [55], biotic and abiotic tresses [37, 39]. To examine the effects of different abiotic stresses and plant hormone on TaSPL gene expression, wheat cultivar Chinese Spring seedlings were treated with drought, PEG6000, NaCl, Cold, ABA, IAA, GA, GR24, MeJA and BR, and the qRT-PCR method also used to investigate the expression levels of the eight TaSPL genes (Fig. 10).
Under drought treatment, the expression of these eight genes was up-regulated or down-regulated at different time points. TaSPL-6 was down-regulated during the whole treatment period, and TaSPL-10 was up-regulated by more than seven-fold after 3 h treatment. After PEG treatment, the expression of TaSPL-2, -4, -6, -8 and -18 genes peaked at 1 h, TaSPL-10 peaked at 12 h, while TaSPL-3 was down-regulated at all treatment time points. Under NaCl stress treatment, the expression level of TaSPL-2 was up-regulated, TaSPL-3, -4 and -14 genes were significantly down-regulated throughout the treatment time, and the expression levels of TaSPL-6 and -10 genes peaked at 1 h. After cold stress treatment, the expression level TaSPL-2 was up-regulated distinctly at different time points, the expression of TaSPL-4, -6, -10, -14 and -18 genes peaked at 1h, while TaSPL-8 peaked at 24 h.
Under ABA treatment, the expression levels of these eight genes showed a trend of up-regulation or down-regulation at the whole time point of treatment, the expression of TaSPL-2, -3, -8 and -18 genes peaked at 3 h, TaSPL-4, -6 and -14 genes peaked at 6 h, and TaSPL-10 peaked at 1 h. After IAA treatment, the expression levels of the TaSPL-2, -3, -4, -6, -14 and -18 genes were distinctly up-regulated at the whole treatment process, more remarkably for TaSPL-2 and TaSPL-18 and TaSPL-10 peaked at 12 h. Under GA treatment, the expression levels of the TaSPL-3 and -6 genes were down-regulated, TaSPL-10 was obviously up-regulated at each point in time of treatments, and TaSPL-2 and -4 genes peaked at 24 h, TaSPL-8, -14 and -18 genes peaked at 12 h. Under GR24 treatment, the expression levels of the TaSPL-2, -3, -6 and -10 genes were up-regulated at each treatment time points, and TaSPL-4 and -8 genes peaked at 12 h, TaSPL-14 and -18 genes peaked at 24 h. Under MeJA treatment, TaSPL-3, -8 and -18 were down-regulated during the whole treatment time points, TaSPL-2 and -10 peaked at 3 h, TaSPL-4 peaked at 1h, and TaSPL-6 and -14 genes peaked at 6 h. Under BR treatment, the expression levels of the TaSPL-3 and -4 genes were up-regulated at the each time point of treatment, and TaSPL-2 and -14 peaked at 12 h, TaSPL-6 peaked at 3 h and TaSPL-8, -10 and -18 genes peaked at 24 h. These results showed that some TaSPL genes could respond to at least one hormone and participate in the abiotic stress process. For example, the TaSPL-3 was up-regulated under IAA treatment and down-regulated under GA treatment. TaSPL-10 was up-regulated under GA and GR24 treatments.