Determination of drought tolerance in linseed varieties
In this study, we measured three drought-tolerant related phenotypic traits of Z141 and NY-17 (Supplementary Table S1). The result showed that Z141 consistently performed better than NY-17 under drought stress (Fig. 2a-d). In addition, we observed that under drought stress Z141 had lower plant height and biomass reduction rate compared with NY-17 under drought stress (Fig. 2e, f; Supplementary Table S2). The biomass reduction rate under drought stress was 30% and 46% in Z141 and NY-17 respectively. The LAWC and LRWC which are key components for drought-tolerance were significantly higher in Z141 than NY-17 under drought stress, thus suggesting Z141 can hold more water when it under drought stress. (Fig. 2g, h; Supplementary Table S3)
Table 1
Two-way ANOVAs to test the effects of different drought stress (Fixed effect), two linseed biotypes (random effects), and their interaction on plant height, biomass, leaf absolute water content (LAWC) and leaf relative water content (LRWC).
Trait | Drought | Linseed | D x L |
df | F | p | df | F | p | df | F | p |
Plant height | 1 | 709.13 | 0.000 | 1 | 479.90 | 0.000 | 1 | 26.36 | 0.001 |
Biomass | 1 | 108.03 | 0.000 | 1 | 302.29 | 0.000 | 1 | 41.29 | 0.000 |
LAWC | 1 | 314.44 | 0.000 | 1 | 1.27 | 0.292 | 1 | 36.88 | 0.000 |
LRWC | 1 | 949.63 | 0.000 | 1 | 5.22 | 0.05 | 1 | 6.48 | 0.03 |
The drought (D) had two levels (drought stress or non-drought stress) and linseed biotype (L) had two levels too. |
Two-way ANOVA result showed significant effects of the different varieties, different drought level treatments and their effects on plant height, biomass LAWC and LRWC (Table 1). NY-17 did not reveal any significant evidence of drought tolerance.
Analysis of linseed transcriptome by PacBio Iso-seq
The total RNA of Z141 and NY-17 was isolated from the control, DS, RW and RD treatments groups and quality checked. A total of 16 RNA samples were sent to Wuhan Frasergen Bioinformatics Co.,Ltd. Genomic Service for sequencing using PacBio Sequel platform. This platform can generate sufficiently longer read lengths which covers most RNA transcripts in full length thus ensuring accurate reconstructed full-length (FL) splice variants are obtained. Over 2 million polymerase reads with mean length of ~30,000bp were generated after quality checking by Frasergen (Supplementary Table S5). After processing raw data we obtained more than 33 million filtered subreads with mean length ~2000bp (Supplementary Table S6). In addition we obtained 1,599,415 CCS reads which included 1,293,134 FL reads (Supplementary Table S7). De novo transcriptome reconstruction of the data was done using RNA-Seq reads and publically available flax sequences. To evaluate the density and length of isoforms, we compared the loci coverage of PacBio FLNC and swine SSC 10.2 annotation. In PacBio data set, a total of 1,093,282 high-quality FLNCs covered 108,579 isoforms and were allocated to 28,686 loci (Supplementary Table S9). Due to high base error of SMRT sequencing technology, a high-quality Illumina short reads was done using Provverad software to correct the error (Supplementary Table S10). In this study the before and after correction FLNC sequences were aligned to linseed genome sequence respectively through GMAP, finally we obtained 1,093,282 high-quality FLNC for further study (Supplementary Table S8).
Global comparisons of DS and RD related transcriptomes reveal their gene expression and functional groups difference
The mRNA populations were compared using principal component analysis (PCA) to provide a framework for understanding how linseed genes are regulated to respond to the drought stress. Transcriptomes of Z141 and NY-17 under DS, RW or RD were likely to share a great similarity in gene expression respectively, which variations forming three groups that were far deviated from the control (Figure 3a). The transcriptomes of DS exhibited distinct relationship from that of RD, suggesting the gene expression of transcriptome has a major shift between DS and RD.
Cluster analysis of differentially expressed genes further supported our observed results in PCA (Figure 3b). The ratio of up- or down-regulated genes overlap between Z141-RD and NY-17-RD was significantly higher than that of Z141-DS and Z141-RD, with 62.1% compared to 47.8 and 70.7% compared to 60.6% respectively (Figure 3c, d). In addition, in Z141 and NY-17 approximately 52.2% and 65.6% of up-regulated genes were uniquely responsive to RD respectively, 29.9% and 43.6% of up-regulated genes were uniquely responsive to DS. Specifically, in Z141 and NY-17 has 8005 (including 3245 for DS and 4760 for RD) and 6285 (including 2381 for DS and 3904 for RD) genes up-regulated under drought stress respectively. About 9104 (including 4041 for DS and 5063 for RD) and 7908 (3515 for DS and 4393 for RD) genes were down-regulated under drought stress in Z141 and NY-17. We also observed a higher proportion of stress responsive genes under RD compared to that under DS. In this study, 2275, 1343, 3067 and 2154 genes were up- and down-regulated when Z141 or NY-17 under drought stress respectively. Thereinto, 1007 and 1686 genes were significantly up- and down-regulated when Z141 and NY-17 under DS and RD (Figure 3c, d). Taken together, this result suggest that the transcriptomes of DS and RD has fundamentally different.
Gene Ontology (GO) enrichment analysis was conducted to examine the functional distribution of the drought stress-related candidate genes identified in our study. A serial of GO categories exhibited significantly higher enrichments in the overlapped or unique up-regulated gene sets under DS, and RD treatments compared to the background. The GO terms of overlapped up-regulated genes between DS and RD in Z141 and NY-17 were mainly enriched in “proline biosynthetic process (GO: 0006561)” and “proline metabolic process (GO: 0006560)” (Figure 4a, b). Moreover, except for the amino acids biosynthesis and metabolism, more abiotic stress related GO terms e.g. “response to stress (GO: 0009650)” and “response to desiccation (GO: 0009269)” exhibited significant enrichment in Z141 up-regulated genes (Fig. 4A). Interestingly, GO terms related to flower development (GO: 0009908) were only significantly enriched in Z141 up-regulated genes which indicates that the drought avoidance mechanism of Z141 has been activated (Figure 4a). Drought stress inhibits plant photosynthesis, under DS and RD, in this study we observed the GO terms of photosynthesis (GO: 0015979) significantly enriched in down-regulated genes in Z141 and NY-17 (Supplementary Figure S1 a, b). Although the molecular mechanism between proline and plant drought tolerance is still unproven, our results suggest that proline may play an important role in drought tolerance of linseed.
The difference of linseed gene regulation pattern under DS and RD, suggest that under repeated drought stress linseed may have different molecular mechanisms for drought tolerance. Of the stress responsive GO terms, two distinct functional categories of DS specifically up-regulated genes inZ141 exhibited significantly higher enrichments, namely methylation and negative regulation. The first group included “histone H3-K36 demethylation (GO: 0070544)” and “macromolecule methylation (GO: 0043414)”, whereas the second group contained “negative regulation of biological process (GO: 0048519)” and “negative regulation of macromolecule metabolic process (GO: 0010605)” (Supplementary Figure S1c). The GO terms of up-regulated genes in Z141 under RD mainly enriched in “fatty acid oxidation (GO: 0019395)”, “fatty acid biosynthetic process (GO: 0006633)”, “fatty acid m metabolic process (GO: 0006631)” and “lipid metabolic process (GO: 0006629)” (Supplementary Figure S1d). The GO terms of unique down-regulated genes in Z141 under DS were mainly enriched in “carbohydrate metabolic process (GO: 0005975)”, “lignin biosynthetic process (GO:0009809)” and “lignin metabolic process (GO:0009808)”, whereas under RD the GO terms of unique down-regulated genes in Z141 were mainly enriched in “amide biosynthetic process (GO:0043604)” and “cellular amide metabolic process (GO:0043603)” (Supplementary Figure S1e, f). Overall, these functional categories indicated that epigenetic modifications might play a crucial role in the DS responsive process, although the exact functions of these genes remain to be elucidated. Meanwhile, drought stress may induced Z141 growth state from vegetative growth to reproductive growth.
The GO terms of unique up-regulated genes in NY-17 under DS were mainly enriched in RNA regulation including “RNA modification (GO: 0009451)”, “RNA processing (GO: 0006396)” and “ncRNA processing (GO: 0034470)”. Under RD, the GO terms of unique up-regulated genes were mainly enriched in “transmembrane transport (GO: 0055085)” (Supplementary Figure S1g, h). The GO terms of down-regulated genes in NY-17 under DS were mainly enriched in flavonoid biosynthetic (GO: 0009813). Interestingly, The GO terms of down-regulated genes in NY-17 under RD were similarity with that in Z141 mainly enriched in “amide biosynthetic process (GO: 0043604)” and “cellular amide metabolic process (GO: 0043603)” (Supplementary Figure S1i, j).
Comparison of Z141 and NY-17 related transcriptomes reveals molecular mechanism of linseed drought tolerance
Although the transcriptomes of Z141 and NY-17 are very similar in overall gene expression, a set of stress responsive genes exhibited altered expression patterns specific to Z141 or NY-17 under drought stress, indicating distinguished functional categories could be impact on the drought tolerance of linseed. The GO terms of overlapped up-regulated genes between Z141 and NY-17 under DS were mainly enriched in two distinct functional categories, including proline biosynthesis and reproductive development. The proline biosynthesis category included “proline biosynthetic process (GO: 0006561)”, “proline metabolic process (GO: 0006560)”, “glutamine family amino acid biosynthetic process (GO: 0009084)” and “glutamine family amino acid metabolic process (GO:0009064)”, whereas the abiotic stress response category contained “reproductive system development (GO: 0061458) and “reproductive structure development (GO:0048608)” (Figure 5a). Under RD treatment, the GO terms of overlapped up-regulated genes between Z141 and NY-17 were also mainly enriched in proline biosynthesis category with “proline biosynthetic process (GO: 0006561)” and “proline metabolic process (GO: 0006560)” , and in abiotic stress response category with “response to abscisic acid (GO: 0009737)”, “response to desiccation (GO: 00009269)”, “response to acid chemical (GO: 0001101)” (Figure 5b). Without suspense, The GO terms of overlapped down-regulated genes between Z141 and NY-17 under DS and RD were mainly enriched in functional categories related to photosynthesis (Supplementary Figure S2a, b)
The GO terms for unique up-regulated genes in Z141 under DS were mainly enriched in “abscission (GO: 0009838)”, “defense response (GO: 0006952)” and “NADP biosynthetic process (GO: 0006741)” (Supplementary Figure S2c), whereas under RD, the GO terms were mainly enriched in “jasmonic acid biosynthetic process (GO: 0009695)” and “jasmonic acid metabolic process (GO: 0009694)” (Supplementary Figure S2d). The unique up-regulated genes showed more enrichment on pathways which closely related to plant drought resistance such as jamonic acid biosynthesis, abscission and NADP biosynthetic [30, 31]. In contrast, in NY-17 under DS the GO terms for unique up-regulated genes were mainly enriched in RNA regulation functional category with “ncRNA metabolic process (GO: 0034660)”, “ncRNA processing (GO: 0034470)”, “tRNA processing (GO: 0008033)” and so on (Supplementary Figure S2g). Under RD, the GO terms for unique up-regulated genes in NY-17 were mainly enriched in “phenylpropanoid biosynthetic process (GO: 0009699)” and “phenylpropanoid metabolic process (GO: 0009698)” (Supplementary Figure S2h).
Identification of temporarily up- and down-regulated transcription factors (TFs) in response to DS and RD
TFs have been proven to play irreplaceable roles in response to various abiotic stresses by modulating target gene expression [32]. To understand the essence of regulatory processes during DS and RD treatment, a domain searching method [33] was used to first predict transcription factors in Z141 and NY-17 on a whole-genome scale based on our identified non-redundant linseed unigenes. A total of 4,936 linseed TF genes distributed among 50 families were identified (Supplementary Table S11).
To profile stress-responsive TFome under DS and RD, we focused on TF genes exhibiting diverse expression patterns with stress changes, including continuous up-regulation, continuous down-regulation an early peak of expression and the late peak of expression patterns. AS a result, 1190 TFs distributed in 50 families were found to be differentially regulated in response to at least one stress. (Fold change ≥ 2 and FDR adjusted p < 0.01). Among which, eleven TF families accounted for approximately half of stress-responsive TF genes, including bHLH (9%), C2H2 (8%), NAC (8%), MYB (6%), ERF (6%), bZIP (5%), WRKY (5%) and MYB-related (4%) (Figure 6a)
Moreover, the 1190 TFs were further classified into 15 clusters according to their expression patterns by performing Mfuzz program analysis in R software [34]. Cluster5, 8,11 and 13 consists of 387 TFs mainly up-regulated by DS and RD, including DREB, HSF and NF-YA10 which have been confirmed to be key regulator of plant abiotic resistance pathways [35-37] (Figure 6b and Supplementary Table S11).
Validation of isoforms by RT-PCR
Expression analysis of differentially expressed functional candidate genes, associated with DNA repair, MAPK signaling pathway, proline biosynthetic, and photosynthesis, selected from transcriptome data, and validated by RT-PCR. The results (Figure 7a-d) demonstrated that transcript abundances of selected genes were consistent with transcriptome analysis thereby validating the reliability of our annotated transcriptome data for future study.