RNA-seq and de novo transcriptome assembly
To obtain an overall view of the nitrogen deficiency transcriptome in the two provenances,
RNA samples were prepared from the roots and foliage of both provenances on 30 days
after nitrogen deficiency treatment. RIN values indicate that RNA quality can be used
for cDNA library construction (Additional file 2). Gene expression profiles of the
high carbon fixation Fraxinus mandshurica provenance (WC) and low carbon fixation Fraxinus mandshurica provenance(HL) seedlings roots and foliage under both normal nitrogen and nitrogen
deficiency conditions were analyzed. For the eight samples, three biological replicates
were performed in sequencing. In total, 1,235,174,984 clean reads originated from
the 24 Fraxinus mandshurica cDNA libraries were obtained following removal of adaptors and low-quality reads
(Additional file 3). The percent G+ C and fraction of bases with quality scores of
Q30 for the 24 libraries averaged 45.2% and 94.2%, respectively. These numbers showed
that the data was of high quality. Trinity software was then used to assemble the
clean reads into 88,655 unigenes with N50 length of 1259 bp. The relative length of
the assembled sequences is one standard by which a transcriptome assembly can be assessed
and the summary statistics are shown in Fig.1. In the final assemblies there were
48,754 unigenes ranging from 200 to 500 bp in length, which accounted for 55% of the
total. There were 19,220 (21.68%) and 20,681 (23.33%) unigenes from 500 to 1000 bp
and > 1000 bp in length, respectively, and assembled sequences < 200 bp were not taken
into account.
Annotation and functional classification of unigenes
The assembled sequences were used as queries in BLAST searches (E ≤ 10−5) against the NR, KOG, Swissprot and KEGG databases, and a total of 55,537 were annotated
(Fig. 2a). The majority of the unigenes (53,603; 60.5%) were annotated from the NR
database. In contrast, only 24,959 (28.2%), 35,879 (40.5%) and 42,650 (48.1%) annotated
unigenes were matched to the KEGG, KOG and Swissprot database, respectively. As shown
in Fig.2a, a total of 21,084 unigenes were annotated with all four databases. To further
utilize the transcriptome data, the KOG, GO, and KEGG databases were used to identify
the molecular processes that occur during nitrogen deficiency of Fraxinus mandshurica seedlings roots and folium. The annotated unigenes were initially classified into 25 KOG categories
with the largest group being “General function prediction only” (10,345; 28.83%) (Fig.2b).
The Gene Ontology (GO), which is an internationally standardized classification system,
was used to assign 15,827 unigenes to the three principal GO domains: “cellular component”,
“molecular function” and “biological process” (Fig.2c). In the “cellular component”
category, “cell” and “cell part” were the most abundant terms. For the category of
“molecular function”, “catalytic activity” and “binding” were the most prominent.
“Metabolic process” and “cellular process” were the most abundant terms in the “biological
process” category.
Identification of differentially expressed genes (DEGs)
The transcriptional levels were normalized using the FPKM method. Meanwhile, FDR<0.05
was used as screening thresholds to test the significance of difference in transcript
abundance. From eight comparisons, including CP1 (WCKF vs. HCKF), CP2 (WCKR vs. HCKR),
CP3 (WCKF vs. WCTF), CP4 (WCKR vs. WTR), CP5 (HCKR vs. HTR), CP6 (HCKF vs. HTF), CP7
(WTF vs. HTF), CP8 (WTR vs. HTR), a large number of differentially expressed genes
(DEGs) were identified (table1). The numbers of DEGs detected were as follows: CP1
669 (285 up- and 384 down-regulated), CP2 149 (77 and 72), CP3 305 (165 and 140),
CP4 4225 (1907 and 2318), CP5 4531 (2386 and 2145), CP6 1414 (991 and 423), CP7 141
(67 and 74), CP8 178 (44 and 134). More down-regulated DEGs were found in CP1 and
CP8, which indicated that the numbers of DEGs up-regulated in WC were 34.74% and 204.55%
greater than that in HL under normal nitrogen for foliage and nitrogen deficiency
for roots. In order to study the difference in DEGs between two provenances, the DEGs
in CP1, CP2 and CP7, CP8 were further analyzed together. Consequently, 1057 DEGs were
identified using pair-wise comparison of each accession between WC and HL under normal
condition and nitrogen deficiency (Additional file 4). 783 DEGs were identified between WC and HL under normal condition, including both
up-regulated (345) and down-regulated (438) genes (fig.3a and fig.3b). Interestingly,
DEGs in the CP1 (669) were nearly 4.5 times as much as those in the CP2 (149) (table1),
which indicated that the number of DEGs in foliage was significantly higher than that
in roots under normal nitrogen. Similarly, 300 DEGs were identified between WC and
HL under nitrogen deficiency, including both up-regulated (102) and down-regulated
(198) genes (fig.3a and fig.3b). And down-regulated DEGs in the CP8 (134) were nearly
twice as much as those in the CP7 (74) (table1), which indicated that the number of
DEGs in roots was significantly higher than that in foliage under nitrogen deficiency.
Moreover, analysis of DEGs in CP3, CP4 and CP5, CP6 can be used to identify genes
that respond to nitrogen deficiency in WC and HL, and then the differences in response
to nitrogen deficiency between the two provenances can be further analyzed. Consequently,
8173 DEGs under nitrogen deficiency were identified using pair-wise comparison of
each accession between normal and nitrogen deficiency in WC and HL (Additional file 4: Figure S2). However, there were 1959 DEGs commonly found in both HL and WC. 4447 DEGs were
identified between normal nitrogen and nitrogen deficiency in WC, including both up-regulated
(2019) and down-regulated (2431) genes (fig.3c and fig.3d). Interestingly, DEGs in
the CP4 (4225) were nearly 14 times as much as those in the CP3 (305), which indicated
that the number of DEGs in roots was significantly higher than that in foliage in
WC. Similarly, 5685 DEGs were identified between normal and nitrogen deficiency in
HL, including both up-regulated (3210) and down-regulated (2518) genes (fig.3c and
fig.3d). And DEGs in the CP5 (4531) were nearly 3.2 times as much as those in the
CP6 (1414), which indicated that the number of DEGs in roots was significantly higher
than that in foliage in HL. In addition, the transcriptional expression of DEGs in
the two provenances foliage was significantly different in response to nitrogen deficiency.
The two provenances displayed dissimilar expression patterns, in which the mount of
total (1414), up-regulated (991) and down-regulated (423) DEGs in the CP6 were 4.6
times, 6.0 times and 3.0 times as much as those (305,165 and 140) in the CP3(table1),
respectively, which indicated that the foliage of HL provenance responded to nitrogen
deficiency more strongly than that of WC provenance.
To validate the gene expression results derived from the transcriptome data, nine
DEGs related to transcription factors in CP1 and six DEGs involved in the phenylpropanoid
pathway for the biosynthesis of lignin in CP4 and CP5 were selected for qRT-PCR. The
results of experiment showed a strong correlation with the RNA-Seq data (Fig.4), which
indicates that the results of the transcriptomic analysis are reliable.
GO annotation for DEGs
Gene annotation of DEGs was using GO database. Annotated genes were grouped into three
major functional categories: cellular component, molecular function and biological
process, and then were divided into subcategories (Fig.5; Additional file 5). For
ease of presentation, data was expressed as the average of genes in each subcategory
in WCKF vs. the average in HCKF. Within the category of biological process in WCKF
vs. HCKF (CP1), genes involved in single-organism process, cellular process, metabolic
process, localization, biological regulation, response to stimulus and developmental
process were 11.32%,11.80%,13.24%, 3.03%, 3.03%, 2.87% and 1.28% of the total, respectively.
In the cellular component category, 5.10%, 4.47%, 3.83%, 3.83%, 1.28% and 1.12% genes
were related to cell, membrane, organelle, membrane part, organelle part and cell
junction, respectively. In the molecular function category, 14.99%, 9.73% and 1.75%
genes were involved in catalytic activity, binding and transporter activity, respectively
(Fig.5). Annotation for DEGs in CP2, CP3, CP4, CP5, CP6, CP7 and CP8 pairwise comparisons
was also carried out. In all three functional categories, genes belonging to the above
three groupings showed a similar distribution pattern to those of CP1. However, macromolecular
complex showed higher levels in CP3, CP4, CP5 and CP6, which suggested that macromolecular
complex might play an important role in responding to nitrogen deficiency (Fig.5;
Additional file 5).
KEGG pathway analysis for DEGs
Transcriptome sequencing can also be taken to identify a variety of metabolic processes.
In this study, a total of 1057 DEGs were confirmed in CP1, CP2, CP7 and CP8 and 8173
DEGs between normal and nitrogen deficiency were identified in CP3, CP4, CP5 and CP6.
After identifying DEGs, KEGG annotation was used for DEGs from the above comparisons.
In CP1, CP2, CP3, CP4, CP5, CP6, CP7 and CP8,126, 19, 63, 1298, 1165, 232, 21 and
30 DEGs were assigned to 73, 24, 47, 111, 117, 90, 20 and 30 pathways of the KEGG
database, respectively (Additional file 6). The most reliable significantly (p<0.05) enriched pathways of DEGs were represented as Additional file 3. Integrate
CP3, CP4, CP5 and CP6 comparisons, the major pathways involved in the mechanism for
responding to nitrogen deficiency were “Phenylpropanoid biosynthesis”, “Nitrogen metabolism”,
“Diterpenoid biosynthesis”, “Isoquinoline alkaloid biosynthesis”. In addition, in
the roots that responded most to nitrogen deficiency, “Ribosome” pathway had the highest
number of DEGs in both provenances. “Oxidative phosphorylation” and “Starch and sucrose
metabolism” pathways were specifically enriched in the roots of WC and HL, respectively.
Similarly,because only 2 DEGs in CP2 are annotated by KEGG, we used the DEGs in CP1
to analyze the difference between two provenances. In CP1, there were 18 pathways
which could be divided into eight types involving in the growth mechanism: “Carbohydrate
metabolism”, “Energy metabolism”, “Lipid metabolism”, “Amino acid metabolism”, “Metabolism
of other amino acids”, “Metabolism of terpenoids and polyketides”, “Biosynthesis of
other secondary metabolites” and “Signal transduction” (Additional file 6).
DEGs involved in carbon and energy metabolism
The expression levels of many genes involved in carbohydrate metabolism, lipid metabolism
and energy metabolism were differentially between HL and WC. For example, six genes
involved in “starch and sucrose metabolism” pathway in CP1 showed increased transcript
abundance in WC (Additional file 6, Additional file8). These genes encoded polygalacturonase
(Unigene0056877), beta-glucosidase (Unigene0064953, Unigene0055864, Unigene0054701,
Unigene0040728) and trehalose-phosphate phosphatase J(Unigene0037044) were identified
and their expressions were increased by 2.3 to 42-fold. Five DEGs involved in “Nitrogen
metabolism” pathway in CP1 showed increased transcript abundance in WC (Additional
file 6). These genes encoded carbonic anhydrase (Unigene0063117, Unigene0054409, Unigene0063118),
glutamine synthetase (Unigene0053381), high affinity nitrate transporter 2.5 (Unigene0050395)
were identified and their expressions were increased by 3.4 to 16.7-fold. Nine DEGs
involved in Lipid metabolism in CP1 showed increased transcript abundance in WC (Additional
file 6). These genes encoded lipoxygenase (Unigene0062168, Unigene0059365), 3-ketoacyl-CoA
synthase (Unigene0041866, Unigene0041868, Unigene0030198), 4-coumarate--CoA ligase
(4CL6, Unigene0050299), 3-ketoacyl-CoA thiolase 2 (Unigene0043651), Jasmonic acid
carboxyl methyltransferase (Unigene0030475), salicylic acid methyltransferase (SAMT,
Unigene0060674) were identified and their expressions were increased by 2.2 to 13.0-fold.
DEGs related to hormone signaling
A total of 71 DEGs were involved in several plant hormone signal transduction pathways
in our study, such as jasmonic acid(JA), auxin, cytikinine(CK), ethylene, salicylic
acid(SA), gibberellin(GA), brassinosteroid and abscisic acid(ABA) (Additional file
7). Among them, auxin, JA and SA pathways were predominant. According to the above analysis, many genes involved in plant hormone signal transduction
pathway showed increased transcript abundance in WC. Ten genes involved in “Plant
hormone signal transduction” and encoded Zinc-finger protein (ZIM) precursor (TIFY)
(Unigene0007822, Unigene0058809, Unigene0052515, Unigene0052516, Unigene0058810),
ethylene-responsive transcription factor 1B-like (Unigene0033087), auxin-induced protein
15A-like (Unigene0043970),auxin response factor 9-like (Unigene0064310), Auxin-responsive
GH3 family protein (Unigene0056132), histidine-containing phosphotransfer protein
4-like (Unigene0081003) were identified and their expressions were increased by 2.3
to 13.9-fold (Additional file 7).
Other differentially regulated genes
Many genes involved in pathways of “Phenylpropanoid biosynthesis”, “Glucosinolate
biosynthesis” “Terpenoid backbone biosynthesis”, “Monoterpenoid biosynthesis”, “Diterpenoid
biosynthesis” and “Cyanoamino acid metabolism” showed increased transcript abundance
in WC under normal nitrogen in CP1 (Additional file 6). For example, seven genes involved
in “Phenylpropanoid biosynthesis” pathway which encoded beta-glucosidase (Unigene0055864,
Unigene0040728, Unigene0064953 and Unigene0054701), cinnamyl alcohol dehydrogenase
6 (Unigene0045855), peroxidase 19 (Unigene0066438), caffeic acid 3-O-methyltransferase
(Unigene0061140) were identified and their expressions were increased by 2.3 to 40.8-fold
(Additional file 6). Nine genes involved in “Terpenoid backbone biosynthesis”, “Monoterpenoid
biosynthesis” and “Diterpenoid biosynthesis” pathways which encoded cytochrome P450
CYP82U4 (Unigene0059965),terpene synthase (Unigene0055252, Unigene0055251, Unigene0062513,
Unigene0009791), geraniol synthase (Unigene0059790),1-deoxy-D-xylulose-5-phosphate
reductoisomerase (Unigene0042530), heterodimeric geranylgeranyl pyrophosphate synthase
small subunit (Unigene0040100), 1-deoxy-D-xylulose-5-phosphate synthase 2 (Unigene0033472)
were identified and their expressions were increased by 2.4 to 19.7-fold (Additional
file 7). Five genes involved in “Cyanoamino acid metabolism” pathways which encoded
beta-glucosidase (Unigene0054701, Unigene0064953, Unigene0040728, Unigene0055864),
isoleucine N-monooxygenase 1-like (Unigene0035112) and their expressions were increased
by 2.3 to 40.8-fold (Additional file 6, Additional file 8).
In addition, protein kinases and transcription factors (TFs) under normal nitrogen
and nitrogen deficiency were also analyzed (Additional file 9). In CP1, CP2, CP7 and
CP8, a total of 23, 0, 8 and 0 DEGs encoding protein kinases and a total of 28, 2,
2 and 2 DEGs encoding transcription factors were found, respectively (Additional file
10 and Additional file 11). This indicated that the transcription levels of protein
kinases and transcription factors in foliage were different between the two provenances
under both nitrogen deficiency conditions and normal nitrogen. Therefore, the transcriptional
expression of protein kinases and transcription factors in foliage (CP1) was further
analyzed. A total of 23(3up- and 20down-regulated) and 28(11up- and 17down-regulated)
DEGs encoding protein kinases and transcription factors were found in CP1, respectively
(Additional file 10 and Additional file 11). Among DEGs encoding protein kinases,
16 (2 up- and 14 down-regulated) receptor-like serine/threonine-protein kinase (RLKs),
2 (0 and 2) CBL-interacting serine/threonine-protein kinase 14-like (CIPKs) and 3
(1 and 2) Serine/threonine-protein kinase (STPKs) were identified, respectively (Additional
file 10). Among DEGs encoding transcription factors, 7 (2 up- and 5 down-regulated)
AP2/ERF, 9 (3 and 6) bHLH, 3 (3 and 0) MYB, 2 (2 and 0) NAC, 2 (0 and 2) TCP and 2(0
and 2) WRKY were identified, respectively (Additional file 11, Additional file 8).
The results indicated that the protein kinases and transcription factors were more
active in WC rather than in HL.
DEGs between normal nitrogen and nitrogen deficiency in Fraxinus mandshurica
In order to study the two Fraxinus mandshurica provenances transcriptional expression difference in response to nitrogen deficiency,
DEGs between normal condition and nitrogen deficiency from roots and foliage of WC
and HL were obtained, respectively. KEGG enrichment analysis showed that the transcriptional
expression of foliage from two provenances was significantly different in response
to nitrogen deficiency. For example, the number of KEGG pathways which were significantly
enriched was 8 in WC foliage (CP3), less than that (14) in HL foliage (CP6). The pathways
with a larger number of genes enriched in WC foliage (CP3) were “Sulfur metabolism”
and “Nitrogen metabolism”, however, that in HL foliage (CP6) were “Plant-pathogen
interaction”, “Plant hormone signal transduction”, “Protein processing in endoplasmic
reticulum”, “Phenylpropanoid biosynthesis” and “Starch and sucrose metabolism” (Fig6d).
This indicated that the response patterns of WC and HL to nitrogen deficiency were
different in the foliage. Similarly, the DEGs of the roots were analyzed. For example,
the number of KEGG pathways which were significantly enriched was 8 in WC roots (CP4),
less than that (17) in HL roots (CP5). The pathways with a larger number of genes
enriched in WC roots (CP4) were “Ribosome”, “Oxidative phosphorylation”, “Phenylpropanoid
biosynthesis”, “Cyanoamino acid metabolism” and “Diterpenoid biosynthesis”, however,
that in HL roots (CP5) were “Ribosome”, “Starch and sucrose metabolism”, “Oxidative
phosphorylation” , “Plant hormone signal transduction”, “Pentose and glucuronate interconversions”,
“Tyrosine metabolism” and “Phenylpropanoid biosynthesis” (Fig.6c). This indicated
the response patterns of WC and HL to nitrogen deficiency were also different in the
roots.
Protein kinases and transcription factors (TFs) responding to nitrogen deficiency
A total of 14, 92, 132 and 152 DEGs encoding protein kinases were found in CP3, CP4,
CP5 and CP6, respectively (Additional file 10, Additional file 8). Among them, 14
(11 up- and 3 down-regulated), 55 (52 and 3), 86 (80 and 6) and 118 (115 and 3) receptor-like
kinases (RLKs) were identified. And there were 0 (0 up- and 0 down-regulated), 3 (3
and 0), 5 (4 and 1) and 5 (5 and 0) genes of CP3, CP4, CP5 and CP6 involved in the
mitogen-activated protein kinases cascade. We found 0, 1, 2 and 2 DEGs of calcium-dependent
protein kinases family, 0, 1, 2 and 1 DEGs of CBL-interacting protein kinases (CIPK)
family and 0, 1, 5 and 2 protein phosphatase 2C were all up regulated in CP3, CP4,
CP5 and CP6. Moreover, we identified 0, 12, 18 and 10 DEGs of Serine/threonine-protein
kinase (STPK) in CP3, CP4, CP5 and CP6. The details of these PK and protein phosphatase
genes are shown in Additional file 10. These results indicated that protein phosphorylation
may be induced by nitrogen deficiency in Fraxinus mandshurica. In addition, it was interesting that there were more up-regulated PK in HL(CP5)
than WC(CP4) (Fig.7a and Fig.7b).
In this study, totally 4 (1 up- and 3 down-regulated), 52 (34 and 18), 84 (63 and
21) and 60 (52 and 8) transcription factors (TF) genes were identified in CP3, CP4,
CP5 and CP6 (Additional file 11). Obviously, the number of up-regulated expressed
transcription TFs in CP5 (CP6) was greater than that of CP4 (CP3) (Fig.7c-f), which
indicated HL is stronger than WC in response to nitrogen deficiency. Generally, the
major TF families were AP2/ERF, WRKY, bHLH, MYB, MYB-related, NAC and bZIP. In the
CP4, the MYB family was the largest group (21%), followed by the AP/ERF (17%) and
bHLH (13%) family (Additional file 9: Figure S4); for CP5, the top three family were
AP/ERF (21%), bHLH (21%) and MYB (19%) (Additional file 9: Figure S5); and for CP6,
the top three were WRKY (35%), bHLH (27%) and AP/ERF (17%) (Additional file 9: Figure
S6); however, for CP3, only 4 differentially expressed transcription factor genes
were identified (Fig.7c). Of AP2/ERF genes, 3, 14 and 10 were up-regulated in CP4,
CP5 and CP6, respectively; of bHLH genes, 6, 17 and 13; of MYB genes, 6, 10 and 2;
of WRKY genes, 2, 4 and 20. The results indicated that in the response to nitrogen
deficiency, bHLH, MYB and AP2/ERF play a major role in roots, and WRKY plays a major
role in foliage (Fig.7c-f).
DEGs related to hormone signaling responding to nitrogen deficiency
In addition to the basic roles in growth and development, phytohormones are also involved
in various environmental responses, such as light, salt and drought. It has been proposed
that some hormones coordinate demand and acquisition of nitrogen. In this study, the
number of up-regulated DEGs related to hormone signaling was greater than that of
down-regulated DEGs in respond to nitrogen deficiency. Totally 0 (0 up- and 0 down-regulated),
22 (17 and 5), 40 (34 and 6) and 15 (14 and 1) DEGs were identified in CP3, CP4, CP5
and CP6 (Additional file 7). Obviously, the number of up-regulated DEGs related to
hormone signaling in CP5 (CP6) was greater than that of CP4 (CP3), which indicated
the response to nitrogen deficiency, HL is stronger than WC. Among them, auxin, ABA,
SA, JA and CK pathways were predominatly induced. In the auxin signaling pathway,
there were 8, 19 and 2 genes up-regulated in CP4, CP5 and CP6; in the SA pathway,
the number of up-regulated genes was 5, 5 and 1; in the ABA pathway, the number of
up-regulated genes was 0, 5 and 3; in the JA pathway, the number of up-regulated genes
was 0, 1 and 7; in the CK pathway, the number of up-regulated genes was 2, 5 and 0
(Additional file 7). In addition, the response patterns of DEGs related to hormones
under nitrogen deficiency in roots and foliage were different. For example, the hormones
that predominantly played a role in the foliage were JA and ABA but in the roots were
auxin, SA, CK and ABA.
DEGs related to nitrogen assimilation responding to nitrogen deficiency
Many genes involved in nitrogen absorption and assimilation were differentially expressed
under nitrogen deficiency relative to the normal condition. In the current study,
10 DEGs encoding nitrate transporters were detected (Additional file 12, Additional
file 8). Compared to normal nitrogen, 7 DEGs were up-regulated and 3 DEGs were down-regulated
under nitrogen deficiency. Whereas, the number of these DEGs in WC was less than that
in HL, and there were 2 up-regulated DEGs (Unigene0034112 and Unigene0050395) which
encoded NRT3.2 and NRT2.5 and 2 down-regulated DEGs (Unigene0046155 and Unigene0047301)
which encode NRT2.7 and NRT in both WC and HL. In addition, four DEGs (Unigene0018686,
Unigene0022313, Unigene0026134, Unigene0050965) which encoded NRT3.1 and NRT2.1 were
up-regulated and one DEG (Unigene0039451) which encoded NRT3.1 was down-regulated
only in HL, and one DEG (Unigene0071953) which encoded NRT was up-regulated in WC,
respectively. Four nitrogen deficiency responsive genes encoding ammonium transporters
(Unigene0034277, Unigene0045039, Unigene0045040, Unigene0054529) were identified (Additional
file 12), in which, Unigene0054529 was responsive to nitrogen deficiency only in HL.
Moreover, fourteen, six and four DEGs encoding amino acid, lysine histidine and oligopeptide
transporters were found (Additional file 12), respectively. There were also three,
six, two and two DEGs encoding glutamine synthetase (GS), glutamate dehydrogenase
(GDH), nitrite reductases (NiR) and nitrate reductase (NR) which were the key enzymes
in nitrate assimilation and their transcription expression were all down-regulated
except two GS unigenes (Unigene0009471, Unigene0053381).
In this study, there were also one, one, three and one DEGs encoding Dof zinc finger
protein DOF1.5-like (CDF4), Pyruvate kinase (PYK1), Ribulose-1,5-bisphosphate carboxylase
(RuBP) and neutral invertase A (INVA) which were the key enzymes in carbon assimilation
and their transcription expression were all down-regulated except one INVA unigene
(Additional file 12). Meanwhile, expression of many genes associated with absorption
or translocation of other nutrients changed under nitrogen deficiency, such as phosphate
(9), potassium (3), sulfate (5), Zinc (4), iron (3) and molybdate transporter (2),
indicating that uptake of these nutrients in Fraxinus mandshurica is affected by nitrogen metabolism under cross-talking regulation.