Profiling of cinnamaldehyde in different tissues of C. cassia
In order to investigate the accumulation of cinnamaldehyde in various tissues of C. cassia, HPLC was utilized to analyze the methanol extracts obtained from leaf buds, young leaves, mature leaves, branch barks, and barks. Our findings revealed that barks exhibited the highest levels of cinnamaldehyde accumulation, while branch barks demonstrated the second highest levels (Fig. 1). These levels were found to be twice as high compared to those detected in young leaves, mature leaves, and leaf buds. These results suggest that the expression of cinnamaldehyde biosynthetic genes differs among these tissues.
Sequencing and de novo assembly of the C. cassia transcriptome
In order to identify key genes associated with cinnamaldehyde production, a total of fifteen RNA-seq libraries were constructed from tissues including young leaves, mature leaves, leaf buds, branch barks, and barks. Subsequently, these libraries were sequenced using the Illumina platform. De novo assembly of the obtained data resulted in 21,456,754 transcripts, with an average length of 48 bp. These transcripts were further assembled into 249,510 unigenes, possessing an average length of 467.33 bp and an N50 of 738 bp (Table 1). Although the N50 value obtained in this study was slightly lower than the previously reported N50 of 1248 bp (Gao et al. 2020), it sufficiently meets the requirements for the discovery of metabolite-related genes associated with cinnamaldehyde production.
Table 1
Throughput and quality of RNA-seq.
Type | Sequences | Bases | Min | Max | Average | N50 | (A + T) % | (C + G) % |
All_Contigs | 21,456,754 | 1.25E + 09 | 25 | 16889 | 58.23 | 48 | 54.67 | 45.33 |
All_Unigenes | 249,510 | 1.17E + 08 | 201 | 15761 | 467.33 | 738 | 56.34 | 43.66 |
Functional annotation of C.cassia unigenes
C. cassia unigenes were subjected to BLAST searches against five public databases to obtain annotations. Among the entire set of unigenes, 61,849 were successfully annotated in the GO database, 62,805 in the Swiss-Prot database, 89,467 in the Nr database, 49,232 in the COG database, and 13,441 in the KEGG database (Fig. 2).
Firstly, among the biological process subcategories in the GO annotation, the subcategory with the second highest number of annotated unigenes was metabolic process (24,620). Additionally, within the molecular function subcategories, the most frequently matched subcategory was catalytic activity (33,124) (SFig. 1).
Secondly, in the COG annotation, the largest category observed was general function prediction only (6,486). Moreover, this study specifically focused on the categories of secondary metabolite biosynthesis, transport, and catabolism, with 2,188 unigenes being identified in the COG database. This finding highlights the complexity of secondary metabolism in C. cassia.
Thirdly, Blastx results for C. cassia were utilized to assign associated KEGG pathways to all unigenes, resulting in 13,441 unigenes being assigned to 148 pathways. Key secondary metabolic pathways related to phenylpropanoid biosynthesis were also assigned to 203 unigenes (SFig. 3).
In conclusion, the gene annotations obtained in this study sufficiently meet the requirements for the discovery of metabolite-related genes associated with cinnamaldehyde production.
Putative genes involved in cinnamaldehyde biosynthetic pathway and sequence analysis of CcCCR1
In this study, a heat map depicting the expression profiles of genes involved in the phenylalanine to cinnamaldehyde conversion pathway was generated using RPKM reads of unigenes across various tissues. Figure 3A presents the putative pathway of cinnamaldehyde biosynthesis. The RPKM value of CcCCR1, which is the first gene in the cinnamaldehyde pathway, showed higher expression in the branch bark and bark compared to other tissues (Fig. 3A). For phylogenetic tree analysis, CCRs (cinnamoyl-CoA reductase) from monocot and dicot plants, including CcCCR1, were utilized. CcCCR1 was categorized into the dicot subfamily along with other CCRs from different plant species (Fig. 3B). Sequence alignment of CcCCR1 revealed the presence of the KNWYCYGK motif responsible for CoA-binding and NADPH binding, whereas other CCRs do not possess this motif. These two motifs have been reported to exist in all confirmed CCR sequences (Lacombe et al. 1997) (Fig. 3C), indicating that CcCCR1 plays a pivotal role in the cinnamaldehyde biosynthetic pathway.
Isolation and functional characterization of the CcCCR1 gene involved in the synthesis of cinnamaldehyde in C. cassia
The full-length cDNA of CcCCR1 was identified to be 1581bp in length, which included a 993 bp open reading frame (ORF) (GenBank ID: OR416486). The ORF encoded a protein consisting of 330 amino acids, with a calculated molecular weight of 36.3 kDa and an isoelectric point of 6.40. To investigate the biochemical functions of CcCCR1, the cDNA containing the 993-bp ORF was successfully cloned. Recombinant CcCCR1 protein production was achieved using an His-tagged vector (pET-30a). This vector facilitated the expression of soluble proteins in Rosetta (DE3) cells. Subsequently, the proteins were purified via Ni affinity chromatography using Dextrin Beads 6FF. On SDS-PAGE, the CcCCR1 protein displayed a molecular mass of approximately 42.9 kDa (Fig. 4A). Upon incubation of the recombinant CcCCR1 protein with cinnamoyl-CoA, HPLC analysis revealed that the protein effectively converted the substrate cinnamoyl-CoA into the corresponding product cinnamaldehyde (Fig. 4B).
Enzymatic kinetic parameters
The spectrophotometric NADPH consumption assays were conducted to determine the kinetic parameters of the recombinant proteins CcCCR1 and AtCCR1. It was observed that both CcCCR1 and AtCCR1 had a similar affinity towards cinnamoyl-CoA. However, CcCCR1 exhibited an approximately 17-fold higher catalytic efficiency (kcat/Km) compared to that of AtCCR1. This finding suggests that CcCCR1 possesses superior catalytic activity and holds promise as a more favorable enzyme for future engineering of biosynthetic pathways targeting cinnamaldehyde production. Detailed results can be found in Table 2 and Supplementary Fig. S4.
Table 2
Kinetic parameters of AtCCR1 and CcCCR1.
Protein ID | Vmax (nkat mg− 1) | Km(µM) | kcat(min− 1) | Km/kcat(µM− 1 min− 1) |
AtCCR1 | 0.498617985 | 248.2 | 1.283186 | 0.00517 |
CcCCR1 | 6.63287069 | 245.5 | 17.66643 | 0.071961 |
Molecular docking
To elucidate the catalytic activities of AtCCR1 and CcCCR1, a molecular docking approach was employed. The overall structure (SFig. S5) of the two CCRs showed significant resemblance to the crystal structures of PhCCR1. When aligning PhCCR1 with AtCCR1, a higher r.m.s.d. value of 0.495 Å was obtained compared to the alignment between PhCCR1 and CcCCR1, which yielded a higher r.m.s.d. value of 0.488 Å. The r.m.s.d. value for the alignment between CcCCR1 and AtCCR1 is 0.351 Å. The molecular docking analysis also involved the utilization of NADPH and cinnamoyl-CoA as small ligand molecules. The binding pocket of NADPH and cinnamoyl-CoA is consistent with previous studies (Fig. 5A & 5B) (Pan et al. 2014, Sattler et al. 2017). When NADPH was used as a ligand, the docking scores of CcCCR1 (-10.7638) and AtCCR1 (-10.9801) were found to be similar, suggesting possible differences in cinnamoyl-CoA binding. Following ligand docking using Schrodinger, CcCCR1 exhibited a higher docking score than AtCCR1 (-11.0371 < -9.11643). The greater binding capability of CcCCR1 might account for the higher cinnamoyl-CoA production in C. cassia compared to A. thaliana.