Changes in seed water absorption and germination rate
The germination rate of CL-2 seeds over a 48-hour period (Fig. 2) displayed a gradual increase from 0 to 20 h, with no statistically significant changes observed thereafter until 48 h (p > 0.05). As a result, the optimal time for sampling biochemical indicators in the later stage was determined to be at the 20-hour mark. The germination rate exhibited relatively low levels during the initial 0–8 h, referred to as the "germination preparation stage", followed by the 8–20 h characterized as the "germination stage", and the subsequent 8–16 h designated as the "rapid germination stage". After 20 h, the average germination rate of seeds reached 95%, suggesting optimal seed viability. The findings regarding seed water absorption rate (Fig. 2) demonstrated a swift uptake of water by the seeds within the initial 4 hours, labeled as the "rapid water absorption stage", followed by a gradual and consistent water absorption starting at 8 h.
Changes of storage compounds during seed germination
A reduction in starch content was observed during the germination process of seeds, as illustrated in Fig. 3. The highest starch content, recorded at 519.85 ± 13.14 mg/g, was observed in 0-hour. Following 20 h of germination, the starch content decreased to its lowest point, measuring at 208.66 ± 40.19 mg/g. Changes in total amylase activity were also depicted in Fig. 3. During the rapid water absorption stage (0–4 h), amylase activity peaked at 6.70 ± 0.62 U/L. Subsequently, there was a decline in amylase activity, with the enzyme activity reaching its minimum at 2.12 ± 0.42 U/L after 20 h. The soluble protein content demonstrated a decrease from 0 h to 12 h, followed by a subsequent overall increase in the later stages. Specifically, at the 12-hour mark, the minimum value observed was 8.13 ± 2.62 mg/g, which subsequently peaked at 39.96 ± 2.79 mg/g by the 16-hour mark. In contrast, the soluble sugar content exhibited a decline during the initial 0–8 h period, followed by a rapid increase from 8 h to16 h, and a subsequent decrease after 16 h. The minimum and maximum soluble sugar content values were recorded at 14.86 ± 1.12 mg/g and 32.64 ± 4.85 mg/g, respectively, at 4 and 16 h.
Changes of small molecule sugar content during seed germination
The results of analyzing fructose, glucose, sucrose, and maltose levels during the germination process of quinoa seeds are presented in Fig. 4. In terms of the sugars analyzed, fructose displayed the lowest content, fluctuating within a narrow range of 0.335 to 0.358 mg/g. In contrast, glucose levels showed an upward trend during the initial stages of germination (0–8 h), indicating a consistent increase in concentration over time. When compared to monosaccharides, disaccharides demonstrated higher levels, specifically maltose with a distribution spanning from 2.52 to 3.474 mg/g and sucrose distribution ranging from 1.177 to 3.22 mg/g. The maximum value of maltose content was observed at 8 h. Sucrose, on the other hand, was higher in dry seeds but underwent a marked decline throughout the germination process.
Changes in ABA and GA3 during germination process
The results of ABA and GA3 content within 20 h of seed germination are depicted in Fig. 5. A notable pattern was observed in the ABA content, showing an increase followed by a decline at the 12-hour mark. The lowest ABA content was detected in the dry seeds, measuring at 2.33 ± 0.27 ng/g, whereas the peak level was recorded at 12 h, reaching 6.20 ± 0.26 ng/g. Statistical analysis indicated a significant difference in ABA content at 0 h, 4 h, and 8 h (p < 0.05), with no significant disparity observed between the subsequent time intervals from 8 to 20 hours (p > 0.05). On the contrary, the GA3 content in the dry seeds was observed to be at its lowest level (0.05 ± 0.02 ng/g), during the stage of rapid water absorption by the seeds, at the 4-hour mark, the GA3 concentration reached its peak at 1.64 ± 0.27 ng/g before experiencing a sharp decline. Between the 8 to 20-hour time interval, the GA3 content fluctuated within the range of 0.04 to 0.65 ng/g.
Changes in ABA and GAs-related enzyme activity
The results of hormone-related enzyme activity are illustrated in Fig. 6. The enzyme activity linked to ABA synthesis exhibited the lowest value in the dry seeds, with ZEP, NCED, and AAO showing minimum enzyme activities of 998.91 ± 20.01 U/L, 491.92 ± 6.81 U/L, and 40.43 ± 0.87 U/L, respectively. Subsequently, during the rapid germination stage of seeds, the enzyme activity peaked at 16 h, 16 h, and 12 h, respectively, with values of 1194.87 ± 13.33 U/L, 625.77 ± 10.11 U/L, and 44.93 ± 0.52 U/L. The ABA synthase ZEP demonstrated the highest level of activity, ranging from 979.39 to 1210.35 U/L, while the AAO enzyme exhibited the lowest activity, with a range of 39.49 to 45.68 U/L. The ABA8'-H enzyme, which mediates ABA decomposition, exhibited consistent activity during the germination preparation phase, followed by a rapid increase and subsequent decline around the 16-hour mark.
The expression levels of GA20ox and GA3ox, key enzymes involved in GAs synthesis, exhibited a consistent upward trajectory during seed germination, as depicted in Fig. 7. In the dry seeds, the enzymatic activities of GA20ox and GA3ox were relatively low at 194.33 ± 4.66 U/L and 47.97 ± 0.85 U/L, respectively. Subsequently, from 8 h to 20 h post-germination, the enzymatic activities of both synthases increased steadily, reaching maximum values of 222.94 ± 3.70 U/L and 61.46 ± 0.91 U/L for GA20ox and GA3ox, respectively. The activity of GAs metabolic enzyme GA2ox displayed a biphasic pattern, with an initial increase followed by a decline, reaching a maximum value of 21.55 ± 0.34 U/L at 16 h. The lowest activity of GA2ox was observed in the dry seeds, registering at 17.33 ± 0.09 U/L.
Illumina sequencing and quality control
Based on the experimental results of germination rates, we observed that 8 h marked a turning point in germination. Prior to 8 h, the germination rate was relatively low. However, once exceeding this point, the seed germination rate accelerated significantly. Based on this finding, time points before and after 8 h (specifically, 4 h and 12 h) were decided to be selected as comparisons for pre- and post-germination stages. Quinoa seeds underwent a germination period of 4 h for the control group and 12 h for the experimental group. The specifics of the sequencing samples after QC are presented in Table 1. Following QC, the data volume of clean data per sample ranged from 6.04 GB to 6.47 GB, with an overall comparison rate of 96.70–97.50%. The Q30 ratio ranged from 92.44–93.12%, and the GC content varied from 43.69–44.85%. These results confirm the reliability of the transcript data and the use of clean data for further analyses.
Table 1
Sequence information after quality control
Sample ID | Before QC | After QC | Q20 rate | Q30 rate | GC content | Overall mapping rate |
4h-01 | 6.78G | 6.47G | 97.71% | 93.12% | 44.78% | 97.50% |
4h-02 | 6.42G | 6.12G | 97.57% | 92.86% | 44.48% | 97.40% |
4h-03 | 6.6G | 6.27G | 97.38% | 92.44% | 44.85% | 97.20% |
12h-01 | 6.32G | 6.04G | 97.55% | 92.80% | 43.69% | 97.40% |
12h-02 | 6.35G | 6.07G | 97.61% | 92.96% | 44.07% | 96.70% |
12h-03 | 6.43G | 6.13G | 97.47% | 92.62% | 44.07% | 97.20% |
Furthermore, by assessing the correlation between samples using TPM data, the reproducibility of biological experiments within the sample group was evaluated. The Spearman correlation coefficient analysis revealed that coefficients greater than 0.92 indicate a strong correlation between samples, as depicted in Fig. 8.
DEGs analysis by GO annotation
By utilizing the criteria of |log2(FoldChange)| > 1 and padj < 0.05 to identify DEGs, a total of 3349 DEGs were identified in the comparison between 4 h and 12 h; of these, 2113 DEGs exhibiting upregulation and 1236 DEGs showing downregulation (Fig. 9). The functional annotation of DEGs within specific pathways was performed using GO, resulting in a total of 3030 GO entries. These entries encompassed 2132 biological processes, 595 molecular functions, and 303 cellular components. A total of 15686 genes were associated with these GO entries, specifically including 1588 DEGs included. The p-values associated with these annotations were adjusted using the false discovery rate (FDR) method.
In the top 20 most significant entries of GO analysis (Fig. 10A), the focus is on Response, Hydrolase, Circadian, and Biosynthetic. The top 5 significant biological processes include Seed maturation (0010431), Sterol metabolic process (00016125), Multicellular organismal homeostasis (00048871), Sterol biosynthetic process (0016126), and Response to water (00009415). In terms of molecular functions, the most significant entries are Cysteine-type endopeptidase activity (0004197), Hydrolase activity and acting on glycosyl bonds (0016798), Hydrolase activity and hydrolyzing O-glycosyl compounds (0004553), and Acid phosphatase activity (0003993).
Among statistically significant GO terms (padj < 0.05), which pertaining to seed germination and maturation, entries included Seed maturation (0010431), Regulation of seed germination (0010029), Rhythmic process (0048511), Positive regulation of seed germination (0010030), and Regulation of seedling development (1900140). Notably, only the "Response to gibberellin" (0009739) within the GAs and ABA relative GO entries demonstrated statistical entries showed significance, with an adjusted p-value of 0.0543. In the carbohydrate-related GO entries, Carbohydrate catabolic process (0016052) exhibited statistical significance with an adjusted p-value of 0.0447.
DEGs analysis by KEGG annotation
To further investigate the DEGs expression profiles, the DEGs were annotated by the KEGG database. A total of 13242 genes were enriched in 365 KEGG pathways, including 1639 DEGs. The findings presented in Fig. 10 (B) illustrate the top 20 pathways with the lowest adjusted p-values, suggesting a notable enrichment in pathways associated with carbon metabolism, plant hormone signaling, biosynthesis of biologically active compounds, and lipid-related processes. To further elucidate the alterations in starch, sugar, ABA, and GA3 levels, as well as signaling pathways related to ABA and GAs during quinoa seed germination, an extensive annotation analysis of the KEGG pathways was conducted.
The result revealed that the ABA synthesis metabolic pathway (M00372) did not show statistically significant enrichment (p > 0.05), while the Diterpenoid biosynthesis pathway (ko00904), including the GAs biosynthetic pathways M00927, M00928, and M00929, displayed significant enrichment (p < 0.05). Furthermore, the Plant hormone signal transduction pathway (ko04075) exhibited highly significant enrichment (p < 0.001). The expression levels of DEGs, as measured by Fragments Per Kilobase Million (FPKM) and Fold Change (FC), related to the ABA and GAs pathways are illustrated in Fig. 11.
In the realm of hormone synthesis metabolism, the ABA pathway consists of a total of 21 genes, among which 5 genes, namely ZEP_1, ZEP_2, CYP707A, ABA2, and LUT5, were identified as DEGs (The full names and IDs corresponding to the abbreviations of genes are detailed in Appendix B.). The FC for ABA biosynthesis genes varied from − 1.68 to 1.22 overall, with ZEP and LUT5 showing downregulation in expression, while ABA2 exhibited upregulation. Notably, the metabolic gene CYP707A displayed significant downregulation, with a FC of -3.25. Meanwhile, the pathways related to ABA and GAs were exhibited in Fig. 12.
Furthermore, a total of 21 genes have been identified as being associated with GAs, with 7 of these genes identified as DEGs: CYP701_1, CYP701_2, GA3ox, KAO, GA20ox, GA2ox_1, and GA2ox_2. Specifically, genes GA2ox_1, GA2ox_2, and KAO exhibited significant upregulation during germination, while CYP701_1, CYP701_2, GA3ox, and GA20ox showed significant downregulation. The observed upregulation of the upstream GAs synthesis gene KAO (FC = 2.96) and the downregulation of CYP701_1, CYP701_2, GA20ox, and GA3ox (with FC ranging from − 1.60 to -1.01) suggest a regulatory role in GAs synthesis. Noteworthy is the differential expression of DEGs of GA2ox in quinoa, specifically GA2ox_1 and GA2ox_2, with FC of 4.69 and 4.85, respectively.
Additionally, 48 genes are involved in the ABA signaling pathway and 13 genes in the GAs signaling pathway within the broader context of plant hormone signaling pathways. Among these, 14 DEGs were identified as ABA signaling genes, including SnRK2_1, SnRK2_2, PP2C_1, PP2C_2, PP2C_3, PP2C_4, ABF_1, ABF_2, ABF_3, ABF_4, ABF_5, PYL_1, PYL_2, and PYL_3. Significantly, a total of 10 genes, including ABF, PYL, and SnRK2, showed upregulation, while PP2C displayed downregulation. Moreover, two DEGs, namely GID1_1 and GID1_2, were identified within the GAs signaling pathways, with GID1_2 being upregulated and GID1_1 being downregulated.
The starch and sucrose metabolism pathway (ko0050) exhibited significant enrichment, encompassing a total of 236 genes, of which 57 were identified as DEGs (Fig. 13). The schematic representation of the starch and sucrose metabolism process is depicted in Fig. 14, demonstrating the breakdown of sucrose into D-fructose and D-glucose by INV and malZ, as well as the interconversion of sucrose with UDP-glucose by SUS genes. Four homologous SUS genes were identified in quinoa, with SUS_1 and SUS_2 showing upregulation by 4.27 and 1.69 times, respectively, while SUS_3 and SUS_4 downregulated, with FC of -1.56 and − 1.84, respectively. In quinoa, four homologous genes of INV were identified among DEGs, with the highest FC of 4.12. The expression levels of the genes involved in D-fructose degradation metabolism, such as HK and scrk were concurrently upregulated.
In addition to the breakdown of sucrose into glucose and maltose by malZ, and cellulose into glucose by EGLU and β-glu, a total of 8 EGLU homologous genes and 8 β-glu homologous genes were identified in quinoa. Among these, only EGLU_8 showed downregulation with an FC of -1.93. Quinoa also possesses 6 homologous genes of β-glu, all of which were upregulated.
Simultaneously, starch undergoes enzymatic catalysis by AMY to produce maltose, while AMY is also involved in the hydrolysis of starch into dextrin. The levels of expression for both beta AMY and alpha AMY were elevated. In the process of starch biosynthesis, D-glucose-1P is converted into ADP glucose by glgC, subsequently leading to the formation of amylose through the activity of the starch synthase gene glgA, resulting in starch formation under the influence of GBE1. The expression levels of glgC and glgA increased during germination, whereas the expression of GBE1 decreased with an FC of -1.16.
The analysis revealed that 161 genes were involved in the glycolysis/gluconeogenesis pathway (ko00010), with 34 DEGs identified (Fig. 15). The genes involved in the glycolysis/gluconeogenesis pathway are presented in Fig. 16. In the initial phase of glycolysis, commonly known as the "energy consumption phase," glucose phosphorylation facilitated by GALM and HK results in the production of α-D-glucose-6-phosphate. Subsequently, α-D-glucose-6P was isomerized to β-D-fructose-6P and further phosphorylated to generate β-D-fructose 1,6-bisphosphate through the actions of PFKA and PFP. PFKA encodes a crucial enzyme that regulates the rate of glycolysis, and in quinoa, 4 homologous genes (AUR62031952, AUR62033274, AUR62039818, AUR62005870) were identified. Among these, only AUR62005870 exhibited downregulation, with a FC of -1.2.
The enzyme ALDO facilitated the condensation of a six-carbon molecule into two three-carbon compounds, glycerone-P and D-glyceraldehyde 3-P. These three-carbon compounds can be interconverted by the gene TPI. In the second stage of glycolysis, referred to as the "energy -yielding stage," the conversion of three-carbon compounds into 3-phospho-D-glyceroyl phosphate by the gene gapA. Subsequently, the enzymes MINPP1, PK_1, and PK_2 facilitated the production of pyruvate, ultimately completing the glycolytic pathway.
Pyruvate was further metabolized through anaerobic oxidization to lactate by LDH, or alternatively underwent decarboxylation to generate Acetyl CoA and ethanol, among others byproducts. Acetyl CoA then entered the Tricarboxylic acid cycle (TCA cycle) for further energy production. The process of pyruvate salt formation was facilitated by enzymes including PDHB, NAPH+, FRMA, ACSS1_2, and ALDH. Notably, the expression levels of ACSS1_2, FRMA, and PDHB were increased, whereas NADPH + levels were decreased.
Results of Key genes
Utilizing the R language and Cytoscape, betweenness centrality was computed to identify pivotal genes within the target pathway. Out of the 28 nodes encompassed in the ABA and GAs-related pathways, the top 3 genes were identified as SnRK2_1, GA2ox_2, ABF_2 (Fig. 17). Additionally, the associations between starch and sucrose metabolism pathways and glycolysis pathways individually assessed, leading to the establishment of networks for the top 30 correlated nodes within these pathways (Fig. 18 and Fig. 19). The findings revealed that malZ_2, glgA_2, and SUS_3 were the top 3 ranked genes in the starch and sucrose metabolism pathway, while PDHB_2, ALDH_1, and HK were the top 3 in the glycolysis pathway.
Based on the rankings provided by 9 different computation methods in Cytohubba, the intersection of genes ranked by these diverse methods was chosen, and these genes were designated as the central genes calculated by Cytohubba's computation. The results showed that GA3ox, GA20ox, ABA2, CYP707A, and ZEP as central genes in the ABA and GAs-related gene pathways. However, in the realm of starch and sucrose metabolism, a total of 5 central genes were identified: SUS, PYG, GBE1, PGM, and HK. In the context of glycolysis, the genes LDH, PGM, ALDO, PFKA, NAPH+, PK, and TPI were identified. The specific computational findings are outlined in Appendix C.
The key parameters for central network analysis were determined by the MCODE algorithm. Subsequent analysis revealed the significance of PP2C, ZEP, ABA2, CYP707A, and GA2ox in the ABA and GAs-related pathway, as illustrated in Fig. 20. Similarly, SUS, scrk, PGM, alpha AMY, HK, ostB, GBE1, INV, and PYG were identified as central network genes in the starch and sucrose metabolism pathway. In glycolysis, the genes HK, PGM, NAPH+, PFKA, LDH, PK, ALDO, and TPI were identified as central networks.
By examining DEGs functions, pre- and post-germination FC, gene correlations, and the central networks extraction, key genes from various pathways were identified. Subsequent analysis revealed key genes involved in the ABA and GAs pathways, including CYP707A (AUR62001756), ABA2 (AUR62021168), GA2ox_1 (AUR62024597), and GA2ox_2 (AUR62011753). In the pathway of starch and sucrose metabolism, pivotal genes were identified, including INV_1 (AUR62009834), INV_2 (AUR62039932), malZ_1 (AUR62021097), beta glu_6 (AUR62029347), EGLU_1 (AUR62000202), EGLU_2 (AUR62006550), scrk_1 (AUR62002325), HK (AUR62031934), and alpha AMY (AUR62012986). Similarly, essential genes involved in glycolysis were HK (AUR62031934), ACSS_1 (AUR62032527), and ACSS_2 (AUR62017459).
RT-qPCR validation
Based on the results of Cytohubba calculations, gene correlation analysis, gene variation multiples, and a thorough literature review, a total of 15 DEGs were chosen for further validation through RT-qPCR analysis in pathways associated with diverse biological processes, with the primer sequences detailed in Appendix D. (Fig. 21). The results demonstrated that the expression trends observed in the RNA-seq results were consistent with the calculated expression levels obtained from RT-qPCR.