Plant materials
P. ginseng seeds used in the present study were harvested in July 2016 and were provided by Gyeonggi Eastern Ginseng Nonghyup (Incheon, Korea). Seeds were kept at 14°C for 16 weeks for seed coat dehiscence. Seeds were checked every week, and wholly dehisced seeds were stored at -80°C until analyses.
Chemical reagents
Chloroform, methanol, water, methylene chloride, and methyl tert-butyl ether were purchased from JT Baker (Philipsburg, NJ, USA). Campesterol, β-sitosterol, clerosterol, Δ5-avenasterol, lupeol, and 24-methylenecycloartanol were obtained from ChemFaces Biochemical CO., Ltd. (Wuhan, Hubei, China). Benzoic acid (d5) was purchased from Cambridge Isotope Laboratory (Andover, MA, USA). All other chemical reagents were obtained from Sigma Aldrich (St. Louis, MO, USA).
Untargeted metabolomics
Seed coats were removed to prevent potential contamination. Afterward, the embryos and the endosperms were ground using a mortar and pestle under liquid nitrogen and lyophilized, and then 35 mg of the samples were dissolved in 1 mL of a chloroform:methanol:water (2:5:2) mixture. Benzoic acid (d5) was spiked as an internal standard. The samples were vortexed and ultra-sonicated (Sonics & Material, Inc., Newtown, CT, USA) for 30 min at 40°C, followed by centrifugation at 16,000 × g for 5 min. The supernatants (600 μL) were obtained for use in analyses, and the remaining 300 μL supernatant were used as QC samples. The samples were dried under nitrogen purge at 50°C. Thereafter, 100 μL of anhydrous toluene was added, and the samples were dried again under nitrogen purge to eliminate all solvent, followed by derivatization. First, 100 μL of methoxyamine hydrochloride in pyridine (20 mg/mL) was added, and samples were incubated for 90 min at 30°C. Afterward, 100 μL of N,O-bis(trimethylsilyl)trifluoroacetamide with trimethylchlorosilane (1% TMCS) was added, and samples were incubated again for 15 min at 70°C. The final 200 μL of the samples were analyzed using GC-MS.
The extracted samples were analyzed using a gas chromatography-mass spectrometer (Shimadzu-QP2010, Kyoto, Japan) with a DB-5MS column (30 m × 0.25 mm, 0.25 μm, Agilent, Santa Clara, CA, USA) to profile ginseng seed metabolites. The injection volume was 1 μL, and split mode with a ratio 5:1 was used as the injection mode. The injection temperature was 300°C. Helium was used as a carrier gas with a flow rate of 1 mL/min. The initial oven temperature was maintained at 70°C, increased to 150°C at a rate of 5°C/min, to 250°C at a rate of 8°C/min, to 300°C at a rate of 5°C/min, and then held at 300°C for 10 min. The electron impact ionization method was used as a mass spectrometry detector with 70 eV of electron energy, and mass data were collected in a scan mode at 30–600 m/z.
Phytosterol content analysis
Oil extraction and preparation of unsaponifiable fractions were performed according to previously described methods [24]. In brief, 5 g of ginseng seed power was boiled under reflux in 50 mL of methylene chloride for 2 h three times. After filtering, the extract was concentrated by rotary vacuum evaporation. One gram of the obtained oil was added into 20 mL of 1 M KOH in methanol, and 1 mL of 0.1% (w/v) 5α-cholestane in methyl tert-butyl ether added as an internal standard, and then stirred overnight. Subsequently, 40 mL of distilled water was added to dilute the mixture, and then extracted three times with 30 mL methyl tert-butyl ether. Afterward, 15 mL of 0.5 M KOH in methanol was added into the collected organic extract, and the solution was washed with 30 mL of distilled water until the pH became equivalent to the pH of the water, followed by the last wash with 15 mL of saturated sodium chloride solution. The solvent was eliminated by rotary vacuum evaporation.
Approximately 15 mg of the unsaponifiable fraction was subjected to the above derivatization step and analyzed with the same GC-MS instrument. The initial oven temperature was held at 190°C for 2 min, increased to 260°C at a rate of 5°C/min, to 275°C at a rate of 5°C/min, to 300°C at a rate of 5°C/min, and held at 300°C for 10 min. The other conditions were similar to those described above.
RNA extraction and real-time PCR analysis
Total RNA was extracted from ginseng seeds using the RNeasy RNA Plant Mini Kit (Qiagen, Germany) according to the manufacturer’s instructions. To eliminate genomic DNA, the samples were treated with RNase-Free DNase (Qiagen, Germany) before the final washing step. The quality and quantity of extracted RNA were measured using a NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific, Wilmington, USA). For real-time PCR analysis, cDNA was synthesized from 1 μg of extracted RNA using M-MLV reverse transcriptase (Invitrogen, Carlsbad, CA, USA). The levels of expression of different genes were determined using an AB 7300 Real-time PCR system (Applied Biosystems, Foster City, CA, USA) with PowerUp SYBR green master mix (Applied Biosystems, Foster City, CA, USA) as the fluorescence reporter dye. The primer sequences of the target genes were SE (sense 5′-TCCTTTTTGGGTTCCTGCTC-3′, anti-sense 5′-GTATAAGCAAGAGCAGAGCCAGC-3′), cycloartenol synthase (CAS, sense 5′-GGGGAGTCTGCTTCACC-3′, anti-sense 5′-CTCCCCAACCACCAGAAG-3′), β-amyrin synthase (β-AS, sense 5′-TATCCTGGACACCGAAAGAAGG-3′, anti-sense 5′-GTAGTATGTCTTTCCAGCTGCCG-3′), dammarenediol synthase (DDS, sense 5′-GCATACCGCCGTTGAGATTA-3′, anti-sense 5′-TAGTGTCAATCGTTCCGCTG-3′), 24-methylenesterol C-methyltransferase (SMT2, sense 5'-CTCTGCGAGGTGGTTCG-3′, anti-sense 5′-GTCACCCACTCGTACGATAC-3′), 9-cis-epoxycarotenoid dioxygenase (NCED1, sense 5′-CTTGTATGACTCCAGCGGAC-3′, anti-sense 5′-TATTCACCATCCCTGCCTCT-3′), and glyceraldehyde 3-phosphate dehydrogenase (GAPDH, sense 5′-GTGGCCAGAGTGGCTTTG-3′, anti-sense 5′-GACCGTGACTGGCTTCTCAC-3′).
Data preprocessing and compound identification
Raw data were first converted to *.cdf files using GC-MS PostRun v.4.11 (Shimadzu). The *.cdf files were then converted into *.abf format files using the Reifycs Analysis Base File converter (Reifycs Inc., Tokyo, Japan). Afterward, the files were processed using MS-DIAL v.2.82), the universal and comprehensive software for metabolomics analysis [52]. The data collection parameters were as follows: Retention time begin of 5 min, retention time end of 48.50 min, mass range begin of 30 Da, and mass range end of 600 Da. For peak detection, linear weighted moving average with a level of 2 scans was applied for peak smoothing, average peak width was 5 scans, and minimum peak height was 2000 amplitude. For deconvolution parameters, sigma window value and electron ionization were 0.5 and 10 amplitudes, respectively. Peak alignment was processed using one QC file as a reference file, and other parameters were set as default.
The deconvoluted spectra were imported into MS Search (version 2.2) coupled with the NIST14 library for compound identification. Primarily, a match score of 700 was utilized as the threshold for putative identification. The retention index, the pattern of the peaks in the spectrum, and the reverse match score were also taken into consideration during the annotation process. The identities of differentially expressed metabolites, when applicable, were further confirmed by authentic commercial standards.
Exploration of data analysis and visualization
Processed spectral data were furthered log-transformed and scaled using by Pareto scaling method before actual analyses. PCA and heatmap analysis were mainly applied for data exploration and visualization. In addition, k-means clustering was used to obtained the first impression of the potential groupings of ginseng seeds before and after dehiscence.
Statistical analysis
In univariate analysis, a t-test was used to detect the differentially expressed metabolites between the non-dehisced and dehisced seed groups. In multivariate analysis, PLS-DA with cross-validation was conducted to develop the descriptive model of the separation between the two groups. All metabolites that were significantly different between two groups in t-test that also had a VIP score of 1.2 were selected for application in the development of a bio-signature of dehiscence. All metabolomics related analyses were complemented in MetaboAnalyst 4.0 [53]. Other statistical analyses were performed using IBM SPSS Statistics 25 (IBM Corp., Armonk, N.Y., USA). Continuous variables are presented as mean ± standard deviation. A strict cut-off of adjusted P-value, FDR 0.001, was utilized to be the significance level for metabolomics experiments. FDR was adopted considering the multiple comparisons, which is a characteristic of high-dimensional analysis. The discriminant capacity of the bio-signature was validated using an independent set of samples based on ROC curve analysis using the PLS-DA algorithm. A P-value of 0.05 was applied for all other analyses unless otherwise stated.
Functional enrichment analysis
Pathway enrichment analysis of the differentially expressed metabolites was performed in MBRole v.2.0. Functional data on compounds were obtained from the KEGG database [54].