Plant materials and Chemicals
The root samples of G. glabra L. and G. uralensis Fisch. were collected from Urumqi, Xinjiang province, China during the middle stage of root formation. The middle stage of root formation was two years. The root samples of G. glabra L. and G. uralensis Fisch. from the middle stage were set with three biological replicates (marked as G1, G2, G3 and W1, W2, W3, respectively). All the root samples were frozen at -80 ℃ for RNA extraction. The purity of all the chemical compounds, including liquiritigenin (CAS: 578-86-9), daidzein (CAS: 486-66-8), formononetin (CAS: 485-72-3), medicarpin (CAS: 32383-76-9), NADH (CAS: 606-68-8), and NADPH (CAS: 2646-71-1), are > 98%. All the chemical compounds were commercially available (Yuanye, Shanghai, China).
Deep Illumina sequencing and transcriptome analysis
Total RNAs were extracted from the root samples of G. uralensis Fisch. and G. glabra L. using the Column Plant RNAout2.0 (Tiandz Inc., Beijing, China). Approximately 100 mg of each sample was used to extract RNA. The RNA was evaluated using agarose gel electrophoresis, Nanodrop One (Nanodrop Technologies Inc., DE, USA), and Agilent 2100 (Agilent Technologies Inc., CA, USA) to confirm the purity, concentration, and integrity, respectively. The 260/280 nm ratios and 260/230 nm ratios of 1.8–2.2 and 1.4–1.8, respectively, from the Nanodrop were regarded as pure (Table S4). Next, the RNA library was constructed, and sequencing was performed by Genepioneer technologies corporation (Nanjing, China)[44]. Novaseq 6000 platform (Illumina Inc.) was used for high-throughput sequencing[45, 46] with pair-end 150 bp. The reference genome of G. uralensis (http://ngs-data-archive.psc.riken.jp/Gur-genome/download.pl.) was used for transcriptome analysis, and gene functions were comprehensively annotated based on the following databases of Nr (NCBI nonredundant protein sequences), Pfam (protein families), KOG/COG (clusters of orthologous groups of proteins), SwissProt (a manually annotated and reviewed protein sequence database), KEGG (Kyoto encyclopedia of genes and genomes database), GO (gene ontology), and KO (KEGG Orthology). Fragments per kilobase per million mapped reads (FPKM) value was used to estimate the expression level of the genes. DEGs between libraries were identified by DESeq2 (http://www.bioconductor.org/packages/release/bioc/html/DESeq.html). Fold change represents the ratio of expression quantity between two samples, and the Benjamini-Hochberg approach was used to adjust the P values for controlling the FDR. Unigenes with FDR < 0.05 and an absolute value log2 (Fold change) ≥ 1 were seen as differentially expressed[47]. The full-length cDNAs of the targeted genes were amplified by PCR using High-Fidelity PCR Master Mix (Tolo Biotech, shanghai, China), and the primers were designed based on the transcriptomic data (Table S1). The transcriptomic data used in this study were submitted to the Sequence Read Archive (SRA) of NCBI database with the accession numbers as SRR22859373, SRR22859372, SRR22859371 for three biological replicates from G. glabra L. and SRR22859370, SRR22859369, SRR22859368 for three biological replicates from G. uralensis Fisch., respectively.
Based on the annotation results, seventeen candidate genes involved in the medicarpin biosynthesis pathway, including the sequences which encoding PAL (phenylalanine ammonialyase), C4H (cinnamate 4-hydroxylase), 4CL (4-coumaroyl CoA ligase), TAL (tyrosine ammonialyase), PD (pyruvate dehydrogenase), ACC (acetyl-CoA carboxylase), CHS (chalcone synthase), CHR (chalcone reductase), CHI (chalcone isomerase), 2-HIS (2-hydroxyisoflavanone synthase), CPR (cytochrome P450 reductase), I4′OMT (isoflavone 4′-O-methyltransferase), HID (2-hydroxyisoflavanone dehydratase), I2′H (isoflavone 2′-hydroxylase), IFR (isoflavone reductase), VR (vestitone reductase), PTS (pterocarpan synthase) were highly expressed in the root of G. glabra L. Since the enzyme activities of CPR and PTS from G. glabra L. were weak, CPR of soybean[24] and PTS of G. pallidiflora Maxim[48]. were selected and used for the heterogenous biosynthesis of medicarpin.
Gene expression
The genes HID, I4′OMT, IFR and PTS were amplified from the cDNAs from the roots of G. glabra L., and G. pallidiflora Maxim, respectively, and then cloned into the pET-28a vector, and the gene VR was amplified from the cDNA from the root of G. glabra L. and cloned into the pGEX-4T-1 vector using the EasyGeno Assembly Cloning kit (Tolobio, Shanghai, China). The plasmids were transferred into Escherichia coli BL21 (DE3). All primers used in vector construction are listed (Table S1). Transformants were screened on solid culture medium (containing 100 µg/mL kanamycin sulfate or ampicillin sodium), and single colonies were picked for sequencing verification.
Recombinant cells were cultured in 1 L Luria-Bertani (LB) medium (37 ℃, 165 rpm) containing 100 µg/mL kanamycin sulfate or ampicillin sodium. When OD600 value of the reaches 0.6–0.8, isopropyl β-D-thiogalactoside (IPTG) with a final concentration of 0.5 mmol/L was added. The cultivation was further carried out at 18 ℃ for 12 hours. 1 L of bacterial suspension was centrifuged to obtain expression cells (5000 rpm, 15 min), and the precipitant was resuspended in 20 mL buffer C solution (10 mmol/L Tris-HCl, 5% glycerol, 0.2 mol/L NaCl). The cells were disrupted with a high-pressure cell crusher (ATS Engineering Limited, Suzhou, China), then centrifuged at 4 ℃ and 12000 rpm. The recombinant protein was purified using the His-Tagged Protein Purification Kit or the GST-Tagged Protein Purification Kit (Smart-Lifesciences, Changzhou, China), and the protein concentration was determined by ultraviolet spectrophotometer (Allsheng, Hangzhou, China).
Cytochrome P450 2-HIS and I2′H were first cloned from the cDNA from the roots of G. glabra L, and expressed in S. cerevisiae IMX581 using pESC-LEU vector. Cytochrome P450 reductase (CPR) was amplified from the cDNA of soybean, and expressed in IMX581 using the vector pESC-URA. The recombinant plasmid was transferred into IMX581 by electroporation, and transformants were grown on corresponding SC-Leu or -Ura media with 2% glucose at 30 ℃ for 3 days. The positive colonies were picked and shaken at 30 ℃ and 200 rpm until OD600 value reaches about 0.8. The 2% glucose medium was replaced with an induction medium (containing 2% galactose), and cultivated at 30 ℃ and 200 rpm for 72 h.
Characterization of enzymatic activity in vitro and in vivo
The enzymatic reaction systems using liquiritigenin as substrate contained 0.1 mol/L dipotassium hydrogen phosphate, 0.5 mol/L sucrose, 0.5 mmol/L glutathione, 1 mmol/L NADPH, 1 mmol/L NADH, 5 mmol/L magnesium chloride, 100 µmol/L substrate liquiritigenin and 0.5 mmol/L S-adenosyl-L-methionine (SAM), and 500 µL crude yeast extraction supernatant solution of strain DWY1, and 30 µL I4′OMT protein derived from E. coli expression. The enzymatic reaction systems[49] using formononetin as substrate contained 0.1 mol/L potassium phosphate (pH = 8.0), 0.4 mol/L sucrose, 0.5 mmol/L glutathione, 2 mmol/L NADPH, 2 mmol/L NADH, 5 mmol/L magnesium chloride, 50 µmol/L substrate formononetin, and 250 µL crude yeast extraction supernatant solution of strain DW08.
In vitro enzymatic activity measurement mixture was incubated at 30 ℃ for 12 h with gentle shaking, and the reaction mixture was extracted with an equal volume of ethyl acetate for three times. The extracted metabolites were dried and re-dissolved in methanol. The metabolites were further purified by passing through a 0.22 µm polytetrafluoroethylene (PTFE) filter and used for UPLC-ESI-Q-TOF-MS/MS analysis.
For in vivo analysis, the yeast strains were cultivated in medium containing 20 mg liquiritigenin or formononetin at 30 ℃ and 210 rpm for 24 hours; Then 20 mg of liquiritigenin or formononetin was added to the medium, and continued to cultivate for another 72 hours. The fermentation broth was extracted with an equal volume of ethyl acetate for three times, and the final products were redissolved in methanol and used for UPLC-ESI-Q-TOF-MS/MS analysis.
UPLC‒ESI-Q-TOF-MS/MS analysis of catalytic and final products
An Acquity UPLC system coupled with a Synapt mass spectrometer (Waters Corp., Milford, MA, USA) and equipped with an electrospray ionization (ESI) device were used to analyze the catalytic products. The separation was carried out on a Waters C18 column (100 mm × 2.1 mm, 1.7 µm). The column was eluted with a gradient mobile phase which consisted of 0.1% aqueous formic acid (solvent system A) and acetonitrile (B). The mobile phases during operation were: 0–1 min, 5% B; 1–20 min, 5%-100% B; 20–30 min, 100% B; 30–31 min, 100%-5% B; 31–33 min, 5% B. The injection volume was 2 µL, and the flow rate was 0.4 mL/min.
Extraction of medicarpin and other metabolites from G. uralensis Fisch. and G. glabra L.
The dried roots of G. uralensis Fisch. or G. glabra L., were pulverized into powder, and 4 g of the powder was dissolved in 400 mL of 70% methanol. The power in the methanol was treated with ultrasonic (KQ-500DE numerical control ultrasonic cleaner, working frequency: 40 kHz, power: 500 W, Kunshan Ultrasonic Instrument Co., Ltd.) at 30 ℃ for 60 min to extract metabolites in the roots. After centrifugation, the supernatant was filtered through a 0.22 µm PTFE filter and used for qualitative analysis.
Reconstitution of the medicarpin biosynthetic pathway in yeast
The chassis strain used in this experiment was S. cerevisiae IMX581 (MATa ura3-52 can1::cas9-natNT2 TRP1 LEU2 HIS3)[50]. Heterologous genes were integrated into the targeted genomic loci via the CRISPR/Cas9 system[23]. Plasmid pMEL10 was used as gRNA vector. Phanta Max Super-Fidelity DNA Polymerase (Vazyme Biotech Co. Ltd) was used for DNA fragment amplifications, and PrimeStar DNA polymerase (TaKaRa Bio) was used for in vitro fusion PCR. The ClonExpress MultiS One Step Cloning Kit (Vazyme Biotech Co.,Ltd) was used for in vitro fragment recombinant. FastPure Plasmid Mini Kit (Vazyme Biotech Co.,Ltd) was used for plasmid extraction. FastPure Gel DNA Extraction Mini Kit (Vazyme Biotech Co.,Ltd) was used for DNA extraction. Super Yeast Transformation Kit (Coolaber) was used for transformation. All primers, the plasmids, and the strains used in this study were listed (Table S1 -3). The genes used in this study were submitted to Genbank database with the accession numbers of OQ067102(CHS), OQ067258(CHI), OQ067259(CHR), OQ102530(2-HIS), OQ102529(HID), OQ067262(I2′H), OQ067263(IFR), OQ067264(VR), OQ067265(I4′OMT), OQ067266(C4H), OQ067267(4CL).
Fermentation and product analysis
S. cerevisiae strain DW11 was used for medicarpin biosynthesis in shake flask. DW11 was grown at 30 °C and 200 rpm for 12 h, and then transferred into 1 L of fresh medium containing 0.2 g/L liquiritigenin or formononetin. DW11 was further cultivated at 30 °C and 200 rpm for 5 days, and yeast sample was fetched every 24 h during cultivation. The collected yeast samples were disrupted with a high-pressure cell crusher, and then centrifuged at 10000 g for 20 min to collect the supernatant. The metabolites in the supernatant were extracted with the same volume of ethyl acetate (v/v = 1:1) for three times. The extraction was dried and redissolved with methanol for UPLC analysis.
Quantitative UPLC analysis
The yeast product was detected using a Waters C18 column (100 mm × 2.1 mm, 1.7 µm) on an Acquity UPLC system (Waters Corp., Milford, MA, USA) by gradient elution with a mobile phase comprising 0.1% (v/v) formic acid aqueous solution (A) and acetonitrile (B) at a flow rate of 0.4 mL/min. The mobile phases were programmed as: 0–1 min, 5% B; 1–10 min, 5%-100% B; 10–11 min, 100% B for 1 min; 11–12 min, 100%-5% B; 12–13 min, 5% B. The injection volume was 2 µL. The column temperature during operation was 25 °C. For the product vestitone with no available standard compound, the semi-preparative RP-HPLC (Shimadzu, Kyoto, Japan) was used to collect and concentrate, which was further dried to obtain solid powder by rotary evaporator (Buchi, Switzerland). The verstitone could be used as standard for quantification analysis.