Metabolic profiles of P. trichocarpa leaves
Metabolites present in P. trichocarpa leaves as determined by GC/MS analysis of extract are detailed in Supplementary Table 1. 176 metabolites were identified and semi-quantified relative to an internal standard by GC/MS analysis of extract from the training leaf set and of those metabolites, 80 were classified to be “aromatic” in nature (consisting of phenolics, benzoates, salicylic acid moieties, etc.). 115 metabolites were identified and semi-quantified in the test set of extracts based on GC/MS analysis, of these 49 were positively identified to be aromatic.
Py-MBMS analysis of P. trichocarpa leaves
Py-MBMS analysis of the training set of P. trichocarpa genotypes grown in Clatskanie, OR was used to analyze spectral features consistent with biomass composition that would enable estimation of the relative lignin content and syringyl/guaiacyl (S/G) ratio in lignin, as well as the relative abundance of specific metabolites present in the leaves. Table 1 shows the average lignin content and S/G ratio of the leaves determined by summation of mean-normalized ion intensities of m/z 120, 124 (G), 137 (G), 138 (G), 150 (G), 152, 154 (S), 164 (G), 167 (S), 168 (S), 178 (G), 180, 181, 182 (S), 194 (S), 208 (S) and 210 (S), where G denotes primarily guaiacyl-derived ions, S denotes primarily syringyl-derived ions, and other ions either derive from other lignin monomers or multiple sources. Lignin content was estimated by using a single point response factor relative to a representative NIST standard of known Klason lignin content using the same ions. Syringyl-to-guaiacyl (S/G) ratios were determined by dividing the sum of S-based ions by the sum of G-based ions using mean-normalized ion intensities.
Table 1
Lignin content and composition of P. trichocarpa leaf sets as determined by py-MBMS.
P. trichocarpa leaf set
|
Lignin Content (DW %)
|
S/G
|
Clatskanie Training (n = 219)
|
9.7 (± 0.7)
|
0.6 (± 0.1)
|
Boardman Test (n = 223)
|
9.4 (± 0.6)
|
0.8 (± 0.1)
|
The leaves in both sets exhibit significantly lower lignin content and S/G as is expected and compared to that typically seen in mature woody stem xylem tissue (9–12). Additionally, the variation in ions and principal component analysis (PCA) show that the lignin-derived ions are not the main source of spectral variance across the sets. Principal component analysis (PCA) loadings for the first principle component plotted in spectral format to demonstrate the variation in the test leaf set arises primarily from ions m/z 43, 57, 71, 85, 95, 97, which originate primarily from sugars, as well as m/z 77, 91, 94, 105, and 122, derived primarily from aromatics (13, 14), which were generally negatively correlated with the sugar-derived ions (Fig. 1a).
The relative lignin content and S/G determined for the test set of P. trichocarpa genotypes grown in Boardman, OR, were found to be similar to those in the training set from Clatskanie, OR (Table 1). Additionally, the majority of spectral variation originates from similar ions derived from aromatics, including m/z 68, 71, 91, 94, 105, 107, 108, and 122 (Fig. 1b). However, various minor differences from a number of sources (true biomass compositional variation, instrumental drift, etc.) make it difficult to directly compare the two sets and develop PLS modeling approaches that can be used across sets without including several of the same standards, samples, etc. in each set. In order to circumvent this without requiring a priori metabolomics by GC/MS, and because these differences may not be the same for any future sets being compared, and also to simplify future analyses, the main source of spectral variation (being ions derived from aromatics) was used as a rough estimate to compare samples within the sets in the same way lignin content has been compared using this method over the last decade (5, 9, 10, 12, 15–17).
Aromatic metabolite predictions using py-MBMS
Pearson correlation coefficients were determined for select ions in the MBMS spectra, lignin content and monolignol ratios (S/G), as well as the total fractional abundance of aromatics determined by GC/MS from the training set of leaves (Table 2). The select ions in the MBMS spectra (Table 2) and the ions chosen to represent the sum of aromatic metabolites were chosen based on the correlation coefficients with aromatic metabolites, the principal component analyses (Fig. 1), and the loadings of the ions in the PLS models constructed using this data (Fig. 2). The PLS models used to predict the relative aromatic composition of extractable metabolites were primarily driven by the ions known to derive from aromatic species (13), primarily m/z 68, 71, 77, 91, 94, 105, 107, 108, and 122, which were also the main source of variation in the spectra of the set and provided a reasonable estimation and validation of using py-MBMS to predict aromatic metabolite composition in poplar leaf extracts. The sum of the aromatic metabolite ion intensities in the MBMS spectra from the training set had a correlation coefficient with the sum of the aromatic metabolites determined by GC/MS of 0.87 and R2 of 0.76 (Fig. 2c). This simplified method of summing the ion intensities (m/z) 68, 71, 77, 91, 94, 105, 107, 108, and 122 to predict the relative abundance of aromatic metabolites in extract could theoretically be used in place of PLS modeling for ranking samples when limited a priori knowledge of metabolite information is available.
While many of the aromatic metabolites had a positive correlation with the sum of the aromatic metabolite intensities from the MBMS, they all differed in value and many of the signature ions produced from each metabolite overlap, making it difficult to differentiate and resolve the specific species (Table 2). Therefore, this method was developed to rapidly estimate the total relative abundance of aromatic metabolites present in poplar leaf extract samples. Further deconvolution and speciation of specific metabolites will be the subject of future investigations.
Table 2
Correlations between select ions from MBMS spectra of P. trichocarpa leaves in the training set from Clatskanie, OR and select metabolites as determined by GC/MS as well as lignin content, S/G ratios and the summation of metabolite-derived ions from MBMS spectra and the total aromatic metabolites determined by GC/MS. *metabolite ion intensities summed 68, 71, 77, 91, 94, 105, 107, 108, and 122
Trait
|
Lignin content
|
S/G
|
Sum metabolite ions in MBMS*
|
Aromatics by GC/MS
|
m/z 64
|
-0.34
|
0.02
|
0.68
|
0.65
|
m/z 65
|
-0.38
|
0.32
|
0.74
|
0.69
|
m/z 66
|
-0.54
|
0.32
|
0.87
|
0.80
|
m/z 68
|
-0.49
|
0.30
|
0.71
|
0.59
|
m/z 77
|
-0.64
|
0.38
|
0.98
|
0.85
|
m/z 78
|
-0.43
|
0.23
|
0.72
|
0.66
|
m/z 91
|
-0.39
|
0.25
|
0.68
|
0.59
|
m/z 92
|
-0.32
|
0.25
|
0.61
|
0.53
|
m/z 94
|
-0.55
|
0.35
|
0.93
|
0.85
|
m/z 105
|
-0.66
|
0.35
|
0.98
|
0.85
|
m/z 106
|
-0.52
|
0.40
|
0.77
|
0.68
|
m/z 107
|
-0.59
|
0.36
|
0.96
|
0.82
|
m/z 108
|
-0.65
|
0.45
|
0.95
|
0.84
|
m/z 110
|
-0.26
|
-0.06
|
0.61
|
0.60
|
m/z 122
|
-0.65
|
0.39
|
0.98
|
0.86
|
m/z 181
|
-0.19
|
0.38
|
0.59
|
0.55
|
2-O-salicyloylsalicin
|
-0.36
|
0.26
|
0.52
|
0.63
|
2,6-cyclohexadiene-1,2-diol
|
-0.45
|
0.29
|
0.69
|
0.55
|
6-hydroxy-2-cyclohexenone alcohol
|
-0.52
|
0.38
|
0.80
|
0.71
|
α-salicyloylsalicin
|
-0.55
|
0.40
|
0.87
|
0.88
|
benzoyl-gentisyl alcohol
|
-0.34
|
0.29
|
0.54
|
0.50
|
benzoyl-salicyloylsalicin
|
-0.44
|
0.33
|
0.62
|
0.72
|
benzyl-coumaroyl-glucoside
|
-0.53
|
0.31
|
0.76
|
0.74
|
catechol
|
-0.53
|
0.36
|
0.80
|
0.77
|
coumaroyl-tremuloidin
|
-0.45
|
0.25
|
0.67
|
0.65
|
phenethyl-tremuloidin
|
-0.45
|
0.23
|
0.65
|
0.63
|
salicortin
|
-0.50
|
0.33
|
0.83
|
0.81
|
salicyl-salicylic acid-2-O-glucoside
|
-0.40
|
0.30
|
0.66
|
0.75
|
salicylic acid
|
-0.45
|
0.25
|
0.68
|
0.61
|
salicyloyl-coumaroyl-glucoside
|
-0.58
|
0.37
|
0.85
|
0.86
|
salicyltremuloidin
|
-0.45
|
0.30
|
0.68
|
0.83
|
salireposide
|
-0.42
|
0.26
|
0.68
|
0.67
|
Sum metabolite ions in MBMS*
|
-0.66
|
0.39
|
1.00
|
0.87
|
Aromatics by GC/MS
|
-0.54
|
0.32
|
0.87
|
1.00
|
tremulacin
|
-0.52
|
0.38
|
0.77
|
0.85
|
tremuloidin
|
-0.40
|
0.24
|
0.56
|
0.55
|
trichocarpin
|
-0.43
|
0.32
|
0.68
|
0.67
|
The total aromatic fraction of secondary metabolites in the test set from Boardman, OR, were predicted based on the simplified py-MBMS ion summation method and validated by GC/MS analysis of the extract. After GC/MS metabolic profiles were generated, PLS models of the test set were also built to further validate the use of the simplified ion method. There is a correlation between the total abundance of the aromatic metabolites as determined by GC/MS and the sum of the aromatic metabolite ions determined by MBMS (Fig. 3a), with an R2 of 0.78 and Pearson correlation coefficient of 0.88 (Table 3) for the second test set of leaves, indicating that this simplified method is reasonably capable of predicting the relative abundance of aromatics in poplar leaf extracts across data sets.
Additionally, similarities in correlations between the two data sets for py-MBMS ions, metabolites and other compositional features include positive correlations between aromatic-derived ions, such as 105 and 122, with many salicylate metabolites (Table 3). The value of correlation coefficients for different traits differed to some degree between sets though, due, in part, to the differences in the actual metabolites detected and analyzed by GC/MS in the two sets. While these differences and metabolites themselves could not be speciated amongst these two sets, it was still possible to reasonably predict total aromatic metabolites in the two sets using the simplified py-MBMS ion summation method as demonstrated in Fig. 3a. Additionally, the relative abundance of aromatic metabolites based on ranking GC/MS abundance reasonably correlates with the ranking determined by py-MBMS (Fig. 3b).
The PLS model for the test set from Boardman, OR (Fig. 4) was driven by similar aromatic-derived ions as the training set, but was different enough that the PLS models developed by one set could not be used to accurately predict aromatics in the other set. Interestingly, the PLS model for the test performed better than the model developed for the training set. These results further validate that PLS models would be similar but not directly translatable between different data sets and hence the need for the simplified ion intensity summation approach for screening purposes.
Table 3
Correlations between select ions from MBMS spectra of P. trichocarpa leaves in the test set from Boardman, OR and select metabolites as determined by GC/MS as well as lignin content, S/G ratios and the summation of metabolite-derived ions from MBMS spectra and the total aromatic metabolites determined by GC/MS. *metabolite ion intensities summed 68, 71, 77, 91, 94, 105, 107, 108, and 122
Trait
|
Lignin Content
|
S/G
|
Sum metabolite ions in MBMS*
|
Aromatics by GC/MS
|
m/z 122
|
-0.15
|
0.10
|
0.96
|
0.87
|
m/z 108
|
-0.05
|
0.06
|
0.86
|
0.87
|
m/z 107
|
-0.03
|
0.00
|
0.93
|
0.87
|
m/z 106
|
-0.10
|
0.11
|
0.88
|
0.82
|
m/z 105
|
-0.15
|
0.08
|
0.95
|
0.81
|
m/z 94
|
-0.04
|
0.02
|
0.74
|
0.69
|
m/z 91
|
-0.04
|
-0.01
|
0.54
|
0.51
|
m/z 78
|
-0.07
|
0.01
|
0.79
|
0.68
|
m/z 77
|
-0.12
|
0.07
|
0.97
|
0.87
|
m/z 68
|
-0.07
|
-0.05
|
0.74
|
0.58
|
m/z 66
|
-0.07
|
0.01
|
0.72
|
0.63
|
1,2,3-benzenetriol
|
-0.11
|
0.06
|
0.43
|
0.50
|
1,2,4-benzenetriol
|
-0.09
|
0.05
|
0.55
|
0.64
|
2,5-dihydroxybenzoic acid-2-O-glucoside
|
-0.01
|
0.00
|
0.48
|
0.51
|
Aromatics by GC/MS
|
-0.13
|
0.09
|
0.88
|
1.00
|
benzoyl-salicyloylsalicin
|
-0.12
|
0.06
|
0.67
|
0.49
|
catechol
|
-0.03
|
0.02
|
0.52
|
0.62
|
phenol
|
-0.06
|
-0.01
|
0.64
|
0.68
|
salicin
|
-0.01
|
0.03
|
0.61
|
0.72
|
salicyl alcohol
|
-0.09
|
0.07
|
0.36
|
0.41
|
salicyl-coumaroyl-glucoside
|
0.04
|
-0.11
|
0.70
|
0.62
|
salicylic acid
|
-0.04
|
0.00
|
0.63
|
0.44
|
salicyltremuloidin
|
-0.10
|
0.05
|
0.68
|
0.46
|
Sum metabolite ions in MBMS*
|
-0.13
|
0.05
|
1.00
|
0.88
|
tremulacin
|
-0.12
|
0.03
|
0.63
|
0.47
|
tremuloidin
|
-0.18
|
0.16
|
0.55
|
0.78
|
trichocarpin
|
-0.06
|
-0.01
|
0.49
|
0.50
|
trichocarpin conjugate
|
-0.11
|
0.07
|
0.71
|
0.69
|
trichocarpinene
|
-0.09
|
0.00
|
0.50
|
0.48
|