Seed molecule-guided non-targeted oligomer screening framework.
A particular polymer can release a diverse range of oligomers sharing the same RU19. They were designated as oligo(RU)nα,ω, where "n" indicates the DP, and "α,ω" denotes the possible combinations of EGs (α and ω end structures). Based on whether the end structure was the same as that of the parent polymer, the oligomers were further divided into homologues and congeners (Fig. 1a). As shown in Fig. 1b, OLAnH,OH, a homolog of PLA, constitutes the predominant distributions of OLA. All these homologues are terminated with a hydroxyl group at the end. OLAnα,ω represents congeners of PLA with various end structures. Different congeners are composed of the same RU ([LA], C3H4O2).
We developed the Oligomer-Finder workflow to identify all oligomers in the samples. This strategy employed a seed-guided approach to expand and identify additional elements. The underlying principle was to first identify seed oligomers exhibiting repeated neutral losses (rNL) in tandem mass spectrometry (MS2) spectrum. The seed oligomer can provide RU and EG information regarding the parent polymer. As shown in Fig. 1c, all the oligomers derived from the same polymer were detected using seed oligomer guidance. Each seed oligomer was assigned a confidence degree ranging from Degree 4 to Degree 1, enabling the assessment of concern levels and prioritization of the related analysis (see Methods). The oligomers from each polymer were identified as the cycle continued until the last seed oligomer. Figure 1d summarizes the steps of Oligomer-Finder. Three fundamental script packages, namely "Seed oligomer-Finder," "Homologue-Finder," and "Congener-Finder," coupled with custom databases, enable comprehensive oligomer annotation through MS data. A graphical user interface (GUI) version of Oligomer-Finder was provided for easy use. The entire process diagram for oligomer screening is shown in Fig. 1e. MS data were obtained using LC-HRMS, and pre-analysis was performed to obtain the MS1 peak table and MS2 spectra. After the peak table and spectra were imported into the software, homologues and congeners were identified by analysing the mass differences between the peaks of the seed oligomer or RUs and all MS1 peaks (details in Supplementary Fig. S1 and Methods).
Oligomers with and without end-group modification share repeated neutral loss in MS fragmentation.
All oligomers exhibited a repeated neutral loss (rNL) pattern in the MS/MS analyses, generating a series of less-polymerized homologous ions within the collision cell (Fig. 2a). In the case of OLA6H,OH (HO-[LA]6-H), it exhibited a series of signal peaks according to [OLAnH,OH -H]− (n ≤ 6) which were associated with rNL of 72.0236 Da (mass of [LA]), corresponding to the RU of PLA shown in Fig. 2b. The spectral pattern of OLA4H,Meo with a methoxy end structure, was the same as those of OLA6H,OH in Supplementary Fig. S2. The MS2 patterns of the oligomers from different polymers were similar; a set of normally distributed signals representing low-DP fragments losing RUs was observed in the MS2 spectrum of each oligomer. The polymers, poly (hydroxybutyrate) (PHB) with an rNL of 86.0368 Da, corresponding to RU ([HB], C4H6O2), and PCL with an rNL of 114.0681 Da, corresponding to RU ([CL], C6H10O2), readily released a diverse range of oligomers (Supplementary Fig. S3a–c for oligohydroxybutyrates (OHBnH,OH, and HO-[C4H6O2]n-H) and Supplementary Fig. S3e–f for the oligocaprolactones (OCLnH,OH, and HO-[C6H10O2]n-H), respectively).
The number of NL occurrences quantified were designated as "Count," which were always less than DP but higher than 1. Notably, the most frequently occurring rNL had the maximum "Count" value (Fig. 2c). Oligomers possess distinctive structural characteristics and manifest specific patterns in MS2, thus presenting opportunities for non-targeted screening. The presence of rNL and Count serve as the "criteria" for oligomer screening. To identify potential oligomers, a custom R script termed "Seed oligomer-Finder" was developed (details in Supplementary Fig. S1 and Methods). The logic of the algorithm involves computing all NLs through pairwise subtraction of product ions in a single cycle and recording the NL with the highest count, which is represented as rNL.
The detection limit of "Seed oligomer-Finder" was determined by the Count value of rNL. The Count for a dimer was 1, indicating that its rNL cannot be discerned from a multitude of NLs. The theoretical maximum Count increased with increasing DPs. However, the quality of their MS2 spectra also imposed certain requirements, as evidenced by the Count. This was primarily influenced by the normalized collision energy (NCE) of MS2 and the intensity of the oligomers. For instance, OLA13H,OH exhibited a count of 12 with NCE set at 20%, whereas the count diminished to only 1 at NCE 60% (Supplementary Fig. S4a,b). High collision energies tend to generate fragment ions, primarily comprising monomers and dimers. A higher count signifies a superior spectral quality, which is advantageous for oligomer retrieval. The NCE was adjusted to optimize the rNL Count, as illustrated in Fig. 2d. Specifically, a stepped NCE approach of 10%-20%-30% was employed for the analysis of the oligomers.
The intensity of the MS1 peak of the oligomer significantly influenced the quality of the MS2 spectrum used for oligomer screening. At lower oligomer concentrations, the MS2 spectrum were incomplete. For instance, the spectrum of OLA5H,OH demonstrated a Count of 4 with an intensity of 1.09E7, whereas the Count decreased to only 1 at 7.58E4 (Supplementary Fig. S4c,d). The OLA solution was obtained from the PLA MP leachate by mimicking the leaching process (see Methods). The reconstituted OLA solution prior to dilution was labelled as "High, " while the solution diluted 5 times is labelled as "Medium," and the solution diluted ten times was labelled as "Low." The OLAnH,OH Counts at different concentrations are presented in Fig. 2e. Among these, oligomers with low abundance exhibited diminished MS2 spectrum quality at lower concentrations or may have even posed difficulties in obtaining MS2 spectra in data-dependent acquisition (DDA) mode. Hence, as relying solely on "Seed oligomer-Finder" proves insufficient in screening all oligomers, it is imperative to also consider oligomers in MS1.
Predictability of oligomer retention time.
Polymers and oligomers undergo depolymerization and often coexist with numerous homologues. Under our LC experimental conditions, the shorter the retention time (RT), the stronger the hydrophilicity of the substances. The RT is a reflection of chemical hydrophobicity21. Long-chain oligomers exhibit strong hydrophobicity22. This indicates that the long chains of the oligomers have a long RT. In the LC-MS analysis of the PLA leachate, OLA homologues were detected, and there was a robust logarithmic correlation between their DPs and LC RT (Fig. 3a and Supplementary Table S1). In the OLAnH,OH (n = 4–13) chromatograms, there was a notable association between the natural logarithm of DP (ln(DP)) and RT (Fig. 3b). Moreover, the correlation was steady, irrespective of the LC gradient method employed (Fig. 3c). Analogous correlations for broader DP ranges of OHBnH,OH (n = 2–12) and OCLnH,OH (n = 2–8) are depicted Supplementary Figure S5 ~ 6.
Seed oligomers, which are often polydispersible, coexist with their homologues, encompassing parent compounds with higher DP or degradation products with lower DP. The feasibility of using the RT of a seed oligomer and its homologue to predict the RT of other homologues with different DP was validated for OLAnH,OH. The straight line fitted to represent the relationship between RT and the ln(DP), based on any two DPs, generated predictions for other oligomer DPs, all within a deviation range of 0.32 minutes (Fig. 3d). For heightened prediction accuracy, it is imperative to discern the selection of two points that favour a wider DP disparity. This factor was integrated into the RT prediction algorithm in the Homologue Finder. It is anticipated that this retention time prediction method does not rely on specific chromatographic systems or standard substance calibrations.
The predicted RTs were used to correct the screening results of the homologues (see Methods). The false positive rate (FPR) and false negative rate (FNR) of OLA homologues, predicted on the seed oligomer ([OLA13H,OH -H]−, m/z 953.2823, exact m/z 953.2779694), are shown in Fig. 3e, along with the FPR after manual interference exclusion based on the predicted RT. This approach effectively mitigates the FPR induced by excessively large mass error settings and achieves a balance between the FPR and FNR in screening outcomes.
Oligomers with diverse end-group modifications.
To further elucidate the relationship between the oligomers and parent polymer and summarize their structure and MS information, we established a polymer oligomer database (PODB) of 171 pieces of polymer information based on existing polymer databases (Fig. 4a and Supplementary Fig. S7, details in Methods), which were also used for oligomer structure annotation by MS.
For congeners, a customized type of oligomer, the end structures differ from those of the parent polymer, suggesting modifications by other chemicals23. An oligomer end group database (OEGDB) was established through literature collection and OLA4α,ω spectra analysis, facilitating congener screening in Oligomer-Finder (Fig. 4b and Supplementary Fig. S8, details in the Methods section). Additional databases of potential oligomeric EGs in the environment (OEGDB-env) 24 and biology (OEGDB-bio) 25, 26 were established through a thorough literature review and analysis of additional experimental MS2 spectra for Degree 2 seed oligomer candidates (Fig. 4b and Methods). These included 15 and 30 pieces of EG information, respectively, providing the potential for the further identification of additional modified end structures. OEGDB facilitated the discovery of OLAnAc,H and its structure was confirmed through the fragment ion annotation of OLA4Ac,H in Fig. 4e. In the water-leached sample of PLA MPs, 29 OLA congeners encompassing six distinct types of end structures were screened, each displaying a special peak intensity and RT (Fig. 4c and Supplementary Table S2). The MS2 spectra of OLA4Ac,H confirmed the presence of RU and the structure of EG.
To further evaluate the effect of end-group modifications on the biological effects of these oligomers, we predicted their ADMET properties using ADMETlab 2.0 software, based on congener structure identification by MS/MS27. A significant difference in toxicity to Daphnia magna (D. magna) was observed between OLAnH,OH and OLAnα,ω, indicating that the end structures of the oligomers played a crucial role in determining toxicity. Specifically, congeners demonstrated a 48 h Daphnia magna 50 percent lethal concentration (LC50 D. magna) of 1 g/L or lower, which was significantly lower than that of homologues with concentrations higher than 2 g/L (Fig. 4e). For highly associated pathways, both the DPs and end structures of the oligomers were crucial in determining the probability of activating androgen receptor (AR) (Supplementary Fig. S9a). For all congeners (n = 3–6), the probability of activating nuclear peroxisome proliferator-activated receptor gamma (PPAR-γ) exceeded 50% (Supplementary Fig. S9b). Given the widespread use of PLA as a multifilm and compostable bag in agricultural fields, it has the potential to enter rivers after fragmentation. The degradation products may pose chemical disturbances to the ecosystem, exhibiting toxicity to D. magna and serving both as a predator and prey. Furthermore, it has been reported that oligomers with different end groups (OH or Ac group) exhibit varying levels of toxicity, particularly higher than that of PCL MPs28. End-capped oligomers are likely to be more significant than non-modified oligomers, suggesting higher toxicity for congeners than for homologues.
Microplastic breakdown products in environmental samples.
In theory, during the degradation process, polymers inevitably pass through a range of oligomer molecular weights before reaching monomers and complete mineralization1. The total amount of oligomers would far exceed that of the main additives that have garnered significant attention (ranging from 0.05 to 70 µg/L)29. In this study, we used the proposed Oligomer-Finder approach to screen oligomers in two sample types: MP water leachate and original landfill leachate. Five types of MPs, PLA, PHB, PCL, poly (butylene succinate) (PBS), and nylon6-poly (caprolactam) (PA6), were used in water leaching experiments. We selected a leaching time of 7 d to gain insights into the environmental consequences associated with the oligomer release of MPs into water. Details of sample preparation can be found in the Methods and Supplementary. A holistic workflow for the non-targeted analysis of oligomers is depicted in Supplementary Figure S10a.
The results from the "Seed oligomer-Finder" for the analysis of leachate revealed a rapid surge in the number of molecules with RUs within the first day, followed by a gradual decrease in the detection numbers over the subsequent days (Fig. 5b). With the aid of PODB and OEGDB, a diverse array of oligomers originating from five polymer types, various DPs, and EGs were successfully discovered. By exclusively relying on Degree 1 seed oligomers for screening, the analysis of homologues and congeners significantly broadened the spectrum of detectable oligomers (from 4 to 176) (Fig. 5c). The number of oligomers released from the five polymers rapidly increased in water within a day, followed by a gradual transformation into monomers or aggregation into minuscule particles through degradation out of the detection range. All MP inputs were detected on the third day of sampling, indicating that the ability of MPs to release oligomers depends on their type and morphology.
The landfill leachate revealed various types of seed oligomer candidates with different degrees of confidence, and the two candidates with Degree 1 were from PET (Fig. 5d, Supplementary Fig. S11). PET MPs were detected at an abundance of 1–2 items/g in the leachate30. Further oligomer analysis led to the additional identification of OETnH,OH (n = 2–5) (shown in Fig. 5e and Supplementary Table S3). OET is considered a NIAS and has been previously demonstrated to migrate from plastic materials into food32, 36. The in silico assessment raised concerns regarding the genotoxicity of OET and their hydrolysis products31. Given the extensive utilization of PET in beverage bottles and other food packaging materials, its oligomers could potentially be considered new pollutants.
In vitro degradation of degradable MPs to release oligomers with cysteine-modified end structures.
Modification of the end structures of the screened congeners revealed the involvement of nucleophilic chemicals in oligomer formation. Functional amino acids that play critical roles in catalysis and regulation display elevated nucleophilicity and can be selectively targeted for covalent modifications by reactive electrophiles32. When exposed to MPs in a biological environment, oligomers may conjugate with endogenous small molecules during in vivo release.
The utilization of OEGDB-bio (with Cys included for its sulfhydryl group is nucleophilic) in "Congener-Finder" mode of Oligomer-Finder proves beneficial for the discovery of oligomers modified by metabolites. In this study, incubation with simulated small intestine medium showed that four types of degradable polyesters (PLA, PHB, PCL, and PBS) could conjugate with Cys through end structure modification, releasing a series of oligomers (OLAnH,Cys, OHBnH,Cys, OCLnH,Cys and OBSnH,Cys, as illustrated in Fig. 6a and detailed in Supplementary Table S4. The MS2 spectrum of OLA6H,Cys shown in Fig. 6b contains the fragment \({y}_{4}^{-}\) conjectured as [OLA4H,Cys -H]−. Fewer OCL conjugates were detected, suggesting that the PCL MPs exhibited either no reactivity or low reactivity with Cys.
OLA covalently bind to cysteine residues of human proteins.
To determine whether oligomers can modify the cysteine residues of proteins and which proteins may be affected, proteins in liver-derived HepG2 cells exposed to OLA were profiled using an LC-MS-based shotgun proteome approach (Fig. 6c, details in Methods). HepG2 cells were selected because the liver is a major organ for the metabolism of exogenous substances, and our previous study found that OLA and their nanoparticles bioaccumulated in the liver15. Three possible OLAnH,OH (n = 2–4) modification patterns identified from the above results were added to the cysteine (C) residues in Proteome Discoverer 3.1 as variable modifications. The variable modification settings are shown in Supplementary Table S5. Twenty proteins were identified by oligomer modifications. These were assessed using experimental group-specific criteria, with consideration given to the presence of rNL in the spectra (Supplementary Fig. S13 and Supplementary Table S6). A network consisting of 14 OLA-modified proteins was established in the protein–protein interaction network within STRING33 (Fig. 6d). Taking VDAC2 as an example, the modified peptide 1MATHGQTCARPMCIPPSYADLGK23 produced a molecular ion of [M + 2H]3+ at m/z 826.3871, which increased by 144 Da. The fragment ion of \({b}_{12}^{2+}\) was also observed to increase by 144 Da, suggesting that the cysteine at position 8 (Cys8) should be modified by OLA2H,OH. Moreover, the other fragmentation ions of \({y}_{6}^{+}\) and \({y}_{1}^{+}\)~ \({y}_{9}^{+}\)had the same mass as the original peptide, indicating that the peptide belonged to MATHGQTCARPMCIPPSYADLGK (Supplementary Fig. S12). In addition, the fragment ion of \({y}_{16}^{+}\) and \({y}_{16}^{2+}\) was observed to increase by 72 Da, which suggested that the peptide modified by OLA2H,OH would also exhibit the rNL of OLA ([LA] in Fig. 6e). This result further confirms that oligomers can covalently modify the Cys residues of HUMAN_VADC2 and that RU can provide additional information for the quick screening of modified peptides. VADC2 forms a voltage-dependent anion channel through the mitochondrial outer membrane that allows the diffusion of small hydrophilic molecules34. OLA was reported for the first time, providing more possible modification patterns for subsequent identification of the modified protein. Protein adduct formation might give rise to toxicity by disrupting protein structures and/or functions or by provoking an immune response and hypersensitivity, followed by a series of downstream effects25.