A number of 694 DEGs and 6,099 key module genes were obtained
Totally 694 DEGs between PE and controls were acquired by differential expression analysis of the GSE190971 dataset, with 447 over-expressed and 247 down-regulated DEGs in the PE group (Fig. 1A-B). The AMS-RGS score significantly differed between PE and control groups, with PE patients exhibiting a higher MAMs-RGs score (p = 0.0235) (Fig. 1C). MAMs is involved in cellular processes such as Ca2+ signaling, inflammation, lipid metabolism, ER stress, autophagy, and apoptosis12. So we inferred that higher MAMs-RGs scores might reflect multiple pathological changes such as cellular stress, oxidative stress, metabolic disorders, and inflammatory responses in patients with PE. Clustering analysis manifested that there were no obvious outliers in the GSE190971 dataset, as the sample clustering dendrograms presented in Fig. 1D, included all samples within the clusters. The best β of 18 was selected when the R2 was 0.8 and mean connectivity neared zero (Fig. 1E). Then, five co-expressed modules were identified by merging genes into different modules (Fig. 1F). Of note, the MEyellow (cor = -0.450, p < 0.01), MEblue (cor = -0.80, p < 0.001) and MEturquoise (cor = -0.44, p < 0.01) modules were deemed as key modules as a result of their strong correlations with MAMs-RGs score (Fig. 1G). The genes in the three key modules were then combined, generating 6,099 key module genes.
Totally 43 candidate genes might be involved in transcriptional regulation, energy metabolism, and nutrient response
By intersecting 694 DEGs with 6,099 key module genes, 43 candidate genes were identified (Fig. 2A). Subsequent GO analysis revealed that these candidate genes were involved in 87 BPs, 14 CCs and 24 MFs. The top 5 items were shown in the bar chart. Obviously, BPs in which candidate genes were involved included regulation of GTPase activity, the cell growth and muscle tissue development. CC entries such as cell cortex, phosphatase complex and actin filament were also enriched by candidategenes. Additionally, candidate genes might also be involved in actin binding, transcription corepressor activity, GTPase activator activity and other MFs (Fig. 2B). Furthermore, in sum of 80 functional entries enriched by candidate genes were found by Metascape database. The pathway-pathway interaction network showed complex interactions between metabolic pathways, e.g. fatty acid metabolism, tryptophan metabolism and response to nutrient levels (Fig. 2C). In short, candidate genes might be involved in several important biological functions and pathways, including transcriptional regulation, regulation of cytoskeletal dynamics, energy metabolism, and nutrient response, which were essential for maintaining normal cellular function and physiological status. The PPI network containing 31 proteins encoded by candidate genes after removing the discrete protein was constructed. These proteins interacted with each other, forming 39 interaction pairs. Obviously, PDPR interacted more closely with other proteins, such as ACSS1, APMPA and FOXJ2. In addition, interaction pairs encompassed CAST-NEB and PAWR-TCHP (Fig. 2D).
ABCD3, CAST and PAWR were considered as latent diagnostic biomarkers for PE
Two machine learning (ML) algorithms were executed to screen for candidate biomarkers from candidate genes. These 43 candidate genes were downscaled by the LASSO regression model, and a total of 12 LASSO-featured genes were selected when the minimum lambda value was 0.009234441 (Fig. 3A). The SVM-RFE algorithm singled out 15 SVM-RFE-featured genes, at which time the model was most accurate (Fig. 3B). The same genes in the two models were selected, obtaining five candidate biomarkers (ABCD3, CAST, IFRD1, TXNRD1, and PAWR) (Fig. 3C). Gene expression analysis found that ABCD3, CAST and PAWR were down-regulated in PE, and their expression trends were consistent in GSE190971 and GSE24129 datasets (p < 0.01) (Fig. 3D). These three genes were considered as latent diagnostic biomarkers for PE due to their property of distinguishing PE and control samples. A receiver operating characteristic(ROC) analysis was further proceeded to appraise the diagnostic performance of biomarkers. The AUCs of ABCD3, CAST, and PAWR were 0.957, 0.973, and 0.884 in the GSE190971 dataset, singly. And they had an AUC of more than 0.9 in the GSE24129 datasets. This highlighted that all three biomarkers had the ability to precisely diagnose both PE and control samples (Fig. 3E-F).
Biomarkers might synergistically regulate cell metabolism
Gene set enrichment analysis (GSEA) was applied in this study to reveal the functions of biomarkers. In the first 10 pathways in which each biomarker was involved, biomarkers were co-enriched in spliceosome, proteasome and ubiquitin-mediated proteolysis. And biomarkers were positively correlated with these pathways (Fig. 4A-C). As demonstrated in the Sankey diagram, among the total pathways enriched by biomarkers, the top 5 identical pathways embraced endocytosis, ubiquitin mediated proteolysis, spliceosome, RNA degradation and proteasome (Fig. 4D). Taken together, the co-enrichment of biomarker pathways suggested that they might hold a synergistic part in the regulation of cellular metabolism, protein explication and gene expression regulation. A query of the GeneMANIA database revealed 20 genes maintaining similar biological functions to biomarkers. They were displayed in the GGI network. The majority of these genes exhibited a strong tendency for physical interactions with three biomarkers. Intriguingly, ABCD3, along with PEX19, ABCD1, ABCD2, and PEX3, might perform analogous functions, particularly in peroxisomal membrane transport and related cellular processes (Fig. 4E).
Biomarkers were negatively related to activated B cells, activated CD4 T cells and activated DCs
The immunoinfiltrating landscape of cells in PE was characterized. In the GSE190971 dataset, although activated B cells and activated CD4 T cells had negative ssGSEA scores in the samples, they had relatively high scores in PE (p < 0.05). Activated dendritic cells (DCs) scored significantly higher than the scores of activated B cells and activated CD4 T cells. Moreover, the score of activated DCs in the PE group was also markedly higher compared to the control samples (p < 0.001). This implied that B-cell activation, CD4 + T-cell activation as wellas DCs activation were significantly increased in PE (Fig. 5A-B). Activated DCs were positively correlated with activated B cells using Spearman's correlation analysis. (cor = 0.420, p < 0.05) and activated CD4 T cells (cor = 0.400, p < 0.05) (Fig. 5C). There were clear negative correlations between biomarkers and differentially infiltrated immune cells. Activated DCs showed the most negative correlations with ABCD3 (cor = -0.505, p < 0.001), CAST (cor = -0.636, p < 0.001) and PAWR (cor =-0.472, p < 0.01) (Fig. 5D).
Abrine, temozolomide and acetaminophen were co-acting compounds of biomarkers
A large number of compounds were screened using the CTD to identify potential drugs acting with biomarkers for the treatment of PE. A number of 149, 129 and 144 drugs interacting with ABCD3, CAST and PAWR, respectively were predicted in CTD. The drug-biomarker network displayed that a total of 34 drugs comprising abrine, temozolomide and acetaminophen were co-acting compounds of biomarkers (Fig. 5E).
Characterization of protein localization, structure, and disease relevance of biomarkers
Subcellular localisation unveiled that the proteins corresponding to ABCD3 and PAWR were predominantly found in the nucleus, whereas the proteins of CAST were more active in the cytoplasm (Fig. 6A). The human protein structures corresponding to biomarkers were further predicted. The structure of the human protein corresponding to ABCD3 was relatively complex compared to that of CAST and PAWR, due to its diverse disc region folding (Fig. 6B). Besides, the relevance of biomarkers with other pregnancy-related diseases was explored. Prenatal exposure delayed effects exhibited the highest interaction scores with each biomarker among the top five diseases concurrently associated with three biomarkers, whereas fetal death showed the lowest interaction scores with each biomarker (Fig. 6C).
Biomarker expression was regulated by multiple miRNAs and TFs
A total of 11, 30, and eight miRNAs acting with ABCD3, CAST and PAWR were identified in the miRWalk website. When clipExpNum was greater than 20, 11 and 3 lncRNAs targeting CAST and PAWR were obtained, severally. The lncRNA-miRNA-mRNA showing the interaction pairs formed by biomarkers, miRNAs and lncRNAs was synthesized. For instance, regulation of CAST and PAWR by hsa-miR-455-5p and hsa-miR-625-3p, respectively, might be simultaneously affected by MALAT1 (Fig. 6D). Using the NetworkAnalyst database, 21, 9 and 20 TFs that might regulate the level of ABCD3, CAST and PAWR were predicted, respectively. As demonstrated in the mRNA-TF network, ZNF610 might regulate the expression level of ABCD3 and CAST, and expressed content of ABCD3 and PAWR might be affected by FOXA3, CEBPG, MXD4 and ZNF76 (Fig. 6E).
Clinical validation of biomarkers in PE placenta
The placentas of the patients were tested for the expression of distinctive genes using real-time reverse transcriptase-polymerase chain reaction (RT-qPCR)The relative mRNA expression of of the biomarkers were down-regulation in PE patients than in control patients for PAWR and CAST (P = 0.02; P = 0.04).However,ABCD3 was up-regulation (P = 0.007) (Fig. 7A-C).