Initial identification of tDEGs, tDEMs and tDEMTGs in pancreatic cancer
To find key genes which expressed differentially between PC patients and healthy people,we decided to filter them out in two sides: mRNA and miRNA targeted method. Firstly, we selected three expression array profiling datasets (GSE62165, GSE15471 and GSE32676) and two non-coding RNA array profiling datasets (GSE24279 and GSE32678) from GEO database. As for mRNA, 1398, 607 and 1157 DEGs were extracted from GSE62165, GSE15471 and GSE32676. Among them, there were 997, 562, 669 (103 shared genes) up-regulated and 401, 45, 488 (11 shared genes) down-regulated DEGs were identified. Taking advantage of Venn analysis in up and down regulated DEGs respectively, we captured 114 total shared DEGs, which we termed as tDEGs in this study, where 103 were high-expression and 11 were low-expression genes (Fig. 1A&B). As for miRNA, the same operations were also performed. 417 and 569 DEMs were extracted from GSE24279 and GSE32678. Two miRNA datasets totally shared 66 DEMs, which we termed as tDEMs, including 24 were up-regulated and 42 were down-regulated miRNAs (Fig. 1C&D).
The target genes of tDEMs were predicted by miRTarBase, miRWalk and Diana Tools databases. And the 114 consistent expression genes, which termed as tDEMTGs, were found by Venn analysis (Fig. 1E).
GO and pathway enrichment analysis
The GO term and KEGG pathway analysis were performed via DAVID. Firstly, the results of GO term enrichment analysis varied a lot between tDEGs and tDEMTGs (Fig. 2A&B). Biological process (BP) analysis of GO showed tDEGs were significantly enriched in extracellular matrix associated part, such as cell adhesion, extracellular matrix organization and disassembly, collagen catabolism and organization. As for tDEMTGs, top 2 clusters of genes were enriched in the regulation of transcription and proliferation, which showed a similar result in tDEGs, but other genes were mainly responsible for the regulation of cell death associated part. For GO cell component (CC) analysis, the tDEGs were significantly enriched in extracellular part, such as extracellular region, extracellular exosome, extracellular space, extracellular matrix and proteinaceous extracellular matrix. However, the results of tDEMTGs were focused on plasma membrane part, cytosol, organelle lumen, membrane-enclosed lumen and intracellular organelle lumen. The same differences also showed in molecular function (MF) analysis, tDEGs were mainly enriched in protein binding, but tDEMTGs were enriched in nucleotide binding, transcription regulator activity, ribonucleotide binding, purine ribonucleotide binding and purine nucleotide binding.
Additionally, the differences between tDEGs and tDEMTGs were also confirmed KEGG pathway analysis. The result disclosed that tDEGs were involved in PI3K-Akt signaling pathway, focal adhesion, ECM-receptor interaction, amoebiasis and pathways in cancer (Fig. 2C). However, tDEMTGs were significantly enriched in pathways in cancer and partial enriched in pancreatic cancer (Fig. 2D).
Identification of key genes and miRNAs
To identify potential key genes, tDEMTGs were aligned with tDEGs to obtain the intersection part, which we termed as key genesfor further analysis (Fig. 1F). Importantly, the result showed runt related transcription factor 2 (RUNX2), laminin subunit gamma-2 (LAMC2) and F-box protein 32 (FBXO32) were commonly shared and listed in Table 1. Subsequently, we screened candidate miRNAs involved in the regulation of key genes from 66 tDEMs, and found 16 up-regulated and 19 down-regulated key miRNAs (Table 2).
Table 1
The differential expression of key genes in different GEO datasets.
Gene | Datasets | ID | LogFC | P.Val | Expression |
RUNX2 | GSE62165 | 11725180_a_at | 3.7 | 3.10E-29 | UP |
GSE15471 | 232231_at | 2.93 | 9.52E-11 | UP |
GSE32688(mRNA) | 232231_at | 1.69 | 0.1046045 | UP |
LAMC2 | GSE62165 | 11737325_a_at | 3.49 | 4.80E-25 | UP |
11743514_a_at | 4.09 | 7.56E-26 | UP |
11743516_s_at | 4.29 | 1.49E-26 | UP |
11743515_s_at | 4.35 | 1.06E-28 | UP |
11747923_s_at | 4.37 | 7.93E-31 | UP |
GSE15471 | 207517_at | 1.60 | 3.83E-09 | UP |
202267_at | 2.76 | 1.76E-13 | UP |
GSE32688(mRNA) | 202267_at | 4.93 | 0.0002728 | UP |
207517_at | 2.25 | 0.0038994 | UP |
FBXO32 | GSE62165 | 11719394_a_at | 2.88 | 3.83E-22 | UP |
GSE15471 | 225803_at | 1.72 | 9.65E-14 | UP |
241762_at | 1.83 | 2.64E-11 | UP |
225328_at | 1.91 | 5.28E-12 | UP |
GSE32688(mRNA) | 241763_s_at | 2.44 | 0.0095169 | UP |
241762_at | 2.32 | 0.0215962 | UP |
225803_at | 1.88 | 0.0176426 | UP |
225345_s_at | 1.56 | 0.0702839 | UP |
Table 2
The differential expression of candidate miRNAs in different GEO datasets.
Target gene | miRNA | Datasets | LogFC | Expression |
RUNX2 | hsa-miR-519c-3p | GSE32678 | 0.531577 | UP |
| GSE24279 | 3.2867924 | |
hsa-miR-876-3p | GSE32678 | 0.755198 | UP |
| GSE24279 | 9.9730133 | |
hsa-miR-1265 | GSE32678 | 0.883218 | UP |
| GSE24279 | 0.6795887 | |
hsa-miR-92a-2-5p | GSE32678 | -1.231335 | DOWN |
| GSE24279 | -0.7331099 | |
hsa-miR-193b-5p | GSE32678 | 0.6020578 | DOWN |
| GSE24279 | -1.218209 | |
hsa-miR-517-5p | GSE32678 | -0.606971 | DOWN |
| GSE24279 | -1.2242733 | |
hsa-miR-488-3p | GSE32678 | -1.145003 | DOWN |
| GSE24279 | -1.145003 | |
RUNX2/FBXO32 | hsa-miR-23b-3p | GSE32678 | 1.214636 | UP |
| GSE24279 | 0.7358448 | |
hsa-miR-221-5p | GSE32678 | 1.277623 | UP |
| GSE24279 | 1.8034373 | |
hsa-miR-199a-5p | GSE32678 | 1.240204 | UP |
| GSE24279 | 1.6889355 | |
hsa-miR-376a-5p | GSE32678 | 0.546421 | UP |
| GSE24279 | 2.3186641 | |
hsa-miR-1825 | GSE32678 | 1.225039 | UP |
| GSE24279 | 4.0684489 | |
hsa-miR-1227-3p | GSE32678 | 0.687478 | UP |
| GSE24279 | 3.0619077 | |
hsa-miR-198 | GSE32678 | -1.004687 | DOWN |
| GSE24279 | -0.6024842 | |
hsa-miR-937-3p | GSE32678 | -0.768939 | DOWN |
| GSE24279 | -1.2399531 | |
FBXO32 | hsa-miR-345-5p | GSE32678 | 0.821379 | UP |
| GSE24279 | 7.9474776 | |
hsa-miR-1233-3p | GSE32678 | 0.528843 | UP |
| GSE24279 | 1.0034567 | |
hsa-miR-1289 | GSE32678 | 0.583983 | UP |
| GSE24279 | 0.5373088 | |
hsa-miR-621 | GSE32678 | 0.962951 | UP |
| GSE24279 | 0.9769538 | |
hsa-miR-1248 | GSE32678 | 1.377747 | UP |
| GSE24279 | 1.0123159 | |
hsa-miR-30b-3p | GSE32678 | -0.705179 | DOWN |
| GSE24279 | -0.7012921 | |
hsa-miR-373-5p | GSE32678 | -0.943119 | DOWN |
| GSE24279 | -1.5962925 | |
hsa-miR-492 | GSE32678 | -0.665303 | DOWN |
| GSE24279 | -1.1011978 | |
hsa-miR-510-5p | GSE32678 | -1.069006 | DOWN |
| GSE24279 | -2.499187 | |
hsa-miR-575 | GSE32678 | -0.77259 | DOWN |
| GSE24279 | -0.5620953 | |
hsa-miR-604 | GSE32678 | -0.508662 | DOWN |
| GSE24279 | -1.2474484 | |
hsa-miR-611 | GSE32678 | -1.159661 | DOWN |
| GSE24279 | -1.6342346 | |
hsa-miR-617 | GSE32678 | -0.816815 | DOWN |
| GSE24279 | -1.5017728 | |
hsa-miR-921 | GSE32678 | -0.90506 | DOWN |
| GSE24279 | -1.9104782 | |
hsa-miR-520d-5p | GSE32678 | -0.660262 | DOWN |
| GSE24279 | -0.6523876 | |
FBOX32/LAMC2 | hsa-miR-199b-5p | GSE32678 | 1.544751 | UP |
| GSE24279 | 1.0106128 | |
LAMC2 | hsa-miR-588 | GSE32678 | 0.526075 | UP |
| GSE24279 | 4.3557666 | |
hsa-miR-891a-5p | GSE32678 | -0.713461 | DOWN |
| GSE24279 | -1.3790634 | |
hsa-miR-622 | GSE32678 | -0.961764 | DOWN |
| GSE24279 | -1.1362947 | |
LAMC2/RUNX2 | hsa-miR-628-3p | GSE32678 | -0.64033 | DOWN |
| | GSE24279 | -0.9205919 | |
Diagnostic and prognostic value of candidate miRNAs
To test the clinical applicability of the screening results, we explored the diagnostic significance of candidate miRNAs by ROC curve analysis (Fig. 3A). However, the mean AUC values of them were all lower than 0.85 (Fig. 3B). Then, to further evaluate the prognostic value, survival analysis was applied and the results showed 10 up-regulated key miRNAs indicated a significantly lower OS rate: hsa-miR-519c-3p, hsa-miR-1265, hsa-miR-1825, hsa-miR-1227-3p, hsa-miR-1233-3p, hsa-miR-1289, hsa-miR-621, hsa-miR-1248, hsa-miR-199b-5p and hsa-miR-588 (Fig. 4A). Correspondingly, three down-regulated candidate miRNAs were positively related with OS rate: hsa-miR-488-3p, hsa-miR-30b-3p and hsa-miR-628-3p (Fig. 4B). These data indicated that these 13 candidate miRNAs could be potential for clinical application as prognostic markers.
Diagnostic and prognostic value of key genes
We then compared the transcriptional levels of key genes in multiple cancers with those in normal samples by using TCGA database via GEPIA (Fig. 5A&B&C). The mRNA expression levels of RUNX2, LAMC2 and FBXO32 showed significantly differences between cancer and adjacent non-cancerous tissues especially in pancreatic adenocarcinoma (PAAD, Fig. 5.D). ROC and survival analysis were also performed to ensure the application prospect of key genes. All the three key genes indicated a good discriminating ability,of which the mean AUC values of each key gene in three GEO datasets were all greater than 0.85 (Fig. 5E). Furthermore, the mean AUC values of LAMC2 and FBXO32 both exceeded 0.95 which indicated that these two molecules might be promising biomarkers to PC diagnosis. The results of survival analysis showed only LAMC2 expression level affected OS rate significantly (HR = 3.06, P = 0.00011, Fig. 5F). It suggested that LAMC2 may be an important molecule which participate in the development of PC and also act as a potential prognostic biomarker.
Verification of key genes
In addition to high accuracy and specificity, the diagnostic methods should also be painless and woundless. Blood and saliva are easily obtained samples that we can get with little harm for the body. Thus, we verified the diagnostic possibility of key genes in 4 blood cells GEO datasets (GSE74629, GSE49641, GSE49515 and GSE15932) and 1 saliva cells GEO dataset (GSE14245). Interestingly, we found only RUNX2 had significant reduction compared with normal controls in saliva and blood cells (GSE14245 and GSE49641) which suggested that RUNX2 could be considered as one of promising candidate biomarkers of PC. However, LAMC2 and FBXO32 were not significantly different between the control and PC samples.
Then, to further clarify the expression of RUNX2 and LAMC2 in PC tissue, we studied the IHC experiments of these two genes in HPA (Fig. 6B, Supplementary Fig. 1). First, the result of LAMC2 was consistent with previous analysis. The staining of anti-LAMC2 antibodies were all above medium in PC patients, which were hardly detected in normal pancreas tissues. Although staining quantity were varying (5.5/12 were > 75%, 2.5/12 were 75%-25% and 4/12 were < 25% stained), almost all of their staining intensity were strong. Besides, LAMC2 were visualized in cytoplasmic/membranous while RUNX2 were not corroborating with predicted. Though a slight up-regulation of RUNX2 could be found in some samples (17%), most of them which just like normal samples could not be detected. Consistently, immune blot was performed to detect the level of RUNX2 which showed a similar result with IHC (Fig. 6C). These data indicated that LAMC2 may play a crucial role in the PC therapy.
In a second validation study, we examined RUNX2 and LAMC2 performance in different clinical stage, sex, age, tumor subtype and personal tumor status groups (Fig. 7). We found RUNX2 was only significantly up-regulated in different stage groups. However, LAMC2 were significantly high-expressed not only in advanced stage PC patients, but also in ductal type PC group (P = 0.03) and PC patients (P = 0.02).
According to above evidences, we further found LAMC2 might be the main factor in influencing median survival time of PC patients by combination analysis of RUNX2 and LAMC2 (Fig. 8).