Age and MYCN amplification are independent prognostic factors in pediatric neuroblastoma.
Previously, we had shown that neuroblastoma patients with MYCN amplification were associated with adverse prognosis and revealed the prognostic significance of MYCN target genes using TARGET dataset [7]. In the present study, we further tested the prognostic effects of age at initial neuroblastoma diagnosis in TARGET dataset.
According to the TARGET clinical data, the mean age at initial neuroblastoma diagnosis was 3.2 years old. However, the age of neuroblastoma patients were varied significantly. 11.8% (72 out of 608) neuroblastoma patients were younger than one year old, while, 3.3% (20 out of 608) patients were older than eight years old. Previous reports suggested that age was a prognostic factor in neuroblastoma [9]. Consistently, we found that old neuroblastoma patients had worse prognosis than young pediatric neuroblastoma patients in TARGET dataset (Fig. 1a).
The prognostic effects of age in pediatric neuroblastoma were further confirmed using GSE49710 and GSE85047 datasets. The mean age at initial neuroblastoma diagnosis was 2.08 years old in GSE49710 dataset and 2.19 years old in GSE85047 dataset. Similar to the results derived from TARGET dataset, we found that old pediatric neuroblastoma patients had adverse clinical outcomes in both GSE49710 and GSE85047 datasets (Fig. 1a).
MYCN amplification was also associated with the clinical outcomes of pediatric neuroblastoma [5]. So, next, we determined the relationships of age and MYCN amplification in neuroblastoma. We found that there was no significant difference of the mean age in pediatric neuroblastoma patients with or without MYCN amplification in TARGET and GSE85047 datasets (Fig. 1b). Furthermore, the multivariate cox regression assay suggested that age and MYCN amplification were independent prognostic factors in pediatric neuroblastoma in TARGET, GSE49710 and GSE85047 datasets (Fig. 1c).
MYCN non-amplified pediatric neuroblastoma is a heterogeneous disease and MYCN non-amplified young neuroblastoma patients have better prognosis.
Next, we determined the combination of age and MYCN amplification in the predication of the clinical outcomes of pediatric neuroblastoma. Pediatric neuroblastoma patients in TARGET, GSE49710 and GSE85047 datasets were divided into four sub-groups based on the mean age and MYCN amplification status. The Kaplan-Meier plots showed that MYCN amplified old patients and MYCN amplified young patients had not different clinical outcomes (Fig. 2a). However, MYCN non-amplified pediatric neuroblastoma was divided into two different sub-groups. MYCN non-amplified young neuroblastoma patients had significant favorable prognosis than MYCN non-amplified old neuroblastoma patients in TARGET, GSE49710 and GSE85047 datasets (Fig. 2a). Those results suggested that MYCN non-amplified pediatric neuroblastoma was heterogeneous, and could be divided into old and young pediatric neuroblastoma sub-groups.
Moreover, we showed that all the MYCN amplified pediatric neuroblastoma patients and MYCN non-amplified old pediatric neuroblastoma patients were with histological unfavorable outcomes (Fig. 2b). However, only 55% MYCN non-amplified young pediatric neuroblastoma patients were with histological unfavorable outcomes in TARGET dataset (Fig. 2b). Similarly, 25% MYCN non-amplified young pediatric neuroblastoma patients were in low risk sub-group, while, only one MYCN non-amplified old pediatric neuroblastoma patients were in low risk sub-group in TARGET dataset (Fig. 2b). Those results further confirmed the existing of different sub-groups in MYCN non-amplified pediatric neuroblastoma patients.
Three sub-consensuses of MYCN non-amplified neuroblastoma patients are with different clinical overall survival.
Non-negative matrix factorization (NMF) sub-consensus is a robust cancer classification system based globe gene expression levels [15, 16]. To further address the inner sub-groups of MYCN non-amplified pediatric neuroblastoma patients, we divided the MYCN non-amplified neuroblastoma patients in TARGET dataset into two sub-consensuses (Fig. 3a) or three sub-consensuses (Fig. 3b) using NMF classification. We then tested the clinical overall survival of different sub-consensuses. When the MYCN non-amplified neuroblastoma patients were divided into two sub-consensuses, we found that there was no significant difference in the overall survival between sub-consensus 1 and sub-consensus 2 (Fig. 3a). When divided the MYCN non-amplified neuroblastoma patients into three sub-consensuses, we found that, compared with sub-consensus 1 or sub-consensus 2, MYCN non-amplified neuroblastoma patients in sub-consensus 3 were with significant low overall survival in TAEGET dataset (Fig. 3b).
The three sub-consensuses classification of MYCN non-amplified neuroblastoma was further validated in GSE49710 and GSE85047 datasets. Similarly, MYCN non-amplified neuroblastoma patients in GSE49710 and GSE85047 datasets were divided into three sub-consensuses using NMF (Fig. 3c and 3d). Although not significantly, MYCN non-amplified neuroblastoma patients in sub-consensus 3 were with low overall survival in GSE49710 dataset (P = 0.09) (Fig. 3c). In GSE85047 dataset, MYCN non-amplified neuroblastoma patients in sub-consensus 1 were with favorable overall survival (Fig. 3d). All those results suggested there were three sub-consensuses in MYCN non-amplified neuroblastoma patients.
MYCN amplified pediatric neuroblastoma is a relatively homogeneous disease.
Previously, we had shown that age alone could not divide the MYCN amplified pediatric neuroblastoma patients into two different sub-groups with different clinical outcomes (Fig. 2a and 2b). Further, we studied the inner sub-groups of MYCN amplified pediatric neuroblastoma patients using NMF assay. MYCN amplified neuroblastoma patients in TARGET dataset were divided into two sub-consensuses (Fig. 4a). We found that, the overall survival between sub-consensus 1 and sub-consensus 2 was no significantly different (Fig. 4a). Even, in the three sub-consensuses classification, the overall survival of each sub-consensus was also not significantly different (Fig. 4b).
Next, we tried to determine whether there were some genes could divide the MYCN non-amplified or MYCN amplified pediatric neuroblastoma into two different sub-groups with different overall survival. To achieve this study, all the prognostic genes in MYCN non-amplified neuroblastoma patients in each TARGET, GSE49710 or GSE85047 dataset were identified, respectively. We found that 226 genes shared common prognostic significance in MYCN non-amplified neuroblastoma patients in TARGET, GSE49710 and GSE85047 datasets (Fig. 4c). On the contrary, there was only one gene CSNK1G2 had prognostic significance in MYCN amplified neuroblastoma patients in TARGET, GSE49710 and GSE85047 datasets (Fig. 4d). Those results further highlighted that MYCN non-amplified pediatric neuroblastoma was a heterogeneous disease, while, MYCN amplified pediatric neuroblastoma was a relatively homogeneous disease.
Using Kaplan-Meier plots, we demonstrated the prognostic effects of CSNK1G2 in MYCN amplified pediatric neuroblastoma. We found that MYCN amplified neuroblastoma patients with high expression levels of CSNK1G2 were with low overall survival in TARGET, GSE49710 and GSE85047 datasets (Fig. 4e).
Identification of the transcriptional profiling in MYCN non-amplified young neuroblastoma patients
Previously, we had shown that MYCN non-amplified pediatric neuroblastoma was included different sub-groups and MYCN non-amplified young neuroblastoma patients had better prognosis (Fig. 2a). Next, we tried to identify the differentially expressed genes in MYCN non-amplified young neuroblastoma patients in TARGET dataset. Compared with MYCN amplified neuroblastoma patients and MYCN non-amplified old neuroblastoma patients, 64 genes were highly expressed in MYCN non-amplified young neuroblastoma patients in TARGET dataset (Fig. 5a). However, there were only 12 genes were down-regulated in MYCN non-amplified young neuroblastoma patients (Fig. 5a). Those differentially expressed genes were involved in the regulation of neuron projection, regulation of GTPase activity and regulation of cell-cell adhesion (Fig. 5b).
The differentially expressed genes in MYCN non-amplified young neuroblastoma patients in GSE49710 and GSE85047 datasets were also identified. There were 2952 and 612 genes were differentially expressed in MYCN non-amplified young neuroblastoma patients GSE49710 and GSE85047 datasets, respectively (Fig. 5a). Among them, ALCAM, BTBD9, CACNA2D3, DST, EPB41L4A, FGD6, GMEB1, IGSF3 and KIFIB were commonly changed in MYCN non-amplified young neuroblastoma patients in TARGET, GSE49710 and GSE85047 datasets (Fig. 5c). Using Heatmaps, we showed that those genes were all highly expressed in MYCN non-amplified young neuroblastoma patients (Fig. 5d).
High expression levels of ALCAM, CACNA2D3, DST, EPB41L4A or KIFIB are associated with the favorable prognosis of MYCN non-amplified neuroblastoma patients.
Next, we determined the prognostic effects of ALCAM, BTBD9, CACNA2D3, DST, EPB41L4A, FGD6, GMEB1, IGSF3 and KIFIB in MYCN non-amplified neuroblastoma patients. Previously, we had identified 226 prognostic genes in MYCN non-amplified neuroblastoma patients in TARGET, GSE49710 and GSE85047 datasets (Fig. 4c). ALCAM, CACNA2D3, DST, EPB41L4A and KIFIB were among those 226 prognostic genes and were associated with the good prognosis in MYCN non-amplified neuroblastoma patients. MYCN non-amplified neuroblastoma patients with low expression levels of ALCAM, CACNA2D3, DST, EPB41L4A or KIFIB were with low overall survival in TARGET dataset (Fig. 6a), GSE49710dataset (Fig. 6b) and GSE85047 dataset (Fig. 6c).
Interestingly, high expression levels of ALCAM, CACNA2D3, DST, EPB41L4A or KIFIB were not only associated with the prognosis of MYCN non-amplified neuroblastoma patients, but also were associated with the prognosis of all neuroblastoma patients. Neuroblastoma patients with high expression levels of ALCAM, CACNA2D3, DST, EPB41L4A or KIFIB had favorable clinical overall survival in TARGET dataset (Fig. 7a), GSE49710dataset (Fig. 7b) and GSE85047 dataset (Fig. 7c).
CACNA2D3, DST, EPB41L4A and KIFIB are down-regulated in the sub-consensus 3 of MYCN non-amplified neuroblastoma patients.
Previously, we had shown that there were three sub-consensuses of MYCN non-amplified neuroblastoma patients with different clinical outcomes (Fig. 3). Next, we tested the expression levels of ALCAM, CACNA2D3, DST, EPB41L4A and KIFIB in the three different sub-consensuses in TARGET, GSE49710 and GSE85047 datasets. The expression levels of ALCAM, CACNA2D3, EPB41L4A and KIFIB were not different in the three sub-consensuses of MYCN non-amplified neuroblastoma patients in TARGET dataset (Fig. 8a). Only, compared with sub-consensus 1, DST was lowly expressed in sub-consensus 3 of MYCN non-amplified neuroblastoma patients (Fig. 8a).
Moreover, DST was also lowly expressed in sub-consensus 3 of MYCN non-amplified neuroblastoma patients in GSE49710 (Fig. 8b) and GSE85047 (Fig. 8c) datasets, compared with MYCN non-amplified neuroblastoma patients in sub-consensus 1 or sub-consensus 2. Furthermore, the relative expression levels of CACNA2D3, EPB41L4A and KIFIB were lower in sub-consensus 3 in GSE49710 dataset, compared with MYCN non-amplified neuroblastoma patients in sub-consensus 1 (Fig. 8b). Also, compared with sub-consensus 2, CACNA2D3, EPB41L4A and KIFIB were lowly expressed in MYCN non-amplified neuroblastoma patients in sub-consensus 3 in GSE85047 dataset (Fig. 8c).
Expression level of DST is an independent prognostic factor in MYCN non-amplified pediatric neuroblastoma.
Next, we tried to determine the associations of ALCAM, CACNA2D3, DST, EPB41L4A and KIFIB in MYCN non-amplified neuroblastoma patients. First, based on their expression levels, we found those genes were highly correlated with each other, as demonstrated the high correlation coefficients of those genes in TARGET, GSE49710 and GSE85047 datasets (Fig. 9a).
Second, using multivariate cox regression, we determined the correlation of age, ALCAM, CACNA2D3, DST, EPB41L4A or KIFIB expressions in the prediction of the clinical overall survival of MYCN non-amplified neuroblastoma patients. In TARGET, GSE49710 and GSE85047 datasets, we found that the expression level of DST was an independent prognostic marker (Fig. 9b). Those results suggested that, although, ALCAM, CACNA2D3, DST, EPB41L4A and KIFIB shared similar expression signature, the prognostic significance of DST was different.
Young patients with high DST expression level have the best prognosis in MYCN non-amplified pediatric neuroblastoma.
Since age associated gene DST was an independent prognostic factor in MYCN non-amplified pediatric neuroblastoma, we wondered if the combination of DST, age and MYCN could achieve best prognostic significance. To test this hypothesis, MYCN non-amplified pediatric neuroblastoma patients in TARGET datasets were divided into old patients with high DST expression, old patients with low DST expression, young patients with high DST expression and young patients with low DST expression four sub-groups. We found that MYCN non-amplified young patients with high DST expression had the better prognosis than other three sub-groups (Fig. 10a). Similarly, in GSE49710 and GSE85047 datasets, MYCN non-amplified young patients with high DST expression also had the best prognosis, while, MYCN non-amplified old patients with low DST expression had the worst prognosis (Fig. 10a).
Furthermore, the contingency graphs showed that among the MYCN non-amplified young pediatric neuroblastoma patients with high DST expression, 73% or 62% patients were in favorable or low risk sub-group, receptively. Contrast with the 16% or 0% patients were in favorable or low risk sub-group in MYCN non-amplified young pediatric neuroblastoma patients with low DST expression (Fig. 10b). Combined all those results suggested that MYCN non-amplified young neuroblastoma patients with high DST expression levels had the best clinical overall survival.