The patient diagnosed as Ph-like ALL was identified with a novel TPR-PDGFRB fusion gene
A 55-year-old woman (patient ID: Case 13) was referred to our hospital due to fatigue in August 2019 (Fig. 1A and Additional file 1: Fig. S1A). Karyotype analysis showed a normal karyotype (Additional file 1: Fig. S1B), and the wright-stained smear presented with 74% blasts at diagnosis (Additional file 1: Fig. S1C). Immunophenotyping revealed the positive expression of the B-lymphoid markers cCD79α, CD19 and CD10 at diagnosis (Additional file 1:Fig. S1D). A multiplex RT-PCR analysis for 43 leukemia-related fusion genes showed a negative result (data not shown). After conventional chemotherapy, the patient reached CR (Fig. 1A and Additional file 3–4: Tables S2-S3) but developed a recurrence of the disease (1st relapse: 200201) while preparing for a hematopoietic stem cell transplantation (Fig. 1A). The leukemia cells were cleared after prolonged induction therapy; however, she developed a second relapse soon after (2nd relapse: 200414).
The rapid relapse combined with negative BCR-ABL1 fusion observed in this patient is, to a great extent, in line with the features of Ph-like ALL. RNA-seq was performed on 13 patients with B-ALL (including Case 13) and 8 healthy donors (Additional file 5: Table S4), and comparison of the expression profiles with healthy donors revealed the enrichment of the BCR-ABL1 fusion signature in the Case 13 (Additional file 1: Fig. S2A). Furthermore, the patient (Case 13) exhibited higher expression similarities with the Ph-like and Ph+ ALLs than with the other ALL subtypes in the correlation analysis (Additional file 1: Fig. S2B). Hierarchical clustering analysis based on the Ph-like ALL signature consisting of 192 genes [2] (Additional file 6: Table S5), showed that this patient (Case 13) blended into the Ph+ cluster in all three cohorts (Additional file 1: Fig. S2C). These results suggest that the expression profile of our patient was similar to that of the patients with Ph+ ALL.
In general, molecular genetic alterations that activate cytokine receptor and kinase signaling account for approximately 80%-90% of Ph-like ALL cases [6, 7]. In the current study, we identified a novel TPR-PDGFRB in-frame fusion gene, which consists of exons 1–46 of the TPR gene with exons 11–23 of the PDGFRB gene (Fig. 1B and Table S6). The PDGFRB rearrangement was confirmed using FISH (Fig. 1C), and the junction sequences of the TPR-PDGFRB fusion transcript was validated using RT-PCR and Sanger sequencing (Fig. 1D). The presumed TPR-PDGFRB fusion protein includes four coiled-coil domains from TPR, one transmembrane domain, and one tyrosine kinase domain from PDGFRB (Fig. 1E). Immunoblot analysis confirmed the presence of chimeric kinase protein TPR-PDGFRB as a TPR- and PDGFRB-immunoreactive band (Fig. 1F). Together, these results illustrate that the patient should be diagnosed with Ph-like ALL.
Previous studies have reported that PDGFRB fusion patients with Ph-like ALL were refractory to conventional therapy but amenable to TKI [8, 33]. Imatinib was given as monotherapy but failed to induce CR within 2 weeks (Fig. 1A). Moreover, the patient exhibited CNS-L with 96.5% blasts in the CSF (Fig. 1A). Subsequently, dasatinib combined chemotherapy was administered, and CR was achieved within 4 months (Fig. 1A). Unfortunately, the disease relapsed after dasatinib discontinuation due to the exorbitant medical costs (3rd relapse: 200911; Fig. 1A). Finally, the patient showed no response to further treatment with ponatinib and died on October 20, 2020 (Fig. 1A).
The single-cell expression atlas and cell types in the Ph-like ALL patient
The transcriptome landscape of Ph-like ALL at single-cell resolution remains unknown. In this study, scRNA-seq was used to profile the gene expression in cells taken from the BM specimens at diagnosis and first relapse (Fig. 2A). After quality control, an anchoring-based integration analysis was conducted using 10,273 cells to account for the technical and biological variances between individual samples [26, 34]. A reduction in the dimensionality was observed using the t-distributed stochastic neighbor embedding (tSNE) projection method implemented in the Seurat toolkit (Fig. 2B).
Unsupervised clustering analysis identified 17 transcriptionally distinct cell clusters (Additional file 1: Fig. S3A), and the heatmap showed the top five cluster specific marker genes (Additional file 1: Fig. S3B and Additional file 8: Table S7). By coupling the cell-type annotation by “scHCL” package and the well-known lineage-specific marker genes, 17 clusters were classified into six hematopoietic cell types including B cells, T cells, natural killer (NK) cells, megakaryocytes-erythroid progenitors (MEPs), classical monocytes and non-classical monocytes (Fig. 2C).
For the tSNE representation, 12 B-cell clusters were characterized by the high expression of CD19, CD79B, CD24, IGHM, and VPREB3 (Fig. 2C-E). Clusters 11 and 17 were assigned to classical and non-classical monocytes based on the high expression of CD14 and FCGR3A (CD16), respectively (Fig. 2C and 2E). Besides the well-known markers, we also observed the specific expression of S100A12 in classical monocytes, and CSF1R, CDKN1C, C1QA and C1QB in non-classical monocytes (Fig. 2D and Additional file 1: Fig. S3C). T cells (cluster 9) markedly expressed CD3D and consisted of both CD4+ and CD8+ T cells (Fig. 2D-E and Additional file 1: Fig. S3D). NK cells (cluster 16) were characterized by the expression of signature genes GNLY and KLRC1 and cytotoxicity-related genes PRF1 and FCER1G (Fig. 2D-E and Additional file 1: Fig. S3D). Cluster 13 was defined as MEPs based on the high expression of HBD, CA1, and APOC1, which harbored the potential for proliferation due to the expression of the cell cycle genes CENPE and TPX2 (Fig. 2D-E and Additional file 1: Fig. S3E). Gene set enrichment analysis (GSEA) across different cell types using the MSigDB hallmark gene sets demonstrated significantly up-regulated enrichment of the gene sets associated with cell cycle progression, ectopic KRAS and TP53 signaling pathway in the B cells (Fig. 2F). Taken together, scRNA-seq captured the panorama of the BM cells from the primary and relapsed Ph-like ALL case and depicted all major cell types and their gene expression states.
Heterogeneous transcriptional profiles of malignant B cells in the Ph-like ALL patient
B-ALL is a B-cell malignancy characterized by the abnormal proliferation and accumulation of B-lymphoid progenitor cells that invade both the peripheral blood and BM. At the single-cell level, the B cells were grouped into 12 clusters, suggesting the transcriptomic heterogeneity of the B cells in Ph-like ALL (Fig. 3A). By comparing the expression profile with normal B cell progenitors using “singleR” method, we classified the 12 B cell clusters into six subsets (Fig. 3A). Based on the expression of marker genes, cluster 1 was assigned to CD20 (MS4A1) high and IGLL1 low PreB cells; the subset of IGKC high and IGLL1 high PreB cells was consisted of clusters 2, 3, and 14 (Fig. 3A-B). Clusters 4 and 6 were defined as IGKC low and IGLL1 high PreB cells, whereas cluster 5 consisted of CD34+ ProB cells (Fig. 3A-B). Because of the specific expression of GPR183 and AIM2, cluster 15 was defined as memory B cells (Fig. 3C).
A previously reported core gene set, which was shown to denote the G0/G1, S, and G2/M phases [29], were used to infer the cell cycle states. Clusters 7, 8, 10, and 12 were characterized by the strong expression of the G2/M and S-phase genes and were grouped into cycling PreB cells, likely corresponding to the highly proliferating malignant B cells (Fig. 3D). Furthermore, cells in clusters 7 and 10 highly expressed the cell cycle progression genes (e.g., PCNA, MCM7, TYMS, GINS2, and PCLAF), demonstrating the enrichment of the S-phase cells (Fig. 3D-E). Representative proliferating genes (such as MIK67, TOP2A, ASPM, and CCNB1) were enriched in clusters 8 and 12, indicating the G2/M phase cells (Fig. 3D-E).
Monocle2 was usually used to order cells based on their expression patterns to represent the distinct cellular fates or biological processes [32]. Here, we performed the pseudotime inference on six B-cell subsets, and revealed the distinct bifurcated architecture of the cell trajectory among different cell subsets, implying a divergence in the transcriptional state (Fig. 3F). According to the pseudotime, the six B cell subsets appeared to start principally from the CD34+ ProB and cycling PreB cells and moved toward two IGLL1 high PreB cell subsets, following to CD20 high or GPR183 high cell subsets (Fig. 3F). These results suggest the remarkable heterogeneity within malignant B cells, which is reflected in their distinct proliferating and differentiation states.
A relapse-specific subpopulation associated with disease progression in the patient with Ph-like ALL
To decipher the relationship between the clusters and disease progression, the tSNE representation was divided on the basis of disease status (Fig. 4A). And the percentage of cells labeled under different disease status within each cluster was calculated (Additional file 1: Fig. S4A). Surprisingly, we found that all non-B cell clusters (cluster 9: T cells; cluster 11: classical monocytes; cluster 13: MEPs; cluster 16: NK cells; cluster 17: non-classical monocytes) were enriched in relapse sample (Additional file 1: Fig. S4A-B). Meanwhile, functional annotation analysis based on the DEGs from bulk RNA-seq showed the significant enrichment of HEMATOPOIETIC_CELL_LINEAGE (Additional file 1: Fig. S4C). The volcano plot showed that most of the well-known non-B cell lineage marker genes were up-regulated in relapse bulk RNA-seq data, and the B-cell marker gene CD20 was significantly down-regulated (Additional file 1: Fig. S4D). Further, we also validated the specific expression of these lineage marker genes in different cell types based on the scRNA-seq data (Additional file 1: Fig. S4E).
For the B cell clusters, we discovered that clusters 4, 6, and 15 were enriched with cells at the onset of the disease (> 80%), along with the low expression of CD19 and high expression of CD79A and CD10 (Fig. 4A-B). GSEA of primary specific B-cell clusters revealed a down-regulation in the MYC targets, oxidative phosphorylation, and the pathways associated with fatty acid metabolism (Fig. 4C and Additional file 1: Fig. S4F). Furthermore, relapse-specific B-cell clusters 2 and 14 (> 90%) characterized by the absence of CD20 expression were identified (Fig. 4A-B), consisting with the observation from bulk RNA-seq data (Additional file 1: Fig. S4D). By comparing the two clusters with the other B-cell clusters, the top 10 up-regulated genes (fold change > 2 & P < 0.01; including ACSM3, HRK, DPEP1, NSMCE1, and BTG2) were identified (Fig. 4D and Additional file 9: Table S8). GSEA analysis using all the up-regulated genes showed that B-cell receptor signaling, glycolysis, and the hypoxia pathways were significantly enriched (Fig. 4E). The relapse-specific B-cell feature was determined as the mean expression of the top 10 up-regulated genes, based on which, the 198 patients with B-ALL from the TARGET-ALL-P2 (Therapeutically Applicable Research to Generate Effective Treatments ALL Phase II) cohort [24] were categorized into a high group or a low group. The patients in the group with the high relapsed B-cell feature was found to be significantly associated with a poor clinical prognosis (Fig. 4F, P = 0.0077). Furthermore, the expression levels of the specific marker genes ACSM3 and HRK were significantly higher in relapsed patients with CNS leukemia, indicating their potential contribution to brain infiltration in the patient with Ph-like ALL (Fig. 4G and Additional file 1: Fig. S4G). Taken together, these findings uncover the single-cell transcriptome-related dynamic evolution during disease progression.
Integrative single-cell analysis revealed the presence of a Ph-like specific B-cell subset and a common feature
Considering the similarities between Ph-like and Ph+ ALL as determined by bulk RNA-seq, the expression relationship at the single-cell level was evaluated by integrating the scRNA-seq datasets from a previous study that involved two patients with Ph+ B-ALL [16]. After quality control, a total of 25,559 cells were used for the anchoring-based integration analysis and 23 clusters were identified and visualized as tSNE projection (Fig. 5A-B). According to the cell type annotation tool and the well-known lineage markers, 23 clusters were labeled as B cells (CD19, CD79B, and MS4A1), T cells (CD3D and CD8A), NK cells (GNLY and NKG7), hematopoietic stem and progenitor cells (HSPC: FAM30A and CD34), erythrocytic cells (HBD and AHSP), or myeloid cells (CD68, MNDA, and CD14) (Fig. 5C and Additional file 1: Fig. S5A). Cell cycle scoring showed the high expression of the genes related to proliferation (e.g., MKI67, TOP2A, PCNA, and PCLAF) in two B-cell clusters (clusters 3 and 19) and two erythrocytic cell clusters (clusters 11 and 20, Fig. 5D).
tSNE representation of clusters split by the disease subtype and the cell percentage within each cluster showed that most of the cells (> 90%) in clusters 1 and 8 originated from the Ph-like samples (Fig. 5B and Additional file 1: Fig. S5B). GSEA analysis showed that Ph-like specific B-cell clusters (Cluster 1 and 8) displayed a higher stemness feature comparing with other B cell clusters (Fig. 5E), which is consistent with the comparison between PDGFRB fusion positive B-ALL and Ph+ B-ALL according to bulk RNA-seq data (Additional file 1: Fig. S5C). Further, we identified the significantly up-regulated genes which included CD81, AES, H1FX, and JUND in clusters 1 and 8 (Fig. 5F and Additional file 10: Table S9). Survival analysis of the Ph-like specific B-cell feature, defined as the mean expression of the up-regulated genes (fold change > 2 & P < 0.01; n = 7), demonstrated that patients with high levels of Ph-like specific B-cell feature were significantly associated with an adverse prognosis (Fig. 5G, P = 3e-04).
Recently, some researchers have reported that a subset of CD8+ T cells with KLRB1 (CD161) overexpression displayed different cytotoxic states and NK cell signatures [30, 31]. The expression of classical cytotoxic genes (Fig. 5H, left panel) and NK cell genes (Fig. 5H, middle panel) were evaluated in clusters 4, 9, 10, and 16 (T cells) and clusters 15 and 23 (NK cells) and observed the expression of KLRB1 in cluster 9 (CD8 negative) and cluster 10 (CD8 positive) (Fig. 5C). Signature scoring showed that cluster 10 possessed both high cytotoxicity signature scores and high NK cell signatures (Fig. 5I-J) as described in glioma [31], cluster 4 and 9 showed the expression similarity of memory T cells characterized by the expression of CCR7 and GPR183 respectively, and cluster 16 showed the feature of stress T cells (Additional file 1: Fig. S5D-E). Additionally, cluster 10 exhibited a low exhausted state characterized by the low expression of well-known co-inhibitory receptors (Fig. 5H, right panel). Next, the presence of associations between KLRB1+ CD8 T cells in B-ALL and similar transcriptional programs in other human tumor types were evaluated. Extensive and significant overlap in the KLRB1 transcriptional signatures from each of the five cancer types and pan-cancer KLRB1 program was noted (Additional file 1: Fig. S5F and Additional file 11: Table S10).
KLRB1, encoding the CD161 protein, binds to CLEC2D to inhibit NK cell-mediated cytotoxicity and acts as the inhibitory immune checkpoint in NK cells [35]. Our scRNA-seq data revealed that CLEC2D mRNA was also expressed in malignant B cells, except the T and NK cells (Additional file 1: Fig. S5G), indicating the potential for immune evasion. Furthermore, the expression of CD24 and its ligand SIGLEC10 was detected in B cells and myeloid cells, respectively (Additional file 1: Fig. S5G). Despite the patients with high SIGLEC10 expression were significantly associated with a worse prognosis (Additional file 1: Fig. S5H, P = 0.013), the SIGLEC10 displayed low expression in Ph-like subtype and high expression in Ph+ subtype (Additional file 1: Fig. S5I). In contrary, the CLEC2D and KLRB1 exhibited high expression in both Ph-like and Ph+ subtypes (Additional file 1: Fig. S5I). Further, we also observed the expression of CLEC2D and KLRB1 in other cancer types from TCGA data (Additional file 1: Fig. S5J). These data imply that several paths might be simultaneously involved in tumor immune escape in patients with B-ALL patients.