Impact of Clonal Hematopoiesis on the Carcinogenic Process of Multiple Myeloma

doi:10.21203/rs.3.rs-4672454/v1

Download PDF

Article

Impact of Clonal Hematopoiesis on the Carcinogenic Process of Multiple Myeloma

https://doi.org/10.21203/rs.3.rs-4672454/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Clonal hematopoiesis (CH), a phenomenon linked to aging, correlates with inflammation and myeloid malignancies. Here, we explore the interaction of CH, with terminally differentiated lymphoid malignancy, and multiple myeloma (MM). Analysis of CH in clinical cohorts revealed a higher prevalence among MM patients and a lower deep response to proteasome inhibitors. By utilizing the bone marrow samples from MM patients with CH, single-cell transcriptome analyses indicated frequent interaction between CH and MM cells, mediated by CCR10-CCL2, resulting in the upregulation of the MAPK pathway and angiogenesis, findings corroborated by exosome RNA analysis. Conditioned media from TET2 knockdown macrophages significantly enhanced MM cell proliferation compared to that from wild-type cells, an effect reversible by a CCR10 inhibitor. Our results underscore the pivotal role of TET2 CH in driving CCR10-high myeloma progression through paracrine oncogenic effects via exosomal interactions on CCR10, suggesting its potential as a therapeutic target.

Biological sciences/Cancer/Haematological cancer/Myeloma

Health sciences/Biomarkers/Predictive markers

Biological sciences/Cancer/Cancer genomics

Biological sciences/Immunology/Haematopoiesis

clonal hematopoiesis of indeterminate potential

multiple myeloma

TET2 mutation

CCR10

Clonal expansion of blood cells that have acquired somatic mutations associated with expansion but have not transformed into cancer cells is called clonal hematopoiesis (CH).^1,2 CH can be found in as much as 9 to 18% of elderly populations older than 40 years,² and a large proportion of these clonal expansions remain benign.³ Such CH largely derives from the accumulation of somatic mutations throughout aging.⁴ However, CH can be an early stage of clonal evolution toward the development of myeloid leukemia and myelodysplastic syndrome.⁵

While early research focused on the evolution of CH into hematologic malignancies, the risk of all-cause mortality increases with the presence of CH.² Further exploration of the relationship demonstrated that the presence of CH is associated with atherosclerosis independent of age.⁶ The mechanism by which CHs affect atherosclerosis is through the inflammasome, which consists of several chemokine and cytokine genes.⁶ TET2 knockout macrophages, one of the most commonly found CHs, exhibited high transcript levels in CXCL1, CXCL2, CXCL3, PF4, IL1B, and IL6, which were increased in response to low-density lipoprotein cholesterol loading in vitro.⁶ Another study on TET2 loss in macrophages also showed enhanced expression of lipopolysaccharide-stimulated proinflammatory cytokines with delayed resolution, suggesting that TET2 loss is associated with impaired resolution of inflammation.⁷ Therefore, CHs can trigger inflammation that may disturb the microenvironment surrounding the CHs.

These findings raise questions about whether CHs may affect the development of cancer through mechanisms other rather than clonal evolution. For multiple myeloma (MM), as the disease originates from monoclonal gammopathy of unknown significance (MGUS),⁸ direct associations between CHs and MM seemed scarce. However, recent studies have shown that altered hematopoiesis, including CH, is a common event in MM and is associated with disturbances in the tumor microenvironment and adverse outcomes that could be abrogated by immunomodulatory drugs,^9–11 suggesting an interaction between CH and MM. Considering the potential effects of CH on tumor microenvironments, CHs may have a hidden role in the development and the pathophysiology of MM.¹² In this study, we sought to reveal these interplays between CH and MM.

Peripheral blood CH in MM patients and their clinical features

We evaluated the pattern of peripheral blood CH in MM patients. To mitigate the potential influence of cancer treatment on CH^13,14, we collected peripheral blood samples at the time of diagnosis from MM patients without any history of receiving cytotoxic chemotherapeutic agents or radiotherapy. Peripheral blood monocytes were extracted, and their DNA was sequenced using a next-generation sequencing panel designed to detect CH. A total of 194 peripheral blood samples passed quality control and were included. The depth of coverage for coding sequences of CH-associated genes is represented in Supplementary Fig. 1.

The demographic, laboratory, and clinical data of the 194 patients are provided in Table 1. Among these samples, 79 samples harbored CH (40.7%) with a variant allele frequency (VAF) cutoff of 1.5. Patients with CH were significantly older than those without CH (median age 68 vs 63, p = 0.002), as expected. When compared with the healthy population, patients with MM had a significantly higher incidence of CH (40.7% vs 12.1%, p < 0.001). This significant association persisted after adjustment for age and gender (odds ratio (OR): 3.50, 95% confidence interval (CI): 2.45–5.00, p < 0.001).

The overall results of the mutations of CHs are available in Fig. 1a and Supplementary Data 1. The most frequently mutated genes found in CHs were DNMT3A (15.5%, 30 samples), followed by TET2 (10.3%, 20 samples), and ASXL1 (5.7%, 11 samples). Each of the major three CHs (DTA CH) was significantly more frequent in MM patients compared with the healthy population (Fig. 1b, Supplementary Table 1). Among these CHs, TET2-mutated CHs were the most significantly enriched in the MM patients (OR: 3.40, 95% CI: 1.82–6.28, p < 0.001). These findings suggest that there may be underlying effects of CHs, especially for DTA CHs, on the development of MM.

There was no significant difference in disease stage at diagnosis between patients with and without CH (p = 0.550). Regarding the rate of deep response (very good partial response (VGPR) or complete remission (CR)) to first-line proteasome inhibitor-based treatment, patients with CH showed significantly lower VGPR/CR rate (39.3%, 95% CI: 27.1–52.7 for patients with CH vs. 74.5%, 95% CI: 64.5–82.8 for patients without CH, p < 0.001) (Fig. 1c). The low odds of VGPR/CR rate in patients with DTA CH remained significant even when adjusted for age, stage, and high-risk chromosomal abnormalities (OR: 0.28, 95% CI: 0.11–0.72, Supplementary Table 2).

Validation of impact of CH using UK Biobank data

To further validate whether CHs may be associated with MM development, we analyzed the incidence of MM according to CH status in the UK Biobank cohort. Our analysis revealed a temporal trend, with CH-positive individuals demonstrating a progressively increasing incidence of MM compared to CH-negative individuals (Supplementary Fig. 2a). This observation was further corroborated by the Cox proportional hazards regression model, which yielded a statistically significant 1.54-fold increase in the hazard ratio (HR) for MM development among CH carriers (95% CI: 1.26–1.90, p < 0.001) (Supplementary Fig. 2b).

Within the spectrum of CH mutations, TET2 displayed the most pronounced impact on MM risk (Fig. 1d). Carriers of this specific gene exhibited a substantial 2.44-fold increase in HR (95% CI: 1.67–3.55, p < 0.001). These findings were mirrored with respect to overall survival after MM diagnosis (Fig. 1e). CH carriers displayed diminished overall survival, as evidenced by a 1.39-fold increase in hazard for mortality (95% CI: 1.09–1.8, p = 0.009). Notably, the TET2 mutation again played a prominent role (Supplement Fig. 2c, 2d), independently impacting overall survival with a 1.72-fold increase in hazard (95% CI: 1.05–2.8, p = 0.031).

Interestingly, individuals harboring the TET2 CH showed rapid progression from MGUS to MM, suggesting a carcinogenic effect on pre-malignant plasma cells by TET2 mutated cells (Fig. 1f). This specific group demonstrated a 5.78-fold increase in the hazard for progression from MGUS to MM (95% CI: 2.62–12.8, p < 0.001). These findings collectively underscore the prognostic relevance of CH mutations, particularly the TET2 mutation, in influencing the development and progression of MM.

Single-cell multi-omics analysis confirms the myeloid lineages harbor CH

Single-cell multi-omics (DNA + protein) sequencing was performed to confirm the distribution of cells with CH mutations in MM patients with CH using bone marrow (BM) samples. CH variants, including TET2:chr4:106158215:C/A, were identified in the previously analyzed bulk DNA sequencing data. Based on this, we reviewed the detected CH variants within the single-cell DNA sequencing (scDNA-seq) data and confirmed the presence of the same TET2 mutation (TET2:chr4:106158215:C/A, 4.33%).

We visualized cells using Uniform Manifold Approximation and Projection (UMAP) and annotated cells based on the protein expression of canonical markers from the single-cell protein data (Fig. 2a, c). Additionally, cell-level genotyping was conducted using the scDNA-seq data, as shown in Fig. 2b. Upon analyzing the mutated cells, we observed that the majority of CH mutated cells were distributed within the myeloid lineage (Fig. 2d, e). The distribution of mutated cells was notably skewed, with TET2 mutated subclones comprising 29.9% of the myeloid cluster, while other cell types had less than 4% mutated cells (Fig. 2f). Delving into the TET2 mutated cell population, a significant proportion of 88.2% was identified in myeloid cells (Fig. 2g). These findings confirm that the oncogenic effect of CH mutations in the myeloid lineage may influence plasma cells, albeit not in a clonal evolution manner.

Single-cell RNA sequencing analysis reveals a myeloid commitment in the BM of MM patients with CH

To investigate the effect of CH mutation on the BM of MM patients, we collected BM samples from MM patients without CHs (Control, n = 3) and with TET2 mutated CHs (CH, n = 6). Each sample was sorted into CD138⁺ plasma cells and CD138⁻ non-plasma cells and equally mixed for single-cell RNA sequencing (scRNA-seq) analysis (Fig. 3a). After removing low-quality cells, we profiled a total of 93,642 high-quality cells from 9 samples and visualized them using UMAP (Fig. 3b). To confirm the identity of each cell, we classified them into 18 cell types based on the expression of canonical marker genes (Fig. 3c) and measured the cell type proportions of non-plasma cells for each patient (Fig. 3d). When examining the changes in cell type proportions between the two conditions, we found that the BM of the CH group had a lower proportion of naïve T cells but a higher proportion of myeloid lineage cells and hematopoietic stem cells (HSC) than the control group (Fig. 3e).

To identify cellular and molecular changes induced by TET2 mutated CHs in MM patients during hematopoiesis, we inferred the differentiation trajectories of HSCs into erythroid, myeloid, and B-cell progenitors (Fig. 3f). We observed self-renewal and myeloid commitment of HSCs in the CH group (Fig. 3g). Additionally, the CH group had a significantly higher branch probability to myeloid progenitors than the Control group (Fig. 3h). As HSCs displayed increased self-renewal and a myeloid bias, we performed gene set enrichment analysis (GSEA) to identify the biological pathways in HSCs that were affected by TET2 CH mutations. Notably, HSCs in the CH group exhibited enhanced proliferation and myeloid differentiation at the transcriptomic level compared to the Control group (Fig. 3i). Taken together, these findings indicate that HSCs are more proliferative and exhibit a myeloid bias in the BM of MM patients with TET2 CH mutations.

CCL2-CCR10 interaction between classical monocytes and plasma cells promotes plasma cell growth through activation of MAPK signaling

To identify the cell type with the greatest transcriptional changes induced by TET2 mutated CHs, we calculated the number of differentially expressed genes (DEGs) for each cell type (Fig. 4a). Classical monocytes exhibited the largest number of DEGs between conditions, even after adjusting for cell number, suggesting that classical monocytes are the primary cell type affected by TET2 mutated CHs. Several genes encoding chemokines (CCL2, CCL3, and CCL4), which are known to be secreted by classical monocytes, were upregulated in classical monocytes of the CH group (Fig. 4b).

Given the CH-dependent overexpression of chemokines in classical monocytes, we hypothesized that TET2 mutated CHs would induce stronger inflammatory features in classical monocytes. To confirm this, we first performed GSEA and found that classical monocytes in the CH group had enhanced cytokine and chemokine production and response (Fig. 4c). We also estimated the activity scores of NF-kB and its partner RelA, which are known to induce the expression of chemokine-encoding genes identified as DEGs in classical monocytes (Fig. 4d). This analysis revealed that the activity of inflammation-associated transcription factors was higher in the CH group for cell types belonging to the HSCs and myeloid lineages.

To characterize how TET2 mutated CHs perturb cellular interactions between classical monocytes with strong inflammatory features and abnormal plasma cells, we performed a cell-cell interaction analysis on ligand-encoding genes identified as DEGs in classical monocytes. We found the increased interaction of CCL2 in classical monocytes by TET2 mutated CHs with its non-canonical receptor, CCR10, which is mainly expressed in plasma cells (Fig. 4e). C-C chemokine receptor type 10 (CCR10), a member of the chemokine receptor subfamily, is known to be overexpressed in several tumors and plays an important role in cancer development and progression.¹⁵ Recent studies have shown that activation of CCR10 promotes breast cancer cell invasion and migration by inducing the ERK/MAPK signaling pathway.¹⁶ Indeed, the MAPK cascade is enriched in plasma cells of the CH group (Fig. 4f). Taken together, classical monocytes in the BM of MM patients with TET2 mutated CHs exhibit strong inflammatory features and produce more chemokines. Among these chemokines, C-C motif chemokine ligand 2 (CCL2) appears to further activate the MAPK signaling pathway, which promotes cancer cell growth through interaction with CCR10 on plasma cells.

Exosomal RNA analysis shows paracrine effect of CHs on MM via MAPK and integrin pathway

Exosomal RNA was analyzed to investigate whether CHs activate pathways in MM cells through a paracrine effect (Fig. 5a). Differential expression analysis identified 10 down-regulated miRNAs with baseMeans ≥ 20, adjusted P-value < 0.05, and log2 fold change ≤ -1 (Fig. 5b). Among these miRNAs, hsa-let-7f-5p, hsa-let-7a-5p, and hsa-miR-21-5p target MAP kinase, while hsa-miR-320a and hsa-let-7b-5p are linked to integrin, angiogenesis, and metastasis (Table 2).

A computational tool, miRSystem was employed to predict and consolidate the influenced target genes and pathways affected by these differentially expressed (DE) miRNAs. The findings showed that certain pathways, notably the MAPK signaling pathway, chemokine signaling pathway, and cytokine-cytokine receptor interaction, as well as integrins in angiogenesis, which are related to inflammation, paracrine effects, metastasis, and cancer, were not effectively suppressed in CH cases (Fig. 5c).

UK Biobank proteomics analysis confirms consistent paracrine effects of CHs on MM

We validated differentially expressed proteins using Olink proteomics data from the UK Biobank. A cohort was established based on patients who did not have MM at the initial collection from the UK Biobank but were subsequently diagnosed and had an International Classification of Diseases, 10th revision (ICD-10) code generated. We applied ordinal regression with age and gender as covariates to control for confounding factors. The results are plotted in a volcano plot (Fig. 5d).

After confirming statistical significance, post-hoc tests were conducted to calculate estimate values. Utilizing proteins that were statistically significantly identified through post-hoc tests (adjusted P-value < 0.05), we conducted over-representation analysis. The results showed a significant over-representation of pathways such as MAPK, STAT, and integrins, which were corroborated by scRNA-seq and miRNA comparisons between CH and controls. Notably, the role of the STAT3 pathway in cancer progression through anti-apoptotic effects on cancer cells, tumor invasion, migration, metastasis, and angiogenesis,¹⁷ was echoed in miRNA findings, supporting the high prevalence of MM malignancy in CH cases. The figure above illustrates the heightened expression of these pathways' proteins in CH (Fig. 5e).

TET2 knockdown promotes CCR10 high MM cell growth by overexpressing CCL2, which is inhibited by blocking CCR10

To test whether classical monocytes promote cancer cell growth through the CCL2-CCR10 axis and its downstream MAPK signaling activation, we incubated MOLP-8 (MM cancer cell line) in conditioned media (CM) of wild-type (WT) or TET2 knock-down (KD) THP-1 (monocyte cell line) (Fig. 6a). CM from TET2 KD macrophages significantly enhanced the proliferation of MM cells compared to those from wild-type cells (Fig. 6b). To identify the secretory factors responsible for the effects of CM from TET2 KD in macrophages, we performed RNA sequencing for both wild-type and TET2 KD macrophages and investigated differentially expressed genes. We observed an increase in the expressions of several cytokines, including IL1β, amphiregulin (AREG), IL1α, CXCL6, and CCL2, in TET2 knockdown macrophages compared with wild-type macrophages (Fig. 6c). Gene ontology (GO) enrichment analysis also showed that MAPK signaling activity is attenuated in MOLP-8 incubated in WT CM compared to TET2 KD CM (Fig. 6c), which is consistent with the scRNA-seq results. Furthermore, MOLP-8 incubated in TET2 KD CM had activated cytokine-mediated signaling pathway and cell cycle (Fig. 6d). Considering ligand-receptor interaction, we found an increase in the expressions of CCR10, the receptor for CCL2, in certain MM cell lines, especially in MOLP-8 cells (Supplementary Fig. 3). To explore the effect of CCL2-CCR10 signaling on MM cell proliferation, we employed a CCR10 inhibitor, BI-6901. The introduction of BI-6901 significantly inhibited the proliferation-promoting effect of the conditioned media from TET2 KD macrophages (Fig. 6e). These findings were not observed in other MM cell lines with low CCR10 expression (Data not shown). Taken together, our in vitro experiments confirm that CCL2 expression is increased in TET2 KD THP-1, which induces MAPK pathway activation and cell cycle promotion in MOLP-8, and that CCR10 plays an important role in cell growth of MOLP-8 in this situation.

CCR10 expression is associated with survival in MM patient

From the results indicating that CCR10 blocking affects MM cell survival, we hypothesized that CCR10 might be associated with the overall survival of MM patients. To confirm this, we analyzed the risk of CCR10 in the MM Research Foundation (MMRF) dataset.¹⁸ In the MMRF dataset, we found significantly higher expression of CCR10 in plasma cells compared to other cell types (Fig. 7a). Next, we divided the samples into two groups based on CCR10 expression and compared MM survival between the two groups (Fig. 7b). The high CCR10 expression group had significantly worse overall survival than the low expression group (Fig. 7c), and CCR10 showed a significantly higher hazard ratio for MM survival (HR 1.8, 95% CI: 1.3–2.5, p < 0.001). In summary, we identified a negative impact of CCR10 on MM survival in the MMRF dataset, which is independent of our data.

In this study, we investigated the relationship between CH and MM. Specifically, we found that CHs, particularly TET2 CHs, increase the hazard ratio for MM development, MGUS to MM progression, and adversely affect the treatment outcomes of MM patients by promoting MM cell proliferation. These impacts were believed to be facilitated through cytokine interactions between myeloid-biased cells and plasma cells, such as the CCL2-CCR10 axis resulting in the activation of oncogenic pathways, including the MAPK pathway. Especially, CCR10 is shown to play a pivotal role in this CH related MM pathogenesis, which could be reversed by its inhibitor.

This is the first paper showing the role of CH in lymphoid malignancy development. So far, the carcinogenic role of CH in hematologic cancer has been studied with myeloid neoplasms using the clonal evolution model.^2,19 Individuals harboring CH are at a high risk for developing myeloid neoplasms, possibly up to 50% in 10 years when combined with old age, red blood cell abnormalities, and cytopenia.²⁰ This clinical trajectory of CH is expected, as the additional accumulation of oncogenic mutations in HSCs is a major driver for myeloid neoplasms. However, scDNA-seq data in this study directly shows that the relationship between CH and MM could not be explained by the clonal evolution model. In fact, mutations in DNMT3A, TET2, and ASXL1, which are prevalent in CH, are scarcely found in MM patients.²¹ Single-cell transcriptomic data confirmed myeloid biased bone marrow environment in MM patients with co-existing CH. It is well known that TET2 CH derives myeloid differentiation bias in HSC.²² Experimentally, we observed that co-culture of TET2-mutated THP-1 cells with MOLP-8 cells, a MM cell line with high CCR10 expression, significantly increased the proliferation of MOLP-8 cells. All in all, our data strongly support the paracrine role of CH cells in the pathogenesis of MM. Hence, our discovery of the role of CH in MM is aligned with the paracrine effect of TET2 CH in non-hematologic disease such as cardiac diseases or osteoporosis.²³ It is well known that the pro-inflammatory effects of CHs may contribute to atherosclerosis, heart failure, stroke, and atrial fibrillation, leading to adverse overall survival.^2,6,23,24 These diverse clinical effects of CHs as an altered immunity suggest that their presence in bone marrow can impact the tumor milieu in MM. Our data consistently show that the presence of CH, especially TET2 CH, is related to MM development and the progression of MGUS to MM. However, it is not clear at this moment whether TET2 CH contributes to the malignant transformation itself of plasma cells or contributes to the progression of already mutated plasma cells. Thinking about the pathogenic mechanism in cardiac disease, we believe the role of CH might be more important for the progression and proliferation of malignant plasma cells. Its role in resistance to induction therapy shown in this study and previous knowledge of worsened course after stem cell transplantation²⁵ coincide with this hypothesis.

We discovered that CCR10 in malignant plasma cells might play an important role in CH related MM pathogenesis. From single cell transcriptomic data, we propose that one of the main interactions between TET2 mutated monocyte and MM cell axes involves the CCL2-CCR10. It is quite trivial that CCL2 is over-secreted from biased monocyte by TET2 mutation²⁶ as supported by our bulk RNA sequencing data. Our finding is consistent with previous literature indicating that CCR10/CCL27 myeloma-stroma crosstalk contributes to bortezomib resistance in MM cell lines.²⁷ In fact, blocking CCR10 significantly reduced the proliferative effect of TET2 mutated CH on MM cells, suggesting a potential therapeutic target. Recent surface proteomic analysis also suggests CCR10 as an important molecule in MM.²⁸ Previous literatures have also explored targeting CCR10, identifying it as widely expressed on malignant plasma cells, and demonstrating that CCL27-based chimeric antigen receptor T cells with CCR10 knockout exhibit in vitro killing activity against MM cells.²⁸ In addition to these previous observations, we found that targeting CCR10 may disrupt the interaction between MM cells and the surrounding BM microenvironment modified by aberrant myeloid biased milieu, particularly in MM patients with CHs. This is clinically relevant, as increased expression of CCR10 was associated with adverse outcomes in MM patients from the MMRF cohort. We believe further exploration of the interaction between CHs and MM could reveal strategies to manipulate BM microenvironments to treat MM. This approach is promising, as although treatment strategies for MM have improved dramatically, most of these approaches have focused solely on MM cells, potentially overlooking synergistic effects targeting both tumor cells and surrounding microenvironments.^29,30 In addition to the monocyte-plasma cell interactions identified by cell-cell interaction analysis in this study, the possibility exists for other immune cells, such as macrophages recruited by CCL2, to play a bridge role. Further studies are needed to investigate these points. It is also possible that IL-1β hypersecretion by macrophages may play a role in MM pathogenesis by CH. TET2 mutant macrophages have genetically driven activation of NLRP3 inflammasome, which is well documented.^31,32 However, this hypothesis could not be confirmed in our scRNA-seq data due to the low number of macrophages. Further investigation is required to confirm this.

Lastly, our analyses on exosomal RNA and UK Biobank proteomic data support the potential mechanism of the paracrine effect of CH on MM cells. In MM cells of patients with CH, we observed decreased miRNAs that suppress the MAPK pathway, which is crucial for cellular functions such as survival, growth, and differentiation. If not properly regulated, this pathway is well-documented as an oncogenic pathway.³³ The pathway serves as the canonical pathway for CCL2, reinforcing the results of ligand-receptor interactions revealed in the previous scRNA-seq analysis.³⁴ We also observed the enrichment of the integrins pathway. The integrins pathway plays a role in angiogenesis and metastasis. The enrichment of this pathway in plasma cells correlates with drug resistance in MM,^35–37 supporting the therapy resistance and unfavorable prognosis observed in MMRF data. Exosomes play a crucial role in cell-to-cell communication, particularly through the transfer of miRNAs. The differential miRNA profiles observed in exosomes with and without CH in MM patients suggest a potential paracrine effect of CH in MM pathogenesis and progression.

Several limitations should be acknowledged in this study. Firstly, the establishment of cutoffs for VAF to determine CH in MM patients requires further investigation, as there is no consensus on biologically and clinically significant cutoffs for VAF to determine CH. Additionally, the correlation between VAF values and prognosis needs to be explored further. Secondly, further validation is required to determine which CHs detected are significantly associated with MM cells, as many CHs may be bystanders without any effect. Nevertheless, our findings suggest that the presence of frequently detected CHs, especially those involving major genes such as TET2, DNMT3A, and ASXL1, appears to be significantly associated with MM prognosis. Thirdly, we did not perform in vivo confirmation of the interaction between CH and MM cells. However, we performed multiple analyses using scRNA, exosomal RNA, and proteomics to demonstrate a consistent paracrine effect of CH on CCR10 high MM cells through the CCR10 axis, which was validated by cell-line analyses. Lastly, the interaction of CH with MM cells without high expression of CCR10 is not apparent through our analysis. Further investigations on other interactions between CH and MM cells need to be elucidated.

In conclusion, our study has provided evidence of the association between CH and MM, indicating that CH contributes to MM development and resistance to proteasome inhibitors. These mechanisms are mediated through paracrine effects involving the CCL2/CCR10 axis, which activates oncogenic pathways. Targeting CCR10 hold promise as a strategy to modulate the unfavorable bone marrow microenvironment induced by CHs in MM patients.

Patient recruitment and statistical analysis for clinical data

The samples were obtained from patients who visited two tertiary health centers, Seoul National University Hospital and Samsung Medical Center. The study protocol was conducted after approval by the institutional review board of each center (No. 1805-122-948, 2013-09-009). The study was conducted in accordance with Declaration of Helsinki. Patients aged 18 years or older with newly diagnosed MM were included. The exclusion criteria for patients were a history of previous hematologic malignancy or solid cancer excluding localized skin squamous cell carcinoma and uterine cervical cancer, history of cancer treatment, history of hematopoietic stem cell transplantation, and those who did not agree to enroll. After recruiting patients, medical records including demographic data, laboratory data, radiographic data, and treatment outcome data were collected. We used the International Myeloma Working Group consensus criteria for response and minimal residual disease assessment for response evaluation.³⁸ Deep response was defined as either achievement of VGPR or CR, with VGPR defined by 90% or more reduction of the monoclonal component.

To compare categorical variables, Fisher’s exact test was utilized. For the comparison of continuous variables, the Mann-Whitney test was employed. Univariate and multivariate logistic regression analyses were conducted to assess the association of factors with deep response. All statistical analyses for the clinical data analysis were performed using R software version 4.0.0 (https://www.r-project.org).

Deep targeted sequencing of clonal haematopoiesis mutations

The peripheral blood sampling was conducted after obtaining written informed consent. DNA extraction from peripheral white blood cells was performed for targeted sequencing of 24 CH-associated genes (see Supplementary Table 3). The Twist target enrichment panel (Twist Bioscience, USA) and DNBSEQ-G400 Dx (MGI Tech, China) with 2× 100 bp paired-end reads were utilized, archiving an average depth of coverage over 1000×. All reliable non-synonymous variants associated with malignancy were annotated as CH-driven mutations, including truncating mutations or any somatic variants previously reported in the COSMIC database v83. Variants with a VAF of ≥ 1.5% were considered positive. The CH mutation calling and filtering process were identical to that described in previous literature.²⁴

Healthy cohort comparison

For the healthy control cohort, we included 5,487 healthy subjects aged 50–79 years who underwent self-referred health check-ups between February 2014 and March 2016 from the Gene-ENvironmental Interaction and phenotypE (GENIE) cohort at the Seoul National University Hospital Healthcare System Gangnam Center and had no history of cancer.³⁹ The cohort, blood sample collection, genome analysis, and the current study design were approved by the Institutional Review Board (IRB) of Seoul National University Hospital (IRB number: 2112-136-1284, 25-1103-127-357). Informed consent for blood sample storage and use for research purposes was obtained from the participants at the time of blood sample collection.

To evaluate the prevalence of CH mutations in MM and compare it with that of non-MM healthy subjects, we utilized a multivariable logistic regression model adjusted for age and sex to assess the association between CH mutations and MM. In order to mitigate potential selection bias related to age and sex, we conducted 1:4 case-control propensity score matching using the MatchIt package in R.

UK Biobank data analysis

CH variant calling and quality control

We conducted a somatic variant calling procedure for the UK Biobank cohort using Mutect2 following the GATK best practices pipeline (https://www.oreilly.com/library/view/genomics-in-the/9781491975183/). In summary, Mutect2 was executed over 1301 intervals where mutations are known to occur in CH, with the “OrientationBiasReadCounts” annotation and F1R2 outputs enabled. We utilized the GetPileupSummaries and CalculateContamination functions in GATK to estimate cross-sample contamination, and the LearnReadOrientationBias function to learn orientation bias priors. Subsequently, we employed the FilterMutectCalls function to annotate variants that passed or failed quality control filters, incorporating the cross-sample contamination estimate and orientation bias priors as inputs. Variant effects were annotated using Funcotator, and the annotated VCF was exported to a tabular form with bcftools.

Prior to manual review, we applied stringent quality control filters to the raw variant calls. We excluded variants with a total depth < 20, a depth for either allele < 5, or a variant allele fraction < 0.02. Additionally, variants flagged by the FilterMutectCalls function with the following filter flags were removed: "orientation", "haplotype", "base_qual", "contamination", "strand_bias", "map_qual", "position", "germline", and "clustered_events". Variants flagged with the "slippage" filter were excluded if their variant allele fraction was < 0.10. Furthermore, individual mutations occurring more frequently than the R882H mutation were excluded, except for the c.1926_1927insG mutation in ASXL1.

Diagnostic information and cohort selection

We utilized diagnostic information based on the ICD-10 code. Individuals diagnosed with MM at any point between attendance and the last follow-up were selected for analysis. To ensure the specificity in our analysis, individuals with a prior diagnosis of MM, monoclonal gammopathy of undetermined significance (MGUS), or plasmacytoma before enrollment were excluded.

Survival analysis

The Kaplan-Meier plot was employed to visualize the cumulative incidence of MM, with group comparisons made using the log-rank test. Additionally, a Cox proportional hazard regression model was utilized to estimate the HR for risk factors in the multivariate analysis. This survival analysis encompassed both disease incidence and death.

Time to MM incidence was calculated from the “date of attending assessment center” (data-field: f53) to the “date of first inpatient diagnosis icd10” (data-field: 41280). Similarly, time to death after MM diagnosis was computed from the “date of first inpatient diagnosis icd10” (data-field: 41280) until the “date of death” (data-field: 40000). In cases of multiple diagnoses, the first inpatient diagnosis was considered the primary event time.

The covariates incorporated into the Cox regression model included the following factors: genetic principal component (Genetic PC) (1–10), genetic sex (male/female), body mass index (BMI), and age at the time of disease diagnosis, specifically for MM and MGUS. Furthermore, to mitigate any potential influence of genetic background on MM predisposition, the list of known germline predisposition mutations in MM was included as a covariate.⁴⁰

For the analysis of MGUS to MM progression, only samples diagnosed with both MGUS and MM using the ICD-10 code were included. Event time was defined as the duration between MGUS diagnosis and MM diagnosis. The control group consisted of MGUS patients who did not progress to MM by the last follow-up.

Single cell DNA sequencing

Sampling and generation process of single-cell DNA sequencing data

Bulk DNA sequencing was conducted on PBMC samples extracted from two MM patients to validate CH, and CH-positive variants were detected. Following this, a custom amplicon panel targeting genes linked to clonal hematopoiesis was designed for single-cell DNA sequencing analysis. Subsequently, BM aspirates were obtained from the same two patients, and data were generated using a single-cell multi-omics (DNA + Protein) sequencing platform (Tapestri®, Mission Bio, Inc.).

Preprocessing and quality control of scDNA data

The generated FASTQ files underwent preprocessing through the Tapestri pipeline.⁴¹ This pipeline includes adapter sequence trimming, alignment of read content to the human genome (hg19), assignment of sequence reads to cell barcodes, and conducting genotype calling through GATKv3.7. The resulting data is compiled into a VCF file and exported as loom and HDF5 files for further processing. Basic filtering steps for low-quality genotypes or cells were performed in Tapestri Insights with default settings. The minimum variant quality score was set at 30, with a minimum of 10 reads per variant per cell. Cells with an alternate allele frequency below 20 were excluded. Additionally, variants occurring in less than 50% of cells and cells with less than 50% informative genotypes among potential variants were filtered out. Following these procedures, the data underwent subsequent analysis using Tapestri Mosaic (version 3.0.1, https://missionbio.github.io/mosaic) within a Python (3.8.17) environment.

Detection of CH variant in scDNA data

In the context of this multi-omics platform, the Tapestri Pipeline was utilized for DNA analysis as previously described. Confirmation and visualization of variants were performed in the Python environment using guidelines and scripts provided by Tapestri Mosaic. For protein analysis, we employed the same module to quantify the number of reads per antibody per cell. Subsequently, normalization was conducted within the same environment, and variants that were not validated in bulk sequencing were excluded.

Using this tool, we reviewed the scDNA data from the processed HDF5 files. We identified variants consistent with those validated through bulk sequencing. However, due to differences in panel design and the limitations of targeted sequencing using the amplicon method, only one of the two samples confirmed the same TET2 mutation as observed in bulk sequencing (TET2:chr4:106158215:C/A).

Single cell protein analysis and cell type profiling

To investigate the distribution of TET2 mutations within specific cell types identified through DNA analysis, we utilized the Tapestri Mosaic module in Python for the analysis of single-cell protein data. UMAP and clustering were performed using the run_umap and cluster functions of the respective tool. Using the same tool, we examined the normalized counts of marker proteins within each cluster and conducted cell type profiling.

Visualization of single-cell multi-omics analysis results

The expression of these markers was visualized as a heatmap using the same tool as mentioned above. We used the UMAP projection to display cell type profiling results. The distribution of cells harboring the previously identified TET2 mutation from the scDNA analysis was illustrated by overlaying it on the UMAP.

Single cell RNA sequencing

scRNA-seq Library Preparation

To prepare cells for single-cell sequencing, the LUNA-FL™ Automated Fluorescence Cell Counter is used alongside the 10x Genomics protocols and guidelines to ensure optimal sample preparation. Libraries are then prepared using the Chromium controller following the 10x Single Cell 3’ v3 protocol. This involves diluting cell suspensions to a target count (10,000 cells), mixing them with the master mix, and loading them into a chip with Gel Beads and Partitioning Oil. Within droplets, RNA transcripts are barcoded and reverse transcribed. The resulting cDNA undergoes end repair, 'A' base addition, adapter ligation, purification, and PCR enrichment to form the final library. Quantification is done via qPCR (KAPA), qualification via Agilent Technologies 4200 TapeStation, and sequencing on the HiSeq platform (Illumina) as per the user guide specifications.

scRNA-seq Data Processing

Raw FASTQ files for scRNA-seq were processed by using the Cell Ranger software (v3.1.0). Reads were mapped to the human reference genome (GRCh38) using the Ensembl GRCh38.100 GTF file. For each sample, a gene-by-cell unique molecular identifier (UMI) count matrix was generated with default parameters. Empty droplets were removed using the emptyDrops function of the DropletUtils (v1.10.3) R package with FDR < 0.05.⁴² To filter out low-quality cells, cells with more than 60% of UMIs assigned to mitochondrial genes and less than 2.5 log10-scaled UMI counts were excluded using the calculateQCMetrics function of the scater (v1.18.6) R package.⁴³ To normalize the count matrix for cell-specific biases, cells were clustered using the quickCluster function of the scran (v1.18.7) R package.⁴⁴ Cell-level size factors were calculated using the computeSumFactors function of the same package. Raw UMI counts were normalized by cell-level size factors and then log2-transformed with a pseudocount of 1. Highly variable genes (HVGs) were identified using the modelGeneVar function of the scran package with FDR < 0.05 for biological variability. For downstream analysis, cells were clustered into 22 clusters using the FindClusters function of the Seurat (v4.3.0) R package on the top 15 principal components (PCs) of HVGs with resolution = 0.5.⁴⁵ Cells were visualized on a two-dimensional UMAP plot using the RunUMAP function of the same package.

scRNA-seq Data Analysis

To identify differentially abundant cell subpopulations associated with CH mutation, we utilized a single-cell specific method based on mixed-effects modeling of associations of single cells (MASC). We obtained odds ratios and P-values indicating differences in abundance for each cell type across conditions.⁴⁶ Differential gene expressions between conditions within each cell subpopulation were assessed using the FindMarkers function of the Seurat package, implementing the MAST algorithm (v1.16.0), with patients considered as latent variables. With the log-scaled fold changes for each gene between the two conditions, gene set enrichment analysis (GSEA) was performed using the fgsea function of the fgsea (v1.16.0)⁴⁷ R package, utilizing the Gene Ontology gene sets used as a database with the msigdbr function of the msigdbr (v7.5.1) R package.⁴⁸ Trajectory analysis for hematopoiesis was conducted using the run_palantir function of the Palantir (v1.0.1) Python package.⁴⁹ To visualize cells on two-dimensional t-SNE plots, batch effects across patients were corrected using the run_harmony function of the Harmonypy (v0.0.9) Python package on the first 100 diffusion components (DCs).⁵⁰ Subsequently, a k-nearest neighbor (k-NN) graph (k = 30) was constructed using the first 10 DCs. Coordinates for the t-SNE plots were computed using the run_tsne function with perplexity = 700. Using the run_palantir function, branch probabilities for each of the three differentiation fates were calculated for all cells. Transcription factor activities were inferred using the DoRothEA (1.2.2) R package based on the Dorothea regulon database with A, B, and C levels out of 5 levels representing the confidence level based on the number of supporting evidence.^51,52 Cell-cell interactions between classical monocytes and plasma cells were inferred using the CellPhoneDB (v3.1.0) Python package with the normalized count matrix and default parameters.⁵³

Exosome RNA analysis

Sampling, exosome extraction, and small RNA sequencing

Exosome extraction was performed from BM samples of 30 MM patients, followed by sequencing and FASTQ file generation for small RNAs within the extracted exosomes. (NEXTflex®, Illumina, Inc.)

The FASTQ files have been uploaded to the Genboree Workbench for the purpose of employing the exceRpt small RNA-seq pipeline (http://genboree.org/java-bin/workbench.jsp) to conduct a consistent analysis across the entire dataset.

Preprocessing, mapping, and read counting

The small RNA FASTQ files extracted from the BM of MM patients were initially processed using the exceRpt small RNA-seq Pipeline (version 4.6.2) through a batch submission tool.

The exceRpt pipeline processes samples by aligning reads and filtering to eliminate contaminants, then aligns them to the endogenous sequence database.⁵⁴ The pipeline primarily utilizes default settings, including adapter trimming set to ‘auto detect’, which identifies and trims adapter sequences for various library types. Additional parameters were chosen to remove random degenerate 4N sequences of NEXTflex libraries. The default random barcode setting was employed, indicating the presence of a random 4 nucleotides sequence immediately preceding the 5′ and 3′ ends of the insert sequence. The sequences and identities of the adapters identified by the exceRpt pipeline were confirmed in output files, with no identified missing or incorrect adapters in the libraries.

Read quality assessment was conducted using FASTQC, with reads filtered out if they had PHRED scores below 30 (FASTX-Toolkit v0.0.13).⁵⁴ Additionally, reads shorter than 16 nucleotides were excluded from further analysis.

To address potential contamination from laboratory or ribosomal RNA (rRNA), sequences were mapped using Bowtie2 to UniVec (a database of common laboratory contaminant sequences maintained by NCBI) and human rRNA sequences, and subsequently removed.⁵⁴

After the filtering step, the reads were aligned to both the human genome and pre-miRNA sequence databases. Initially, the reads underwent mapping to miRbase version 21, gtRNAdb, piRNABank, Ensembl transcripts (hg19), and circBase databases. This process aimed to assign reads to microRNAs (miRNAs), transfer RNAs (tRNAs), PIWI-interacting RNAs (piRNAs), gencode transcripts, and circular RNAs, respectively.⁵⁴

Following the pipeline execution as described, a reevaluation of internal quality metrics of the exceRpt pipeline was conducted because the output did not include any samples with TET2 mutations that passed the tool's inherent quality control criteria. The inherent quality metrics, specifically transcriptome read count and transcriptome-genome ratio, were redefined to maximize the detection of TET2 mutation signals (Transcriptome reads ≥ 2000, Transcriptome-genome ratio ≥ 0.25). Modified criteria-compliant samples were integrated and combined into a raw read count matrix by the exceRpt pipeline on Genboree Workbench.

Differential expressed miRNA analysis

After preprocessing and quality control, the raw read counts of miRNA obtained from the Genboree Workbench exceRpt small RNA-seq pipeline were further analyzed using R (version 4.3.1) for differentially expressed miRNA analysis.

The calculation of differential expression fold-changes and adjusted P-values was performed using the DESeq2 package (version 1.40.2).⁵⁵

Altered down-regulated miRNAs with a baseMean greater than 20 were selected based on criteria of fold change ≤ -1.0 and adjusted P-value < 0.05. Ten differentially expressed miRNAs (DE miRNAs) were confirmed through this filtering. Subsequently, these DE miRNAs were visualized through a volcano plot using ggplot2 (version 3.4.2).

miRNA targeted gene and pathway prediction

To identify potential target genes for these DE miRNAs, we employed the miRSystem tool (https://mirsystem.cgm.ntu.edu.tw/), which integrates validated data from miRecords, TarBase, and seven miRNA target prediction databases (PITA, miRanda, DIANA-microT, mirBridge, PicTar, rna22, TargetScan), along with five pathway databases (Gene Ontology, KEGG, BioCarta, PID, Reactome).⁵⁶ The goal is to predict targets and decipher their biological roles and pathway involvements.

Upon entering DE miRNAs into miRSystem, it produced a target gene summary report, functional annotation summary report, pathway ranking summary, and individual reports for target genes and pathways for each miRNA. The DE miRNAs were found to regulate genes linked to the MAPK signaling pathway, integrins in angiogenesis, and various growth factors and inflammatory pathways.

UK Biobank Olink proteomics data preprocessing

The Olink Proteomic profiling in the UK Biobank involved analyzing blood plasma samples from 54,967 participants using the Olink® Explore 3072 platform, which measures 2,923 protein analytes across panels such as Cardiometabolic, Inflammation, Neurology, and Oncology. After whole-exome sequencing-based proteogenomic analyses on 52,217 samples, 50,065 passed Olink NPX quality control (96%). The quality control method was based on the paper by Sun et al.⁵⁷ A total of 2,923 unique proteins were measured across eight Olink® Explore 3072 panels (Cardiometabolic, Cardiometabolic II, Inflammation, Inflammation II, Neurology, Neurology II, Oncology, and Oncology II). However, we utilized only 1,463 protein assays that were publicly available at the time of our analysis out of 2,923 assay data. These assays belonged to the Cardiometabolic, Inflammation, Neurology, and Oncology panels.

CH variant calling was conducted on UK Biobank data using Mutect2, with the specific methodology similar to the description provided earlier. ICD-10 codes for MM were collected and organized in the same manner as described previously.

The proteomics NPX data, accessible via the UK Biobank research analysis platform (RAP) (https://ukbiobank.dnanexus.com), has been integrated and filtered with information regarding CH and MM diagnostic codes (ICD-10 code). Merged with UK Biobank showcase metadata, including assay details and quality control information, the data has been restructured into the Olink data format. This entire process was carried out in R (version 4.3.1).

Ordinal regression of proteomics data

CH is marked by somatic mutations that increase mutation rates as individuals age,⁵⁸ displaying a mosaic nature. Gender differences affect CH occurrence and progression.⁵⁹ To minimize the influence of the confounding factors of age and gender, an ordinal regression analysis was conducted using the OlinkAnalyze package (version 3.6.0, https://CRAN.R-project.org/package=OlinkAnalyze). The analysis incorporated these factors as covariates in the regression model (NPX ~ CH + Age + Gender).

Through this process, proteins exhibiting statistically significant expression differences based on the presence of CH were identified. A post-hoc test calculated estimate values, representing the difference in mean NPX between variables, leading to further validation of these differences.

Pathway analysis of proteomics data

Using the statistically significant protein set from the aforementioned post-hoc test results (adjusted P-value < 0.05), an over-representation analysis was conducted. Statistically significant pathways with adjusted P-value < 0.05 were confirmed. Based on these results, key pathways were selected by considering keywords where significant differences were observed in previous scRNA-seq and miRNA analyses, such as cytokines, MAPKs, and integrins. For a more detailed interpretation, the estimate values from the previous post-hoc test for each protein within the set of relevant proteins in pathways were visualized together in a heatmap. All analysis processes were conducted using the OlinkAnalyze package in R.

Cell culture

The human monocyte cell line, THP-1, human MM cell line, MOLP-8, and 293FT cells were obtained from the Korean Cell Line Bank. THP-1 cells were cultured in RPMI 1640 (Hyclone, Cat.#SH30027.01), MOLP-8 cells were cultured in MEM medium (Hyclone, Cat.#SH30024.01), and 293FT cells were cultured in DMEM medium (Hyclone, Cat.#SH30243.01). All media were supplemented with 10% FBS (Hyclone, Cat.#AH30007592) and 1% penicillin-streptomycin (Gibco, Cat.#15140-122).

Differentiation of THP-1 cells into macrophages was induced by the addition of 150 nM phorbol 12-myristate 13-acetate (PMA; Sigma, Cat.#P8139) for 72 hours, and conditioned media of the differentiated THP-1 cells were collected for 48 hours using fresh media. All cells were incubated at 37°C in a 5% CO2 atmosphere.

Generation of TET2 knockdown THP-1 cells

TET2 knockdown THP-1 cells were generated by lentivirus infection containing Cas9 and a single guide RNA (sgRNA) targeting the TET2 gene. For lentiviral sgRNA production, sgRNA was constructed with the lentiCRISPR v2-GFP vector. Lentiviral vector, VSVg, and psPAX2 were cotransfected into 293FT cells using lipofectamine 2000 for 72 hours. Viral particles were concentrated using Lenti-X™ concentrator (Takara) and used to infect cells with 8 µg/ml polybrene for 48 hours. After infection, cells were sorted by flow cytometry for GFP expression. Knockdown of TET2 was evaluated by western blotting.

Cell proliferation assay

Cell proliferation was estimated using the trypan blue exclusion assay. Cells were incubated with a 0.4% solution of trypan blue dye (Thermo Scientific), and cell numbers were counted using a hemocytometer. The numbers of blue-stained cells compared with the initial cell count were calculated for cell proliferation estimation. To investigate the effect of CCR10 inhibition on cell proliferation, BI-6901 (0.1 ng/ml) was treated with conditioned media for 120 hours.

Cell-line Bulk RNA analysis

Bulk RNA sequencing

Total RNA from differentiated THP-1 cells was extracted using the RNeasy Plus Mini Kit (Qiagen). Whole-transcriptome expression profiles were generated by RNA sequencing.

Bulk RNA-seq Data Processing and Analysis

Raw paired-end RNA-seq reads were aligned to the human genome (GRCh38) using STAR (v2.7.10b). The aligned reads were mapped with the Ensembl GRCh38.103 GTF file. The raw count matrix was normalized by scaling and size factors using the DESeq2 (v1.30.1) R package, with genes having a count of at least 10 for every sample.⁶⁰ Differential gene expression was tested using the DESeq2 R package and genes with P-values < 0.05 and log2-scaled fold change > 0.1 or < -0.1 were identified as DEGs. Functional enrichment analysis of DEGs was performed using the topGO (v2.42.0)⁶¹ R package with Gene Ontology Biological Process (GOBP) terms based on the org.Hs.eg.db (v3.12.0) annotation data package.⁶²

MMRF dataset analysis

To investigate the impact of a single gene on MM survival, we utilized the Survival Genie platform with the MMRF CoMMpass dataset.¹⁸ This dataset contains bulk RNA sequencing data from the CD138⁺ fraction of MM patients. Survival analysis was performed using the single gene option on primary MM samples for overall survival. Samples were divided into high and low gene expression groups using an optional cut point (cutp).

FUNDING

This work was funded by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2021R1A2C3005360 [Y.K.] and 2023R1A2C3004065 [J.K.K.]) and supported by a grant from the Seoul National University College of Medicine Research Foundation (No. 800-202200123).

ACKNOWLEDGEMENT

Our gratitude goes to the Korea Research Environment Open NETwork (KREONET) and the Global Science experimental Data hub Center (GSDC) for the data computation and networking facilitated by the Korea Institute of Science and Technology Information (KISTI).

AUTHOR CONTRIBUTIONS

CP, GC, GR, and JP designed the research, performed the research, collected data, analyzed, and interpreted data, performed statistical analysis, and wrote the manuscript.

HY, YO, CL, HA, CHS, S-HC, J-JL, BSK, JMB, D-YS, JH, IK, S-SY, DN, TM, S-YC, S-JK, C-HK performed research, collected data, and reviewed the manuscript.

KK, S-YC, SJ, JKK, and YK designed research, performed research, collected data, analyzed, and interpreted data, and reviewed the manuscript.

COMPETING INTERESTS

Choong Hyun Sun and Youngil Koh are founders and stockholders of Genome Opinion Incorporation. Chansub Lee and Hongyul An are employees of Genome Opinion Incorporation.

Steensma, D. P. et al. Clonal hematopoiesis of indeterminate potential and its distinction from myelodysplastic syndromes. Blood 126, 9–16 (2015).
Jaiswal, S. et al. Age-Related Clonal Hematopoiesis Associated with Adverse Outcomes. New England Journal of Medicine 371, 2488–2498 (2014).
Genovese, G. et al. Clonal Hematopoiesis and Blood-Cancer Risk Inferred from Blood DNA Sequence. New England Journal of Medicine 371, 2477–2487 (2014).
Xie, M. et al. Age-related mutations associated with clonal hematopoietic expansion and malignancies. Nat Med 20, 1472–1478 (2014).
Heuser, M., Thol, F. & Ganser, A. Clonal Hematopoiesis of Indeterminate Potential. Dtsch Arztebl Int 113, 317–22 (2016).
Jaiswal, S. et al. Clonal Hematopoiesis and Risk of Atherosclerotic Cardiovascular Disease. New England Journal of Medicine 377, 111–121 (2017).
Cull, A. H., Snetsinger, B., Buckstein, R., Wells, R. A. & Rauh, M. J. Tet2 restrains inflammatory gene expression in macrophages. Exp Hematol 55, 56-70.e13 (2017).
Kyle, R. A. et al. Long-Term Follow-up of Monoclonal Gammopathy of Undetermined Significance. New England Journal of Medicine 378, 241–249 (2018).
Maia, C. et al. Biological and clinical significance of dysplastic hematopoiesis in patients with newly diagnosed multiple myeloma. Blood 135, 2375–2387 (2020).
Padrnos, L. J. et al. Prevalence and significance of clonal hematopoiesis of indeterminate prognosis (CHIP) in multiple myeloma. Journal of Clinical Oncology 38, 8542 (2020).
Mouhieddine, T. H. et al. Clonal hematopoiesis is associated with adverse outcomes in multiple myeloma patients undergoing transplant. Nat Commun 11, 1–9 (2020).
Neri, P. Clonal hematopoiesis in myeloma: root of all maladies! Blood 135, 2330–2331 (2020).
Coombs, C. C. et al. Therapy-Related Clonal Hematopoiesis in Patients with Non-hematologic Cancers Is Common and Associated with Adverse Clinical Outcomes. Cell Stem Cell 21, 374-382.e4 (2017).
Gibson, C. J. et al. Clonal hematopoiesis associated with adverse outcomes after autologous stem-cell transplantation for lymphoma. Journal of Clinical Oncology 35, 1598–1605 (2017).
Korbecki, J., Grochans, S., Gutowska, I., Barczak, K. & Baranowska-Bosiacka, I. CC Chemokines in a Tumor: A Review of Pro-Cancer and Anti-Cancer Properties of Receptors CCR5, CCR6, CCR7, CCR8, CCR9, and CCR10 Ligands. Int J Mol Sci 21, 7619 (2020).
Lin, H. et al. CCR10 activation stimulates the invasion and migration of breast cancer cells through the ERK1/2/MMP-7 signaling pathway. Int Immunopharmacol 51, 124–130 (2017).
Ma, J., Qin, L. & Li, X. Role of STAT3 signaling pathway in breast cancer. Cell Communication and Signaling 18, 33 (2020).
Dwivedi, B., Mumme, H., Satpathy, S., Bhasin, S. S. & Bhasin, M. Survival Genie, a web platform for survival analysis across pediatric and adult cancers. Sci Rep 12, 3069 (2022).
Tuval, A. & Shlush, L. I. Evolutionary trajectory of leukemic clones and its clinical implications. Haematologica 104, 872–880 (2019).
Weeks, L. D. et al. Prediction of Risk for Myeloid Malignancy in Clonal Hematopoiesis. NEJM Evidence 2, (2023).
Bolli, N. et al. Analysis of the genomic landscape of multiple myeloma highlights novel prognostic markers and disease subgroups. Leukemia 1–13 (2018) doi:10.1038/s41375-018-0037-9.
Abegunde, S. O., Buckstein, R., Wells, R. A. & Rauh, M. J. An inflammatory environment containing TNFα favors Tet2-mutant clonal hematopoiesis. Exp Hematol 59, 60–65 (2018).
Zadeh, F. J. et al. The role of molecular mechanism of Ten-Eleven Translocation2 (TET2) family proteins in pathogenesis of cardiovascular diseases (CVDs). Mol Biol Rep 47, 5503–5509 (2020).
Ahn, H.-J. et al. Clonal haematopoiesis of indeterminate potential and atrial fibrillation: an east Asian cohort study. Eur Heart J (2024) doi:10.1093/eurheartj/ehad869.
Gibson, C. J. et al. Donor Clonal Hematopoiesis and Recipient Outcomes After Transplantation. Journal of Clinical Oncology 40, 189–201 (2022).
Vlasschaert, C. et al. Clonal hematopoiesis of indeterminate potential is associated with acute kidney injury. Nat Med 30, 810–817 (2024).
Thangavadivel, S. et al. CCR10/CCL27 Crosstalk Contributes to Failure of Proteasome-Inhibitors in Multiple Myeloma. Oncotarget vol. 7 www.impactjournals.com/oncotarget/ (2016).
Ferguson, I. D. et al. The surfaceome of multiple myeloma cells suggests potential immunotherapeutic strategies and protein markers of drug resistance. Nat Commun 13, (2022).
Giannakoulas, N., Ntanasis-Stathopoulos, I. & Terpos, E. The Role of Marrow Microenvironment in the Growth and Development of Malignant Plasma Cells in Multiple Myeloma. Int J Mol Sci 22, 4462 (2021).
García-Ortiz, A. et al. The Role of Tumor Microenvironment in Multiple Myeloma Development and Progression. Cancers (Basel) 13, 217 (2021).
Sano, S. et al. Tet2-Mediated Clonal Hematopoiesis Accelerates Heart Failure Through a Mechanism Involving the IL-1β/NLRP3 Inflammasome. J Am Coll Cardiol 71, 875–886 (2018).
Tall, A. R. & Fuster, J. J. Clonal hematopoiesis in cardiovascular disease and therapeutic implications. Nature Cardiovascular Research 1, 116–124 (2022).
Wagle, M.-C. et al. A transcriptional MAPK Pathway Activity Score (MPAS) is a clinically relevant biomarker in multiple cancer types. NPJ Precis Oncol 2, 7 (2018).
Chen, C.-Y. et al. Enhancement of CCL2 expression and monocyte migration by CCN1 in osteoblasts through inhibiting miR-518a-5p: implication of rheumatoid arthritis therapy. Sci Rep 7, 421 (2017).
Hosen, N. Integrins in multiple myeloma. Inflamm Regen 40, 4 (2020).
Damiano, J. S. & Dalton, W. S. Integrin-Mediated Drug Resistance in Multiple Myeloma. Leuk Lymphoma 38, 71–81 (2000).
Vacca, A. & Ribatti, D. Bone marrow angiogenesis in multiple myeloma. Leukemia 20, 193–199 (2006).
Kumar, S. et al. International Myeloma Working Group consensus criteria for response and minimal residual disease assessment in multiple myeloma. Lancet Oncol 17, e328–e346 (2016).
Lee, C. et al. Health and Prevention Enhancement (H-PEACE): a retrospective, population-based cohort study conducted at the Seoul National University Hospital Gangnam Center, Korea. BMJ Open 8, e019327 (2018).
Canzian, F. et al. A polygenic risk score for multiple myeloma risk prediction. European Journal of Human Genetics 30, 474–479 (2022).
Miles, L. A. et al. Single-cell mutation analysis of clonal evolution in myeloid malignancies. Nature 587, 477–482 (2020).
Lun, A. T. L. et al. EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data. Genome Biol 20, 63 (2019).
McCarthy, D. J., Campbell, K. R., Lun, A. T. L. & Wills, Q. F. Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R. Bioinformatics 33, 1179–1186 (2017).
Lun, A. T. L., McCarthy, D. J. & Marioni, J. C. A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor. F1000Res 5, 2122 (2016).
Butler, A., Hoffman, P., Smibert, P., Papalexi, E. & Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol 36, 411–420 (2018).
Fonseka, C. Y. et al. Mixed-effects association of single cells identifies an expanded effector CD4 ⁺ T cell subset in rheumatoid arthritis. Sci Transl Med 10, (2018).
Korotkevich, G. et al. Fast gene set enrichment analysis. bioRxiv 060012 (2021) doi:10.1101/060012.
Liberzon, A. et al. The Molecular Signatures Database Hallmark Gene Set Collection. Cell Syst 1, 417–425 (2015).
Setty, M. et al. Characterization of cell fate probabilities in single-cell data with Palantir. Nat Biotechnol 37, 451–460 (2019).
Korsunsky, I. et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods 16, 1289–1296 (2019).
Garcia-Alonso, L., Holland, C. H., Ibrahim, M. M., Turei, D. & Saez-Rodriguez, J. Benchmark and integration of resources for the estimation of human transcription factor activities. Genome Res 29, 1363–1375 (2019).
Badia-i-Mompel, P. et al. decoupleR: ensemble of computational methods to infer biological activities from omics data. Bioinformatics Advances 2, (2022).
Efremova, M., Vento-Tormo, M., Teichmann, S. A. & Vento-Tormo, R. CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes. Nat Protoc 15, 1484–1506 (2020).
Rozowsky, J. et al. exceRpt: A Comprehensive Analytic Platform for Extracellular RNA Profiling. Cell Syst 8, 352-357.e3 (2019).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014).
Lu, T.-P. et al. miRSystem: An Integrated System for Characterizing Enriched Functions and Pathways of MicroRNA Targets. PLoS One 7, e42390 (2012).
Sun, B. B. et al. Plasma proteomic associations with genetics and health in the UK Biobank. Nature 622, 329–338 (2023).
Brown, D. W. et al. Shared and distinct genetic etiologies for different types of clonal hematopoiesis. Nat Commun 14, 5536 (2023).
Kamphuis, P. et al. Sex Differences in the Spectrum of Clonal Hematopoiesis. Hemasphere 7, e832 (2023).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014).
Alexa A & Rahnenfuhrer J. topGO: Enrichment Analysis for Gene Ontology. Preprint at (2023).
Carlson M. org.Hs.eg.db: Genome wide annotation for Human. (2019).

Table 1. Patient demographics

	Number (%), Total N = 194
Median age (Range)	64 (32 – 89)
Sex
Male	114 (58.8%)
Female	80 (41.2%)
ISS stage
I	63 (32.5%)
II	64 (33.0%)
III	44 (22.7%)
NA	23 (11.9%)
Heavy chain type
IgG	110 (56.7%)
IgA	41 (21.1%)
IgD	2 (1.0%)
IgM	1 (0.5%)
None	38 (19.6%)
NA	2 (1.0%)
Light chain type
Kappa	111 (57.2%)
Lambda	76 (39.2%)
None	5 (2.6%)
NA	2 (1.0%)
Abnormal 1q
Yes	81 (41.2%)
No	55 (28.4%)
NA	58 (29.9%)
IgH rearrangement
Yes	65 (33.5%)
No	80 (41.2%)
NA	49 (25.3%)
TP53 deletion
Yes	16 (8.2%)
No	128 (66.0%)
NA	50 (25.8%)
Bone lesion
Yes	105 (54.1%)
No	62 (32.0%)
NA	27 (13.9%)

Abbreviations: ISS, International Staging System

Table 2. DE miRNAs down-regulated in CH and their major target genes and pathways.

miRNA	adjusted P-value	baseMean	Log2FC	Up/Down	Target Genes (Total Hit)	Target Pathways (p < 0.01)
miR-21-5p	6.86E-06	86.51288845	-7.001280394	Down	RASA1(6), CCL20(5), IL12A(5), MAP3K1(4), MAPK10(4), IL1B(3)	KEGG: MAPK signaling pathway
						KEGG: Cytokine-cytokine receptor interaction
						KEGG: JAK-STAT signaling pathway
						REACTOME: NF-κB and MAPK activation mediated by TLR4 signaling repertoire
miR-320a	0.000183532	45.43069301	-5.723839133	Down	CDK6(6), AKT3(5), RAC1(5), RAB18(5), RASA1(5), MAPK1(4)	PID: mTOR signaling pathway
						KEGG: MAPK signaling pathway
						KEGG: TGF-β singaling pathway
						REACTOME: Signaling to RAS
miR-423-3p	0.000290462	46.80747094	-5.448251279	Down	KIAA0652(3), RAC1(3), RAP2C(3), VEGFA(3)	PID: mTOR signaling pathway
						KEGG: Renal cell carcinoma
						PID: Integrins in angiogenesis
						KEGG: VEGF signaling pathway
let-7f-5p	0.000355197	45.83253273	-5.227263655	Down	MAP4K3(7), MAP4K4(6), NRAS(6), ITGB3(5), MAP3K3(5), MAPK6(5), MYCN(5)	KEGG: MAPK signaling pathway
						PID: Integrins in angiogenesis
						REACTOME: Signaling by PDGF
						KEGG: Cytokine-cytokine receptor interaction
let-7g-5p	0.000472223	20.59953694	-4.913492322	Down	COL1A1(6), COL1A2(6), ITGB3(6), MAP4K3(6), MAP4K4(6), NRAS(6), MYCBP1(5), MAP3K1(4), MYCN(4), PDGFB(4)	KEGG: MAPK signaling pathway
						PID: Integrins in angiogenesis
						PID: Syndecan-1-mediated signaling events
						REACTOME: PI3K-Akt activation
miR-423-5p	0.000863627	103.1820267	-5.053503196	Down	MAP3K3(3), ELK1(3), MAZ(3), PIK3R3(3)	PID: ErbB1 downstream signaling
						KEGG: MAPK signaling pathway
						KEGG: Focal adhesion
						REACTOME: Signaling by interleukins
miR-451a	0.001011554	38.35242195	-4.507162554	Down	CAB39(5), RAB5A(4), MIF(3)	PID: Signaling mediated by p38-α and p38-β
						REACTOME: Signaling by NOTCH
						KEGG: mTOR signaling pathway
						PID: Notch signaling pathway
let-7a-5p	0.001091073	37.74806932	-4.406541142	Down	NRAS(7), MAP4K3(7), COL1A1(6), ITGB3(6), MAP4K4(6), MAPK6(6), PDGFB(4)	KEGG: MAPK signaling pathway
						PID: Integrins in angiogenesis
						KEGG: Wnt signaling pathway
						KEGG: JAK-STAT signaling patway
let-7b-5p	0.004373118	106.0664831	-4.08818452	Down	MAP4K3(7), COL1A1(6), ITGB3(6), MAP3K3(6), MAP4K4(6), MAPK6(6), NRAS(6)	KEGG: MAPK signaling pathway
						PID: Integrins in angiogenesis
						REACTOME: Signaling by PDGF
						PID: Syndecan-1-mediated signaling events
miR-486-5p	0.014835585	303.9519132	-3.326159184	Down	PI3KR1(4), SMAD2(4), MAP3K7(3)	KEGG: Wnt signaling pathway
						KEGG: Pathways in cancer
						REACTOME: PI3K-Akt activation
						KEGG: Focal adhesion

All miRNAs were considered significant if they passed the criteria of baseMean > 20, adjusted P-value < 0.05, log2FC > 1 (Up) or log2FC < -1 (Down). We utilized miRSystem to determine the number of hits of miRNA target genes according to well-known major miRNA databases (PITA, miRanda, DIANA-microT, mirBridge, PicTar, rna22, and TargetScan) and experimental validation records. We curated key target genes with high hit counts for each miRNA. The pathways targeted by each miRNA were assessed against major pathway databases (Gene Ontology, KEGG, BioCarta, Pathway Interaction Database, and Reactome) using the same tools, and major target pathways were curated among the pathways with a P-value < 0.05.

Yes there is potential Competing Interest. Choong Hyun Sun and Youngil Koh are founders and stockholders of Genome Opinion Incorporation. Chansub Lee and Hongyul An are employees of Genome Opinion Incorporation.

MMCHIPSupplementaryInformation.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Impact of Clonal Hematopoiesis on the Carcinogenic Process of Multiple Myeloma

Status:

Version 1

Abstract

Figures

INTRODUCTION

RESULTS

Peripheral blood CH in MM patients and their clinical features

Validation of impact of CH using UK Biobank data

Single-cell multi-omics analysis confirms the myeloid lineages harbor CH

Exosomal RNA analysis shows paracrine effect of CHs on MM via MAPK and integrin pathway

UK Biobank proteomics analysis confirms consistent paracrine effects of CHs on MM

CCR10 expression is associated with survival in MM patient

DISCUSSIONS

MATERIALS AND METHODS

Patient recruitment and statistical analysis for clinical data

Deep targeted sequencing of clonal haematopoiesis mutations

Healthy cohort comparison

UK Biobank data analysis

CH variant calling and quality control

Diagnostic information and cohort selection

Survival analysis

Single cell DNA sequencing

Sampling and generation process of single-cell DNA sequencing data

Preprocessing and quality control of scDNA data

Detection of CH variant in scDNA data

Single cell protein analysis and cell type profiling

Visualization of single-cell multi-omics analysis results

Single cell RNA sequencing

scRNA-seq Library Preparation

scRNA-seq Data Processing

scRNA-seq Data Analysis

Exosome RNA analysis

Sampling, exosome extraction, and small RNA sequencing

Preprocessing, mapping, and read counting

Differential expressed miRNA analysis

miRNA targeted gene and pathway prediction

UK Biobank Olink proteomics data preprocessing

Ordinal regression of proteomics data

Pathway analysis of proteomics data

Cell culture

Cell proliferation assay

Cell-line Bulk RNA analysis

Bulk RNA sequencing

Bulk RNA-seq Data Processing and Analysis

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Version 1