Multi-center study on predicting breast cancer lymph node status from core needle biopsy specimens using multi-modal and multi-instance deep learning

doi:10.21203/rs.3.rs-2189946/v1

Objective

To develop a deep learning model based on clinicopathological data and digital pathological image of core needle biopsy specimens for predicting breast cancer lymph node metastasis.

Methods

We collected 3701 patients from the Fourth Hospital of Hebei Medical University and 190 patients from four medical centers in Hebei Province. Integrating clinicalpathological data and image features build multi-modal and multi-instance (MMMI) deep learning model to obtain the final prediction.

Results

1. For predicting with or without lymph node metastasis, the AUC was 0.770, 0.709, 0.809 based on the clinicopathological features, WSI and MMMI, respectively. 2. For predicting four classification of lymph node status (no metastasis, isolated tumor cells (ITCs), micrometastasis, and macrometastasis), the prediction based on clinicopathological features, WSI and MMMI were compared. The AUC for no metastasis was 0.770, 0.709, 0.809, respectively; ITCs were 0.619, 0.531, 0.634, respectively; micrometastasis were 0.636, 0.617, 0.691, respectively; and macrometastasis were 0.748, 0.691, 0.758, respectively. MMMI developed a more accurate prediction effect. 3. For prediction of different molecular types of breast cancer, MMMI demonstrated a better prediction effect for any type of lymph node status, especially in the molecular type of triple negative breast cancer (TNBC). 4. In the external validation set, MMMI also showed better predictive effect in the four classification, with AUC of 0.725, 0.757, 0.525, and 0.708, respectively.

Conclusions

We developed a breast cancer lymph node metastasis prediction model by using MMMI. Through all cases tests, the results showed that the overall prediction ability was high and relatively stable.

Biological sciences/Cancer/Breast cancer

Health sciences/Oncology/Cancer/Breast cancer

Deep learning

Multiple modal

Breast cancer

Lymph node metastasis

Breast cancer is the most prevalent malignant cancer among women worldwide ^[1]. Observing the occurrence of axillary lymph node (ALN) metastasis in breast cancer patients is not only important for prognosis, but also for clinical diagnosis and treatment decisions^[2–3]. Sentinel lymph node (SLN) is the first drainage site to experience the lymphatic spread of breast cancer. SLN biopsy (SLNB) is the standard method of ALN staging, which can guide clinicians in deciding axillary lymph node dissection (ALND), surgery, and follow-up treatment^[4–5]. Preoperative prediction of lymph node status is critical for individualized treatment and for avoiding unnecessary surgery. Based on the idea of noninvasive prediction, several studies have attempted to utilize clinical predictors for establishing models to evaluate the possibility of SLN metastasis, and certain important prediction models have been developed^[6–7].

Deep learning has achieved progress and application in the medical field^[8–11], yielding remarkable results in diagnosis and prognosis by automatically learning the latent features from medical data (i.e., histopathological images and clinical characteristics)^[12–16]. For example, Cao et al. developed a deep learning model to predict the microsatellite instability status^[17]. Meanwhile, clinical characteristics, which reflect the clinical status of the patients, are easier to obtain and have also been used by deep learning. Liang et al. applied a deep learning model to clinical characteristics at admission to predict the risk of COVID-19 patients developing critical illness^[18]. Additionally, extensively utilizing various modal information has become an increasingly developed technology in the medical artificial intelligence (AI) field, and it has been widely demonstrated to be significantly useful^[19–21].

Multivariate logistic regression, tree-based methods, and shallow neural network-based methods have been used in previous studies to analyze clinical indicators ^[22–24]. Recent studies have shown that using tabular learning can effectively extract latent features and the interaction between previous methods on several tasks^[25]. For histopathological image analysis, a weakly supervised method based on multi-instance learning can be used to learn latent features for subsequent analysis^[26–27].

Our research is based on the multi-modal prediction of clinicopathological indicators and pathological images, and utilizes all aspects of information that can be obtained before surgery. Therefore, we combined clinical pathological indicators with digital pathological images to establish a prediction model of breast cancer lymph node metastasis. This model performs a more comprehensive analysis for breast cancer, resulting in the improvement of the accuracy of clinical applications.

1. Patients characteristics

In this study, the clinicopathological data and corresponding digital pathological images of 3701 female breast cancer patients were enrolled, with a mean age of 53 years. Patients were divided into training set (2222 cases), validation set (736 cases) and test set (743 cases). Among 3701 patients, according to postoperative pathological results and confirmed by immunohistochemical results, 1953 patients had no lymph node metastasis, 118 patients were isolated tumor cells (ITCs), 564 were micrometastasis, and 1066 were macrometastasis. There was no significant difference in clinicopathological features among the three cohorts (P > 0.05) (Table 1).

Table 1

Patient and tumor characteristics of training set, validation set and test set
Characteristic	total	train	val	test	P value
No.	3701	2222	736	743
Menopause status					0.520
premenopausal	1522 (41%)	897 (40%)	311 (42%)	314 (42%)
menopause	2179 (59%)	1325 (60%)	425 (58%)	429 (58%)
Tumor size					0.185
≤ 2cm	1921 (52%)	1147 (52%)	368 (50%)	406 (55%)
༞2cm	1780 (48%)	1075 (48%)	368 (50%)	337 (45%)
Histological grade					0.121
1	227 (6%)	153 (7%)	34 (5%)	40 (5%)
2	2616 (71%)	1562 (70%)	536 (73%)	518 (70%)
3	858 (23%)	507 (23%)	166 (23%)	185 (25%)
Lymph node status					0.215
negative	1953 (53%)	1173 (53%)	387 (53%)	393 (53%)
ITCs	118 (3%)	67 (3%)	29 (4%)	22 (3%)
micrometastasis	564 (15%)	340 (15%)	95 (13%)	129 (17%)
macrometastasis	1066 (29%)	642 (29%)	225 (31%)	199 (27%)
Tumor location					0.739
UOQ	2090 (56%)	1251 (56%)	427 (58%)	412 (55%)
UIQ	865 (23%)	511 (23%)	176 (24%)	178 (24%)
LOQ	441 (12%)	276 (12%)	74 (10%)	91 (12%)
LIQ	305 (8%)	184 (8%)	59 (8%)	62 (8%)
Vasular invasion					0.196
absent	2975 (80%)	1774 (80%)	609 (83%)	592 (80%)
present	726 (20%)	448 (20%)	127 (17%)	151 (20%)
Nerve invasion					0.312
absent	3199 (86%)	1936 (87%)	627 (85%)	636 (86%)
present	502 (14%)	286 (13%)	109 (15%)	107 (14%)
ER					0.960
negative	781 (21%)	466 (21%)	158 (21%)	157 (21%)
positive	2920 (79%)	1756 (79%)	578 (79%)	586 (79%)
PR					0.672
negative	957 (26%)	584 (26%)	190 (26%)	183 (25%)
positive	2744 (74%)	1638 (74%)	546 (74%)	560 (75%)
HER2					0.996
negative	2625 (71%)	1577 (71%)	522 (71%)	526 (71%)
positive	1076 (29%)	645 (29%)	214 (29%)	217 (29%)
Ki67					0.491
༜20%	531 (14%)	320 (14%)	113 (15%)	98 (13%)
≥ 20%	3170 (86%)	1902 (86%)	623 (85%)	645 (87%)
TILs					0.373
low	2657 (72%)	1572 (71%)	543 (74%)	542 (73%)
middle	768 (21%)	481 (22%)	145 (20%)	142 (19%)
high	276 (7%)	169 (8%)	48 (7%)	59 (8%)
BI-RADS grade					0.197
4a	1032 (28%)	615 (28%)	193 (26%)	224 (30%)
4b	679 (18%)	431 (19%)	119 (16%)	129 (17%)
4c	1052 (28%)	617 (28%)	224 (30%)	211 (28%)
5	938 (25%)	559 (25%)	200 (27%)	179 (24%)
Molecular subtype					0.870
Luminal	2931(79%)	1762(79%)	580(79%)	589(79%)
HER2 over-expression	580(16%)	344(15%)	115(16%)	121(16%)
TNBC	190(5%)	116(6%)	41(5%)	31(5%)
Note: UOQ, upper outer quadrant;UIQ, upper inner quadrant; LOQ, lower outer quadrant; LIQ, lower inner quadrant.

2. The characteristics of the deep learning model of clinicopathological parameters, WSI and MMMI

For histopathological images, we used multi-instance learning method to tile them into patches, and then learn the embedding for the bag-level feature. As for the clinicopathological parameters, we applied tabular learning model to learn the interaction between the features and abstract the final representation of the tabular data by nonlinear combination of the features. After effective pre-processing, data cleaning and imputation, we developed a novel modal fusion module that aims at borrowing information from clinicopathological parameters to focus on discriminative patches in multi-instance learning of the histopathological images, and promoting the flow of complementary information between modalities through intermediate fusion (Fig. 1).

3. Predictive performance of lymph node metastasis model

The test set was used to test the prediction of lymph node status (no metastasis and metastasis) by deep learning model. The area under curve (AUC) of deep learning model for clinicopathological features was 0.770, WSI was 0.709, and MMMI was 0.809. MMMI developed by combining clinicopathological features with WSI showed a more accurate prediction effect for lymph node status prediction (Fig. 2).

In order to predict lymph node status more accurately and provide a more detailed basis for clinical decision, we classified lymph node status in more detail (no metastasis, ITCs, micrometastasis and macrometastasis). For metastasis-free, predicted by deep learning model of tabular, the AUC was 0.770 (95%CI: 0.737–0.804), accuracy was 0.723, sensitivity was 0.791, and specificity was 0.649. Predicted by deep learning of WSI, the AUC was 0.709 (95%CI: 0.672–0.746), accuracy was 0.669, sensitivity was 0.593, and specificity was 0.757. Predicted by MMMI, the AUC was 0.809 (95%CI: 0.779–0.840), accuracy was 0.751, sensitivity was 0.768, and specificity was 0.734. In contrast, MMMI demonstrated better prediction performance. The same results were found in ITCs, micrometastasis and macrometastasis (Table 2). Finally, no matter which kind of lymph node status was predicted, the prediction of MMMI was obviously better than that of single model based on clinicopathological features or digital pathological images. The ROC curves were shown in Fig. 3A-D.

Table 2

Performance comparison of different models for predicting lymph node status
Class	Methods	AUC	ACC	SEN	SPE
negative	Tabular	0.770(0.737–0.804)	0.723(0.693–0.752)	0.791(0.662–0.845)	0.649(0.580–0.769)
	MIL-WSI	0.709(0.672–0.746)	0.669(0.637–0.703)	0.593(0.458–0.728)	0.757(0.617–0.874)
	MMMI	0.809(0.779–0.840)	0.751(0.720–0.779)	0.768(0.616–0.855)	0.734(0.637–0.874)
ITCs	Tabular	0.619(0.501–0.738)	0.701(0.265–0.938)	0.600(0.240–0.960)	0.705(0.241–0.962)
	MIL-WSI	0.531(0.424–0.639)	0.346(0.230–0.938)	0.880(0.200-1.000)	0.329(0.205–0.964)
	MMMI	0.634(0.519–0.749)	0.746(0.392–0.880)	0.600(0.320–0.960)	0.751(0.375–0.897)
micrometastasis	Tabular	0.636(0.582–0.690)	0.538(0.351–0.743)	0.770(0.450–0.960)	0.508(0.261–0.787)
	MIL-WSI	0.617(0.561–0.673)	0.490(0.380–0.682)	0.800(0.510–0.930)	0.440(0.302–0.706)
	MMMI	0.691(0.638–0.744)	0.623(0.431–0.773)	0.710(0.450–0.910)	0.611(0.355–0.818)
macrometastasis	Tabular	0.748(0.710–0.785)	0.723(0.638–0.759)	0.658(0.582–0.827)	0.757(0.568–0.807)
	MIL-WSI	0.691(0.650–0.731)	0.616(0.552–0.692)	0.769(0.542–0.871)	0.544(0.415–0.747)
	MMMI	0.758(0.721–0.796)	0.734(0.647–0.773)	0.653(0.556–0.822)	0.776(0.591–0.844)

4. Predictive performance of lymph node status in different molecular subtypes

In addition, we also analyzed the performance of the proposed model in predicting lymph node metastasis of different molecular subtypes. The AUC of different lymph node status in Luminal breast cancer were 0.784 (95%CI: 0.747–0.821), 0.611 (95%CI: 0.479–0.743), 0.663 (95%CI: 0.603–0.723) and 0.733 (95%CI: 0.69–0.776), respectively. The AUC in HER2 over-expressed group were 0.885 (95% CI: 0.823–0.947), 0.76 (95% CI: 0.548–0.972), 0.78 (95% CI: 0.665–0.895), 0.849 (95% CI: 0.65–0.895), respectively. The AUC in TNBC were 0.895 (95% CI: 0.781-1), 0.968(95% CI: 0.905-1) and 0.75 (95% CI: 0.583–0.917), respectively. Due to the limitation of ITCs samples, the AUC results of ITCs were not obtained in the TNBC group. However, by comparing all results, we found that MMMI demonstrated a better prediction effect no matter which kind of lymph node status, especially in the molecular subtype of TNBC, and the ROC curve was shown in Fig. 4.

5. Feature importance analysis

We explored the feature importance by using MMMI. The analysis results showed that the characteristics of mitosis, glandular ducts and vascular invasion played an important role in predicting lymph node metastasis (Fig. 5).

6. Testing of external data sets (multi-center study)

In order to test the applicability of MMMI, We selected 190 external data from four other centers for verification, and the AUC value for predicting lymph node metastasis was 0.626. MMMI also achieved better results in predicting lymph node status (no metastasis, ITCs, micrometastasis and macrometastasis), with AUC of 0.725 (95% CI: 0.65–0.8), 0.757 (95% CI: Na-Na), 0.525 (95% CI: 0.325–0.725), 0.708 (95% CI: 0.63–0.787), respectively. (Fig. 6). Except for the low AUC value of micrometastasis due to the number of cases, the other groups showed higher prediction performance.

ALN metastasis of breast cancer not only determines the method of operation, but is also an important prognostic factor. Accurate prediction of lymph node metastasis in breast cancer patients can assist clinicians to develop axillary lymph node dissection, reduce postoperative complications, and improve prognosis. In AJCC, lymph node metastasis can be divided into ITCs (≤ 200 scattered tumor cells or tumor clusters ≤ 0.2mm), micrometastasis (tumor > 0.2 mm and ≤ 2 mm), and macrometastasis (tumor > 2 mm)^[38], according to the number of cancer cells in metastatic lymph nodes and the size of the tumor focus. Previous studies predicted ALN status from clinicopathological data, such as tumor grade, tumor size, lymphatic vascular invasion, etc. However, these studies only predicted the presence or absence of lymph node metastasis and could not distinguish between ITCs, micrometastases, or macrometastases. Moreover, the characteristics of the tumor micro-environment in pathological images cannot be described in words or quantified into clinicopathological indicators, and clinical application accuracy and external validation are insufficient.

Based on the idea of noninvasive prediction, several studies have attempted to use clinical predictors for establishing models to evaluate the possibility of SLN metastasis. Some previous studies have developed models for predicting ALN status. For instance, the most important prediction model is the Memorial Sloan–Kettering Cancer Center (MSKCC)^[6], which developed a nomogram to predict SLN metastasis. The ROC curve was 0.75, indicating an adequate level of prediction and discrimination. Liu, et al^[39] adopted the smote-bagged-tree algorithm to establish a model for predicting SLN metastasis in early breast cancer patients. The ROC curve was 0.801, and the overall prediction ability was extremely high, indicating that the prediction model was accurate and stable.

Deep learning has gained increasing attention in the field of medical imaging. Currently, deep convolutional neural networks (DCNNs) are one of the well-known types of deep learning algorithms. DCNNs are widely used in medical image processing and pattern recognition because of their simple structure and strong applicability, especially in imaging and pathology. In a previous study, researchers successfully developed a prediction model for lymph node metastasis in breast cancer patients using a deep learning neural network. The AUC of the CNN model with the best performance was 0.89. Additionally, the ROC performance of this model was better than that of the three experienced radiologists. These results demonstrated the feasibility of using CNNs to predict whether early primary breast cancer will metastasize and determine the feasibility of using deep learning methods to predict clinically negative ALN metastasis from ultrasound images in patients with primary breast cancer^[40]. A deep learning radiography (DLR) method based on clinical parameters of breast conventional ultrasound (CUS) and shear wave elastography (SWE) images has been developed and verified^[41], which can be used to predict the ALN status of clinical T1 or T2 breast cancer patients before surgery. The differential diagnosis effect of this method on axillary negative (N0) and axillary metastasis (N+(≥ 1)) is better than that of the single method. Furthermore, the model indicated high discrimination between patients with low risk of axillary metastasis (N+(1–2)) and high risk (N+(≥ 3)).

In clinical practice, an increasing number of patients wish to understand the SLNs state before undergoing surgery. The prediction results obtained using these models are more reliable than simple clinical estimates. In our study, four classifications of lymph node metastasis can be accurately predicted using preoperative multi-modal data, combined with clinicopathological indexes and pathological image features. For patients with different metastases, providing targeted surgical methods can avoid over-treatment and improve the quality of patients’ lives. However, some information in HE stained slices, such as the tumor micro-environment, cannot be quantified in tables. Deep learning can be used to extract more information about the tumor micro-environment from pathological images. These two methods reflect the patients’ information at different levels, and when combined, they provide a more comprehensive representation of the patient’s condition and disease progression. There are currently several studies on lymph node metastasis in imaging that have obtained certain results. However, the detection of imaging focuses on macroscopic features, and it is easy to miss the detection of early small metastases such as micrometastases or ITCs. In this study, we predict whether lymph nodes have metastasis, and more specifically, its status (no metastasis, ITCs, micrometastases, and macrometastases) and compared the prediction efficiency of the models. The results showed that the MMMI model had better prediction ability than the single model. We also verified the predictive ability of each molecular subtype and the results showed that MMMI could predict the lymph node status of each group, especially TNBC. Although a small number of cases with ITCs and micrometastases, MMMI has a satisfactory prediction ability. We will further expand the amount of data for increasing convince.

To test the applicability of the MMMI, we selected 190 cases for multi-center verification. For predicting the presence of lymph node metastasis, the AUC value was 0.6258. In addition, we tested the performance of the model for predicting no metastasis, ITCs, micrometastases, and macrometastases. Except for the low AUC value of micrometastases due to the number of cases, the other groups showed highly predictive performance. The performance of the model declined in external validation because of the differences in interpretation between different centers and the influence of HE staining. The model can be improved by adding external data, unifying interpretation and marking, and optimizing the WSI.

This study has some limitations. There was no predictive verification of ITCs in TNBC due to the few cases of ITCs and uneven distribution, and an excessive AUC value was observed in other molecular classifications. Although MMMI can predict lymph node metastasis more accurately than single clinicopathological factors or WSI features, it has a certain decline in the four classifications of lymph node metastasis. In the future, we plan to optimize MMMI by increasing the sample size, adding other central sample data or gene test results to obtain more accurate and detailed prediction results of lymph node status.

1. Patients

We collected the clinicopathological data and pathological images of preoperative core needle biopsy of 4038 female invasive breast cancer patients in the Fourth Hospital of Hebei Medical University from January 2015 to December 2018. Additionally, the clinicopathological data and whole slide imagin (WSIs) of 190 female invasive breast cancer patients from four medical centers in Hebei Province were collected for external validation of the proposed method. The inclusion criteria were as follows: 1) three experienced pathologists confirmed that all breast biopsy specimens were invasive breast cancer; 2) no neoadjuvant treatment (NAT) pre-operation was performed; 3) histopathology and immunohistochemistry were used to postoperatively confirm lymph node metastasis; and 4) complete clinical pathological data was obtained. The exclusion criteria were as follows: 1) microinvasive carcinoma (invasive lesions < 1 mm); 2) special types of invasive carcinoma; 3) poor/blurred scanned pathological image quality; 4) preoperative treatment (NAT, chemotherapy, radiotherapy and chemotherapy, ablation, etc.); and 5) incomplete clinical pathological data. Finally, 3701 patients were selected for this study.

Patients’ clinicopathological data of biopsy tissues were collected and evaluated, including age, menopausal status, tumor size, histological grade,nuclear atypia, mitosis counts, tumor-infiltrating lymphocytes (TILs), histological grade, ER (estrogen receptor) status, PR (progesterone receptor) status, HER2 (human epidermal growth factor receptor 2 ) status, lymph node metastasis postoperatively.

2. Pathological evaluation

Histological grading was based on the World Health Organization classification of breast tumors (5th Edition)^[28] and the Nottingham grading system. All cases were classified as grade I, grade II, or grade III. TILs evaluation criteria: area occupied by mononuclear inflflammatory cells over total stromal area^[29–30]. More than 1% of positive tumor cell nuclei are considered hormone receptor-positive for ER and PR. IHC (Immunohistochemistry) score of 3 + or FISH (Fluorescence in situ hybridization) amplification was defined HER2 positivity. All cases divided into three subtypes: luminal (hormone receptor-positive, including luminal A and luminal B), HER2 over-expression (hormone receptor negative, HER2 positive), and triple negative breast carcinoma (both hormone receptor and HER2 negative, TNBC).

3. Structure and standardization of the data.

Clinicopathological parameters were extracted from this report using a text pattern-matching algorithm. For the categorical variables, the LabelEncoder function in the scikit-learn package was used to encode the target categorical variables into numerical variables. Thus, our algorithm generated structured data for each patient. Multivariate imputation via chained equations was applied to impute missing data^[31]. Color normalization was performed on all scales of histopathological images using an enhanced cycle-consistent generative adversarial network^[32].

4. Data partitioning, image preprocessing, and data augmentation.

The dataset was stratified at the patient level and randomly divided into training (60%), validation (20%), and test (20%) sets. Given the gigantic size (typically 130,000 × 50,000 pixels) of a WSI, the WSIs were tiled into 512 × 512 patches in the form of a grid for subsequent processing. In this study, three magnification scales (5×, 10×, and 20×) were explored, under which tiling was performed^[33]. The threshold of overlap varied among different magnifications. Data augmentation was applied to the patches during the training process to improve the generalization.

5. Development, validation and interpretation of the model.

MIL-based representation of WSI.

Each WSI was tiled into patches, and the prediction of lymph node metastasisv (LNM) relies on the entire Region of Interest(ROI) of WSIs instead of individual patches^[34]. EfficientNet^[35] pre-trained on the ImageNet dataset^[36] was applied to extract patch-level features, and attention layers on the instance-level and feature-level were applied as the WSI modality network backbone.

Tabular learning-based representation of the clinicopathological parameters.

We adopted an attentive interpretable tabular learning network, TabNet^[25], to generate a representation of the clinicopathological parameters. The network employed sequential attention on features for inference in each decision step and learned the salient features from the structured clinicopathological parameters.

Integrating the representation of WSI and clinicopathological parameters.

Deep learning, as a form of representation learning, transforms raw data into a suitable representation for pattern recognition in specific tasks^[37].We developed a new multi-modal multi-instance (MMMI) fusion module comprising multi-modal joint instance aggregate learning and global-aware instance aggregation. The representation of WSIs and clinicopathological parameters were input to the module and embedded as the global multi-modal feature, which was used to guide the learning process of each modality in turn.

Model training and testing.

Because WSIs in the MIL method have a variable patch number, the model was designed to accept different instance numbers as input.Label smoothing was used to prevent the model from learning the label-related bias. A weighted sampling method was applied to the distributed training to achieve an imbalanced distribution of samples across the four categories. The final loss was computed as follows:

$$\mathcal{ℒ}=-\sum _{\text{i}=1}^{\text{n}}\left\{\left(1-{\epsilon }\right)\left[-\sum _{\text{y}=1}^{\text{K}}\text{p}\left(\text{y}|{\text{x}}_{\text{i}}\right)\text{log}{\text{q}}_{{\theta }}\left(\text{y}|{\text{x}}_{\text{i}}\right)\right]+{\epsilon }\left[-\sum _{\text{y}=1}^{\text{K}}\text{u}\left(\text{y}|{\text{x}}_{\text{i}}\right)\text{log}{\text{q}}_{{\theta }}\left(\text{y}|{\text{x}}_{\text{i}}\right)\right]\right\}$$

where ${\text{q}}_{{\theta }}\left(\text{y}\right|{\text{x}}_{\text{i}})$denotes the predicted likelihood from the model for sample ${\text{x}}_{\text{i}}$, $\text{n}$ is the number of samples, $\text{K}$ is the number of candidate labels, and ${\epsilon }\in \left[\text{0,1}\right]$ is a weight factor. In practice, $\text{u}\left(\text{y}|{\text{x}}_{\text{i}}\right)$ is not dependent on data; thus, we set $\text{u}\left(\text{y}|\text{x}\right)= \frac{1}{\text{K}}$.

Feature importance.

Both MIL and tabular methods are based on the attention mechanism. We investigated the feature importance based on the learned weights of the instances in the MIL and the features of the clinicopathological parameters after the joint learning process.

6. Statistical analysis

The area under the receiver operating characteristic (ROC) curve was calculated using the pROC in R (version 3.6.1), and the Delong test was applied to compare ROC curves. Cutpointr was used to estimate the optimal cutoff points of the ROC curves. The Wilcoxon rank-sum test was used to compare the signatures. Pearson correlation coefficients were used for the correlation analysis.

Funding

This work was supported by the grant from the Beijing Jingjian Foundation for the

Advancement of Pathology (No. 2019–0007).

Declaration of competing interest

There were no conflict of interest relevant to this article.

Author contributions

Yueping Liu and Jianhua Yao conceived and designed the study.Chunhui Li, Yanan Wang, Xin Xu, Min Zhao, Meng Zhao, Meng Yue, Huiyan Deng, Huichai Yang collected the experiment data and literature. Ying Ding, Fan Yang, Mengxue Han and Yueping Liu wrote the manuscript, made the figures, edited, and made significant revisions to the manuscript. All authors read and approved the final manuscript. Yan Ding, Fan Yang and Mengxue Han contributed equally to this article. Yueping Liu is corresponding author.

Data availability statement

The code and data can be used only for "non-commercial" purpose and under the permission of the correspond author.

Siegel, R. L., Miller, K. D., Jemal, A. Cancer statistics, 2018. CA Cancer J. Clin. 2018, 68(1): 7–30.
Jatoi I, Hilsenbeck SG, Clark GM, et al. Significance of axillary lymph node metastasis in primary breast cancer. J Clin Oncol 1999, 17 (8): 2334–2340.
Mamounas EP, Kuehn T, Rutgers EJT, et al. Current approach of the axilla in patients with early-stage breast cancer. Lancet. 2017, 14. doi: 10.1016/S0140-6736(17)31451-4.
Purushotham AD, Upponi S, Klevesath MB, et al. Morbidity After Sentinel Lymph Node Biopsy in Primary Breast Cancer: results From a Randomized Controlled Trial. J Clin Oncol. 2005, 23(19): 4312–4321.
Lucci A, McCall LM, Beitsch PD, et al. Surgical complications associated with sentinel lymph node dissection (SLND) plus axillary lymph node dissection compared with SLND alone in the American College of Surgeons Oncology Group trial Z0011. J Clin Oncol. 2007, 25(24): 3657–3663.
Bevilacqua JL, Kattan MW, Fey JV, et al. Doctor, what are my chances of having a positive sentinel node? A validated nomogram for risk estimation. J Clin Oncol. 2007, 25(24): 3670–3679.
Chen, Jy., Chen, Jj., Yang, Bl. et al. Predicting sentinel lymph node metastasis in a Chinese breast cancer population: assessment of an existing nomogram and a new predictive nomogram. Breast Cancer Res Treat. 2012, 135(3): 839–848.
Ardila D, Kiraly AP, Bharadwaj S, et al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat Med. 2019, 25(6): 954–961.
Dejun Zhou, Fei Tian, Xiangdong Tian, et al. Diagnostic evaluation of a deep learning model for optical diagnosis of colorectal cancer. Nat Commun. 2020, 11(1): 2961.
Gao Y, Cui Y. Deep transfer learning for reducing health care disparities arising from biomedical data inequality. Nat Commun. 2020, 11(1): 5131.
Oliver Schoppe, Chenchen Pan, Javier Coronel, et al. Deep learning-enabled multi-organ segmentation in whole-body mouse scans. Nat Commun. 2020, 11(1): 5626.
Pierre Courtiol, Charles Maussion, Matahi Moarii, et al. Deep learning-based classification of mesothelioma improves prediction of patient outcome. Nat Med. 2019, 25(10): 1519–1525.
David Dov, Shahar Z Kovalsky, Serge Assaad, et al. Weakly supervised instance learning for thyroid malignancy prediction from whole slide cytopathology images. Med Image Anal. 2021, 67: 101814.
Gabriele Campanella, Matthew G Hanna, Luke Geneslaw, et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med, 2019, 25(8): 1301–1309.
Yuan Liu, Ayush Jain, Clara Eng, et al. A deep learning system for differential diagnosis of skin diseases. Nat Med. 2020, 26(6): 900–908.
Dongguang Li, Jacob R Bledsoe, Yu Zeng, et al. A deep learning diagnostic platform for diffuse large B-cell lymphoma with high accuracy across multiple hospitals. Nat Commun. 2020, 11(1): 6004.
Rui Cao, Fan Yang, Si-Cong Ma, et al. Development and interpretation of a pathomics-based model for the prediction of microsatellite instability in Colorectal Cancer. Theranostics. 2020, 10(24): 11080–11091.
Wenhua Liang, Jianhua Yao, Ailan Chen, et al. Early triage of critically ill COVID-19 patients using deep learning. Nat Commun. 2020, 11(1): 3543.
Nasha Zhang, Jia Wu, Jinming Yu, et al. Integrating Imaging, Histologic, and Genetic Features to Predict Tumor Mutation Burden of Non–Small-Cell Lung Cancer. Clin Lung Cancer, 2020, 21(3): e151-e163..
Xueyan Mei, Hao-Chih Lee, Kai-Yue Diao, et al. Artificial intelligence–enabled rapid diagnosis of patients with COVID-19. Nat Med. 2020, 26(8): 1224–1228.
Azra Bihorac, Tezcan Ozrazgat-Baslanti, Ashkan Ebadi, et al. MySurgeryRisk: Development and Validation of a Machine-learning Risk Algorithm for Major Complications and Death After Surgery. Ann Surg. 2019, 269(4): 652–662.
Y Egashira, T Yoshida, I Hirata, et al. Analysis of pathological risk factors for lymph node metastasis of submucosal invasive colon cancer. Mod Pathol. 2004, 17(5): 503–511.
Rudi Agius, Christian Brieghel, Michael A Andersen, et al. Machine learning can identify newly diagnosed patients with CLL at high risk of infection. Nat Commun. 2020, 11(1): 363.
Salah Al-Zaiti, Lucas Besomi, Zeineb Bouzid, et al. Machine learning-based prediction of acute coronary syndrome using only the pre-hospital 12-lead electrocardiogram. Nat Commun. 2020, 11(1): 3966.
Arik S Ö, Pfister T. TabNet: Attentive Interpretable Tabular Learning[C]. Proceedings of the AAAI Conference on Artificial Intelligence. 2021, 35(8): 6679–6687.
Ilse M, Tomczak J, Welling M. Attention-based deep multiple instance learning[C] International conference on machine learning. PMLR, 2018: 2127–2136..
Amores J. Multiple instance classification: Review, taxonomy and comparative study[J]. Artificial intelligence, 2013, 201: 81–105.
WHO classification of tumours editorial board. Breast tumours [M]. Lyon: IARC Press, 2019. 5th ed,vol 2.
R Salgado, C Denkert, S Demaria, et al. International TILs Working Group 2014. The evaluation of tumor-infiltrating lymphocytes (TILs) in breast cancer: recommendations by an International TILs Working Group 2014. Ann Oncol. 2015, 26(2): 259–271.
Maria Vittoria Dieci, Nina Radosevic-Robin, Susan Fineberg, et al. Update on tumor-infiltrating lymphocytes (TILs) in breast cancer, including recommendations to assess TILs in residual disease after neoadjuvant therapy and in carcinoma in situ: A report of the International Immuno-Oncology Biomarker Working Group on Breast Cancer. Semin Cancer Biol. 2018, 52(Pt 2): 16–25.
Van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate imputation by chained equations in R[J]. Journal of statistical software, 2011, 45(1): 1–67.
Zhou N, Cai D, Han X, et al. Enhanced cycle-consistent generative adversarial network for color normalization of H&E stained images[C]//International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, Cham, 2019: 694–702.
N Otsu. A threshold selection method from gray-level histograms. IEEE transactions on systems, man, and cybernetics. 1979, 9: 62–66.
Y Zhao, F Yang, Y Fang, et al. Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning with Deep Graph Convolution. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
Tan M, Le Q. Efficientnet: Rethinking model scaling for convolutional neural networks[C]//International Conference on Machine Learning. PMLR, 2019: 6105–6114..
Deng J, Dong W, Socher R, et al. Imagenet: A large-scale hierarchical image database[C]//2009 IEEE conference on computer vision and pattern recognition. Ieee, 2009: 248–255.
Esteva, A, Robicquet A, Ramsundar B, et al. A guide to deep learning in healthcare. Nat Med. 2019, 25(1): 24–29.
Amin MB, Edge SB, Greene FL, et al. AJCC Cancer Staging Manual. 8th ed [M]. New York: Springer, 2017.
Liu C, Zhao Z, Gu X, et al. Establishment and verification of a baggedtrees-based model for prediction of sentinel lymph node metastasis for early breast cancer patients. Front Oncol. 2019, 9: 282.
Zhou LQ, Wu XL, Huang SY, et al. Lymph Node Metastasis Prediction from Primary Breast Cancer US Images Using Deep Learning. Radiology. 2020, 294(1): 19–28.
Zheng X, Yao Z, Huang Y, et al. Deep learning radiomics can predict axillary lymph node status in early-stage breast cancer. Nat Commun. 2020, 11(1): 1236.

(Not answered)

Multi-center study on predicting breast cancer lymph node status from core needle biopsy specimens using multi-modal and multi-instance deep learning

Status:

Journal Publication

Version 1

Abstract

Objective

Methods

Results

Conclusions

Figures

Introduction

Results

Discussion

Methods

6. Statistical analysis

Declarations

References

Additional Declarations

Status:

Journal Publication

Version 1