Deep Learning-Enabled Ultrasound Radiomics for Accurate Prediction of Breast Cancer Lymph Node Metastasis

doi:10.21203/rs.3.rs-4519585/v1

Download PDF

Research Article

Deep Learning-Enabled Ultrasound Radiomics for Accurate Prediction of Breast Cancer Lymph Node Metastasis

https://doi.org/10.21203/rs.3.rs-4519585/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Background: Lymph node metastasis serves as a pivotal prognostic marker in breast cancer progression. The present study endeavors to devise a deep learning-driven ultrasound radiomics model for precise forecasting of lymph node metastasis in breast cancer patients.

Methods: A retrospective analysis was conducted on clinical and ultrasound imaging data of breast cancer patients diagnosed surgically and pathologically at our institution between January 2018 and January 2023. The dataset was randomly stratified into training and testing subsets at a 7:3 ratio. Initially, tumor ultrasound images of breast cancer patients were annotated. Subsequently, a pre-trained Densenet121 convolutional neural network(CNN) was employed to extract intricate features from the annotated images. Principal component analysis (PCA) and feature selection techniques were implemented to diminish the dimensionality of the extracted features. These features were then consolidated and optimized using various machine learning algorithms to predict lymph node metastasis. The optimal algorithm was chosen to estimate the probability of lymph node metastasis for each patient utilizing the ultrasound radiomics model. Univariate and multivariate analyses were further conducted on clinical features to identify independent predictors of lymph node metastasis in breast cancer. Ultimately, these clinical predictors were integrated with the prediction probability of the ultrasound radiomics model to formulate a clinical-ultrasound fusion model, whose predictive accuracy was assessed on the testing subset.

Results: The ultrasound radiomics model grounded in deep learning exhibited remarkable performance in forecasting lymph node metastasis in breast cancer. Among the tested algorithms, Logistic Regression (LR) outperformed its counterparts, attaining an AUC (95%CI) of 0.823 (0.775-0.872), along with a sensitivity of 0.835 and specificity of 0.699. Notably, the clinical-ultrasound fusion model further enhanced the predictive accuracy, achieving an accuracy of 0.796 in the training set and 0.820 in the testing set. Moreover, the AUC (95%CI) values were 0.863 (0.837-0.890) and 0.885 (0.847-0.922) for the training and testing sets, respectively, with corresponding sensitivities of 0.744 and 0.741, and specificities of 0.854 and 0.912.

Conclusion: This study successfully developed and validated a deep learning-based ultrasound radiomics model, which exhibited high predictive accuracy and stability in forecasting lymph node metastasis in breast cancer. This model provides clinicians with valuable insights, enabling them to make more personalized and informed treatment decisions for breast cancer patients.

Breast Cancer

Lymph Node Metastasis

Deep Learning

Prediction Model

Breast cancer has become the most common malignant tumor among women worldwide, posing a significant threat to women's health. Global cancer data indicates that around 2.3 million fresh instances of breast cancer were identified worldwide in 2022, making up 11.6% of total cancer diagnoses[1]. Lymph node metastasis(LMN) is a crucial indicator for assessing the progression of breast cancer, and its accurate prediction is essential for developing treatment strategies and assessing prognosis. As a non-invasive, convenient, and relatively low-cost imaging method, ultrasonography is widely used in the screening, diagnosis, and evaluation of treatment response for breast cancer. However, the ultrasonography results are influenced by multiple factors, such as the operator's experience, technical proficiency, and equipment performance. Although traditional clinical pathological parameters are currently used to predict the risk of lymph node metastasis to some extent, many studies have identified shortcomings in their precision and dependability. Tang et al.demonstrated that relying solely on these traditional parameters often fails to comprehensively reflect breast cancer's complexity and individual differences, leading to deviations in prediction results[2]. Furthermore, Jiang et al. highlighted the limitations of traditional clinical pathological parameters in accurately predicting lymph node metastasis, potentially impacting treatment decisions[3].Therefore, this study aims to build a more accurate breast cancer lymph node metastasis prediction model by combining deep learning techniques and clinical pathological characteristics. The objective is to enhance the precision of predictions and offer tailored and scientifically informed recommendations for diagnosing and treating breast cancer.

In recent years, the application of deep learning techniques in medical image processing and analysis has become increasingly extensive and profound. Its exceptional feature extraction and classification capabilities have injected new vitality into the early detection and accurate diagnosis of various diseases. Deep learning exhibits significant advantages compared to traditional omics and clinical pathological feature analysis[4, 5]. While traditional omics and clinical pathological feature analysis provide essential information in diagnosing breast cancer, these methods often rely on manual feature extraction and statistical models, which may be influenced by subjectivity and experience[6]. On the other hand, convolutional neural network(CNN), a type of deep learning, can extract image details more unbiasedly and thoroughly by autonomously acquiring image features, minimizing the impact of human biases[7]. Multiple studies have confirmed that deep learning models exhibit excellent performance in analyzing mammography, ultrasound, and MRI images, significantly improving breast cancer detection's sensitivity and classification accuracy[8, 9]. McKinney et al. suggested that deep learning models can extract valuable characteristics from mammograms, leading to enhanced breast cancer detection accuracy [10]. With its powerful image feature learning ability, CNNs can deeply mine potential information in medical images, revealing subtle changes that are difficult for experts to observe directly. Thus, they perform outstandingly in the early detection and benign-malignant classification of breast cancer[11]. In terms of prognosis evaluation, deep learning has also demonstrated its value.

In contrast to conventional approaches that heavily depend on clinical pathological parameters, deep learning can more precisely assess extensive medical image data from breast cancer patients, predict patient survival rates and recurrence risks with greater accuracy, and offer doctors guidance for tailored treatment plans[12]. Besides image-based characteristics, lymph node metastasis in breast cancer is strongly associated with various clinical-pathological factors. Several studies have incorporated clinical characteristics associated with lymph node metastasis in breast cancer into risk stratification systems through screening clinical features[13, 14]. Hence, integrating clinical characteristics with image features obtained through deep learning in a multimodal fusion model could significantly enhance the accuracy of predicting lymph node metastasis. This fusion method can comprehensively utilize information from different sources, providing a more reliable basis for precise breast cancer treatment.

This study aims to create a model that combines deep learning characteristics and clinical pathological factors to enhance the accuracy of predicting breast cancer lymph node metastasis. Firstly, we used the Densenet121 deep learning model for image feature extraction. In order to improve the accuracy and efficiency of predictions, we utilized principal component analysis (PCA) and feature selection techniques to decrease the dimensionality of the extracted high-dimensional features, keeping only the most important features associated with lymph node metastasis. Finally, we combined a logistic regression (LR) model to analyze and predict these features further. By utilizing this model, we anticipate better-pinpointing patients at risk of lymph node metastasis, ultimately offering valuable decision-making assistance to healthcare providers.

Patient selection

The Medical Ethics Committee of First Affiliated Hospital of Guangxi Medical University has approved the protocol (Approval Number 2024-E393-01). We retrospectively collected data from patients who were surgically and pathologically diagnosed with breast cancer at our hospital from January 2018 to January 2023. Clinical and pathological characteristics (age, location, size, reproductive history, menstrual history, clinical presentation, ultrasound clinical features, grading, pathological diagnosis, etc.) were collected accordingly. Patients were enrolled in the research study if they had received a pathological diagnosis of breast cancer. Patients underwent ultrasound imaging within two weeks before surgery. Lymph node dissection was performed with a precise pathological diagnosis. Complete clinical, ultrasound and pathological data were available. Figure 1 displays the participation of 1986 individuals with breast cancer in the research. Following the evaluation of the criteria, 977 patients were chosen to participate in the research on breast cancer lymph node metastasis models. Based on the post-surgery pathological findings, the patients were categorized into two groups: one with lymph node metastasis (LNM, N = 513) and one without (non-LNM, N = 464). Figure 1 illustrates the process of selecting patients. Participants were assigned randomly to a training group (N = 683) and a testing group (N = 294) in a ratio of 7 to 3. The training group was utilized to construct the model, while the testing group was used to validate the model.

Image Acquisition and Processing

For all patients, one transverse ultrasound image of each breast tumor sample was collected. All images were retrieved and downloaded from our institution's Picture Archiving and Communication System (PACS). The static breast ultrasound images (JPG format) were obtained from ultrasound equipment from manufacturers such as Philips and Toshiba. Meanwhile, image markers and annotations were eliminated through minimal edge cropping to achieve anonymization. As the images were collected from different doctors and equipment, we first standardized the resolution of all ultrasound images to a uniform resolution of 960×720 using interpolation methods to eliminate image differences due to different scanning devices and parameters. Next, a mean filter was applied to reduce image noise and artifacts, improving image quality and enhancing the accuracy of subsequent analysis.

The ITK-SNAP tool (ITK-SNAP Home (itksnap.org)), an open-source software with version 4.0.0, was utilized to outline the area of focus in the ultrasound images. After converting the breast ultrasound images from JPG to Nii.gz format and importing them into ITK-SNAP, the liver region in the files was manually labeled as the ROI using the polygon tool in the toolbar. After the delineation, the ROI files and original images were exported. An ultrasound physician manually segmented the ROI with five years of experience in breast ultrasound images, and each segmentation was then reviewed by a senior ultrasound physician with 15 years of experience in breast ultrasound.

Transfer Learning

Convolutional Neural Networks (CNNs) are network models trained on large-scale datasets. Utilizing transfer learning, this research addressed the issue of limited training data in deep learning, resulting in improved speed, time, and performance effectiveness during the process. Transfer learning was implemented using Python 3.7.2 in the Anaconda distribution. TensorFlow 2.0 and Keras 2.2.4 Python were utilized to code all the deep learning models. During the parameter setup for transfer learning, the Rectified Linear Unit (ReLU) served as the activation function, the binary cross-entropy function was utilized for loss, Adam was chosen as the optimizer, and the sigmoid function was employed for classification. The learning rate was set to 0.001, Alpha to 0.5, and the epoch to 100. Ultrasound images of the breast were divided into training and test sets in a ratio of 7 to 3, chosen at random. Initially, the model was trained with the training set and fine-tuned with the test set. Each epoch's loss value and accuracy during training were recorded, and the CNN weight values were saved. The deep feature extraction model was the CNN model, which achieved the best accuracy and lowest loss value. In this research, the Densenet121 model was employed for transfer learning, being initially trained on the training set and subsequently fine-tuned on the test set. Loss and accuracy metrics were tracked for every epoch, and the network weights were stored. The deep feature extraction model parameters were chosen based on the CNN model, which achieved the most fantastic accuracy and smallest loss value.

Extraction and Screening of Deep Features

Deep features were extracted from the region of interest (ROI) using the pre-trained Densenet121 model through extraction and screening processes. This study extracted features from the second-to-last layer (features. Norm 5 layer) as the deep features for building subsequent machine learning models. Because the deep feature data has many dimensions, Principal Component Analysis (PCA) was employed to reduce the dimensionality initially by converting the original data into a collection of independent representations that are not correlated with each dimension through a linear transformation. After PCA dimensionality reduction, each deep feature matrix was reduced to 512 dimensions. Feature extraction features with a correlation coefficient exceeding 0.9 were eliminated using Spearman's rank correlation coefficient as a threshold. Finally, Least Absolute Shrinkage and Selection Operator (LASSO) was used to reduce the dimensionality of these features further to obtain depth features related to BA diagnosis.

Model Construction and Validation

Eleven machine-learning algorithms from the Scikit-Learn library were used to construct ultrasound radionics models based on the deep features of the training set. The algorithms in the list were Support Vector Machine (SVM), K-Nearest Neighbor (KNN), ExtraTrees, Random Forest (RF), eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Logistic Regression (LR), Multilayer Perceptron (MLP), AdaBoost, GradientBoosting, and NaiveBayes.Optimal parameters were selected using a five-fold cross-validation technique. Subsequently, the model performance was further evaluated and screened in the test set. Following a thorough assessment, the top-performing machine learning algorithm was chosen to determine the likelihood of breast cancer lymph node metastasis in ultrasound imaging for each sample, known as the ultrasound radionics signature (US signature). In order to enhance the model's performance and stability, clinical risk factors for breast cancer lymph node metastasis were identified by analyzing patient clinical characteristics using univariate and multivariate methods. A clinical ultrasound prediction model for breast cancer lymph node metastasis was constructed by integrating the ultrasound radionics signature with clinical independent risk factors, and the model's performance was evaluated. Evaluation metrics included the AUC value (95% CI), accuracy, sensitivity, specificity, etc. Calibration curves, confusion matrices, decision curve analysis(DCA), and patient prediction score distributions were plotted to compare the models' calibration efficiency and clinical application value. The model construction and validation process are shown in Fig. 2.

Statistical Analysis Methods

Statistical analysis was conducted using Python software version 3.7.2. The experimental findings that followed a normal distribution were presented as the average plus or minus the standard deviation. Independent sample t-tests or Wilcoxon tests were used for continuous variables between two groups, and Pearson's chi-square test was used for categorical variables. Statistical analyses were conducted using a two-tailed approach, with statistical significance defined as P < 0.05. The ITK-SNAP program was utilized for preprocessing images and outlining regions of interest, with Python being employed for extracting features, filtering features, and creating and validating models.

Patient Clinical Data

Comparison of patient clinical data between the group with lymph node metastasis and the group without metastasis (Table 1) showed notable variations in tumor size, grade, and lymphovascular invasion (LVI) (p<0.001), with no significant differences in other clinical characteristics observed between the groups. Additional multivariate analysis verified that the dimensions of the tumor [OR (95%CI) 1.191(1.164-1.219), p<0.001], the grade [OR (95%CI) 1.104(1.079-1.130), p<0.001], and the presence of LVI [OR (95%CI) 1.105(1.080-1.131), p<0.001] were autonomous factors that increased the risk of lymph node metastasis in cases of breast cancer. Table 2 indicates no significant variances in initial clinical features between the training and testing groups.

Table 1 Comparison of clinical characteristics between participants in the non-LNM and LNM groups

	non-LNM (N=464)	LNM (N=513)	Overall (N=977)	P-value
Age (years)	50.8±10.9	50.8±10.3	50.8±10.5	0.999
Position				0.997
Right	229 (49.4%)	252 (49.1%)	481 (49.2%)
Left	235 (50.6%)	261 (50.9%)	496 (50.8%)
Pregnant				0.982
No	32.0 (6.9%)	37.0 (7.2%)	69.0 (7.1%)
Yes	432 (93.1%)	476 (92.8%)	908 (92.9%)
Menopause				1
No	261 (56.3%)	289 (56.3%)	550 (56.3%)
Yes	203 (43.8%)	224 (43.7%)	427 (43.7%)
Nipple retraction				0.436
No	421 (90.7%)	477 (93.0%)	898 (91.9%)
Yes	43.0 (9.3%)	36.0 (7.0%)	79.0 (8.1%)
Nipple.discharge				0.46
No	448 (96.6%)	487 (94.9%)	935 (95.7%)
Yes	16.0 (3.4%)	26.0 (5.1%)	42.0 (4.3%)
Number.of.tumor				0.872
Single	436 (94.0%)	486 (94.7%)	922 (94.4%)
Multiple	28.0 (6.0%)	27.0 (5.3%)	55.0 (5.6%)
US tumor size (mm)	25.8±10.2	36.2±14.1	31.2±13.4	<0.001
Aspect ratio				0.508
≤1	410 (88.4%)	465 (90.6%)	875 (89.6%)
＞1	54.0 (11.6%)	48.0 (9.4%)	102 (10.4%)
US tumor borderline				0.461
Clear	47.0 (10.1%)	65.0 (12.7%)	112 (11.5%)
Blurring	417 (89.9%)	448 (87.3%)	865 (88.5%)
US.tumor.form.				0.98
Rule	23.0 (5.0%)	24.0 (4.7%)	47.0 (4.8%)
Lrregular	441 (95.0%)	489 (95.3%)	930 (95.2%)
US tumor blood				0.894
No	77.0 (16.6%)	91.0 (17.7%)	168 (17.2%)
Yes	387 (83.4%)	422 (82.3%)	809 (82.8%)
US BI-RADS				0.905
3	13.0 (2.8%)	12.0 (2.3%)	25.0 (2.6%)
4	374 (80.6%)	410 (79.9%)	784 (80.2%)
5	63.0 (13.6%)	81.0 (15.8%)	144 (14.7%)
6	14.0 (3.0%)	10.0 (1.9%)	24.0 (2.5%)
Calcification				0.941
No	148 (31.9%)	169 (32.9%)	317 (32.4%)
Yes	316 (68.1%)	344 (67.1%)	660 (67.6%)
Echo				0.731
Low-echo	396 (85.3%)	448 (87.3%)	844 (86.4%)
Iso-echo	46.0 (9.9%)	38.0 (7.4%)	84.0 (8.6%)
High-echo	22.0 (4.7%)	27.0 (5.3%)	49.0 (5.0%)
Pathological type				0.974
Others	45.0 (9.7%)	52.0 (10.1%)	97.0 (9.9%)
Invasive ductal carcinoma	419 (90.3%)	461 (89.9%)	880 (90.1%)
Grade				<0.001
1	170 (36.6%)	110 (21.4%)	280 (28.7%)
2	227 (48.9%)	237 (46.2%)	464 (47.5%)
3	67.0 (14.4%)	166 (32.4%)	233 (23.8%)
LVI				<0.001
No	303 (65.3%)	205 (40.0%)	508 (52.0%)
Yes	161 (34.7%)	308 (60.0%)	469 (48.0%)
Ki67				0.932
No	119 (25.6%)	137 (26.7%)	256 (26.2%)
Yes	345 (74.4%)	376 (73.3%)	721 (73.8%)
CK7				0.534
No	243 (52.4%)	287 (55.9%)	530 (54.2%)
Yes	221 (47.6%)	226 (44.1%)	447 (45.8%)
EGFR				0.86
No	357 (76.9%)	387 (75.4%)	744 (76.2%)
Yes	107 (23.1%)	126 (24.6%)	233 (23.8%)
ER				0.902
No	242 (52.2%)	275 (53.6%)	517 (52.9%)
Yes	222 (47.8%)	238 (46.4%)	460 (47.1%)
HER2				0.729
No	176 (37.9%)	182 (35.5%)	358 (36.6%)
Yes	288 (62.1%)	331 (64.5%)	619 (63.4%)
PR				0.983
No	171 (36.9%)	192 (37.4%)	363 (37.2%)
Yes	293 (63.1%)	321 (62.6%)	614 (62.8%)

Table 2 Univariate and Multivariate analysis of risk factors related to LNM in Breast Cancer

Variable	Univariate analysis		Multivariate analysis
Variable	OR (95%CI)	P value	OR (95%CI)	P value
US tumor size	1.212(1.183-1.242)	0.00	1.191(1.164-1.219)	0.00
Grade	1.121(1.093-1.150)	0.00	1.104(1.079-1.130)	0.00
LVI	1.135(1.106-1.164)	0.00	1.105(1.080-1.131)	0.00

Performance of the Deep Learning Model

We trained Densenet121 as a deep feature extraction model for this analysis, extracting 50,176 deep features from each ROI using a pre-trained CNN. Furthermore, PCA was employed to decrease the dimension of pixel-level features to the top 512 most significant features (Supplementary File 1). Subsequently, with the lymph node status of breast cancer as the outcome target, Spearman and LASSO were applied to reduce the extracted deep features to 32 breast cancer lymph node metastasis-related features (Fig. 3A and 3B). Various machine learning algorithms were utilized to build models by integrating the filtered deep characteristics. The LR model showed the best performance in the training set based on the AUROC metric, as depicted in Fig. 3C and Supplementary Table 1. The LR, NaiveBayes, SVM, KNN, RandomForest, ExtraTrees, XGBoost, LightGBM, GradientBoosting, AdaBoost, and MLP models achieved accuracies of 0.772, 0.755, 0.745, 0.497, 0.517, 0.544, 0.684, 0.667, 0.639, 0.680, and 0.735, correspondingly, in the independent testing set (Fig. 3E). Table 4 and Fig. 3F display the AUC (95%CI) values of 0.823 (0.775-0.872), 0.790 (0.738-0.843), 0.796 (0.744-0.849), 0.590 (0.528-0.653), 0.630 (0.567-0.693), 0.603 (0.539-0.666), 0.719 (0.661-0.777), 0.710 (0.651-0.769), 0.683 (0.623-0.744), 0.684 (0.623-0.745), and 0.772 (0.718-0.826), demonstrating that the LR model performed the best in the testing dataset. The LR model in the testing set had a sensitivity of 0.835, specificity of 0.699, PPV of 0.763, and NPV of 0.785. This result demonstrates the high efficacy and stability of the LR model algorithm in predicting lymph node status based on deep features of breast cancer.

Table 3 Comparison of baseline characteristics between the training set and the test set

	Training-set (N=683)	Test-set (N=294)	P-value
Age (years)	50.4±10.4)	51.9±10.8	0.114
Position			0.642
Right	343 (50.2%)	138 (46.9%)
Left	340 (49.8%)	156 (53.1%)
Pregnant			0.831
No	46.0 (6.7%)	23.0 (7.8%)
Yes	637 (93.3%)	271 (92.2%)
Menopause			0.0676
No	401 (58.7%)	149 (50.7%)
Yes	282 (41.3%)	145 (49.3%)
Nipple retraction			0.474
No	623 (91.2%)	275 (93.5%)
Yes	60.0 (8.8%)	19.0 (6.5%)
Nipple.discharge			0.663
No	651 (95.3%)	284 (96.6%)
Yes	32.0 (4.7%)	10.0 (3.4%)
Number.of.tumor			0.908
Single	646 (94.6%)	276 (93.9%)
Multiple	37.0 (5.4%)	18.0 (6.1%)
US tumor size (mm)	31.4±13.3	30.8±13.7	0.826
Aspect ratio			0.701
≤1	608 (89.0%)	267 (90.8%)
＞1	75.0 (11.0%)	27.0 (9.2%)
US tumor borderline			0.881
Clear	76.0 (11.1%)	36.0 (12.2%)
Blurring	607 (88.9%)	258 (87.8%)
US.tumor.form.			0.245
Rule	38.0 (5.6%)	9.00 (3.1%)
Lrregular	645 (94.4%)	285 (96.9%)
US tumor blood			0.59
No	123 (18.0%)	45.0 (15.3%)
Yes	560 (82.0%)	249 (84.7%)
US BI-RADS			0.964
3	16.0 (2.3%)	9.00 (3.1%)
4	544 (79.6%)	240 (81.6%)
5	105 (15.4%)	39.0 (13.3%)
6	18.0 (2.6%)	6.00 (2.0%)
Calcification			0.302
No	232 (34.0%)	85.0 (28.9%)
Yes	451 (66.0%)	209 (71.1%)
Echo			0.834
Low-echo	586 (85.8%)	258 (87.8%)
Iso-echo	59.0 (8.6%)	25.0 (8.5%)
High-echo	38.0 (5.6%)	11.0 (3.7%)
Pathological type			0.982
Others	67.0 (9.8%)	30.0 (10.2%)
Invasive ductal carcinoma	616 (90.2%)	264 (89.8%)
Grade			1
1	196 (28.7%)	84.0 (28.6%)
2	323 (47.3%)	141 (48.0%)
3	164 (24.0%)	69.0 (23.5%)
LVI			0.609
No	348 (51.0%)	160 (54.4%)
Yes	335 (49.0%)	134 (45.6%)
Ki67			1
No	179 (26.2%)	77.0 (26.2%)
Yes	504 (73.8%)	217 (73.8%)
CK7			0.997
No	370 (54.2%)	160 (54.4%)
Yes	313 (45.8%)	134 (45.6%)
EGFR			0.0469
No	505 (73.9%)	239 (81.3%)
Yes	178 (26.1%)	55.0 (18.7%)
ER			0.98
No	360 (52.7%)	157 (53.4%)
Yes	323 (47.3%)	137 (46.6%)
HER2			0.826
No	246 (36.0%)	112 (38.1%)
Yes	437 (64.0%)	182 (61.9%)
PR			0.897
No	257 (37.6%)	106 (36.1%)
Yes	426 (62.4%)	188 (63.9%)
LNM			0.88
No	328 (48.0%)	136 (46.3%)
Yes	355 (52.0%)	158 (53.7%)

Table 4 Comparison of the performance of ultrasound imaging machine learning models in the test set

Model	Acc	AUC	95% CI	Sens	Spec	PPV	NPV	F1
LR	0.772	0.823	0.7747 - 0.8718	0.835	0.699	0.763	0.785	0.798
NaiveBayes	0.755	0.790	0.7381 - 0.8427	0.766	0.743	0.776	0.732	0.771
SVM	0.745	0.796	0.7435 - 0.8485	0.671	0.831	0.822	0.685	0.739
KNN	0.497	0.590	0.5282 - 0.6525	0.082	0.978	0.812	0.478	0.149
RandomForest	0.517	0.630	0.5670 - 0.6928	0.247	0.831	0.629	0.487	0.355
ExtraTrees	0.544	0.603	0.5388 - 0.6664	0.291	0.838	0.676	0.504	0.407
XGBoost	0.684	0.719	0.6611 - 0.7771	0.747	0.610	0.690	0.675	0.717
LightGBM	0.667	0.710	0.6511 - 0.7685	0.684	0.647	0.692	0.638	0.688
GradientBoosting	0.639	0.683	0.6225 - 0.7441	0.620	0.662	0.681	0.600	0.649
AdaBoost	0.680	0.684	0.6230 - 0.7452	0.873	0.456	0.651	0.756	0.746
MLP	0.735	0.772	0.7177 - 0.8261	0.778	0.684	0.741	0.727	0.759

Acc: Accuracy; AUC: Area Under the Curve; Sens: Sensitivity; Spec: Specificity; PPV: Positive Predictive Value; NPV: Negative Predictive Value; F1: F1 Score

Performance of the Fusion Model

The deep features of breast tumors were used to calculate the predicted probability of lymph node metastasis for each subject (US signature) using the LR machine learning algorithm. Further, the US signature was integrated with the three independent clinical risk factors for breast cancer lymph node metastasis using the LR machine learning algorithm to construct a fusion model, as visualized in Fig. 4. Feature fusion further improved the model's performance. As shown in Table 5, the accuracy in the training and testing sets was 0.796 and 0.820, respectively. The AUC (95%CI) values were 0.863 (0.837-0.890) and 0.885 (0.847-0.922), with the results visualized in Fig. 5A. The sensitivity was 0.744 and 0.741, while the specificity was 0.854 and 0.912. The PPV was 0.846 and 0.907, and the NPV was 0.755 and 0.752. In order to confirm the practical significance of the model, clinical assessments showed that utilizing the fusion model for forecasting lymph node metastasis in breast cancer patients with a probability between 0.05 and 0.95 could result in a positive outcome (Fig. 5B). The confusion matrix in Fig. 5C shows True Negatives (TN) of 0.82 and True Positives (TP) of 0.79, indicating that the fusion model has higher predictive accuracy for non-lymph node metastasis patients than for lymph node metastasis patients. Fig. 5D illustrates the fusion model's ability to accurately predict lymph nodes and classify the risk of lymph node metastasis in breast cancer. The findings indicate that the fusion model, which combines tumor-deep features and clinical characteristics, is highly effective in predicting the risk of lymph node metastasis in breast cancer.

Table 5 Performance of the fusion model in the training set and the test set

	Acc	AUC	95% CI	Sens	Spec	PPV	NPV	F1
Training-set	0.796	0.863	0.8367 - 0.8903	0.744	0.854	0.846	0.755	0.792
Test-set	0.820	0.885	0.8465 - 0.9225	0.741	0.912	0.907	0.752	0.815

Breast cancer as a common malignant tumor among women has significant implications for prognosis in terms of its lymphatic metastasis. Accurate prediction of lymphatic metastasis is crucial for treatment and prognosis evaluation. With the rapid development of deep learning technology, its application in medical image processing and disease prediction has gradually attracted widespread attention. The research revealed that the combination model had higher accuracy in predicting patients without lymphatic metastasis, possibly due to the similarity in characteristics among these patients. Meanwhile, we also validated the importance of tumor size, grade, and lymphovascular invasion (LVI) as independent risk factors for breast cancer lymphatic metastasis. Furthermore, our research demonstrated that integrating deep learning technology with traditional clinical features could provide a more comprehensive and accurate method for predicting breast cancer lymphatic metastasis, potentially offering new insights for future breast cancer treatment and prognosis evaluation. Cancer cells have a higher chance of spreading to different body areas via the lymphatic system when they infiltrate lymphatic vessels, resulting in lymphatic metastasis. The study findings indicate that tumor size, grade, and LVI are significant factors that independently increase the risk of breast cancer spreading to the lymphatic system. This finding aligns with previous studies. Tseng et al. demonstrated a significant correlation between tumor size and grade with lymphatic metastasis through a comprehensive analysis of clinical data from a large number of breast cancer patients[15]. Similarly, Chen et al. found that LVI is a powerful predictor of lymphatic metastasis[16]. Additionally, some studies have emphasized the importance of tumor size, grade, and LVI in predicting breast cancer prognosis[17–19]. By considering these factors comprehensively, doctors can more accurately assess patients' conditions and develop personalized treatment plans. In the construction of the deep learning model, we selected Densenet121 for feature extraction. The Densenet architecture effectively mitigates the gradient vanishing problem in image processing and analysis through dense connections, enhancing feature propagation and making the learning and extraction of critical image information more efficient[20]. In the realm of breast cancer, deep learning has made impressive advancements. In a single study, a CNN was employed to categorize breast pathology images, successfully achieving automated identification and categorization of breast cancer cells with a precision similar to that of conventional pathologists[21]. Another study constructed a prognosis evaluation model by combining deep learning techniques with clinical data from breast cancer patients, which accurately predicted patients' survival rates and recurrence risks[22, 23]. Furthermore, deep learning can accurately classify breast cancer molecular subtypes by analyzing complex information such as gene expression data. A study utilizing a deep learning model to analyze gene expression profiles of breast cancer samples successfully divided the samples into different molecular subtypes, providing important evidence for precision medicine[24–26]. Our study successfully reduced the dimensionality of high-dimensional features to 32 features most relevant to lymphatic metastasis through PCA and feature selection methods. This dimensionality reduction strategy improves computational efficiency and helps the model focus on key information, thereby enhancing prediction accuracy. The subsequent results showed that the LR model exhibited the highest predictive performance based on deep learning features. This finding aligns with previous studies, indicating that the LR model is efficient and stable in handling carefully selected features[27]. More importantly, we constructed a more powerful predictive model by integrating deep learning features with clinical risk factors. This fusion method utilizes information from different sources, enabling the model to comprehensively evaluate patients' conditions from multiple perspectives. Research has indicated that combining image characteristics derived from advanced machine learning with patients' medical data can greatly enhance the precision and reliability of predicting diseases[28, 29]. A breast cancer detection model using deep learning has been created through research, integrating medical imaging characteristics and clinical information to achieve accurate breast cancer identification[30, 31]. The fusion model demonstrated notable enhancements in performance in both training and testing datasets, including improved accuracy and decreased false positives, confirming the efficacy of the fusion approach. Notably, the fusion model performed better in predicting non-lymphatic metastasis patients. The differences in characteristics between non-lymphatic and lymphatic metastasis patients may explain this discrepancy, with the former group displaying more consistent traits and the latter exhibiting more complex and varied clinical and pathological features. However, even in this case, the fusion model was still able to effectively classify the risk of breast cancer lymphatic metastasis, demonstrating its strong generalization ability and practicality.

Although a predictive model for breast cancer lymph node metastasis was successfully built by combining deep learning features and clinical data, this study is still subject to certain constraints. Firstly, as a single-center study, all our data were sourced from the same medical institution, which may limit the diversity and representativeness of the samples. Due to potential differences in equipment, technical proficiency, and patient populations among different medical institutions, our model may require adjustment or retraining when applied to other centers to adapt to different data distributions and characteristics. This, to a certain extent, restricts the model's generalizability and scalability. Secondly, this study relies on ultrasonic static image features to predict lymph node metastasis. While static images can provide rich morphological information, they fail to capture dynamic tumor changes, such as blood flow conditions and tissue elasticity. These factors may be valuable in comprehensively assessing tumor biological characteristics and predicting lymph node metastasis. To overcome these limitations, future studies could consider conducting multi-center collaborations to collect a broader and more representative dataset, thus enhancing the model's generalizability and robustness. Furthermore, incorporating dynamic ultrasound images, other imaging techniques like MRI and CT scans, and biomarkers into a holistic predictive model may enhance the precision and dependability of predicting lymph node metastasis.

This study effectively developed a predictive model for breast cancer lymph node metastasis by combining deep learning characteristics with clinical data and confirmed its efficacy. In the future, we will strive to conduct multi-center collaborations and integrate data from more modalities to enhance the model's accuracy, stability, and broad applicability. In summary, this study introduces a new approach and viewpoint for forecasting the spread of breast cancer to lymph nodes, which could assist healthcare providers in creating more accurate treatment strategies.

PCA: Principal component analysis; LMN: Lymph node metastasis; LR: Logistic regression; CNN: Convolutional neural network; PAC: Picture archiving and communication system; DCA: Decision curve analysis; TN: True negatives; TP: True positives. LVI: Lymphovascular invasion; Acc: Accuracy; AUC: Area Under the Curve; Sens: Sensitivity; Spec: Specificity; PPV: Positive Predictive Value; NPV: Negative Predictive Value; F1: F1 Score.

Acknowledgments

We thank the Key Laboratory of Ultrasonic Molecular Imaging and Artificial Intelligence, Guangxi Zhuang Autonomous Region Engineering Research Center for Artificial Intelligence Analysis of Multimodal Tumor Images, and Guangxi Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor/Key Laboratory of Early Prevention and Treatment for Regional High Frequency Tumor (Guangxi Medical University), Ministry of Education for their support of this study.

Authors’ contributions

MW and YW have contributed to the conception, design, and model construction of the study. QC, SG, and ZL have contributed to the clinical data collection, collation, and statistical analysis. The collection and collation of imaging data were carried out by WH，DD and YC. HY was responsible for guiding the delineation of ROI. JW, YC, and EW handled the ultrasonic image data processing and feature extraction. MW and WH were responsible for writing the manuscript. YW provided technical support and rigorously revised and approved the manuscript. All authors have contributed to the article and approved the submitted version.

Funding

This study was funded by the Youth Science Foundation of Guangxi Medical University（GXMUYSF202327），the National Natural Science Foundation of China (82160336), the Natural Science Foundation of Guangxi (2023GXNSFDA026013, 2020GXNSFDA238005).

Availability of data and materials

Individual patient data is only accessed by authors who received approval of institutional review board according to institutional policies. Aggregate summary data may be provided upon request in keeping with protection of healthcare information.

Ethics approval and consent to participate

This study has been approved by the Medical Ethics Committee of the First Affiliated Hospital of Guangxi Medical University (Approval No. 2024-E393-01).

Consent for publication

Our study was not a case report and the need for informed consent for publication was waived by our institutional Medical Ethics Committee due to the noninvasiveness(retrospective analysis of existing data).

Competing interests

The authors declare no competing interests.

Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. Journa. 2024;74(3):229–63.
Tang YL, Wang B, Ou-Yang T, Lv WZ, Tang SC, Wei A, et al. Ultrasound radiomics based on axillary lymph nodes images for predicting lymph node metastasis in breast cancer. Journa. 2023;13:1217309.
Jiang M, Li CL, Luo XM, Chuan ZR, Lv WZ, Li X, et al. Ultrasound-based deep learning radiomics in the assessment of pathological complete response to neoadjuvant chemotherapy in locally advanced breast cancer. Journa. 2021;147:95–105.
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, et al. Dermatologist-level classification of skin cancer with deep neural networks. Journa. 2017;542(7639):115–18.
Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, et al. A survey on deep learning in medical image analysis. Journa. 2017;42:60–88.
Gulshan V, Peng L, Coram M, Stumpe MC, Wu D, Narayanaswamy A, et al. Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. Journa. 2016;316(22):2402–10.
Chen X, Wang X, Zhang K, Fung KM, Thai TC, Moore K, et al. Recent advances and clinical applications of deep learning in medical image analysis. Journa. 2022;79:102444.
Shen L, Margolies LR, Rothstein JH, Fluder E, McBride R, Sieh W. Deep Learning to Improve Breast Cancer Detection on Screening Mammography. Journa. 2019;9(1):12495.
Wu N, Phang J, Park J, Shen Y, Huang Z, Zorin M, et al. Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening. Journa. 2020;39(4):1184–94.
McKinney SM, Sieniek M, Godbole V, Godwin J, Antropova N, Ashrafian H, et al. International evaluation of an AI system for breast cancer screening. Journa. 2020;577(7788):89–94.
Kooi T, Litjens G, van Ginneken B, Gubern-Mérida A, Sánchez CI, Mann R, et al. Large scale deep learning for computer aided detection of mammographic lesions. Journa. 2017;35:303–12.
Ehteshami Bejnordi B, Veta M, van Johannes P, van Ginneken B, Karssemeijer N, Litjens G, et al. Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer. Journa. 2017;318(22):2199–210.
Du LW, Liu HL, Gong HY, Ling LJ, Wang S, Li CY, et al. Adding contrast-enhanced ultrasound markers to conventional axillary ultrasound improves specificity for predicting axillary lymph node metastasis in patients with breast cancer. Journa. 2021;94(1118):20200874.
Zong Q, Deng J, Ge W, Chen J, Xu D. Establishment of Simple Nomograms for Predicting Axillary Lymph Node Involvement in Early Breast Cancer. Journa. 2020; 12:2025-35.
Tseng HS, Chen LS, Kuo SJ, Chen ST, Wang YF, Chen DR. Tumor characteristics of breast cancer in predicting axillary lymph node metastasis. Journa. 2014;20:1155–61.
Chen H, Meng X, Hao X, Li Q, Tian L, Qiu Y et al. Correlation Analysis of Pathological Features and Axillary Lymph Node Metastasis in Patients with Invasive Breast Cancer. Journa. 2022; 2022:7150304.
Akrami M, Meshksar A, Ghoddusi JM, Safarpour MM, Tahmasebi S, Zangouri V, et al. Prognostic Role of Lymphovascular Invasion in Patients with Early Breast Cancer. Journa. 2021;12(4):671–77.
Peng G, Zhou Z, Jiang M, Yang F. Can a subgroup at high risk for LRR be identified from T1-2 breast cancer with negative lymph nodes after mastectomy? A meta-analysis. Journa. 2019; 39(9).
Vranes V, Rajković N, Li X, Plataniotis KN, Todorović Raković N, Milovanović J et al. Size and Shape Filtering of Malignant Cell Clusters within Breast Tumors Identifies Scattered Individual Epithelial Cells as the Most Valuable Histomorphological Clue in the Prognosis of Distant Metastasis Risk. Journa. 2019; 11(10).
Huang G, Liu Z, Pleiss G, Maaten LV, Weinberger KQ. Convolutional Networks with Dense Connectivity. Journa. 2022;44(12):8704–16.
Jiang Y, Chen L, Zhang H, Xiao X. Breast cancer histopathological image classification using convolutional neural networks with small SE-ResNet module. Journa. 2019;14(3):e0214587.
Arya N, Saha S. Deviation-support based fuzzy ensemble of multi-modal deep learning classifiers for breast cancer prognosis prediction. Journa. 2023;13(1):21326.
Afrin H, Larson NB, Fatemi M, Alizad A. Deep Learning in Different Ultrasound Methods for Breast Cancer, from Diagnosis to Prognosis: Current Trends, Challenges, and an Analysis. Journa 2023; 15(12).
Niyas S, Bygari R, Naik R, Viswanath B, Ugwekar D, Mathew T et al. Automated Molecular Subtyping of Breast Carcinoma Using Deep Learning Techniques. Journa. 2023; 11:161 – 69.
Boulenger A, Luo Y, Zhang C, Zhao C, Gao Y, Xiao M, et al. Deep learning-based system for automatic prediction of triple-negative breast cancer from ultrasound images. Journa. 2023;61(2):567–78.
Ma M, Liu R, Wen C, Xu W, Xu Z, Wang S, et al. Predicting the molecular subtype of breast cancer and identifying interpretable imaging features using machine learning algorithms. Journa. 2022;32(3):1652–62.
Cherkassky V, Ma Y. Another look at statistical learning theory and regularization. Journa. 2009;22(7):958–69.
Yuan L, Yang L, Zhang S, Xu Z, Qin J, Shi Y, et al. Development of a tongue image-based machine learning tool for the diagnosis of gastric cancer: a prospective multicentre clinical cohort study. Journa. 2023;57:101834.
Kalafi EY, Nor NAM, Taib NA, Ganggayah MD, Town C, Dhillon SK. Machine Learning and Deep Learning Approaches in Breast Cancer Survival Prediction Using Clinical Data. Journa. 2019;65(5–6):212–20.
Lotter W, Diab AR, Haslam B, Kim JG, Grisot G, Wu E, et al. Robust breast cancer detection in mammography and digital breast tomosynthesis using an annotation-efficient deep learning approach. Journa. 2021;27(2):244–49.
Yala A, Lehman C, Schuster T, Portnoi T, Barzilay R. A Deep Learning Mammography-based Model for Improved Breast Cancer Risk Prediction. Journa. 2019;292(1):60–6.

No competing interests reported.

Download PDF

Editor assigned by journal
05 Jun, 2024
Submission checks completed at journal
03 Jun, 2024
First submitted to journal
03 Jun, 2024

You are reading this latest preprint version

Deep Learning-Enabled Ultrasound Radiomics for Accurate Prediction of Breast Cancer Lymph Node Metastasis

Status:

Version 1

Abstract

Figures

Introduction

Materials and Methods

Patient selection

Image Acquisition and Processing

Transfer Learning

Extraction and Screening of Deep Features

Model Construction and Validation

Statistical Analysis Methods

Results

Patient Clinical Data

Performance of the Deep Learning Model

Performance of the Fusion Model

Discussion

Conclusions

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1