AML and ccRCC are respectively the most common benign and malignant renal tumors. It is essential to distinguish between these two entities due to their distinct treatment approaches and prognoses. For ccRCC, common treatment methods involve surgical intervention and ablation therapy, whereas AML often involves observation and follow-up, with treatment being considered for patients with tumor diameters greater than 4cm or those experiencing tumor bleeding [Dell'Atti et al. 2022; Silson et al. 2021].
In clinical practice, CT and MRI are widely used as effective imaging assessment tools for renal tumors. The features that differentiate fat-poor AML and ccRCC include age, gender, CT density on plain scan, and T2WI signal at all [Tannka et al. 2017; Jomoto et al.2022]. The logistic regression analysis results of this study reveal that ccRCC is more prevalent in elderly male patients. Furthermore, ccRCC exhibits a more heterogeneous T2WI signal compared to fat-poor AML. This increased heterogeneity in ccRCC is attributed to its stronger tumor heterogeneity, making it more susceptible to cystic degeneration and necrosis. Consequently, ccRCC demonstrates a greater mixture of high and low abnormal signals on T2WI, consistent with previous research findings. However, DWI signal and ADC values were not included as an independent predictive factor in the clinical model. It is believed that this may be due to smaller ccRCC tumors often having lower malignancy, resulting in DWI features of these low-grade tumors being somewhat similar to lipid-poor AML. Therefore, the similarity between these features emphasizes the necessity of utilizing more detailed and advanced radiomics and machine learning techniques in tumor differentiation to enhance accuracy.
In recent years, the development and advancement of radiomics have significantly enhanced the ability to predict and classify renal tumors. CT and MRI imaging have been extensively utilized in radiomics and machine learning research to differentiate renal masses [Kim et al. 2022; Sun et al. 2022]. Given that T2, DWI, and ADC maps are widely adopted sequences in MRI examinations in various clinical settings, these three sequences were selected for feature extraction in the current study. A total of 3948 radiomic features were extracted from the T2WI, DWI and ADC sequences, from which 13 most significant features were ultimately identified to construct the radiomics model. In the test set, the model achieved an AUC value of 0.778, representing a substantial improvement compared to previous studies [Matsumoto et al. 2022; Arita et al. 2021] that had focused on extracting only a few dozen features. This expanded feature analysis permits a more thorough exploration of tumor information, thereby enhancing the predictive and classificatory capabilities of radiomics in the context of renal tumors.
Based on the content mentioned above, this study combined conventional MRI features and radiomics features to establish a radiomics nomogram with improved predictive value for distinguishing lipid-poor AML and small renal ccRCC in both the training and validation sets. The AUC values for the nomogram were 0.974 and 0.833, respectively, surpassing those of the standalone clinical model or the imaging radiomics model. This suggests that radiomics features complement conventional clinical and imaging features. The results align with Nie et al.'s [Nie et al. 2020] work, who developed a CT radiomics nomogram. By leveraging both CT and MRI advantages, combining information from both modalities proves to be more effective in distinguishing these entities. Jian et al. [Jian et al. 2022] conducted a study wherein they focused on the clinical value of urinary creatinine by using T2WI sequences and IVIM sequences alongside clinical data and MRI radiomics models to construct a radiomics nomogram for differentiating lipid-poor AML and RCC. Their clinical-radiomics model achieved a commendable AUC of 0.931, demonstrating superior performance compared to the current study. The reason behind this superiority lies in the ability of IVIM sequences to differentiate between the diffusion of water molecules and tissue microcirculation perfusion, thereby providing more accurate descriptions of cellular functional changes and tissue microstructure when compared to DWI sequences. Consequently, IVIM sequences offer a more comprehensive range of information [Malagi et al. 2022; Malagi et al.2023]. However, despite these advantages, IVIM sequences are not extensively utilized in clinical practice, thereby limiting their practical value. Future research could consider integrating other imaging features and clinical factors to further enhance the application effectiveness of radiomics nomograms. These studies further demonstrate that models built using multiple parameters have higher predictive efficacy. By comprehensively utilizing conventional imaging features and radiomics techniques, the ability to differentiate between these two types of small renal masses is improved, providing strong support for more precise treatment plans for patients.
In addition to radiomics models, deep learning models are also being integrated into the study of renal tumors and have achieved good predictive results [Xu et al. 2022; Xi et al. 2020]. The application of deep learning may bring more innovations to the study of kidney tumors, and when combined with traditional methods such as radiomics, it is expected to achieve better outcomes in clinical practice.
There were several limitations to this study. Firstly, the small sample size of this study, particularly for fp-AML, which corresponds to the low overall incidence of this subtype in the population. In addition, for solid renal masses, MRI is not often the preferred imaging modality, and most of the patients in our study had diagnostic difficulties with enhanced CT or ultrasound. Therefore, the potential for selection bias within the patient cohort of this retrospective study highlights the need for further prospective investigations to accurately assess the diagnostic significance in real-world clinical scenarios. Secondly, the data used to train the radiomics models was obtained on two MRI scanning instruments. Previous literature [Kalpathy et al. 2016; Timmeren et al. 2020] has reported that the CT scanner models affect the reproducibility of radiomics features. Thus, different MRI scanner models may also impact data analysis. However, it is essential to highlight that this variation would enable the creation of more generalized models. Future research opportunities may arise from the need to provide additional training or validation of machine learning models on an external dataset to account for this variability and assess its appropriateness. This would require extra effort in accommodating the variability and evaluating the applicability, thereby offering a potential avenue for further investigation.
To sum up, the research conducted in our study focused on the development of a radiomics nomogram based on multiparametric MRI. This nomogram demonstrated excellent predictive effectiveness in distinguishing fp-AML from small ccRCC before surgical intervention. Utilizing radiomics nomograms, which are quantitative and non-invasive, for clinical decision-making has the potential to be highly beneficial. However, it is essential to validate these findings extensively before implementing them widely in clinical practice.