Data from three unique sites per organ (prostate, glioblastoma, and breast) were assessed for this study. Details from each site are further detailed in the subsequent sections; however, a simplified table of these data sites and organs is provided in Table 7.
Table 7. Breakdown of prostate, glioblastoma, and breast cancer data by data site, MR manufacturer, and magnetic field strength.
|
|
Demographics
|
MR Vendor
|
Magnetic Field Strength
|
|
|
Patients
|
Sex
|
GE
|
Siemens
|
Philips
|
1.5 T
|
3T
|
Prostate
|
Total
|
641
|
M: 641
|
256
|
295
|
90
|
89
|
552
|
Site 1
|
385
|
M: 385
|
256
|
125
|
4
|
3
|
382
|
Site 2
|
86
|
M: 86
|
0
|
0
|
0
|
86
|
0
|
Site 3
|
170
|
M: 170
|
0
|
170
|
86
|
0
|
170
|
Glioblastoma
|
Total
|
1016
|
M: 615
|
410
|
606
|
0
|
51
|
965
|
F: 401
|
Site 1
|
52
|
M:35
|
36
|
16
|
-
|
36
|
16
|
F: 17
|
Site 2
|
590
|
M: 358
|
0
|
590
|
-
|
15
|
575
|
F: 232
|
Site 3
|
374
|
M: 222
|
374
|
0
|
-
|
0
|
374
|
F: 152
|
Breast
|
Total
|
236
|
F: 236
|
190
|
46
|
0
|
185
|
51
|
Site 1
|
68
|
F: 68
|
68
|
0
|
-
|
68
|
0
|
Site 2
|
100
|
F: 100
|
54
|
46
|
-
|
49
|
51
|
Site 3
|
68
|
F: 68
|
68
|
0
|
-
|
68
|
0
|
4.1. Prostate Cancer Cohort
4.1.1. Site 1 – LOCAL
Data from 385 prospectively recruited patients treated locally at our institution (Table 7; Figure 7 A, top) with pathologically confirmed prostate cancer undergoing radical prostatectomy between 2014 and 2023 were analyzed for this institutional review board (IRB) approved study. Written informed consent was obtained from all patients. Inclusion criteria for this cohort included clinical imaging including T2-weighted imaging prior to surgery.
Patients underwent multi-parametric magnetic resonance imaging (MP-MRI) prior to prostatectomy on 1.5 T (n1.5T = 3) or 3T (n3T = 382) GE (nGE = 256), Siemens (nS = 125) or Philips (nP = 4) MRI scanner (General Electric, Waukesha, WI, USA; Siemens Healthineers, Erlangen, Germany; Philips, Amsterdam, Netherlands) (Figure 7, B). A subset of patients (n = 68) had additional imaging after removal of the endorectal coil on either the GE or Siemens scanner (nGE = 52, nS = 16) (Figure 7, C). Each protocol included T2-weighted imaging with acquisition parameters as follows: repetition time (TR) = 3370 milliseconds, FOV = 120 mm, voxel dimensions = 0.23 × 0.23 × 3 mm, acquisition matrix = 512, and slices = 26. All image contrasts used in this study were acquired axially.
4.1.2. Site 2 – PROSTATE-DIAGNOSIS
A publicly available dataset including prostate T2WI scanned on a 1.5 T Philips Achieva using a combined surface and endorectal coil was used for our second site31,32. From a total of 92 patients, images from 86 patients were ultimately used in this analysis due to image quality (Table 7; Figure 7 A, middle).
4.1.3. Site 3 – PROSTATEx
The final dataset used in this analysis was a collection of retrospective prostate MR studies including T2WI acquired on two different 3T Siemens MR scanners (MAGNETOM Trio and Skyra)32,33. T2W imaging acquisition parameters include a turbo spin echo sequence with a resolution of ~0.5 mm in plane and a slice thickness of 3.6 mm. All images were acquired without an endorectal coil. After exclusion of images with poor quality, a total of 170 patients’ images were used (Table 7; Figure 7 A, bottom).
4.2. Glioblastoma Cohort
4.2.1. Site 1 – LOCAL
Written, informed consent was obtained from 52 patients for this cohort, each diagnosed with a glioblastoma in concordance with the 2021 WHO classification standards for brain tumors. Inclusion criteria for this cohort included autopsy confirmed GBM and axial clinical imaging including pre- and post-contrast T1-weighted images (T1, T1C), FLAIR, and DWI 1.5 T (n1.5T = 36, n3T = 16, nGE = 36, nS = 16). Due to the use of clinical imaging, acquisition parameters were not standardized across patients. Axial T1, T1C, FLAIR, and ADC images were selected as the primary acquisitions for this study. ADC maps were calculated using the patient’s clinical DWI. T1, T1C, and ADC images were rigidly aligned to patient’s FLAIR image using SPM12 (https://www.fil.ion.ucl.ac.uk/spm/software/spm12/) (Table 7; Figure 8 A, B, C, D top rows). Examples of images scanned on the GE and Siemens scanners in Figure 8 are from this dataset.
4.2.2. Site 2 – UPENN-GBM
Data from this online repository includes MP-MRI for de novo GBM patients from the University of Pennsylvania Health System32,34. All axial images in this dataset, including T1, T1C, FLAIR, and ADC, were skull-stripped co-registered by an automated computational method10. A total of 590 patients from this dataset were used after excluding images without all four pre-surgery acquisitions or poor quality (Table 7; Figure 8 A, B, C, D middle rows).
4.2.3. Site 3 – UCSF-PDGM
Site 3 data comes from the publicly available University of California San Francisco Preoperative Diffuse Glioma MRI (UCSF-PDGM) dataset32,35. This dataset includes 501 subjects with histopathologically-proven diffuse gliomas who were imaged with a preoperative MRI using a 3T GE Discovery 750. Each image contrast was registered to the FLAIR image (1 mm isotropic resolution) using automated non-linear registration (Advanced Normalization Tools). Resampled co-registered data were then skull stripped using a publicly available deep-learning algorithm36,37 Table 7; Figure 8 A, B, C, D bottom rows). Though a total of 501 adult patients with pathologically confirmed grade II-IV diffuse gliomas were collected for this database, only the 374 patients with confirmed GBM were used.
4.3. Breast Cancer Cohort
All datasets used for our breast imaging analyses were available online (https://cancerimagingarchive.net)32 and analysis was performed on non-fat suppressed T1 images (T1nFS) (Figure 9).
4.3.1. Site 1 – ACRIN 6698
The ACRIN trial 6698, organized by the American College of Radiology Imaging Network, was a multi-institutional research project38,39. Its purpose was to determine the efficacy of quantitative DWI in measuring the response of breast cancer to neoadjuvant chemotherapy (NAC). A total 406 women with invasive breast cancer were prospectively enrolled to ACRIN 6698 at ten institutions between August 2012 to January 2015. However, after applying our exclusion criteria described previously in 2.3. Breast Cancer Cohort, only 68 patients’ images were assessed. All patients underwent breast MRI at 4 timepoints over the course of NAC, though only the pre-treatment images are analyzed in this study. MR imaging was performed on a 1.5T GE scanner using a dedicated breast radiofrequency coil (Figure 9 A, top; Figure 9 top). Detailed MRI protocol parameter specifications can be found on https://cancerimagingarchive.net/40.
4.3.2. Site 2 – Duke-Breast-Cancer-MRI
This breast cancer cohort was downloaded from the publicly available MRI dataset 41. The Duke-Breast-Cancer-MRI dataset contains 922 female patients recruited between 2000 and 2014, however, only 351 patients were included in our analyses due to availability of T1nFS images and image quality. Because of annotation constraints described below, a random selection of 100 patients were chosen from the eligible patients for this analysis. As with our local GBM cohort, clinical imaging was provided in the dataset, thus acquisition parameters were not standardized across patients (n1.5T = 49, n3T = 51, nGE = 54, nS= 46) (Figure 9 A, middle; Figure 9 B).
4.3.3. Site 3 – ISPY2
I-SPY 2 (Investigation of Serial Studies to Predict Your Therapeutic Response with Imaging And moLecular analysis 2) is an ongoing, multi-center study. Its objective is to swiftly assess the effectiveness of novel treatments for breast cancer within the context of NAC42. Adult women diagnosed with locally advanced breast cancer (tumor size ≥2.5 cm) without distant metastasis recruited between 2010 and 2016 were analyzed for this study. Breast MRI data was acquired prospectively at over 22 clinical centers using a standardized image acquisition protocol. Patients underwent 4 MRI exams before and during NAC, though only the first scan was assessed in the current study. This is a comprehensive, highly curated imaging data set with histopathologic outcome that can be used to develop, test, and compare imaging metrics and prediction models for breast cancer response to treatment. A total of 719 patients were included in this dataset, however, only 68 were assessed after applying the exclusion criteria. MR imaging was performed on a 1.5T GE scanner. All required imaging was performed axially with full bilateral coverage43 (Figure 9 A, bottom; Figure 9 B top).
4.4. MRI Normalization
Multiple normalization methods were used for each of the three tissue types. Tissue and regions of interest (ROIs) were defined for each tissue type using AFNI (Analysis of Functional NeuroImages, http://afni.nimh.nih.gov/)44. Prostate masks were created on each slice of the patient’s T2-weighted image (T2WI) and additional 10-voxel radius circular ROIs were defined on one slice of the patient’s T2WI within the (1) bladder and (2) levator ani muscle. Corresponding masks were created on the T2WI for patients who had an additional scan done post-endorectal coil removal. Examples of these masks can be viewed in Figure 10, top. The three prostate acquisitions tested in this study were normalized using (1) unnormalized, (2) the standard deviation of intensity within the prostate mask, (3) the Z-score of masked intensity, and the mean intensity within the (4) bladder and (5) levator ani muscle.
Brain imaging masks were segmented using SPM12, defined as the combination of the white and gray matter masks. Cerebral spinal fluid (CSF) masks were created by thresholding the ADC for the high diffusion areas, as this is an indicator of fluid. Finally, intensity within the brain mask was scaled into values of 0-255 as used in45. Aligned T1, T1C, FLAIR, and ADC were normalized using (1) unnormalized, (2) standard deviation and (3) Z-score of intensity within the brain mask, and mean intensity within the (4) CSF, and (5) scaled intensity (Figure 10, middle).
Breast masks were manually drawn on MR images using ITK-Snap. Due to the size of each patient’s imaging, only the center 15 slices were annotated. Similarly to the previously mentioned organs, a breast mask was first drawn, encompassing the breast tissue only. A mask of the sternum was drawn on the axial images and location was verified using the sagittal and coronal images. Finally, the thorax was annotated, avoiding any additional tissue. Five normalization methods were applied to the T1nFS images: (1) unnormalized, (2) standard deviation and (3) Z-score of intensity within the breast mask and using the mean intensity within the (4) sternum and (5) thorax masks (Figure 10, bottom).
4.5. Radiomic feature calculation
Radiomic features were calculated across each image using Matlab’s radiomics function which calculates a total of 197 features. These include 136 texture features (i.e., 50 gray level co-occurrence matrix (GLCM), 16 gray level dependence zone matrix (GLDZM), 32 gray level run length matrix (GLRLM), 16 gray level size zone matrix (GLSZM), 17 neighboring gray level dependence matrix (NGLDM), and 5 neighboring gray tone difference matrix (NGTDM)), and 61 intensity features (i.e., 18 Intensity Based Statistics, 23 Intensity Histogram, 18 Intensity Volume Histogram, and 2 Local Intensity).
4.6. Statistical analysis
Mean and standard deviation of MRI intensity as well as radiomic features were calculated across patients following normalization. Intensity distributions were compared across sites, MR vendors, magnetic field strength (i.e., 1.5T v 3T) using a two one-sided (TOST) test, a test of equivalence that is based on the classical t-test46. While the TOST test requires both one-sided tests to be statistically significant (i.e., < 0.05), all results described below use the highest p-value for each test.