2.1 Chemical and Reagents
Cell culture media and supplements were purchased from Life Technologies, Inc (Grand Island, NY). Fetal bovine serum (FBS) was purchased from Atlanta Biologicals (Flowery Branch, GA), and dextran-coated-charcoal (DCC) stripped was purchased from Gemini Bioscience (Sacramento, CA). Estradiol (E2), ≥98%, was purchased from Sigma Aldrich (St. Louis, MO). Propylpyrazoletriol (PPT), ≥99% (HPLC), was purchased from Tocris Bioscience (Minneapolis, MN). Dimethylsulfoxide (DMSO) was purchased from Acros Organics (Fair Lawn, NJ). Agarose was purchased from Fisher Scientific (Agawam, MA). All the chemicals and reagents were obtained from commercial suppliers.
2.2 Cell Culture and Chemical Treatment
2D Cell Culture: MCF-7 (ATCC No. HTB-22) human breast carcinoma cells (Soule et al. 1973) were cultured according to the previously described protocol (Vantangoli et al. 2015). Briefly, the MCF7 cells were limited to use within the first 15 passages from the original purchased vial from ATCC, to control for genomic drift due to instability. The MCF7 cells were maintained in a growth medium made of phenol-red free DMEM-F12 medium containing 10% FBS, MEM nonessential amino acids, gentamicin, and 10μg/mL insulin in a 5% CO2 incubator at 37°C.
3D Microtissue Culture: The 3D Petri Dish 12-256-small spheroids molds (Microtissues Inc., Providence, RI) was used to make non-adhesive agarose hydrogels, which were seeded with cells as previously described (Vantangoli et al. 2015) MCF-7 cells grown in monolayer in tissue culture flasks were trypsinized, counted and seeded into agarose hydrogels at a density of 600,000 cells/mL. MCF-7 cells were allowed to settle into recesses for 30 minutes before 2mL of treatment media was added.
Estrogenic Compound Treatment: Solutions of E2, PPT, or vehicle control (DMSO) were made in treatment media made of phenol-red free DMEM-F12 medium containing 5% DCC FBS, MEM nonessential amino acids, gentamicin, and 6ng/mL insulin. Following the seeding of MCF-7 cells into hydrogels, 2mL of treatment media with E2, PPT, or DMSO was added. Plates were kept in a 5% CO2 incubator and cultured for up to 7 days. Treatment media was changed on day 3 and 5 of the experiment.
2.3 RNA Isolation and Gene expression
MCF7 microtissues were collected from hydrogels by centrifugation, pelleted, and lysed in Tri Reagent. The total RNA was extracted according to a previous protocol{Li, 2023 #39} (PMID:37230229). Each experiment was designed with three biological replicates. For each biological replicate, seeding cells were from separate flasks, and microtissues from six hydrogels (256 microtissues/ gel) were collected. RNA quantity was determined using a Nanodrop ND1000. For use in qRT-PCR, cDNA was made using the RT2 First Strand Kit (Qiagen) per the manufacturer's instructions. qRT-PCR was performed using RT2 SYBR Green Rox qPCR Mastermix with RT2 qPCR Primer Assays (Qiagen) to determine gene expression levels of PDZ domain containing 1 (PDZK1, PPH08038E), apolipoprotein D (APOD, PPH02630A), cytochrome P450, family 1, subfamily A, polypeptide 1 (CYP1A1, PPH01271F), transforming growth factor, beta 3 (TGFB3, PPH00531F) and normalized to ribosomal protein, large P0 (RPLPO, FWD GTGTTCGACAATGGCAGCAT, REV GACACCCTCCAGGAAGCGA). Plates were run on an Applied Biosystems ViiA 7 machine using cycling conditions recommended by the manufacturer. The mean CT for the target genes and the geometric mean CT for the endogenous control (RPLP0) genes were calculated, and the mean CT for the endogenous controls was subtracted from the mean CT for each target gene within each experiment to give the Δ Mean. The ΔCт Mean at each treatment (E2 or PPT) was subtracted from the control (untreated) ΔCт Mean to provide the ΔΔCт for each treatment. Finally, the ΔΔCт values were raised to the power of 2 (2-ΔΔCт) to give the fold change in the target gene at each time point relative to the DMSO control.
2.4 Imaging Feature Extraction
Cell clearing and imaging: Following treatment, microtissues were rinsed in PBS, fixed in formalin for 15 minutes at room temperature, rinsed in PBS twice, and then stored in PBS at 40C until ready to image. Before imaging, microtissues were switched to ScaleS4 containing 1:1000 Hoechst 33342 and 1:200 rhodamine-phalloidin. ScaleS4 is composed of 40 w/v% D-(-)-sorbitol, 10 w/v% glycerol, 4M Urea, 0.2 w/v% Triton X-100, and 15 v/v% DMSO in deionized water. After 3 hours, ScaleS4 was removed. Agarose hydrogels were removed from a 12-well plate, placed on a paper towel, the extra agarose was removed from the sides, and then flipped over into a 24-well cell imaging plate (Eppendorf) containing 50 uL of fresh ScaleS4. Cell imaging was performed using an Opera Phenix™ High Content Screening System (Perkin Elmer) using a 20x water objective (NA 1.0, HH1400421, PerkinElmer). Image stacks were taken with a 5 µm step size. A 3D image screening protocol was set up to obtain the 3D image of the MCF7 microtissues.
Cell counts: Based on the 3D microtissue image acquired above, the Harmony software built a cell count protocol for each microtissue's total cell count. Briefly, channels of three views were summed, filtered to remove background noise, and bright areas above the set absolute threshold were identified via the 'find image region' method. Several positions and morphology properties (including contact area and the nearest neighbor distance) were calculated and used to filter out image artifacts. After that, nuclei were segmented within each aggregate region via the ‘find nuclei’ method, algorithm 'C.' Similarly, property calculation and filtering were performed to further select bonafide nuclear regions for counting (Supplemental_Data_1).
2D Image selection and feature extraction: In Harmony, the cellular region area on each image slice was measured, and the image slice with the largest cellular region area was selected as the representative 2D image of the respective microtissue. A 2D image feature extraction pipeline was built in Harmony, and the pipeline was applied to the 2D images selected above. Briefly, the pipeline identified objects, such as the image or nuclear region, and then extracted morphological features, such as area, length, roundness, and a collection of texture features. (See Supplemental_Data_2 and Supplemental_Data_3 for a detailed feature extraction pipeline and Supplemental_Data_4 for a complete list of features). A quantitative value was calculated for each feature, and a number matrix was generated and exported for further analysis.
2.5 Image Feature Analysis
Data normalization and Regrouping: A well-established analysis method designed for enhancing multi-class data normalization was adopted here to identify the optimal normalization method for the data. This method is capable of (1) normalizing the multi-class data using 168 different normalization methods/strategies, (2) evaluating the performances of every single method/strategy from multiple perspectives, and (3) comparing the performance of all these normalization methods/strategies based on a comprehensive ranking to identify superior one (Yang et al. 2020).
Since none of the normalization method performed well in analyzing the data in the original groups, we regrouped the samples for reanalysis. For PPT, 1nM PPT, 3nM PPT, and 10nM PPT were combined into the high concentration group, and the 0.1nM PPT was referred to as the medium concentration group, respectively and the 0.01nM PPT as low concentration group. For E2, the 0.1nM E2 and 1nM E2 were combined as the high concentration group, 0.0001nM E2 and 0.001 E2 were combined as the low concentration group, and the left 0.01nM E2 as the medium concentration group.
After regrouping the samples in the PPT group and E2 group, we then analyzed the regrouped data with the method above and successfully identified several normalization methods, which were evaluated as well performed: for the regrouped PPT data, the best normalization method is Range Scaling, and for regrouped E2 data, the best normalization method is Power Scaling (van den Berg et al. 2006) (Supplemental_Data_5).
Feature selection: For multi-class data, the orthogonal partial least squares-discriminant analysis (OPLS-DA) is a commonly used strategy for identifying differential markers (Thevenot et al. 2015) is therefore adopted in our studies for feature selection. The OPLS-DA was conducted by running the opls function in the ropls R package (Thevenot et al. 2015). Parameters ‘orthoI’, ‘crossvalI’, and ‘predI’ of the opls function were set to ‘NA’, ‘2’, and ‘1’, respectively, which means that the number of orthogonal components will be computed and optimized based on 2-fold cross-validation and one predictive component. Principle Component Analysis (PCA): The PCA was conducted via the MetaboAnalystR R package (Pang et al. 2020).
Machine Learning Classification Model: The machine learning algorithm we adopted for constructing the classification models based on our identified markers was Random Forest (RF) since our data all contains more than 2 sample groups (Breiman, L. (2001) Random forests. Machine Learning 45, 5-32.). The RF method combined several decision tree predictors and classified the samples based on the majority of votes of a series of binary questions about given features. In our study, a training set and a test set were generated by stratified sampling from the same group in a ratio of 8:2, then the training set was used to train the RF model via the randomForest function in randomForest R packages, and the parameter ntree was set to 100; finally, the test set was used to evaluate the performance of trained RF model by calculating the AUC value via the multi_roc function in multiROC R packages.
2.6 Luminal Volume Acquisition and Analysis
An automated system was built to perform the luminal volume acquisition and analysis using the data generated by the high-content imaging instrument. The system consists of three parts: an image processing pipeline, a deep learning pipeline, and a volumetric analysis step. The image processing pipeline first enhances the input 2D images (z-slices) then applies a sequence of image processing operations to prepare images for the classification phase. Once the images are ready, we use our deep learning classifier (that we have trained – transfer learning – using 1000 manually marked lumens) to differentiate “true” lumens from “false” ones. The volumetric analysis step re-constructs 3D lumens from the groups of nearby “true” 2D lumens identified by the classifier. Finally, the last step calculates the volume and surface area of each of these 3D re-constructed lumens. While the user interface of the system is developed using Java, the core functionality (image processing, deep learning, re-construction, and volumetric analysis) uses MATLAB R2018a image processing toolbox, transfer learning functionality, and computational geometry toolbox, respectively. The code and associated information have been archived at https://github.com/000haitham000/lumen-explorer.
2.7 Statistical Analysis
The cell count results are represented as the mean ± SD. The gene expression data are expressed as the mean ± SD value of the relative fold change. For all comparisons of the cell count and gene expression values, one-way analysis of variance (ANOVA) statistical analysis was employed with Turkey’s multiple comparisons posttest to compare among different concentrations. All analysis was carried out using GraphPad Prism software (GraphPad Software, Inc., La Jolla, California, USA).