Glioblastoma and neuroblastoma cell culture and transfections
Glioblastoma U251 and U343 cells were obtained from the University of Uppsala (Sweden) and cultured in Dulbecco’s modified Eagle’s medium supplemented with 10% fetal bovine serum, 100 U/ml penicillin, and 100 µg/ml streptomycin. Cells were maintained at 37°C in a 5% CO2 atmosphere. Neuroblastoma BE(2)C were obtained from the American Type Culture Collection and Kelly cells were obtained from the Cancer Therapy and Research Center, San Antonio, TX. Neuroblastoma cells were grown in DMEM/F-12 supplemented with 10% fetal bovine serum, 100 U/ml penicillin, and 100 µg/ml streptomycin. Glioma stem cell (GSC) lines (Mesenchymal: 3565, 3128, 1123NS and Proneural: 1919, 19NS, 84NS) were gifts from Drs. Jeremy Rich, Christopher Hubert, and Ichiro Nakano [24, 25]. GSCs were cultured in serum-free media consisting of Neurobasal-A media supplemented with B-27, sodium pyruvate, Glutamax, penicillin/streptomycin, 20 ng/ml EGF (ThermoFisher), and 20 ng/ml hFGF (PeproTech). Every 72 hours, GSCs were pulsed with EGF/FGF. Dissociation was performed by incubating GSCs with Accutase (ThermoFisher) at room temperature for 10 minutes.
miRNA mimics were obtained from Qiagen (Negative Control Mimic: YM00479903, miR-124: YM00471256, miR-128: YM00471226, miR-137: YM00472450, miR-29a-5p: YM00470481, miR-218-5p: YM00471984, miR-101-5p: YM00470928), and were used in the experiments as described below.
Proliferation assays
Glioblastoma and neuroblastoma cells were reverse-transfected with control or miRNA mimics into 96-well plates at densities ranging from 0.8-5 x 103. Growth curves were generated by an automated and non-invasive Incucyte® system live cell imaging system (Essen BioSciences). Cells were imaged every 4-8 hours. All experiments were performed with three biological and technical replicates.
Differentiation assays
Neuroblastoma cells were reverse-transfected with control or miRNA mimics into 96-well plates at a density of 2.5-5 x 103. Cells were cultured for 120 hours and were imaged with the Incucyte® system live cell imaging system. Differentiation was assessed by total neurite length using IncuCyte® NeuroTrack Software Module (Essen BioSciences). All experiments were performed with three biological and technical replicates.
MTS assays
For cell viability assays, 103 cells per well were plated in a 96-well plate and reverse- transfected with control or miRNA mimics. 96 hours after transfection, cell viability was measured using CellTiter-Glo (Promega) following the manufacturer’s instructions. For GSC lines, cells were dissociated and reverse-transfected at a density of 104 cells/well with miRNA control or miRNA mimics and plated into 96-well plates precoated 3 hours prior with Geltrex™ LDEV-Free Reduced Growth Factor Basement Membrane Matrix (19.2-28.8 mg/ml). Absorbance was quantified at 490 nm. All experiments were performed with technical triplicates.
Colony formation assays
U251 or U343 cells were reverse-transfected with control or miRNA mimics. 24 hours later, cells were trypsinized and re-plated at a density of 0.2 cells/μl. Cells were kept in culture for 10-14 days until colonies were clearly visible. Colonies were fixed with 4% paraformaldehyde solution and visualized by staining with 1% crystal violet. Crystal violet was dissolved from stained plates and absorbance was measured with at 570 nm. All experiments were performed with technical triplicates.
Response to radiation
U251 or U343 cells were reverse-transfected with control or miRNA mimics. 24 hours later, cells were trypsinized and exposed to varying doses of ionizing radiation, using a CP-160 Cabinet X-Radiator (Faxitron X-Ray Corp). Cells were then re-plated, either for MTS assays to assess viability or plated for clonogenic potential. MTS assays were performed as described earlier. Two weeks after irradiation, clonogenic potential was assessed as described earlier. All experiments were performed in triplicate.
Traffic light reporter assays
Traffic Light Reporter (Addgene: 31481) and d20GFP-Donor (Addgene: 31485) plasmids were used to generate lentiviral particles. Lentiviruses were generated and titered as previously described [26]. Traffic light reporter assays were performed as previously described with minor modifications [27]. U343 and U251 cells were first infected with an MOI of 13. 96 hours later, cells were replated into 96-well plates at a density of 104. Cells were co-transfected with 100 ng of I-SceI plasmid along with either 50 nM of control or miRNA mimics using Lipofectamine 3000. 72 hours after transfection, whole wells were imaged with a Celigo automated cell imager (Nexcelom Bioscience). Total numbers of GFP and RFP positive cells were counted and used to calculate the HDR:mutNHEJ Ratio. The I-SceI plasmid was a kind gift from Dr. Alexander JR Bishop. Experiments were performed in triplicates.
miRNA correlation and survival analysis in patients
MiRNA expression and survival data for the TCGA LGG cohort were obtained from UCSC Xena (http://xena.ucsc.edu) [28]. TCGA GBM data were obtained from (https://gdc.cancer.gov/about-data/publications/gbm_2013) [29]. MiRNA and survival data for the neuroblastoma cohort were obtained from the study of Schulte et al. [30]. GBM survival data were obtained from the study by D’Urso et al. (GEO: GSE13030) [31]. Correlation was assessed by Pearson’s correlation coefficient. For survival analysis, miRNA expression was normalized by z-score and the resulting average z-score for each patient was used for log-rank and Kaplan-Meier analysis.
Statistical analysis - Biological Assays
Statistical significance of differences in proliferation and differentiation at the 120-hour endpoint were determined by a one-way ANOVA with a post-hoc Tukey test for multiple comparisons. Synergy combination indices were calculated using: 1) Linear interaction model CI= (EA+EB+EC)/EABC; 2) Bliss independence model CI= ((EA+EB+EC)-(EAEBEC))/EABC. MTS and clonogenic assays were analyzed by one-way ANOVA and a post-hoc Tukey test for multiple comparisons. For radiation experiments, a two-way ANOVA with a post-hoc Tukey test for multiple comparisons was used. Results from the traffic light reporter assays were analyzed with Student's t-tests.
miR-124, miR-128 and miR-137 target compilation
miR-124, miR-128, and miR-137 targets were compiled from our previous studies [18]. We also included target genes of the three miRNAs obtained from mirTarBase [32] for which there was strong experimental evidence.
RNA extraction and RNA-Seq analysis
BE(2)C, 3565, and 1919 cells were reverse-transfected in technical triplicates with 30 nM of miRNA mimics (control vs. miR-124/miR-128/miR-137 combination). 48 hours later, total RNA from transfected cells was extracted using TRIzol reagent (Thermo Fisher) according to the manufacturer’s instructions. RNA was purified and concentrated utilizing RNA Clean & Concentrator-5 kit (Zymo Research). Libraries for RNA sequencing were prepared using TruSeq RNA Library Prep Kit v2 (Illumina), following the manufacturer’s instructions, and sequenced at the GCCRI Genome Sequencing Facility on a HiSeq-3000 sequencer (Illumina). Three biological replicates of control and experimental samples were used in each study.
Data were deposited in European Nucleotide Archive (ENA, www.ebi.ac.uk/ena) with the study identifier PRJEB40058.
Transcriptomic analyses
Transcript quantification was performed using Kallisto (v0.43.1, parameters: --bootstrap-samples=100 --single -l-l 350 -s 10)[33] with insert metrics obtained from the library construction. Gene-level counts were obtained using the R package tximport v1.0.3 [34]. GENCODE v29 (gencodegenes.org) was used as the reference for the human transcriptome. Differential gene expression analyses were carried out with DESeq2 v3.6.2 [35]. Genes were classified as differentially expressed using adjusted p-values < 0.05 and |log2FoldChange| ≥ 0.5.
Two main gene lists containing the overlap of the three different studies were generated. The upregulated gene set consists of the genes considered up-regulated after transfection of the miRNA combination in at least two studies. The same strategy was used to generate the downregulated gene set. For both sets, only protein coding genes were included and conflicting expression behavior genes (i.e. genes downregulated in one comparison and upregulated in another) were removed from the lists.
Correlations between binding sites and gene silencing
To assess correlations between silencing and presence or absence of binding sites for each of the three miRNAs, log2FoldChange of down-regulated genes and not differentially expressed genes were analyzed accordingly with the number of miRNAs targeting the gene. Wilcoxon tests to analyze differences between the group without miRNA targeting and with at least one target were performed for each cell line.
To evaluate correlations between silencing and the number of binding sites, each gene down-regulated in at least two of three studies was assigned with a total amount of miR-124,-138 ,and -137 binding sites indiscriminately. Binding site data were retrieved from TargetScan [36]. Genes were classified between high and low silencing according to the 1st and 3rd quartile. Genes with Log2FoldChange’s standard deviation greater than 0.5 between studies were removed from the analysis. Wilcoxon tests were used to assess differences between the high silencing and low silencing groups. Data were further categorized as being in a single site or multiple sites and statistical differences between groups were assessed by Chi-squared tests.
Permutation analyses
To determine if the down-regulated sets contain more targets of miR-124, miR-128, and miR-137 than expected by chance, we performed a permutation test with 10,000 iterations. Random samples of the same size as the down-regulated sets in all three cell lines (n=253) were extracted from all differentially expressed genes. Next, the percentage of genes targeted by 2 or more miRNA in the sample was calculated.
LncRNA analyses
To identify potential interactions between miR-124, miR-128, miR-137, and lncRNAs, a list with the lncRNA down-regulated in at least two cell lines was collected. Sequences of lncRNAs were obtained from LNCipedia v5.2 [37]. Binding site prediction for these lncRNAs was performed using miRmate [38]. For further analyses, we created a list of lncRNAs that contained predicted miRNA binding sites for all three miRNAs.
Gene Ontology analyses
Gene Ontology (GO) enrichment analysis was performed with the PANTHER statistical overrepresentation test webtool [39]. For all analyses, the whole human genome was used as background. For a summarized GO term selection, PANTHER’s term list was inputted to REVIGO [40] and the output table was sorted by uniqueness and dispensability. KEGG pathway enrichment analysis was performed using the ShinyGO web tool [41]. In both analyses, terms and pathways with FDR-adjusted p- values < 0.05 were considered enriched.
Transcription Factor and network analysis
A list with transcription factors was obtained from Lambert et al. [42] and the network was generated with Cytoscape software [43] using STRING’s protein-protein interaction data [44] (minimum interaction score of 0.7).
Expression of miRNA targets in neuroblastoma and glioblastoma samples
Expression levels of identified miRNA targets in neuroblastoma samples were investigated. Stage 1 and Stage 4 neuroblastoma expression data were obtained from R2 (r2platform.com). The dataset from Kocak et al. contains microarray data from 266 patients (118 Stage 1 and 148 Stage 4)[45]. Following the procedures available in the R2 differential expression webtool, p-values were generated by ANOVA tests between log2 transformed data. Genes were considered differentially expressed if p-values (adjusted for false-discovery rates [FDR]) were below 0.01.
Mapped RNA sequencing data from 154 primary glioblastoma (GBM) and 248 primary glioma grade II (LGG) samples were obtained from The Cancer Genome Atlas Program (TCGA). TCGA datasets were first reprocessed using Kallisto [33] with GENCODE v29 (gencodegenes.org) as the reference for the human transcriptome. Gene-level counts were also obtained using Tximport v1.0.3 [34]. Gene-level counts from 465 healthy (frontal) cortex samples were directly obtained from the Genotype-Tissue Expression - GTEx portal v8 (gtexportal.org). Analyses of differential gene expression were performed between glioblastoma and glioma grade II samples, as well as between glioblastoma and healthy cortex samples using the R package DESeq2 [35]. Only genes presenting a |log2FoldChange| ≥ 1 and FDR-adjusted p-values < 0.05 were considered differentially expressed.
miRNA in silico cluster identification
Target predictions for all broadly conserved miRNA families were first obtained from Targetscan [46]. We selected miRNA families with at least 1,000 predicted targets and generated a list of targets shared by at least two of the selected miRNAs. Finally, the selected miRNA families were hierarchically clustered based on information of their shared targets. The miRNA list was further filtered after a literature search for potential roles in glioblastoma and neuroblastoma. We included additional miRNAs sharing a large number of targets with miRNAs in the cluster. The final dataset contains targets of 11 miRNA (miR-101-3p.2, miR-124-3p.1, miR-128-3p, miR-137, miR-181-5p, miR-218-5p, miR-26-5p, miR-29-3p, miR-144-3p, miR-153, miR-543) collected from TargetScan. Hierarchical clustering was carried out in R by pvclust package v2.2 [47] using Euclidean distance and Ward's hierarchical clustering method.