This study complied with all relevant ethical regulations and was approved by the UCSF Institutional Review Board (13-12587, 17-22324, 17-23196, and 18-24633). As part of routine clinical practice, all patients who were included in this study signed a written waiver of informed consent to contribute de-identified tissue for research.
Single-cell RNA sequencing and analysis
Single cells were isolated from human SFT or meningioma samples, as previously described17. Single-cell suspensions were processed for single-cell RNA sequencing using the Chromium Single Cell 3’ GEM, Library & Gel Bead Kit v3.1 (1000121, 10x Genomics) and a 10x Chromium or Chromium X controller, using the manufacturer recommended default protocol and settings for a target recovery of 5,000 cells per sample. Libraries were sequenced on an Illumina NovaSeq 6000, targeting >50,000 reads per cell, at the UCSF Center for Advanced Technology. Library demultiplexing, read alignment, identification of empty droplets, and UMI quantification were performed using CellRanger (https://github.com/10xGenomics/cellranger). Cells were filtered based on the number of unique genes, and single-cell UMI count data were preprocessed in R with the Seurat package (v4.3.0)24,25 using the sctransform workflow. Dimensionality reduction was performed using PC analysis. UMAP and Louvain clustering were performed on the reduced data, followed by marker identification and differential gene expression.
Clusters were defined using a combination of automated cell type classification6, cell signature gene sets7, cell cycle analysis, and differentially expressed cluster marker genes. The scType R package was used for automated cell type classification, with default parameters and an augmented list incorporating package-provided human 'Brain’ marker genes specific to each cell type6. Gene set enrichment analysis was performed on clusters using cell type signature gene sets from from MSigDB (https://www.gsea-msigdb.org/gsea/msigdb) with the fgsea R package (Bioconductor v3.16). Cell cycle phases of individual cells were assigned with the ‘CellCycleScoring’ function in Seurat, using single-cell cell cycle marker genes26.
Meningeal SFT and meningioma single-cell samples were aligned to the GRCh38 human reference genome; filtered to cells with greater than 250 unique genes, less than 7,500 unique genes, and less than 25% of reads attributed to mitochondrial transcripts; scaled based on regression of UMI count and percentage of reads attributed to mitochondrial genes per cell; and corrected for batch effects using Harmony5. Parameters for downstream analysis were a minimum distance metric of 0.2 for UMAP, resolution of 0.15 for Louvain clustering, determined using clustree (v0.5.0, analyzing resolutions 5, 2, 1, 0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.25, 0.2, 0.15, 0.1, 0.0), with a minimum difference in fraction of detection of 0.25 and a minimum log-fold change of 0.25 for marker identification.
Deconvolution of SFT cell types from reference perinatal or adult vascular cell type single-cell RNA sequencing dataset was performed using SCDC (v0.0.0.9000)27. Single-cell transcriptomic data from the reference datasets were subsampled to 1000 cells per cluster, and the top differentially expressed genes were selected for each cell type. Using this expression set, SFT single cells were deconvolved to yield a matrix with predicted proportions of cell type for each cell, which were visualized using feature plots.
Gene enrichment analysis was performed using a list of 50 most differentially expressed candidate genes from previously published single-cell perinatal or adult vascular and mural cell types22,23. Average counts per cell were summarized, scored as mean, and visualized using feature plots.
Cell-cell communication networks were inferred and visualized using the CellChat R package (v1.5.0)11. Briefly, differentially expressed signaling genes were identified, noise was mitigated by calculating the ensemble average expression, intercellular communication probability was calculated by modeling ligand-receptor interactions using the law of mass action, and statistically significant communications were identified. The CellChat commands ‘computeCommunProb’, ‘computeCommunProbPathway’, and ‘aggregateNet’, were used for analysis, and ‘netVisual_aggregate’ was used for visualization.
Trajectory analyses was performed using monocle3 (v1.3.1)10,28,29 for pseudotime, and velocyto (v0.17.16)8 with scVelo (v0.2.5)9 for RNA velocity. For pseudotime analysis, data were normalized followed by UMAP dimensionality reduction as described above. The ‘cluster_cells’ and ‘learn_graph’ monocle commands were used with default parameters and cells were ordered along pseudotime after manually selecting a root node (based on cluster, cell type, and cell cycle information). For RNA velocity analysis, velocyto was used to generate loom files with spliced and unspliced mRNA count information. scVelo was used to filter and normalize gene expression using criteria “min_shared_counts=3’, and ‘n_top_genes=2000’ prior to computing RNA velocity and latent time. RNA velocity was visualized by projecting on to the UMAP generated using R and Seurat.
Spatial RNA sequencing and analysis
Spatial transcriptomic profiling was performed on FFPE sections using the 10x Genomics Visium Spatial assay (1000336, v1). 6 mm cores were mounted within capture areas on Visium glass slides, deparaffinized, stained with H&E, and imaged at the Gladstone Institutes Histology Core. Libraries were prepared according to manufacturer instructions at the Gladstone Institutes Genomics Core. Libraries were sequenced on an Illumina NovaSeq 6000 instrument at the UCSF Center for Advanced Technology. Sequencing was performed with the recommended protocol (read 1: 28 cycles, i7 index read: 10 cycles, i5 index read: 10 cycles, read 2: 91 cycles). FASTQ sequencing files and histology images were processed using the 10x SpaceRanger pipeline and the Visium Human Transcriptome Probe Set v1.0 GRCh38-2020-A. Data were visualized using the 10x Loupe Browser software (v6.4.0) and Seurat package (v4.3.0)24,25.
Spaceranger generated filtered feature matrices were imported into a Seurat object (v4.3.0, arguments min.cells=3, min.features=100) using R (v4.2.1) and RStudio (v2022.07.2 Build 576). The individual count matrices were normalized based on nFeature_RNA count (subset=nFeature_RNA>1500 and nFeature_RNA<9500) with less than 10% of reads attributed to mitochondrial transcripts and integration using Harmony (v0.1.1). Parameters for downstream analysis were a minimum distance metric of 0.2 for UMAP, resolution of 0.2 for Louvain clustering, determined using clustree (v0.5.0, analyzing resolutions 5, 2, 1, 0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.25, 0.2, 0.15, 0.1, 0.0), and a minimum difference in fraction of detection of 0.25 and a minimum log-fold change of 0.25 for marker identification. UMAP projections and cluster distributions were visualized in the Loupe browser as needed, after combining spatial transcriptomic data from individual capture areas using the 10x Spaceranger aggr pipeline (v2.0.0).
Deconvolution of SFT cell types from reference SFT single-cell RNA sequencing was performed using SCDC (v0.0.0.9000)27. To do so, each spatial transcriptome was treated as a pseudobulked RNA sequencing dataset and leveraged against known cell types from reference single-cell RNA sequencing datasets comprised of 40,022 cells from 4 human SFT samples (Fig. 1a), perinatal human brain vasculature (139,134 cells from gestational weeks 15, 17, 18, 20, 22, or 23)22, or adult human brain vasculature (84,138 cells)23. Single-cell transcriptomic data were subsampled to 1000 cells per cluster, and the top differentially expressed genes were selected for each cell type. Using this expression set, spatial transcriptomes were deconvolved to yield a matrix with predicted proportions of cell type for each spatial transcriptome, which were visualized using spatial feature plots.
Gene enrichment analysis was performed using a list of 50 most differentially expressed candidate genes from previously published single-cell perinatal or adult vascular and mural cell types22,23. Average counts per spatial transcriptome were summarized, scored as a mean, and visualized using spatial feature plots.
The cell-cell communication network was inferred and visualized using the CellChat R package (v1.5.0)11 similar to the method used for single cell RNA sequencing samples. Briefly, differentially expressed signaling genes were identified, noise was mitigated by calculating the ensemble average expression, intercellular communication probability was calculated by modeling ligand-receptor interactions using the law of mass action, and statistically significant communications were identified. The CellChat commands ‘computeCommunProb’, ‘computeCommunProbPathway’, and ‘aggregateNet’ were used for analysis, and ‘netVisual_aggregate’ was used for visualization. ‘computeCommunProb’ was run using spatial information from the Visium assay, including spatial dot coordinates and scale.factors for the fiducials and low/high-resolution tissue images.
DNA methylation profiling and analysis
Genomic DNA underwent bisulfite conversion using the EZ DNA Methylation kit (D5004, Zymo Research), followed by amplification, fragmentation, and hybridization to Infinium EPIC 850k Human DNA Methylation BeadChips (20020530, Illumina) according to manufacturer’s instructions at the University of Southern California Molecular Genomics Core. Bioinformatic analysis was performed in R (v4.2.1). SFT or meningioma DNA methylation data were preprocessed using the minifi pipeline30. In brief, probes were filtered and analyzed using normal-exponential out-of-band background correction, nonlinear dye bias correction, p-value with out-of-band array hybridization masking, and β value calculation (β=methylated/[methylated+unmethylated]). Principal component analysis was performed on the β methylation values from minfi pre-processing pipeline in R. Variable probes were identified from the first three principal components (PCs). PCs greater than 4 contributed to less than 5% of β value variance. The top 2000 probes were selected for analysis by ranking the absolute gene loading score values within PCs and the tumors were projected along the first two PCs.
Targeted next-generation DNA sequencing and analysis
Targeted DNA sequencing was performed using the UCSF500 NGS panel, as previously described19. In brief, this capture-based next-generation DNA sequencing assay targets all coding exons of 479 cancer-related genes, select introns, and upstream regulatory regions of 47 genes to enable detection of structural variants such as gene fusions and DNA segments at regular intervals along each chromosome to enable genome-wide copy number and zygosity analyses, with a total sequencing footprint of 2.8 Mb. Multiplex library preparation was performed using the KAPA Hyper Prep Kit (07962355001, Roche). Hybrid capture of pooled libraries was performed using a custom oligonucleotide library (Nimblegen SeqCap EZ Choice). Captured libraries were sequenced as paired-end reads on an Illumina NovaSeq 6000 at >200x coverage for each sample. Sequence reads were mapped to the reference human genome build GRCh37 (hg19) using the Burrows-Wheeler aligner (v0.7.17). Recalibration and deduplication of reads was performed using the Genome Analysis Toolkit (v4.3.0.0). Coverage and sequencing statistics were determined using Picard (v2.27.5), CalculateHsMetrics, and CollectInsertSizeMetrics. Single nucleotide variant and small insertion/deletion mutation calling was performed with FreeBayes, Unified Genotyper, and Pindel. Large insertion/deletion and structural alteration calling was performed with Delly. Variant annotation was performed with Annovar. Single nucleotide variants, insertions/deletions, and structural variants were visualized and verified using Integrative Genome Viewer (v.2.16.0). Genome-wide copy number and zygosity analysis was performed by CNVkit and visualized using NxClinical (Biodiscovery, v6.0).
Histology and microscopy
For adult human tissue samples, deparaffinization and rehydration of 5µm formalin-fixed, paraffin-embedded (FFPE) tissue sections and H&E staining were performed using standard procedures. Immunostaining was performed on an automated Ventana Discovery Ultra staining system and detection was performed with Multimer HRP (Ventana Medical Systems) followed by fluorescent detection (DISCOVERY Rhodamine and CY5) or DAB.
Immunostaining for NOTCH3 was performed using mouse monoclonal NOTCH3/N3ECD primary antibody (MABC594, Millipore Sigma, 1:25-1:100) with incubation for 32min following CC1 antigen retrieval for 32min. For dual staining, primary antibody incubations were carried out serially with inclusion of positive, negative, and single antibody controls. Following staining for NOTCH3/N3ECD, tissue sections were stained with primary antibodies recognizing SMA (ab7817, Abcam, mouse polyclonal, 1:30,000) for 32min or VWF (A0082, Dako, rabbit polyclonal, 1:1,000) for 20min. All IF experiments were imaged on a LSM 800 confocal laser scanning microscope with Airyscan (Zeiss) and analyzed using ImageJ.
Clinically validated immunohistochemistry for STAT6 (ab32520, Abcam, 1:100 dilution, YE361 clone), SSTR2A (ab134152, Abcam, 1:2000 dilution, UMB1 clone), were performed at UCSF on core mounts with appropriate controls using a Leica Bond III platform and imaged using light microscopy on an Olympus BX43 microscope with standard objectives. Images were obtained and analyzed using the Olympus cellSens Standard Imaging Software package (v1.16).
Statistics
All experiments were performed with independent biological replicates and repeated, and statistics were derived from biological replicates. Biological replicates are indicated in each figure panel or figure legend. No statistical methods were used to predetermine sample sizes, but sample sizes in this study are similar or larger to those reported in previous publications. Data distribution was assumed to be normal, but this was not formally tested. Investigators were blinded to conditions during clinical data collection and analysis. Bioinformatic analyses were performed blind to clinical features, outcomes, or molecular characteristics. The clinical samples used in this study were retrospective and nonrandomized with no intervention, and all samples were interrogated equally. Thus, controlling for covariates among clinical samples was not relevant. Cells and animals were randomized to experimental conditions. No clinical, molecular, or cellular data points were excluded from the analyses.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.