Participants
Current study employed the dataset shared from the Open Access Series of Imaging Studies (OASIS-3) (https://www.oasis-brains.org/). Evaluated by the clinical dementia rating (CDR) score, all participants were categorized as 53 AD (CDR = 1), 90 MCI (CDR = 0.5), and 100 age-, sex-, and education-matched CU (CDR = 0). All participants provided written informed consent to the research procedures. The MMSE were used to evaluate clinical symptoms.
MRI acquisition
High-resolution T1-weighted images, resting-state fMRI were acquired in a 16-channel head coil of Siemens TIM Trio 3T scanner. Since the detailed scanning parameters were described elsewhere71, we have only briefly described it here. The main scanning parameters of resting-state fMRI were as follows: TR = 2200 ms, TE = 27 ms, flip angle = 90°, number of slices = 33, slice thickness = 4 mm, voxel size = 4 × 4 × 4 mm3.The scanning parameters of T1-weighted anatomical images were as follows: repetition time (TR) = 2400 ms, echo time (TE) = 3.16 ms, flip angle = 8°, voxel size = 1 × 1 × 1 mm3, and slice thickness = 1 mm.
Pre-processing steps of functional images
The pre-processing of neuroimaging data was performed using the Statistical Parametric Mapping (SPM12, https://www.fil.ion.ucl.ac.uk/spm/software/spm12/) and Data Processing Assistant for Resting-State fMRI (DPARSF, V4.4_180801, http://rfmri.org/dparsf). The first 10 volumes of functional image were discarded to prevent magnetization disequilibrium for each participant. We performed the realignment to the mean functional images, and obtained the 6-parameter rigid body linear transformation of head motion. All participants have maximum motion < 3 mm and 3°. The mean functional image was co-registered with the T1 image, and then T1 structural images were segmented into GM, WM and cerebrospinal fluid (CSF). Nuisance signals (i.e., the 24-parameters and the mean signal of CSF) were regressed. Temporal scrubbing regressor was also utilized to reduce the effect from head motion. The linear trend was removed for the signal drift correction. The band-pass filtering (0.01 ~ 0.1 Hz) was used to reduce the effect of low-frequency drift and high-frequency physiological noise. Spatial smoothing was performed separately within GM and WM template with a 4 mm FWHM. Finally, the smoothed functional images were normalized into the Montreal Neurological Institute (MNI) space and resampled to 3 × 3 × 3 mm3.
Obtaining the group-level white matter and gray matter masks
To avoid the confusion between GM and WM signals, we first created the group-level GM and WM mask in line with previous study13,14. In specific, by determining the maximum probability of whether each voxel belongs to the GM, WM or CSF, we can get binarized GM and WM masks for each participant. Then, the individual masks were compared across all participants. Voxels that account for more than 60% of the total number of individual WM templates were classified as WM. Noticeably, a lenient threshold with greater than 20% across participants were adopted for GM, and any voxels identified in the WM mask have been removed. In addition, the resulting GM and WM masks were compared to the functional images. Those voxels that were identified as GM or WM, but accounted for less than 80% of functional data across all participants were also excluded. Finally, the Harvard-Oxford Atlas31 were used to mark and exclude the deep regions from group-level WM mask, including the thalamus, caudate nucleus, putamen, globus pallidus, and nucleus accumbens.
Constructing functional connectivity matrices in white matter and gray matter
Before performing the network properties analysis, we conducted the point multiplication on the group-level WM mask and the JHU ICBM-DTI-81 WM atlas30 to obtain a new WM mask with 48 regions of interest (ROIs). We excluded 3 ROIs with no voxels or the number of voxels was one. Eventually, 45 ROIs were retained as the final WM mask. Additionally, the group-level GM mask was point multiplied with the Harvard-Oxford Atlas31 to obtain the final GM mask with 96 ROIs. Details on ROIs see Supplementary Table S1 and S2.
The averaged time series of each WM/GM ROI was separately extracted from the functional images for each individual. By conducting the Pearson’s correlation analysis between each pair of WM masks, we obtain a 45 × 45 WM correlation matrix for each participant, then transform the correlation coefficient into Fisher’s Z score. Similarly, we obtained a 96 × 96 GM connection matrix and a 141 ×141 whole-brain connectivity matrix for each individual.
Network topological property analysis
Graph theoretical analysis was carried out using the Graph Theoretical Network Analysis (GRETNA) Toolbox (http://www.nitrc.org/projects/gretna/). The sparsity threshold values were identified in following steps by referring to previous study22. The maximal threshold was defined as the correlation coefficient value which ensured all connections were significant (P < 0.05). The minimum threshold was defined as follows: (1) the mean degree over all nodes of each network was > 2 × log(N) (N expresses the number of nodes, N = 45 in WM network, N = 96 in GM network, and N = 141 in the whole brain network); (2) the number of nodes in the largest component was > 70% × N. Accordingly, the range of sparsity threshold was 0.23 ~ 0.46 for WM functional network, 0.11 ~ 0.50 for GM functional network, and 0.09 ~ 0.47 for the whole-brain network with a common step of 0.01.
To characterize the topological properties of brain networks, we calculated 2 global network metrics (i.e., assortativity and global clustering coefficient) and 2 nodal parameters (i.e., nodal clustering coefficient and nodal local efficiency) on the unweighted, undirected graphs.
Assortativity quantified the resilience of a network. The formula is as follows:
$$r= \frac{{l}^{-1}\sum _{(i,j)\in L}{k}_{i}{k}_{j}-{\left[{l}^{-1}\sum _{\left(i,j\right)\in L}\frac{1}{2}\left({k}_{i}+{k}_{j}\right)\right]}^{2}}{{l}^{-1}\sum _{(i,j)\in L}\frac{1}{2}\left({{k}_{i}}^{2}+{{k}_{j}}^{2}\right)-{\left[{l}^{-1}\sum _{\left(i,j\right)\in L}\frac{1}{2}\left({k}_{i}+{k}_{j}\right)\right]}^{2}}$$
1
\({k}_{i}\) , \({k}_{j}\) are the degrees of the nodes at the ends of the edge \(i\), with \(i=1\dots l\).
Clustering coefficient of a node \(i\) characterizes prevalence of clustered connectivity around individual nodes, and is defined as follows:
$$C\left(i\right)=\frac{2{t}_{i}}{{k}_{i}({k}_{i}-1)}$$
2
$$C=\frac{1}{n}\sum _{i\in N}C\left(i\right)$$
3
\(C\left(i\right)\) is the clustering coefficient of a node \(i\), \({t}_{i}\) is the number of edges in the sub-graph \({N}_{i}\), and \({k}_{i}\) represents the number of nodes connected to node \(i\).
Local efficiency measures how efficient the communication is between the first neighbors of a node \(i\) when \(i\) is removed. The formula is as follows:
$${E}_{loc}\left(i\right)=\frac{1}{{k}_{i}({k}_{i}-1)}\sum _{j,h\in N,j\ne i}\frac{{a}_{ij}{a}_{ih}}{{d}_{jh}\left({N}_{i}\right)}$$
4
\({k}_{i}\) is the number of nodes in the subgraph \({N}_{i}\) that includes all neighboring nodes of node i, \({d}_{jh}\left({N}_{i}\right)\) is the shortest path lengths between node j and node h in subgraph \({N}_{i}\).
Statistical analyses
To investigate the differences among AD, MCI and CU groups, we conducted statistical analysis on FC, graph theoretical metrics and RFCS separately. We perform non-parametric permutation test on FC (10,000 times) and utilized the Kruskal-Wallis test on RFCS and the area under curve (AUC) of topological metrics. In the post-hoc analysis, Mann-Whitney U test (corrected by FDR) were used for all measures except for topological global metrics, which were examined using the Dunn’s test. Among all the above analyses, the effect of age, sex and education were all regressed as covariates. Finally, a robust Spearman’s rank correlation with a bootstrap of 1,000 times was performed to examine the association between clinical characteristics and FC. Unless otherwise specified, all P-values were adjusted for multiple comparisons using the false discovery rate (FDR) procedure with a significant threshold of 0.05.
Relating gene expression to functional changes in brain region
We used the abagen toolbox32 on the AHBA dataset (http://human.brain-map.org) to make the transcriptional matrix. To relate the FC of each brain region to the transcriptional values, we calculated the RFCS by averaging its FC with all other 44 WM or 95 GM regions respectively. Group differences in RFCS were examined by Kruskal-Wallis test (see Methods, subsection Statistical analysis). Because the right hemispheres in the AHBA dataset were incomplete (i.e., two of the six right hemispheres were collected)72, only 48 GM ROIs and 22 WM ROIs in the left hemisphere were used for gene expressions matrix estimation.
Two PLS regression models33 were built by using the principal components of imaging-transcription matrix as the independent variables to predict the group-differences in GM or WM RFCS respectively. Then the permutation test (10,000 times) was used to evaluate the significance of the variance. After that, we performed bootstrapping to estimate the PLS1 weights for each gene, and normalized the weight scores with a threshold of Z < -3 and Z > 373.
Enrichment analysis
To figure out the molecular features of the thousands of genes, we submitted the PLS1+ (Z > 3) gene list and the PLS1- (Z < -3) list to the Metascape74 (http://metascape.org). Two core default ontologies (i.e., GO biological processes and KEEG pathways) were selected. The significant threshold with 0.05 was set for all pathways and FDR correction was used for multiple comparison. Moreover, to compare the gene list obtained by data-driven analysis versus genes derived through meta-analysis, we conducted the multi-gene-list meta-analysis between the PLS1 + or PLS1- gene list and polygenic risk for AD based on the genome-wide meta-analysis studies (GWAS)24,34.
Cell types
We assigned the gene lists into 7 types of cells, including the astrocytes, endothelial cells, microglia, excitatory, and inhibitory neurons, oligodendrocytes, and oligodendrocyte precursors, by overlapping the PLS1+/- genes lists with previously reported gene sets under each of the seven cell types75. Then, a permutation test was conducted on the number of the genes belonging to each cell type. Finally, we performed functional enrichment analysis based on gene list within each cell type to investigate the cellular signatures associated with the abnormal WM RFCS.