Subjects
The structural sMRI data used in our study were acquired from the Soochow University, which is composed of 68 undergraduates. The study was approved by the Ethics Committee of the Third Affiliated Hospital of Soochow University. Written informed consents was obtained from all subjects. All subjects have not received stimulants or hypnotics before acquisition in order to keep them awake and let the brain work normally. All participants' vision was normal or corrected to normal, and they were right-handed. After the test, each participant will receive a small gift or financial reward. All subjects are required to perform Rosenberg Self-esteem Scale (RSES) test. The RSES is originally developed by Rosenberg in 1965 to assess the overall feelings of undergraduates about self-worth and self-acceptance. It is the most used self-esteem measurement tool in the psychology community [18]. We ranked the RSES test scores from highest to lowest, and then divided them into two groups: high self-esteem group and low self-esteem group. Table 5 provides detailed information of all participants.
Imaging acquisition and preprocessing
All images were collected on a 3T Siemens Medical Systems equipment. The acquisition parameters are set as: echo time (TE) = 2.98 ms, repetition time (TR) = 2300 ms, flip angle (FA) = 9 deg, voxel size = 1 × 1 × 1 mm3, slice thickness = 1 mm, field of view (FoV) = 256mm.
We use an automatic pipeline for sMRI image processing. Firstly, image preprocessing is conducted to remove the gray-scale unevenness of the image and adjust the image orientation to match the template [19]. Secondly, brain extraction is performed to remove the non-brain tissue, such as the skull and cerebellum [20]. Thirdly, gray matter (GM), white matter (WM) and cerebrospinal fluid (CSF) were segmented from the background [21]. Fourth, the segmented image was registered to the template labeled with the Automated Anatomical Labeling (AAL) template [22]. Fifth, in order to calculate the morphological features based on the cortex, the middle layer of the cerebral cortex was depicted [23]. After the whole processing, the morphological measurements of GM volume, WM volume, CSF volume, cortical thickness, and cortical surface area of each ROI were obtained for each subject. It should be noted that we removed 12 subcortical ROIs from AAL template considering that the cerebral cortex contains more neurons.
Classification framework
The framework of the proposed classification algorithm based on multi-resolution ROI brain network is shown in Figure 5, mainly including multiple anatomical network construction, feature selection, and classification. Multi-resolution ROI based multiple anatomical brain network were constructed based on morphological features (volume of different brain tissue, cortical thickness, and cortical surface area). Feature selection can reduce the dimensionality of high-dimensional brain network features, only retaining the features that can maximize the specificity of the subjects. The optimal feature subset can be trained by the classifier as neuroimaging markers representing different self-esteem levels.
Construction of multiple anatomical networks
Through the above image processing steps, GM volume, WM volume, CSF volume, cortical thickness, and cortical surface area of each ROI can be obtained from the MRI image of each subject. In order to reduce individual differences, standardization was performed, dividing the measured value of each ROI by the total intracranial volume, mean cortical thickness, and whole cerebral cortical surface area of the subject. Therefore, we used normalized volume and cortical features to provide a more appropriate representation. More objective measurements can be received by such processing. In order to improve the performance of the classifier, we propose a four-layer hierarchical network framework in this paper. We used brain templates with different ROI resolution in each layer to construct brain network nodes and edges.
Specifically, the bottommost template containing 78 ROIs is defined, the remaining three layers are defined as, where . A larger value indicates a higher-resolution ROI, which is located in the brain network layer closer to the bottom of the hierarchy. By merging small brain regions into large brain functional areas, the number of ROIs are reduced. In the layer , there are 36 ROIs by dividing the whole brain into lateral, medial and inferior surfaces. In the layer , 14 ROIs are defined reefing to the anatomical brain structure of central, frontal, parietal, occipital, temporal, limbic, and insula lobe. The specific definition rules of these ROIs can be found in Table 6. It is worth noting that in the first layer , we study the brain as a whole.
For each layer, correlation between ROIs can be calculated using brain template defined above. Its node correspond to the ROIs in different resolution, and the edge corresponds to the interaction between pairs of ROIs. Take the bottom layeras an example, an 7878 matrix can be calculated by computing the Pearson correlation coefficient between the i-th ROI and j-th ROI. We define
Where and represent the mean thickness of the cerebral cortex corresponding to the -th and -th ROIs.
is defined as where and represent the standard deviation of cortex for the -th and -th ROI. Due to the symmetry of the correlation matrix, we only use the upper triangular elements of the matrix to construct the feature vector. We connect the 3003 upper triangular elements to form the corresponding feature vector for . Since the ROIs in the remaining three layers are obtained by merging ROIs in the bottommost layer, the mean and standard deviation of these compound ROIs can be obtained by calculating the average value of all ROIs. The definition of correlation matrix for other layers is similarly to . The union of the hierarchical networks is constructed by junction of the four upper triangular correlation matrix into a long vector.
Feature selection
In order to reduce the feature dimension and filter out the most discriminative features, we adopted a hybird feature selection method. First, we use the statistical t-test method for preliminary selection of features with the significant p value less than the threshold (p <0.05). Then, the redundant features are removed using the minimum redundancy and maximum correlation (mRMR) method, and only the features that can express the difference between groups in the minimum number are retained [24]. After the above two filter-based feature selections, the machine learning recursive feature elimination (SVM-RFE) method [25] is used to further reduce the feature dimension. After completing the entire feature selection steps, the optimal feature subset is obtained.
Classification using multi-kernel SVM
There are various types of features in the multiple brain network, one is the high-resolution ROI features in the fourth layer, and the other is the brain network features corresponding to different layers. Multi-kernel machine learning method can integrate these various types of features into a final classifier. Firstly, a Gaussian Radial Basis Function (RBF) kernel function is used to construct a kernel matrix for each type of feature. Secondly, the two kernel matrices are integrated into the multi-kernel matrix through appropriate weight coefficients [25]. Comparing the results of using linear kernel function and using RBF function (non-linear), we discover that the RBF kernel can significantly improve the classification performance. Therefore, we choose the RBF kernel function to construct the multi-kernel classifier. Finally, the optimal features subset can be obtained.
Cross-validation
The nested cross-validation method has been applied in our previous research [26]. In the inner loop, the training set are used to determine the parameters of the classifier. In the outer loop, the testing set is used to evaluate the generalization ability of the classifier. It should be noted that at the beginning of the experiment, the entire data set was randomly divided into two parts, one for training and the other one for testing. The training set and testing set can be exchanged throughout the verification process, while the processing steps remain unchanged.