In recent years, several studies on the prediction and diagnosis of PTSD using machine learning methods (collect the symptoms, demographics, trauma types, psychometric information, and MRI data of patients who experienced traumatic events, followed with text mining or machine learning to predict and diagnose PTSD) have been conducted. These methods mostly had AUCs of 0.75–0.96, ranging in accuracy from 57.58% to 92.5% [45-48]. MRI techniques, such as fMRI and DTI, combined with machine learning have been used in the prediction, diagnosis, and assessment of PTSD [49,50]. However, machine learning models based on conventional MRI radiomics for PTSD diagnosis have not been reported.
Previously, we used stepwise discriminant analysis (SDA) and LASSO regression methods to screen mPFC texture features based on T2W images and build radiomics classification models for rats in the control and PTSD groups [33]. SDA analysis showed an overall classification accuracy of 86.5% (non-cross-validation) and 80.4% (cross-validation). Radiomics signatures based on LASSO regression showed that the AUCs for classifying PTSD rats between the control group and each PTSD group were greater than 0.935, and the AUCs among four PTSD groups were greater than 0.889. All these results indicated that mPFC T2W texture analysis possessed good classification performance. The hippocampus plays an important role in the occurrence and development of PTSD. Clinical studies [31,51] have shown that the hippocampal volume shrinkage and decreased N-acetylaspartate levels in patients with PTSD are closely related to pathological changes, such as hippocampal neuronal loss and glial cell reduction, which have been verified by experimental studies [30,52]. Based on our previous study and relevant theories, the present study used LR, SVM, and RF machine learning models based on hippocampal T2-FLAIR radiomics to diagnose PTSD after road traffic accidents. The results showed that, in the training group, the AUCs of LR, SVM, and RF for diagnosing PTSD were 0.829, 0.899, and 0.865, with sensitivities of 74.19%, 96.77%, and 87.10%, specificities of 77.13%, 74.29%, and 77.14%, respectively. The AUCs of the three machine learning models in the test group were 0.779, 0.810, and 0.728, with sensitivities of 76.92%, 61.54%, and 92.31%, specificities of 80.00%, 86.67%, and 53.33%, respectively. These results demonstrate that the machine learning models based on hippocampal T2-FLAIR radiomics have good diagnostic performance. However, compared to our previous experimental study, this study had lower AUCs for the diagnosis of PTSD. With the exception of differences in the selection of brain region, the AUCs were lower probably due to the following reasons. First, few confounding factors and biases were generated in the experimental study. Second, the PTSD animal model highly simulated the pathogenesis and pathological changes of PTSD in humans. However, significant differences in pathophysiology and mental, emotional, and social environments persist. Third, the differences in age, gender, physical exertion, and educational level between PTSD and non-PTSD cases included in this study were not statistically significant. Nevertheless, for each individual, many of these characteristics or factors and their interactions will be important in the occurrence and development of PTSD [53].
Several clinical studies in recent years have demonstrated that machine learning combined with fMRI can predict and assess the status, severity, and symptom clusters in patients with PTSD. Furthermore, it can determine comorbidity with major depression and reveal variables associated with the onset and classification of PTSD in trauma survivors [54-56]. Liu et al. [57] used the combination of multi-level features extracted from fMRI images and multi-kernel learning to classify PTSD cases and healthy controls with an accuracy of 92.5%. Saba et al. [49] used machine learning combined with rs-fMRI to classify PTSD cases and healthy controls with an accuracy of 93.7–99.2% (K-nearest neighbor and SVM with radial basis function kernel) in the training, validation, and test groups. This study had a lower diagnostic performance than previous clinical studies due to the following reasons: differences in the types of trauma, assessment methods, demographic characteristics of participants, MRI techniques, brain region selection, or machine learning algorithms. All of these factors led to differences in the classification and diagnostic performance. However, compared to the above-mentioned fMRI, DTI, and other imaging techniques, conventional MRI has many advantages, including a shorter examination duration, greater convenience, and no hardware or software limitations, making it convenient for use in daily clinical settings.
In this study, although we used 10-fold cross-validation LASSO regression to reduce the effect of model over-fitting (LASSO regression is a sparse learning method for high-dimensional data and commonly used for feature optimization and risk factor screening [20,58,59]) and the Hosmer–Lemeshow test to assess the goodness-of-fit of each model, the small sample size was an important factor for over-fitting or insufficient expression in the machine learning models. This study preliminarily showed that machine learning models based on hippocampal T2-FLAIR radiomics are promising as a potential imaging method for PTSD diagnosis after road traffic accidents. However, further multicenter studies with large samples are needed.
This study had some limitations. First, it was a retrospective study, had a small sample size and short follow-up period, and excluded individuals with any complication of TBI. Second, ROIs were manually selected from the largest area of hippocampal T2-FLAIR images. The hippocampal substructures, other brain regions, and other sequence images were not analyzed in detail. Third, only LR, SVM, and RF machine learning algorithms were used; other algorithms were not investigated.