While there has been an increase in studies researching early identification of PD through biological models, there still remains a major gap in early detection and, even more so, prediction of PD within the prodromal stage31. While multiple studies have used other clinical diagnoses as precursors to predict PD, this might limit the use of such models due to the possible lack of symptoms at the pre-screening stage. Therefore, accurate identification of prodromal PD is essential for implementation of cost-effective clinical trials of putative neuroprotective agents, and ultimately for risk stratification and population-level disease prevention once therapeutic efficacy is established for one or more agents. In order to be effective on a population level, a predictive test of risk of PD should be universally available, cost-effective and non-invasive. Studies found that heart rate variability (HRV) determined from 5-minute ECGs is reduced in prevalent PD, and results from a single prospective study showed that lower HRV was associated with an increased risk of incident PD.34 Prior work by our group utilized machine learning approaches to predict prodromal PD using clinical variables in one study32 and a proof-of-concept study using standard 10-second printed ECGs in another study30. Both studies resulted in moderate accuracy, however the data and model developed in the latter study was biased towards an elderly population and also lacked an external cohort for model validation.30
The current research builds on the use of ECGs in Parkinsonism studies, using a deep learning approach to develop a predictive model for prodromal PD using standard 10-second raw digitized ECGs stored in a large healthcare system EHR and validate this model using an independent cohort from a different healthcare system EHR. The developed model had moderately good classification accuracy in the validation cohort, with an AUC of 0.74 between 6 and 12 months before PD diagnosis and an AUC of 0.67 when PD 5 years before diagnosed onset. While AUC can be deemed as somewhat low, it should be noted that the main aim of this model was to help clinicians in assessing probable risk of people developing the disease and not for diagnosis. This means that the model’s ability to correctly distinguish between the majority of cases and controls, collectively, has important implications for the potential in its use for early detection and pre-screening.
Furthermore, we validated PD diagnosis and incidence dates in the external LUC validation population thus ensuring accurate performance metrics. Given that the model was developed in a cohort without chart reviews in MLH, a proportion of which were likely mislabeled, the predictive performance of the model in the replication cohort provides an important indication for the use of ECGs in prodromal PD prediction as well as for future studies. This is even more important since almost half of the cases were correctly identified as such, and most of the controls were also classified properly. We acknowledge that the developed deep learning model needs improvement in case identification, however, it should be noted that the MLH dataset used for training was largely imbalanced between cases and controls, making the model slightly more effective in identifying controls. The model’s performance can be improved following re-training using a larger cohort or the implementation of transfer learning in combination with a larger cohort.
In comparison to the machine learning models utilizing ECG feature-engineering as inputs (Supplementary Table S1), the CNN model predicted prodromal PD at higher accuracy. These results are also comparable to resulted in published research that included multiple different biological and demographic information, some of which can be quite costly to obtain (e.g. serum analysis), within a machine learning system. Furthermore, we emphasize the fact that while the AUC results of the CNN model for the prediction of prodromal PD within 5 years before onset are somewhat moderate, this research shows that PD cardiac markers, which are found to be present before any of motor changes involved in the onset of PD35, can act as signals within an artificial intelligence framework without the need of full ECG feature extraction. Furthermore, there is the potential to include demographic and clinical variables as additional inputs with the predicted outcomes from the ECG CNN model, to increase the accuracy of predicting prodromal PD within an extended time window. Karabayir et al.32 (2022) showed that there is added benefit in including demographic and clinical variables in the prediction of prodromal PD, allowing for opportunity for exploration of improving this work. Such variables can also extend to the inclusion of data from the Cognitive Abilities Screening Instrument (CASI), olfactory tests and the simple choice reaction times test. At the time of development of the models in this research, such clinical and demographic variables were not available from the MLH and LUC data sources.
Overall, the CNN model developed in this research classified individuals based solely on a 10-second ECG input to an artificial intelligence framework. It is highly likely that incorporation of our ECG-based predictive model with other known prodromal markers of PD would result in improved risk stratification for PD. This further suggests the importance of routine collection of ECGs within cohort studies aiming to understand pathophysiology of PD and other neurodegenerative diseases.
This study has some limitations. Predominantly, we were able to perform chart reviews on LUC data, but not on MLH data. Based on the high false positive rate of PD diagnosis based exclusively on EHR ICD codes at LUC, there might have been the potential that a portion of MLH data included false case annotations, which would have negatively influenced model training and consequently its testing. Therefore, implementing transfer learning on our final deep learning model and retraining it on large well-annotated datasets can result in a more accurate PD prediction tool.
Another limitation of this study was that our chart reviews at LUC showed that out of 47 cases identified in LUC, only 29 of them were real PD cases. Therefore, our study also highlights the challenges around finding PD patients via her-based queries. However, it is worth noting that the developed PD risk prediction model produced significantly higher risk prediction for validated cases compared to controls.
We, note that the raw ECG data used in this study was gathered from two different cardiology information systems: GE MUSE at LUC and Epiphany at MLH. Therefore, a major strength of this study lies in the fact that the concordant results between internal testing and the validation process provides evidence that raw ECGs exported from different platforms can be used reliably within AI frameworks, with minimal pre-processing.
Future works largely lie in the inclusion of demographic, clinical variables and results from known PD diagnoses tests within the current deep learning model. Continuous improvement of the models’ prediction accuracy, especially following the use of larger cohort data, and inclusion of other ‘simple’ variables can lead to the integration of these models within ECG-capable smart wearables. The development of smart applications within wearables allows for cost-effective, non-invasive and minimal clinical burden for screening people for PD risk, allowing for early detection, quicker follow-up times and timely intervention strategies.
The result from this research shows that simple 10-second ECGs allow for the development of a deep learning predictive model that correctly classifies individuals with prodromal PD with modest accuracy. This model was effective in distinguishing between PD cases and controls within an independent cohort, increasing in accuracy closer to disease diagnosis, although still notably within the prodromal stage of the disease. Therefore, the use of standard ECGs, which are easily and routinely collected by healthcare institutes, may help identify individuals at high-risk of PD, allowing for timely inclusion in disease-modifying therapeutic trials and possible intervention strategies to help delay or slow down disease progression.