Noninvasive brain stimulation during EEG improves machine learning classification in chronic stroke

doi:10.21203/rs.3.rs-4809587/v1

Download PDF

Research Article

Noninvasive brain stimulation during EEG improves machine learning classification in chronic stroke

https://doi.org/10.21203/rs.3.rs-4809587/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background:

In individuals with chronic stroke and hemiparesis, noninvasive brain stimulation (NIBS) may be used as an adjunct to therapy for improving motor recovery. Specific states of movement during motor recovery are more responsive to brain stimulation than others, thus a system that could auto-detect movement state would be useful in correctly identifying the most effective stimulation periods. The aim of this study was to compare the performance of different machine learning models in classifying movement periods during EEG recordings of hemiparetic individuals receiving noninvasive brain stimulation. We hypothesized that transcranial direct current stimulation, a form of NIBS, would modulate brain recordings correlating with movement state and improve classification accuracies above those receiving sham stimulation.

Methods:

Electroencephalogram data were obtained from 10 participants with chronic stroke and 11 healthy individuals performing a motor task while undergoing transcranial direct current stimulation. Eight traditional machine learning algorithms and five ensemble methods were used to classify two movement states (a hold posture and an arm reaching movement) before, during and after stimulation. To minimize compute times, preprocessing and feature extraction were limited to z-score normalization and power binning into five frequency bands (delta through gamma).

Results:

Classification of disease state produced significantly higher accuracies in the stimulation (versus sham) group at 78.9% (versus 55.6%, p < 0.000002). We observed significantly higher accuracies when classifying stimulation state in the chronic stroke group (77.6%) relative to healthy controls (64.1%, p < 0.0095). In the chronic stroke cohort, classification of hold versus reach was highest during the stimulation period (75.2%) as opposed to the pre- and post-stimulation periods. Linear discriminant analysis, logistic regression, and decision tree algorithms classified movement state most accurately in participants with chronic stroke during the stimulation period (76.1%). For the ensemble methods, the highest classification accuracy for hold versus reach was achieved using low gamma frequency (30–50 Hz) as a feature (74.5%), although this result did not achieve statistical significance.

Conclusions:

Machine learning algorithms demonstrated sufficiently high movement state classification accuracy in participants with chronic stroke performing functional tasks during noninvasive brain stimulation. tDCS improved disease state and movement state classification in participants with chronic stroke.

chronic stroke

machine learning

electroencephalogram

noninvasive brain stimulation

transcranial direct current stimulation

Chronic stroke affects over seven million people in the US and remains a major source of worldwide disability.¹ Brain computer interfaces (BCIs) are a potential method to improve quality of life for affected individuals given their ability to detect stroke severity, sense ongoing motor behavior and assist with longitudinal recovery.^2–6 To advance BCIs for chronic stroke towards clinical practice, many groups are interested in creating a simple yet flexible BCI embedded with machine learning (ML) that can be deployed at population scale. Yet it is currently unknown what neural input features and ML approaches are optimized for this task.

Electroencephalography (EEG) recordings are noninvasive and an easily accessible method of sampling brain activity. Moreover, EEG can be paired with noninvasive brain stimulation (NIBS) to enhance certain features of control signal classification. In a recent study, high-frequency repetitive transcranial magnetic stimulation (rTMS) was used to augment detection of motor-related activity by an EEG BCI in participants performing imaginary arm flexion.⁷ While mental rehearsal has evolved into the standard paradigm for BCI studies,^8–12 particularly in those with tetraplegia, the vast majority of participants with stroke retain some level of motor capability. Accordingly, many have questioned whether ML-assisted EEG classification in stroke participants performing real movements requires modeling a substantively different parameter space. As an example, Mebarkia et al.¹³ found that layering three support vector machines (SVM) (i.e., multi-voting) was necessary to exceed 90% accuracy in classifying left versus right hand movements in 3D space, though this BCI architecture was tested exclusively in healthy participants.

The parameter space for EEG BCIs designed for stroke rehabilitation is extensive. First, the spectral content of EEG recordings in healthy controls is significantly different from that of individuals with chronic stroke.^14–16 For example, mu (8–12 Hz) and beta (13–30 Hz) power are attenuated following stroke, yet event-related desynchronization (ERD) and synchronization (ERS) of mu and beta are frequently used to provide the feedback signal for decoding movement intention.^17–19 For those with severe motor deficits, ERD/ERS can be effectively absent and thus alternate control signals must be identified, though it is not clear what approach should be used to select surrogate features. Second, vascular compromise gives rise to hemispheric asymmetry (i.e., ipsi- vs contralesional), and this asymmetry is reflected as imbalanced oscillatory patterns following stroke.^20–22 Finally, as mentioned, NIBS modalities such as rTMS have been used in combination with BCIs, however rTMS must be used in a discontinuous fashion with EEG recording due to magnetic pulse-related artifacts. Whether these factors preclude ML classification of brain state during real movement is a critical question to address prior to development of a clinically accepted BCI for chronic stroke.

In this study, we aimed to test the overall hypothesis that EEG data recorded during different movement states in individuals with chronic stroke can be accurately classified using ML. We further tested whether transcranial direct current stimulation (tDCS), which can be activated continuously during EEG recording, boosts classification accuracy compared to sham stimulation. As a control, we employed an identical ML pipeline in healthy participants.

Participants

Ten chronic stroke (CS) participants with hemiparesis and 11 healthy controls (HC) were included in this study (IRB approved by the Medical University of South Carolina, Pro#00087153). Chronic stroke was defined as greater than six months from last ischemic stroke as determined by a fellowship-trained stroke neurologist. Mean time after stroke for CS participants was 98.8 ± 36.7 months. Those lacking the ability to reach with the affected arm were excluded.

Randomization and Blinding

Study participants were randomly assigned to either the stimulation or sham group in a single-blinded fashion. The sham group underwent a placebo procedure that applied stimulation for 30 seconds to induce a tingling sensation in the scalp, mimicking tDCS applied for 20 minutes in the active stimulation group. Research staff operating the tDCS device were aware of group assignment.

Task and Dataset

During a single session, participants were fitted with Ag-AgCl EEG electrodes using the 10–20 International EEG system (DC115, Rhythmlink International LLC, Columbia, SC). Next, a 5-electrode, center-surround, anodal high-density (HD) transcranial direct current stimulation (tDCS) montage was arranged on the scalp contralateral to the affected arm (model 2001tE and 4x1-C3ASKU 4x1-C3ASoterix Medical; 4x1 HD-tDCS/HD-tES adaptor, Soterix Medical, Inc, Woodbridge, NJ). The central anodal tDCS electrode was positioned near the C3 or C4 EEG electrode, depending on the laterality of the upper extremity performing the task. The surrounding cathodal tDCS electrodes were positioned in a square configuration in relation to the central anode. All tDCS electrodes were placed adjacent to, but out of direct contact with, nearby EEG electrodes. Next, an Oculus Rift (Model C4-A, Menlo Park, CA) virtual reality headset was carefully fitted over the EEG-tDCS arrangement, and a wireless controller containing an accelerometer was placed in the affected hand. EEG signals were sampled at 1024 Hz (Natus® Neuroworks®, Pleasanton, CA). An arm reaching task designed in the Unity® (version 2022.2, Unity Technologies, San Francisco, CA) programming environment began with a holding position requiring a slightly outstretched upper limb to remain in place for several seconds prior to the appearance of a colored sphere. Appearance of the sphere prompted a reaching movement (range of 15 to 50 cm) by the participant to virtually ‘touch’ the sphere and return to the holding position. The duration of the holding position was randomly varied between 2–5 seconds, and the location of the sphere was varied randomly between different locations within the virtual environment to reduce learning of the task over time. In between the hold and reach periods, a 0.5 second preparatory (or ‘prep’) cue was delivered in the form of a vibratory pulse of the controller. Participants were not explicitly instructed to attend to this cue. The trial ended after 12 consecutive hold, prep, and reach cycles were completed (average trial duration for all 12 reaches was ~ 3 minutes). Each movement cycle was divided into the following states, or “epochs”: the hold epoch, occurring prior to the prep epoch, and the reach epoch occurring from initiation of movement to return to the resting position. After an initial trial during the pre-stimulation period, the tDCS system was activated and within 30 seconds reached a maximum current delivery of 2.0 mA. At 5- and 15-minutes each after tDCS activation, two additional trials of 12 reaches were performed. Next, at 20 minutes after activation of tDCS, the tDCS current was switched off. Five minutes after deactivation of tDCS, a fourth and final trial was performed. Thus, all participants performed a total of four trials of 12 reaches each. Participants in the sham control group had tDCS activated for only 30 seconds before being immediately deactivated. The HC group underwent the same randomization and procedure. The entire experiment for each participant lasted approximately an hour.

Preprocessing and Feature Extraction

All analyses were conducted in Python 3.9 using the NumPy,²³ SciPy,²³ and Scikit-Learn libraries.²⁴ EEG recordings were labeled according to movement state using synchronous VR data. To achieve a minimally pre-processed pipeline, resampling and filtering of the raw signal were omitted. All data were z-score normalized over the entire EEG signal for each channel per participant, i.e., signals from participants were normalized with respect to only that subject due to heterogeneity of stroke characteristics. This was performed to significantly decrease computational demand and time during ML training. Normalization was performed via the method shown in Eq. 1.0.

$$\:xnorm=\:\frac{x-\stackrel{-}{x}}{\sigma\:}$$

Equation 1: Z-score normalization equation used within a single EEG channel

Recordings were then divided into 1-second epochs and power spectral density (PSD) was calculated for each epoch. PSDs were generated using Welch’s method, which applies the discrete Fourier transform (DFT) to several contiguous windowed subsets of the original signal. Hann windowing was used to generate windowed segments with 50% overlap, and the number of FFT segments was set to the sample rate in order to maintain a spectral resolution of 1 Hz. PSD values were subsequently binned into frequency bands as follows: delta (1–4 Hz), theta (4–8 Hz), alpha (8–12 Hz), beta (12–30 Hz), and gamma (30–50 Hz). As bands did not exceed 50 Hz, a 60 Hz notch filter was not applied. The labeled dataset was split 70:30 for training and testing with grid search cross-validation used for hyperparameter tuning. Training was performed 10 times for each model per trial and training times were recorded. Training was performed on a 6th Gen Intel Xeon(R) Gold 6226R processor at 2.90GHz with 64 cores and 187.5 GB RAM in serial processing alongside two NVIDIA RTX A5000 GPUs.

Machine Learning Implementation

Classification accuracy was compared among 13 different machine learning algorithms on the same data set. The following models were chosen for this study: logistic regression (LR), linear discriminant analysis (LDA), decision trees (DT), Naïve Bayes (NB), K-nearest neighbors (KNN), random forest (RF), AdaBoost, XGBoost and heuristic voting classifiers.^25–28 LR was chosen due to its use in motor imagery classification derived from EEG signals.²⁹ The LDA classifier models the distribution of each class and was included due to its ability to perform dimensionality reduction and minimize training time.²⁴ DT and RF were chosen to detect complex patterns in binarized data that may lead to a higher classification accuracy. The Naïve Bayes classifier applies Bayes theorem to calculate the probability of an observation belonging to a given class based on the assumption that the data are distributed in a Gaussian manner and was included due to its minimal training time and mathematical simplicity.²⁴ KNN was included to determine if movement states exhibited clustering behavior in the associated feature space as that would reveal insights beyond improved classification accuracy. The boosting algorithms XGboost and AdaBoost were included to detect the importance of incorrectly classified data points.

The hyperparameter search process for each classifier was defined in the following way: For LR, a univariate grid search was performed on the parameter C for values 2^x for 15 ≤ x < 35. For LDA, comparisons were conducted for accuracies achieved by singular value decomposition (SVD), least squares (LSQR), and eigen solvers. Although all models demonstrated similar accuracies, SVD was chosen since it does not compute the covariance matrix and therefore has a shorter run time.

For NB classification, a Gaussian implementation was used due to the Gaussian nature of epoched EEG data. A parameter representing the variance was computed using 10^x for − 15 ≤ x < 0. For KNN, the number of neighbors considered was varied for 3 ≤ x < 10. For DT, the minimum weight fraction of each leaf node was empirically determined to be 0. A grid search was performed to optimize the minimum number of samples required to split individual nodes (varied for 2 ≤ x < 11) and the maximum depth allowed for trees (varied for 2 ≤ x < 30). For RF, the minimum number of samples per leaf was optimally determined to be 1, the method of determining the maximum samples to split a node was set to the square root of the total number of samples, and the maximum depth of each tree was set to 30. A grid search was performed to optimize the minimum number of samples required to split individual nodes (varied from 2 ≤ x < 5); the number of individual trees was varied over the set 5n for 5 ≤ n < 21. Feature importance was calculated by taking each feature’s average depth of use and weighing the average from one relative to the other features’ depths. The earlier a feature was used in a tree, the more important it was considered. For AdaBoost, NB and DT were contrasted as base estimators, and the number of individual estimators was varied by 25x for 8 ≤ x ≤ 16. For XGBoost, the number of individual estimators was varied by 25x for 50 ≤ x ≤ 400 and max tree depth varied by 5x for 5 ≤ x ≤ 30. The “Hist” method as implemented by XGBoost 2.0.3³⁰ was chosen for the Tree Method hyperparameter to reduce training time. For voting classifiers, an ensemble of pre-trained models for LR, LDA, DT, RF, NB, and KNN was first created. One hard voting classifier and four soft voting classifiers among these were then utilized. Soft voting classifiers used the following weight methods: uniform (“uni”) weights, weights determined by the individual models’ training set accuracy (termed “train”), uniform weights determined by the highest model accuracy (termed “hard”), and weights predetermined based upon empirical global accuracy of all models within the ensemble (termed “global”). The ensemble labeled “me” was weighted based on the mean of several selected base estimators that appeared to perform better during the initial classification tests used to classify hold versus reach. Effect weights, hyperparameters and compute times were saved for each training. Prediction results were stored as csv files labeled by electrode and feature.

Statistical analysis

Statistical analysis was conducted using the rstatix package for R (version 4.3.1).³¹ Combining data by two to three features for each model (e.g., grouping all electrodes and frequencies) resulted in groups sufficiently large to satisfy the central limit theorem and were therefore treated as parametric data. For comparisons between multiple groups, one-way ANOVA was used. Sphericity was confirmed using Mauchly’s test. Data visualizations were generated using the ggpubr package.³² The default threshold for significance was set at p < 0.05 for all tests.

Twenty-one participants (CS = 10; HC = 11) completed EEG recordings while performing a VR-guided motor task. CS and HC groups did not differ with regard to sex (χ²_[1] < 0; p > 0.05). CS participants (63.3 ± 10.2 years) had a higher mean age than HC participants (46.3 ± 11.3 years; t_[18] = 2.31, p = 0.0319). Six CS participants had left-sided infarct while four had right-sided infarct. From this data set, a total of 4030 models were trained with a total compute time of 103 hours.

We used 13 different ML algorithms (see Methods) to explore the effect of classification accuracy on disease state, stimulation state, movement states, frequency band, time period and electrode location. Time periods examined were pre-stimulation, intra-stimulation at 5 and 15 minutes after stimulation began, and post-stimulation, hereafter referred to as ‘Pre’, ‘Intra5’, ‘Intra15’ and ‘Post’. Features were tested in limited combinations to 1) reduce the exponential increase in model expansion and 2) to narrow clinical interpretability of the parameter space. Note that all accuracies depicted are the classification accuracies on the validation set; no samples in the validation set were used to train any of the models.

To investigate the baseline model accuracy of discriminating between healthy and chronic stroke participants, as a control we compared each algorithm using all electrodes, frequency bands, and movement states grouped together. We observed that the mean accuracy of classifying HC and CS participants was 71.1% for the sham group and 83.4% for the stim group during the pre-stimulation time period, likely due to the increased hemispheric asymmetry evident in the recordings relative to healthy controls (p < 0.0016). A higher mean classification accuracy for stim versus sham groups persisted throughout all intra- and post-stimulation time periods and was greatest at the intra5 time period (stim: 80.4 ± 11.7% versus sham: 58.5 ± 9.1%; t_[24] = 6.3653 p < 1.4e-6, Fig. 1).

For the intra5, intra15 and post-stimulation time periods, the five ensemble models (global, hard, me, train, uni) converged to produce similar accuracies: intra5 (93.6%), intra15 (90.0%), and post-stimulation (93.8%) (Supplementary Table 1).

To investigate the baseline model accuracy in discriminating between sham and stimulation states, as a control we compared each algorithm using all electrodes, frequency bands, and movement states grouped together. We observed that the mean accuracy of classifying stim versus sham state was 70.4% for the HC group and 86.8% for the CS group during the pre-stimulation time period (p < 0.00023). A higher classification accuracy for the sham versus stim groups persisted throughout all time periods and was greatest at the intra15 time period (CS: 80.3 ± 10.8% versus HC: 64.0 ± 8.4%; t_[24] = 5.4557, p < 4.5e-5, Fig. 2).

For the intra5, intra15 and post-stimulation time periods, the five ensemble models (global, hard, me, train, uni) converged to produce similar accuracies: intra5 (92.3%), intra15 (92.0%), and post-stimulation (92.0%) (Supplementary Table 2).

To investigate the accuracy of each algorithm in discriminating between hold and reach movement states, we created models using all electrodes and frequency bands (Fig. 3). We observed that the mean accuracy of classifying hold versus reach was 68.6 ± 0.2% for the CS sham group, 72.2 ± 0.5% for the CS stim group, 79.6 ± 0.3% for the HC sham group, and 71.6 ± 0.6% for the HC stim group at the pre-stimulation time period (Fig. 3C). A higher mean classification accuracy for sham groups persisted throughout all time periods except for the CS cohorts at the pre and intra15 time periods. At the intra15 time period, the mean accuracy for classifying hold versus reach was 75.3 ± 1.3% for the CS stim group and 71.5 ± 1.5% for the CS sham group (t_[23] = 9.7250; p = 8.45e-10, Fig. 3, Supplemental Table 3).

In the CS stim group, LR, LDA, and DT each performed this classification with 76.1% accuracy at the shortest average training time of 0.77 sec per model (lines superimposed in Fig. 3). By comparison, XGBoost performed this classification with 75.2% accuracy at the longest average training time of 1 minute 3.8 seconds sec per model.

To investigate the accuracy of each algorithm in discriminating between hold and reach movement states by frequency band, we created models using all electrodes recorded in the CS stim group. We observed that the mean accuracy of classifying hold versus reach was consistently higher in the stimulation and post-stimulation time periods (Fig. 4A, note that for the pre-stimulation period, RF classification accuracy was below 65% for all frequency bands and is not shown). Interestingly, 10 out of 13 algorithms showed no differences in accuracy between frequency bands. At the intra15 time period, LR, LDA, and DT classified hold versus reach equally at the highest accuracy (76.1%, Fig. 4B) for all frequency bands.

In contrast, the five ensemble models (global, hard, me, train, uni) showed the highest classification accuracy for the gamma frequency band at the intra15 time period in comparison to all other bands (alpha = 74.5%, beta = 75.0%, theta = 75.1%, delta = 75.2%, and gamma = 75.6%; Supplementary Table 4), although this difference was not statistically significant (p = 0.37).

To investigate the accuracy of each algorithm in discriminating between hold and reach states by electrode laterality, we created models using all frequency bands. We chose the electrode overlying primary motor cortex (C3 and C4) according to the contralateral hand used to perform the reaching task. That is, if the right hand performed the task, then the C3 electrode was labeled as the ipsi-stimulated electrode while the C4 electrode was labeled as the contra-stimulated electrode and vice versa. We observed that the mean accuracy of classifying hold versus reach was consistently higher in the contra-stimulated electrode for the sham groups (i.e., HC and CS) except for the pre-stimulation period in the HC sham cohort. In contrast, in the CS stim group, classification accuracy was highest in the ipsi-stimulated electrode during the stimulation periods only (intra5: t_[29]=-4.26, p = 0.0003; intra15: t_[23]=-3.72, p = 0.0011, Fig. 5A, Supplemental Table 5).

Notably, the classification accuracy was similar in both the contra-stimulated and ipsi-stimulated electrodes in the HC stim group, suggesting that the higher accuracy in the ipsi-stimulated electrode in the CS stim group is likely not driven purely by stimulation artifact. Moreover, the classification accuracy in the ipsi-stimulated electrode in the CS stim group is constant relative to the pre-stimulation state, again suggesting a physiological response to stimulation that peaks at the intra15 time period (Fig. 5A, Supplemental Table 5).

As neuromodulation becomes an accepted adjunct for chronic stroke recovery, the potential use of brain stimulation to assist in identifying movement states from brain recordings has become a topic of great interest. Detection of movement intention using EEG recordings has been successfully performed in several studies using healthy and tetraplegic participants. However, real movement classification in individuals with hemiparesis is not well understood, including the dimensionality involved in modeling relevant parameters. Moreover, in implanted brain recording and stimulation systems, compute power is limited to the onboard processor, which prohibits the typical types of algorithms employed, e.g., deep learning and other neural network-based strategies. To surpass these constraints, using tuning optimized for biosignals, supervised ML approaches can be used to model high-dimensional parameter sets in order to arrive at acceptable accuracies and modeling times for control signal classification. The central problem addressed in this study is whether movement classification based on EEG recordings is possible (i.e., above chance) in chronic stroke survivors undergoing NIBS during performance of a functional task. The dataset used in this study was obtained from 10 chronic stroke survivors and 11 healthy control participants. Each group contained both active and sham stimulation cohorts. All participants followed a cued virtual reality (VR) arm reaching task, which was repeated before, during and following stimulation. EEG was recorded throughout the entire experiment. Our overall finding is that NIBS improved classification accuracy primarily in the chronic stroke group with a preference for gamma frequency band in the ensemble methods.

Comparing our results to similar studies that developed EEG classifiers for movement states, SVM performed modestly better in 46.4% of models than LDA and LR in classifying rest, simple arm movement, goal-oriented arm movement and hand clenching using motor imagery.³³ In our study, the performance of LDA for hold versus reach (at 76.1% accuracy during the intra15 time period) was within range of the accuracies reported by Yong and Menon³³ (75–81% accuracy) and higher than those of Rodrigo et. al.³⁴ (64–68% accuracy). The voting classifiers trained in this study for hold versus reach in the chronic stroke cohort (at the intra15 time period, see Table 3) achieved an accuracy of 73.8%, which is not as strong as the voting classifier created by Khrishna et. al.³⁵ (86% accuracy). Similar to the dataset used by Mebarkia and Reffad,¹³ the dataset used by Khrishna et. al.³⁵ classifies motor imagery in right arm, left arm, right foot, and left foot without any consideration for hold or preparation, which may explain the stronger performance. The AdaBoost classifier trained as part of our study achieved an accuracy of 75.3% using decision trees as the classifier base, which is comparable to the AdaBoost classifiers trained by Gao et. al.²⁸ who used SVM and LDA bases to achieve an accuracy of 74% and 72%, respectively.

A few studies have combined tDCS and BCI device technology with mixed results. Matsumoto et al.³⁶ used a motor imagery (MI) BCI in concert with multiple 1 mA 10 min tDCS sessions in 6 healthy participants. In their investigation, mu ERD improved with anodal and attenuated with cathodal stimulation.³⁶ Kasashima and colleagues³⁷ repeated the paradigm in participants with stroke and hemiparesis, demonstrating similar results. In Wei et al’s³⁸ study, tDCS specifically modulated upper mu (10–14 Hz) and beta (14–26 Hz) frequencies in 32 healthy controls. Hong and colleagues³⁹ introduced diffusion and perfusion MRI following tDCS in combination with MI-BCI. Tractography estimates showed significant changes on the ipsilesional side for participants receiving tDCS, though no difference in motor improvement was observed between stim and sham groups.³⁹ Interestingly, the authors also showed bilateral changes in parietal cerebral blood flow correlating with functional recovery and were one of the first to suggest an interhemispheric mechanism driving tDCS/MI-BCI function.³⁹ Notwithstanding, tDCS did not appear to influence MI-BCI performance in a randomized, double-blinded controlled trial in 19 participants with stroke.⁴⁰ These results did not differ from a subsequent study in which functional MRI was used to derive a low-frequency fluctuation metric.⁴¹ None of the studies outlined utilized real movements as a control.

Although we observed that the ensemble methods achieved the highest hold versus reach classification using gamma power as a feature, modulation of gamma in individuals with stroke is only sparsely reported. In Tecchio et al’s⁴² study, increased gamma power (33.5–44 Hz) in the affected hemisphere of chronic stroke participants was correlated with motor improvement using magnetoencephalography recordings. Moreover, Pellegrino and colleagues⁴³ demonstrated that gamma reactivity to an auditory stimulus in chronic stroke participants was tightly correlated to clinical outcome as measured by Barthel Index and Functional Independence Measure. Yet in a recent systematic analysis of randomized controlled trials examining the utility of BCIs in stroke motor recovery, gamma power was absent from the frequency bands investigated as a potential biomarker.⁶

Limitations

This study was not without limitations. First, a small sample size limits interpretability of our classification results. Second, we observed significant differences in mean age between our chronic stroke (63.3 ± 10.2 years) and healthy (46.3 ± 11.3 years) cohorts, potentially confounding our observed differences in classification accuracy which may have been affected by age-related changes in the brain. Third, to attempt to mimic the processing and bit rate constraints of a fully implantable BCI system,⁴⁴ we avoided training more comprehensive deep learning models. Nevertheless, a robust classification pipeline using Convolutional Neural Networks was described by Lun et. al.⁴⁵ who trained a 5-layer model on the Physionet database.^46,47 Remarkably, using only 10 participants from that dataset, a global accuracy of 94% or above was demonstrated. Similar to our study, they limited pre-processing of the EEG signal and still achieved high classification accuracies.⁴⁵ In our study, training was limited to 12 repetitions per task and four tasks per participant. Furthermore, results are almost certainly confounded by some proportion of learning of the task, though we attempted to limit this by randomizing several features, including time between movement states and location of reach target. The effect of learning on our results should be mitigated by the controls that were incorporated, including healthy participants and sham stimulation. We limited the parameter space explored to aspects of the task design, and we have not carried out a full examination of the preprocessing and feature extraction components of the ML pipeline. For example, Chen et al.⁴⁸ used differential entropy to improve classification performance in EEG recordings during a cognitive task. Finally, we did not include asymmetry index measures for the chronic stroke participants as some authors have in order not to bias the comparisons to healthy participants.¹⁵ Given the heterogeneity of each chronic stroke participant, the extent to which each of these factors contributes to the need to personalize the training models for each individual should be explored further in future studies.

Machine learning models were able to classify movement state in participants with chronic stroke using EEG even with minimal preprocessing. Classification accuracy was improved with tDCS in these participants, and the highest accuracies were obtained during stimulation rather than post-stimulation. These findings portend a brain-computer interface which can auto-detect movement state and deliver therapeutic stimulation when most beneficial for chronic stroke rehabilitation.

ADA, adaptive boost; CS, chronic stroke; DT, decision tree; EEG, electroencephalogram; Global, soft-vote ensemble weighted based on overall accuracy; Hard, hard vote ensemble; KNN, k-nearest neighbors; LDA, linear discriminant analysis; LR, logistic regression; Me, Soft-vote ensemble with weights chosen based on the mean of multiple base estimators; NB, naive bayes; RF, random forest; Stim, stimulation; tDCS, transcranial direct current stimulation; Train, Soft-vote ensemble weighted based on training performance; Uni, soft-vote ensemble weighted equally; VR, virtual reality; XG, XGboost.

Acknowledgements: N/A

Author contributions: NR performed the initial experiment and data collection. NR and RS performed the machine learning analyses. NR, RS, MSZ, BFS, MT contributed to the manuscript and provided critical feedback.

Funding statement: Center of Biomedical Research Excellence (COBRE) in Stroke Recovery—Junior Investigator Research Project. Source: National Institutes of Health (5 P20 GM109040).

Availability of data and materials: The data, as well as code, that support the findings of this study are available on request from the lead author.

Ethical approval and consent to participate: All participants were provided a thorough description of the study prior to participation. Approval was provided by the Medical University of South Carolina’s Institutional Review Board.

Consent for publication: Consent for publication was given by all participants.

Competing interests: The authors declare no competing interests.

Author details: ¹College of Medicine, Medical University of South Carolina, Charleston, SC 29425 USA. ²MUSC Institute for Neuroscience Discovery (MIND), Medical University of South Carolina, SC 29425 USA. ³Department of Neurosurgery, Medical University of South Carolina, SC 29425 USA

Tsao CW, Aday AW, Almarzooq ZI, et al. Heart Disease and Stroke Statistics-2023 Update: A Report From the American Heart Association. Circulation. Feb 21 2023;147(8):e93-e621. doi:10.1161/cir.0000000000001123
Chaudhary U, Birbaumer N, Ramos-Murguialday A. Brain-computer interfaces for communication and rehabilitation. Nat Rev Neurol. Sep 2016;12(9):513-25. doi:10.1038/nrneurol.2016.113
Cervera MA, Soekadar SR, Ushiba J, et al. Brain-computer interfaces for post-stroke motor rehabilitation: a meta-analysis. Ann Clin Transl Neurol. May 2018;5(5):651-663. doi:10.1002/acn3.544
Hughes C, Herrera A, Gaunt R, Collinger J. Bidirectional brain-computer interfaces. Handb Clin Neurol. 2020;168:163-181. doi:10.1016/b978-0-444-63934-9.00013-5
Young MJ, Lin DJ, Hochberg LR. Brain-Computer Interfaces in Neurorecovery and Neurorehabilitation. Semin Neurol. Apr 2021;41(2):206-216. doi:10.1055/s-0041-1725137
Fu J, Chen S, Jia J. Sensorimotor Rhythm-Based Brain-Computer Interfaces for Motor Tasks Used in Hand Upper Extremity Rehabilitation after Stroke: A Systematic Review. Brain Sci. Dec 28 2022;13(1)doi:10.3390/brainsci13010056
Jia T, Mo L, Li C, Liu A, Li Z, Ji L. 5 Hz rTMS improves motor-imagery based BCI classification performance. Annu Int Conf IEEE Eng Med Biol Soc. Nov 2021;2021:6116-6120. doi:10.1109/embc46164.2021.9630102
Pichiorri F, De Vico Fallani F, Cincotti F, et al. Sensorimotor rhythm-based brain-computer interface training: the impact on motor cortical responsiveness. J Neural Eng. Apr 2011;8(2):025020. doi:10.1088/1741-2560/8/2/025020
Mokienko OA, Chervyakov AV, Kulikova SN, et al. Increased motor cortex excitability during motor imagery in brain-computer interface trained subjects. Front Comput Neurosci. 2013;7:168. doi:10.3389/fncom.2013.00168
Hänselmann S, Schneiders M, Weidner N, Rupp R. Transcranial magnetic stimulation for individual identification of the best electrode position for a motor imagery-based brain-computer interface. J Neuroeng Rehabil. Aug 25 2015;12:71. doi:10.1186/s12984-015-0063-z
Mihelj E, Bächinger M, Kikkert S, Ruddy K, Wenderoth N. Mental individuation of imagined finger movements can be achieved using TMS-based neurofeedback. Neuroimage. Nov 15 2021;242:118463. doi:10.1016/j.neuroimage.2021.118463
Gao T, Hu Y, Zhuang J, Bai Y, Lu R. Repetitive Transcranial Magnetic Stimulation of the Brain Region Activated by Motor Imagery Involving a Paretic Wrist and Hand for Upper-Extremity Motor Improvement in Severe Stroke: A Preliminary Study. Brain Sci. Dec 29 2022;13(1)doi:10.3390/brainsci13010069
Mebarkia K, Reffad A. Multi optimized SVM classifiers for motor imagery left and right hand movement identification. Australas Phys Eng Sci Med. Dec 2019;42(4):949-958. doi:10.1007/s13246-019-00793-y
Rabiller G, He JW, Nishijima Y, Wong A, Liu J. Perturbation of Brain Oscillations after Ischemic Stroke: A Potential Biomarker for Post-Stroke Function and Therapy. Int J Mol Sci. Oct 26 2015;16(10):25605-40. doi:10.3390/ijms161025605
Sato Y, Schmitt O, Ip Z, et al. Pathological changes of brain oscillations following ischemic stroke. J Cereb Blood Flow Metab. Oct 2022;42(10):1753-1776. doi:10.1177/0271678x221105677
Leonardi G, Ciurleo R, Cucinotta F, et al. The role of brain oscillations in post-stroke motor recovery: An overview. Front Syst Neurosci. 2022;16:947421. doi:10.3389/fnsys.2022.947421
Pfurtscheller G, Aranibar A. Evaluation of event-related desynchronization (ERD) preceding and following voluntary self-paced movement. Electroencephalogr Clin Neurophysiol. Feb 1979;46(2):138-46. doi:10.1016/0013-4694(79)90063-4
Wolpaw JR, McFarland DJ. Control of a two-dimensional movement signal by a noninvasive brain-computer interface in humans. Proc Natl Acad Sci U S A. Dec 21 2004;101(51):17849-54. doi:10.1073/pnas.0403504101
Ramos-Murguialday A, Broetz D, Rea M, et al. Brain-machine interface in chronic stroke rehabilitation: a controlled study. Ann Neurol. Jul 2013;74(1):100-8. doi:10.1002/ana.23879
Bundy DT, Souders L, Baranyai K, et al. Contralesional Brain-Computer Interface Control of a Powered Exoskeleton for Motor Recovery in Chronic Stroke Survivors. Stroke. Jul 2017;48(7):1908-1915. doi:10.1161/strokeaha.116.016304
Dodd KC, Nair VA, Prabhakaran V. Role of the Contralesional vs. Ipsilesional Hemisphere in Stroke Recovery. Front Hum Neurosci. 2017;11:469. doi:10.3389/fnhum.2017.00469
Hasegawa K, Kasuga S, Takasaki K, Mizuno K, Liu M, Ushiba J. Ipsilateral EEG mu rhythm reflects the excitability of uncrossed pathways projecting to shoulder muscles. J Neuroeng Rehabil. Aug 25 2017;14(1):85. doi:10.1186/s12984-017-0294-2
Harris CR, Millman KJ, van der Walt SJ, et al. Array programming with NumPy. Nature. Sep 2020;585(7825):357-362. doi:10.1038/s41586-020-2649-2
Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. 01/02 2012;12
Altman NS. An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression. The American Statistician. 1992/08/01 1992;46(3):175-185. doi:10.1080/00031305.1992.10475879
Balakrishnama S, Ganapathiraju A. Linear Discriminant Analysis—A Brief Tutorial. 01/01 1998;11
Bentlemsan M, Zemouri ETT, Yahya-Zoubir B, Ferroudji K. Random Forest and Filter Bank Common Spatial Patterns for EEG-Based Motor Imagery Classification. 2014.
Gao L, Cheng W, Zhang J, Wang J. EEG classification for motor imagery and resting state in BCI applications using multi-class Adaboost extreme learning machine. Rev Sci Instrum. Aug 2016;87(8):085110. doi:10.1063/1.4959983
Khan RA, Rashid N, Shahzaib M, et al. A novel framework for classification of two-class motor imagery EEG signals using logistic regression classification algorithm. PLoS One. 2023;18(9):e0276133. doi:10.1371/journal.pone.0276133
Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016;
Kassambara A. Pipe-Friendly Framework for Basic Statistical Tests [R package rstatix version 0.6.0]. 2020:
Kassambara A. Ggpubr: ‘Ggplot2’ Based Publication Ready Plots. . 2022;
Yong X, Menon C. EEG classification of different imaginary movements within the same limb. PLoS One. 2015;10(4):e0121896. doi:10.1371/journal.pone.0121896
Rodrigo M, Montesano L, Minguez J. Classification of resting, anticipation and movement states in self-initiated arm movements for EEG brain computer interfaces. Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:6285-8. doi:10.1109/iembs.2011.6091551
Khrishna D, Pasha IA, Savithri T. Multi-level voting method to classify motor imagery EEG signals. ARPN Journal of Engineering and Applied Sciences. 06/01 2018;13:3815-3819.
Matsumoto J, Fujiwara T, Takahashi O, Liu M, Kimura A, Ushiba J. Modulation of mu rhythm desynchronization during motor imagery by transcranial direct current stimulation. J Neuroeng Rehabil. Jun 11 2010;7:27. doi:10.1186/1743-0003-7-27
Kasashima Y, Fujiwara T, Matsushika Y, et al. Modulation of event-related desynchronization during motor imagery with transcranial direct current stimulation (tDCS) in patients with chronic hemiparetic stroke. Exp Brain Res. Sep 2012;221(3):263-8. doi:10.1007/s00221-012-3166-9
Wei P, He W, Zhou Y, Wang L. Performance of motor imagery brain-computer interface based on anodal transcranial direct current stimulation modulation. IEEE Trans Neural Syst Rehabil Eng. May 2013;21(3):404-15. doi:10.1109/tnsre.2013.2249111
Hong X, Lu ZK, Teh I, et al. Brain plasticity following MI-BCI training combined with tDCS in a randomized trial in chronic subcortical stroke subjects: a preliminary study. Sci Rep. Aug 23 2017;7(1):9222. doi:10.1038/s41598-017-08928-5
Chew E, Teo WP, Tang N, et al. Using Transcranial Direct Current Stimulation to Augment the Effect of Motor Imagery-Assisted Brain-Computer Interface Training in Chronic Stroke Patients-Cortical Reorganization Considerations. Front Neurol. 2020;11:948. doi:10.3389/fneur.2020.00948
Hu M, Cheng HJ, Ji F, et al. Brain Functional Changes in Stroke Following Rehabilitation Using Brain-Computer Interface-Assisted Motor Imagery With and Without tDCS: A Pilot Study. Front Hum Neurosci. 2021;15:692304. doi:10.3389/fnhum.2021.692304
Tecchio F, Pasqualetti P, Zappasodi F, et al. Outcome prediction in acute monohemispheric stroke via magnetoencephalography. J Neurol. Mar 2007;254(3):296-305. doi:10.1007/s00415-006-0355-0
Pellegrino G, Arcara G, Cortese AM, et al. Cortical gamma-synchrony measured with magnetoencephalography is a marker of clinical status and predicts clinical outcome in stroke survivors. Neuroimage Clin. 2019;24:102092. doi:10.1016/j.nicl.2019.102092
Leuthardt EC, Freudenberg Z, Bundy D, Roland J. Microscale recording from human motor cortex: implications for minimally invasive electrocorticographic brain-computer interfaces. Neurosurg Focus. Jul 2009;27(1):E10. doi:10.3171/2009.4.Focus0980
Lun X, Yu Z, Chen T, Wang F, Hou Y. A Simplified CNN Classification Method for MI-EEG via the Electrode Pairs Signals. Front Hum Neurosci. 2020;14:338. doi:10.3389/fnhum.2020.00338
Goldberger AL, Amaral LA, Glass L, et al. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation. Jun 13 2000;101(23):E215-20. doi:10.1161/01.cir.101.23.e215
Schalk G, McFarland DJ, Hinterberger T, Birbaumer N, Wolpaw JR. BCI2000: a general-purpose brain-computer interface (BCI) system. IEEE Trans Biomed Eng. Jun 2004;51(6):1034-43. doi:10.1109/tbme.2004.827072
Chen DW, Miao R, Yang WQ, et al. A Feature Extraction Method Based on Differential Entropy and Linear Discriminant Analysis for Emotion Recognition. Sensors (Basel). Apr 5 2019;19(7)doi:10.3390/s19071631

No competing interests reported.

SupplementaryMaterial.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Noninvasive brain stimulation during EEG improves machine learning classification in chronic stroke

Status:

Version 1

Abstract

Background:

Methods:

Results:

Conclusions:

Figures

Background

Materials and methods

Participants

Randomization and Blinding

Task and Dataset

Preprocessing and Feature Extraction

Equation 1: Z-score normalization equation used within a single EEG channel

Machine Learning Implementation

Statistical analysis

Results

Discussion

Limitations

Conclusions

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1