Predicting medical refractoriness of patients with temporal lobe epilepsy: EEG-based parameter optimization and network analysis

doi:10.21203/rs.3.rs-4677811/v1

Download PDF

Article

Predicting medical refractoriness of patients with temporal lobe epilepsy: EEG-based parameter optimization and network analysis

https://doi.org/10.21203/rs.3.rs-4677811/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 04 Sep, 2024

Read the published version in Scientific Reports →

You are reading this latest preprint version

The early identification of refractory epilepsy is important to provide surgical treatment. However, limited studies have used electroencephalography (EEG)-based features to predict medical refractoriness. In this study, we employed feature-based machine learning algorithms to analyze resting-state EEG data to predict drug refractoriness in patients with temporal lobe epilepsy (TLE). This retrospective observational multicenter study included consecutive unilateral TLE patients treated with monotherapy at the time of the first EEG acquisition. Multiple EEG features were extracted from the EEG. The optimal features and frequencies were identified to predict drug refractoriness. Classification was conducted using random forest, extreme gradient boosting, and light gradient boosting models. The features were selected using filter methods and the wrapper method. Graph measurements were compared between the groups. Among the 48 participants, 34 (70.8%) were responsive, while 14 (29.2%) were refractory over a mean follow- up duration of 38.5 months. Coherence feature within the gamma frequency band exhibited the most favorable performance. The light gradient boosting model, employing the mutual information filter-based feature selection method, demonstrated the highest performance (AUROC = 0.821). Interchannel coherence displayed larger values in the refractory epilepsy. Graph theory measurements were higher in the refractory group than in the responsive group. Our study has demonstrated a promising method of identifying the early identification of refractory TLE, a population that may benefit from surgical intervention.

Health sciences/Biomarkers

Health sciences/Medical research

Health sciences/Neurology

prediction

machine learning

optimization

monotherapy

network

temporal lobe epilepsy

Epilepsy is a neurological disorder characterized by recurrent seizures. The primary treatment modality for epilepsy is anti-seizure medication (ASM) and regular maintenance of ASM is required to minimize seizure recurrence, even in patients who experience infrequent seizures.

Numerous cohort studies have revealed that optimal ASM provides seizure freedom in 60 − 70% of patients with newly diagnosed epilepsy. ¹ That is, the remaining 30 − 40% experience recurrent seizures even after adequate ASM therapy and are therefore classified as having refractory epilepsy. The International League Against Epilepsy Task Force proposed a consensus definition of refractory (or drug-resistant) epilepsy as “failure of adequate trials of two tolerated and appropriately chosen and used ASM schedules (whether as monotherapies or in combination) to achieve sustained seizure freedom.” ² This definition has been used to facilitate early identification of refractory epilepsy. Consequently, it encourages the exploration of alternative treatment modalities, including epilepsy surgery, neuromodulation, and ketogenic diet. ³

Ideally, the earlier the refractoriness is determined, the sooner epileptologists can consider alternative treatment options beyond rigorous medical treatment. This concept is supported by the finding that a longer duration of epilepsy before resective surgery leads to poorer surgical outcomes. ⁴ The likelihood of achieving seizure control decreases substantially with an increasing number of ASM trials. ⁵ Seizures relapse in approximately 50% of patients after the failure of the first ASM regimen. Therefore, the early stages of ASM treatment are critical for identifying refractory epilepsy. For this purpose, researchers have used test results from drug-naïve patients to predict medical refractoriness. ⁶

Another challenge in assessing refractoriness and the occurrence of seizures is the sole reliance on the patient's memory. Recent research has shown that more than half of focal impaired awareness seizures or nocturnal seizures go unnoticed and are not reported. ⁷ This underscores the need for objective tools such as electroencephalography (EEG) or imaging to observe the current status or predict refractoriness.

Several clinical factors, such as early onset of epilepsy, symptomatic or cryptogenic epilepsy, multiple seizure types, many seizures before ASM treatment, and a family history of epilepsy, have been reported to be associated with refractory epilepsy in previous studies. ^{1, 8} In a recent meta-analysis, EEG abnormality was a consistent predictive factor for refractory epilepsy. ⁹ Both slow waves and epileptiform discharges have been associated with refractory epilepsy in newly diagnosed patients with epilepsy. ^{10 11 12}

Recently, machine learning (ML) algorithms have been employed to predict epilepsy outcomes. Many researchers have used diverse features that were previously established by conventional statistical methods. ^{6 13} In these models, the EEG results were presented as categorical variables (i.e., normal, non-epileptiform abnormality, or epileptiform discharge). Presurgical clinical, electrographic, neuropsychological, imaging, and surgical data were used to predict surgical outcomes in patients with temporal lobe epilepsy. ^{14 15} Some studies utilized features from raw EEG data to predict treatment responses to levetiracetam. ^{16 17} However, only a few studies have used EEG-based features to predict medical refractoriness. Lin et al. built an SVM model to predict medical refractoriness in children with idiopathic epilepsy ¹⁸, and Wang et al. built an SVM model for children and adults with epilepsy. ¹⁹

In an earlier study, the first ASM led to a seizure-free rate of 47%, the second ASM achieved a seizure-free rate of 13%, and the third option resulted in a seizure-free rate of only 4%. ⁸ Considering these statistics, patients who fail to reach a seizure-free status with initial monotherapy seem to have a likelihood of seizure freedom of less than 20% after further ASM trials. Therefore, we generated an ML model using EEG-based features to predict medical refractoriness in patients with temporal lobe epilepsy on initial monotherapy.

Demographic and clinical characteristics

Forty-eight patients with unilateral TLE treated with monotherapy between 2014 and 2021 were identified; 33 (68.8%) patients had left-sided TLE, and 15 (31.3%) had right-sided TLE. The age of epilepsy onset was 44.9 ± 19.2 years old (mean ± standard deviation), and the age in the EEG study was 54.1 ± 15.5 years old. The follow-up duration from the EEG study to the last follow-up (when the final outcome was determined) was 38.5 ± 21.8 months. Of the 48 patients, 34 (70.8%) were responsive, and 14 (29.2%) were refractory to ASM treatment at the last follow-up. Hippocampal sclerosis was identified in 5 (10.4%) patients, trauma in 5 (10.4%) patients, and hemorrhage in 5 (10.4%) patients. The most frequently used ASM in the EEG study was levetiracetam (N = 20, 41.7%), followed by oxcarbazepine (N = 9, 18.8%) and lacosamide (N = 8, 16.7%). No demographic or clinical characteristics were significantly different between the responsive and refractory groups. (Table 1)

Table 1

Demographic and clinical characteristics of the responsive and refractory groups.
	Responsive group (N = 34)	Refractory group (N = 14)	p-value
Sex (N, %) Male Female	19 (55.9%) 15 (44.1%)	9 (64.3%) 5 (35.7%)	0.830^a
Age of epilepsy onset (years, mean ± s.d.)	45.4 ± 19.5	43.9 ± 19.0	0.809^b
Age at EEG study (years, mean ± s.d.)	53.4 ± 16.7	55.9 ± 12.6	0.626^b
Follow-up duration (months, mean ± s.d.)	39.0 ± 22.5	37.2 ± 21.0	0.800^b
Seizure types (N, %) Focal seizures only Focal & focal to bilateral seizures	20 (58.8%) 14 (41.2%)	7 (50.0%) 7 (50.0%)	0.810^a
Etiology (N, %) Hippocampal sclerosis Trauma Hemorrhage Cerebral infarction Moyamoya disease Encephalitis Focal cortical dysplasia Cavernous malformation Unknown	3 (8.8%) 4 (11.8%) 2 (5.9%) 1 (2.9%) 2 (5.9%) 1 (2.9%) 1 (2.9%) 1 (2.9%) 19 (55.9%)	2 (14.3%) 1 (7.1%) 3 (21.4%) 1 (7.1%) 0 (0.0%) 1 (7.1%) 0 (0.0%) 0 (0.0%) 6 (42.9%)	0.692^c
History of febrile convulsion (N, %)	2 (5.9%)	0 (0.0%)	0.895^c
History of CNS infection (N, %)	1 (2.9%)	0 (0.0%)	1.000^c
Epileptic focus (N, %) Left Right	25 (73.5%) 9 (26.5%)	8 (57.1%) 6 (42.9%)	0.441^a
ASM at EEG study (N, %) Levetiracetam Oxcarbazepine Lacosamide Valproic acid Carbamazepine Lamotrigine Topiramate	16 (47.1%) 7 (20.6%) 4 (11.8%) 4 (11.8%) 2 (5.9%) 0 (0.0%) 1 (2.9%)	4 (28.6%) 2 (14.3%) 4 (28.6%) 1 (7.1%) 2 (14.3%) 1 (7.1%) 0 (0.0%)	0.361^c
Seizure frequency at EEG study (per month, mean ± s.d.)	0.5 ± 1.7	0.6 ± 0.5	0.788^b
IED on first EEG (N, %)	15 (44.1%)	10 (71.4%)	0.160^a
^a Chi-square test. ^b Mann‒Whitney U test. ^c Fisher’s exact test. Abbreviations: EEG = electroencephalography; s.d. = standard deviation; ASM = antiseizure medication; IED = interictal epileptic discharge; CNS = central nervous system.

Predictive performance across various frequency bands and features

Figure 2 shows the predictive performance of the responsive and refractory groups across different frequency bands using various features extracted from EEG signals. On average, features based on interchannel connectivity, such as Pearson’s correlation coefficient and coherence, outperformed those derived from single-channel information, including the Hjorth parameter, statistical measures, energy metrics, and zero-crossing rate. In a comparative evaluation of the highest AUROC values among the various frequency bands and features, single-channel features yielded an average AUROC of 0.518, whereas interchannel features yielded an average AUROC of 0.611. Notably, the coherence feature with the gamma frequency attained the highest AUROC over the 5-fold (0.635 ± 0.131).

Predictive performance across various machine learning models and feature selection methods

In the analysis of coherence features within the gamma frequency, which demonstrated the highest performance in Fig. 2, 190 features were extracted and subsequently analyzed. When implementing feature selection across various ML models, the optimal performance was achieved using the mutual information filter-based feature selection method, in conjunction with LGB (Fig. 3), with the extraction of 25 features. At the window level, the model exhibited an AUROC of 0.774 (95% CI 0.643–0,904), accuracy of 0.757 (95% CI 0.659–0.855), sensitivity of 0.667 (95% CI 0.457–0.876), specificity of 0.807 (95% CI 0.687–0.926), positive predictive value of 0.681 (95% CI 0.522–0.840), and negative predictive value of 0.818 (95% CI 0.726–0.910). Advancing to a patient-level evaluation via soft voting, an AUROC of 0.821 (95% CI 0.654–0.988), accuracy of 0.791 (95% CI 0.640–0.943), sensitivity of 0.683 (95% CI 0.389–0.977), specificity of 0.838 (95% CI 0.692–0.984), positive predictive value of 0.700 (95% CI 0.439–0.961), and negative predictive value of 0.855 (95% CI 0.724–0.985) were achieved. For a comprehensive view of the performance metrics, refer to Table 2.

Table 2

Detailed prediction performances at window and patient level.
Level	AUROC	Accuracy	Sensitivity	Specificity	PPV	NPV
Window	0.774 [0.643–0.904]	0.757 [0.659–0.855]	0.667 [0.457–0.876]	0.807 [0.687–0.926]	0.681 [0.522–0.840]	0.818 [0.726–0.910]
Patient	0.821 [0.654–0.988]	0.791 [0.640–0.943]	0.683 [0.389–0.977]	0.838 [0.692–0.984]	0.700 [0.439–0.961]	0.855 [0.724–0.985]
Values are presented with [95% confidence interval]
Abbreviations: AUROC = average area under the receiver operating characteristic curve; PPV = positive predictive value; NPV = negative predictive value.

Network analysis of selected channel pairs

For network analysis, only the top 20 features (i.e. coherence values between EEG channel pairs) consistently selected across the folds (≥ three times out of five folds) were used. These channel pairs are as follows: Cz-C3, F3-C3, F4-Cz, Fz-C3, Fz-C4, Fz-Cz, P3-C3, P3-Fz, Pz-C3, Pz-O2, Pz-P7, F3-Ca, Pf-Fp1, P4-Fz, Pz-Cz, Pz-Fz, Pz-O1, Pz-P3, P4-F4, and Pz-F4. The SHAP index and importance of the selected channel pairs observed across the five folds are illustrated in Supplementary Fig. S2 and S3, respectively. Supplementary Table S1 shows how many times each channel pair was selected during the folds. Coherence values of selected channel pairs at the window and patient levels are presented in Supplementary Table S2 and S3.

Figure 4 presents a direct comparison of the top 20 selected channel pairs between the responsive and refractory groups at both window and patient levels. In particular, interchannel coherence displayed larger values in the refractory group (blue lines) than in the responsive group (red lines). Coherences with larger values in the responsive group were primarily observed in the hemisphere ipsilateral to the epileptic focus, which is represented as red edges in Fig. 4. Conversely, coherences with larger values in the refractory group were distributed across the contralateral hemisphere (depicted as blue edges in Fig. 4) as well as in the ipsilateral hemisphere. Notably, only one channel pair (Pz-P7) was selected from among the channel pairs that involved the temporal area.

Graph measurements

Table 3 presents a comparison of graph measurements between the responsive and refractory groups. At the window level, the modularity, closeness centrality, clustering coefficient, betweenness centrality, and degree coefficient were significantly higher in the refractory group. Similarly, at the patient level, the modularity, eigenvector centrality, clustering coefficient, betweenness centrality, and degree coefficient were significantly higher in the refractory group.

Table 3

Graph measurement comparisons for patients with resting-state lengths exceeding 10 min at both window and patient levels.
Level	Graph measure	p-value	Responsive group (mean ± s.d.)	Refractory group (mean ± s.d.)
Window	Small worldness	0.636	1.042 ± 0.042	1.040 ± 0.041
	Modularity	< 0.001	0.136 ± 0.114	0.170 ± 0.096
	Eigenvector centrality	0.102	0.221 ± 0.043	0.226 ± 0.047
	Closeness centrality	< 0.001	0.260 ± 0.066	0.297 ± 0.032
	Clustering coefficient	< 0.001	0.323 ± 0.183	0.389 ± 0.181
	Betweenness centrality	< 0.001	0.062 ± 0.034	0.084 ± 0.025
	Degree coefficient	< 0.001	0.171 ± 0.087	0.200 ± 0.088
Patient	Small worldness	0.951	1.048 ± 0.048	1.047 ± 0.044
	Modularity	0.003	0.108 ± 0.104	0.208 ± 0.032
	Eigenvector centrality	0.032	0.216 ± 0.046	0.244 ± 0.024
	Closeness centrality	0.188	0.264 ± 0.066	0.295 ± 0.034
	Clustering coefficient	0.024	0.294 ± 0.195	0.442 ± 0.115
	Betweenness centrality	0.008	0.053 ± 0.037	0.086 ± 0.020
	Degree coefficient	0.022	0.156 ± 0.093	0.228 ± 0.052
Abbreviations: s.d. = standard deviation

In this study, we developed an ML model to predict medical refractoriness using the initial EEGs of patients with TLE who were on monotherapy. The best prediction performance was achieved by the coherence of the gamma frequency band by applying a mutual information filter-based feature selection method utilizing LGB. The refractory group exhibited higher coherence values in the hemisphere contralateral to the epileptic focus than in the responsive group. In the graph analysis, the refractory group exhibited higher graph measurement values than the responsive group.

Among the various features analyzed in our study, coherence within the gamma frequency band demonstrated the most substantial predictive performance. Coherence, a measure of synchrony between EEG signals from different brain regions, offers valuable insights into brain functional connectivity. Disruptions in normal brain connectivity are the hallmark features of epilepsy. Our results align with this understanding, suggesting that higher coherence values in the gamma band may reflect altered or intensified neural communication, which is a characteristic feature of refractory epilepsy. ^{40 41}

Notably, the refractory group exhibited higher coherence values than the responsive group, predominantly in the hemisphere contralateral to the epileptic focus. This observation may indicate compensatory or maladaptive network reorganization in refractory patients. The increased synchrony in the contralateral hemisphere may reflect the brain's attempt to counterbalance the disruption caused by epileptic activity in the affected hemisphere. However, this compensatory mechanism may contribute to the persistence or aggravation of seizures, leading to medical refractoriness.

Furthermore, the superior predictive performance of coherence over that of Pearson’s correlation underscores the importance of considering frequency-specific brain connectivity measures in the study and management of epilepsy. Due to its sensitivity to frequency-specific synchronization relevant to epilepsy, coherence in the gamma frequency band has emerged as a more precise tool for predicting medical refractoriness. ^{42 43 44}

It is important to interpret the study results in the context of the study population, which consisted exclusively of individuals receiving monotherapy. Although it is essential to diagnose refractory epilepsy after failure of two or three ASMs, early identification is considered advantageous. Ideally, predicting refractory epilepsy before treatment initiation would be the optimal approach. Consequently, several studies have been conducted using parameters while patients are in a drug-naïve state.

A previous study involving 287 drug-naïve patients reported a notably high AUROC and F1 score for predicting seizure remission over a follow-up period exceeding three years. ⁶ This study incorporated clinical data, dichotomized imaging data, and EEG results. However, it is crucial to acknowledge that individuals who used multiple ASMs throughout the follow-up period were excluded from the study, potentially limiting the generalizability of the findings to a broader clinical population. Another study that utilized claims data from a cohort of 582,258 patients reported an AUROC of 0.753. ⁴⁵ However, in this study, refractory epilepsy was operationally defined as the prescription of more than four ASMs. A limitation of this study is the paucity of information on seizure occurrence, which is a consequence of utilizing claims data to harness the advantages of big data.

Regarding EEG features, one study aimed to predict the responsiveness to levetiracetam based on the sample entropy of the delta, theta, alpha, and beta waves before the initiation of levetiracetam therapy. The prediction accuracy reached 72.2% using an SVM with 5-fold cross-validation. ¹⁶ In a similarly structured study reported an AUROC of 0.75 for the prediction of the responsiveness to levetiracetam based on baseline EEG data. Additionally, it incorporated EEG data taken 3 months after treatment initiation, which increased the AUROC to 0.80. ¹⁷

Lin et al. developed an SVM model to predict medical refractoriness in 23 children with idiopathic epilepsy. ¹⁸ They extracted 24 EEG features from nine categories (autoregressive modeling predictive error, decorrelation time, energy, entropy, Hjorth, relative power, spectral edge, statistic, and energy of the wavelet coefficients). The model achieved a precision rate of 94.2% and a recall rate of 93.3%, respectively. Wang et al. also developed an SVM model to predict medical refractoriness in a group of 164 drug-naive patients with epilepsy. ¹⁹ This model utilized a combination of clinical characteristics and EEG functional connectivity features (phase-lag index). An accuracy rate of 94%, sensitivity of 95%, specificity of 93%, and AUROC of 0.98 were achieved. It is worth noting that although the models developed by Lin et al. and Wang et al. demonstrated high performance, they included a heterogeneous group of patients with both focal and generalized epilepsy. In contrast, our study exclusively focused on patients with unilateral TLE, allowing us to interpret the functional connectivity results in relation to the epileptic focus.

Horstmann et al. identified higher clustering coefficients and average path lengths in patients with temporal or neocortical extratemporal epilepsy than in controls. ⁴⁶ This distinction was particularly notable in the delta band. Van Diessen et al. studied various graph theory metrics (degree centrality, path length, clustering coefficient, betweenness centrality, closeness centrality, and eigenvector centrality) between children with focal epilepsy and controls. ⁴⁷ Although none of the graph theory measurements showed significant differences between the two groups, an RF-based model utilizing these variables successfully distinguished children with focal epilepsy, achieving an AUROC of 0.89. Regarding the prediction of refractory epilepsy, Lee et al. observed a higher mean clustering coefficient within the hippocampal network in patients with refractory TLE than in those with responsive TLE. ⁴⁸ Consistent with these findings, we observed altered graph theory parameters in the refractory group within our study population. Specifically, the modularity, eigenvector centrality, clustering coefficient, betweenness centrality, and degree coefficient were higher in the refractory group than in the responsive group.

This study has few limitations. (1) Small sample size. This study has a relatively small sample size of 48 patients. The limited sample size could be attributed to two factors. First, the EEG recordings were restricted to a single EEG system, which constrained the number of available subjects. Second, the study focused exclusively on patients with TLE, which contributed to a limited sample size. Therefore, future research may benefit from a multicenter approach and the application of transfer-learning techniques to overcome machine- and site-specific disparities. (2) Timing of EEG acquisition. EEG data were collected after the administration of the first ASM rather than before ASM initiation. This choice was predominantly influenced by the practical difficulty of conducting an EEG immediately after a seizure due to the extended waiting times in the institutions participating in this study. (3) Asymmetry of left and right hemisphere was not considered in adjustment of EEG signals from participants whose epileptic focus was in the right hemisphere. Functional differentiation of language and visuospatial domain exists in cerebral hemispheres, therefore EEG signals from left and right hemispheres are exactly symmetric. However, in this study, we were able to perform graph analyses in regards to epileptic focus because we flipped EEG signals from participants with right epileptic focus.

In this study, we developed an ML model to predict medical refractoriness in patients with TLE using EEG coherence features. By limiting the study subjects to patients with unilateral TLE, we were able to interpret the connectivity analysis results with respect to the epileptic focus. After initial diagnosis of TLE and initiation of single ASM, this ML model could help identify refractory TLE in referral hospitals, where most patients with refractory epilepsy are treated.

Patients and data collection

This is a retrospective observational study using routine clinical data. Adult patients (≥ 18 years) were enrolled at 2 tertiary referral centers for epilepsy, Seoul National University Hospital and Kangbuk Samsung Hospital between 2014 and 2021. Inclusion criteria were as follows: (1) TLE (temporal lobe epilepsy) diagnosis based on seizure semiology, EEG, and magnetic resonance imaging; (2) monotherapy (1 ASM) during the first EEG recording; (3) unilateral epileptic focus. Demographic and clinical characteristics, including baseline and final seizure frequencies, were obtained through a retrospective review of medical records. A total of 48 patients with TLE were selected and divided into two groups according to the final outcome, regardless of the final ASM regimen: the responsive group (no seizures during the last 1 year of follow-up) and the refractory group (one or more seizures in the last 1 year of follow-up). (Fig. 1)

Statistical analysis

We used the mean (standard deviation) or frequency (proportion) for statistical analyses. Normality tests were performed using the Shapiro–Wilk test. The chi-square test was used to compare the distributions of sex, seizure type, epileptic focus, and interictal epileptic discharge on the first EEG between the groups. Fisher’s exact test was used to compare the distributions of the etiology of epilepsy, history of febrile convulsion, history of central nervous system infection, and ASM at the first EEG between the groups. Mann–Whitney U test was used to analyze the differences in age at epilepsy onset, age at EEG study, follow-up duration, and seizure frequency at the time of EEG study between the groups.

EEG recording

Resting-state EEG data were recorded using the NicoletOne® EEG system (Natus, San Carlo, CA, USA), in accordance with the international 10–20 electrode placement protocol, with a sampling frequency of 250 Hz, a hardware high-pass filter of 0.1 Hz, and a hardware low-pass filter of 500 Hz. To ensure optimal signal quality, the impedance of all electrodes was meticulously maintained below 10 kΩ. This study leveraged datasets from two separate organizations to foster a comprehensive analysis. To guarantee uniformity across datasets, only 19 channels (electrodes: Fp1, F7, T7, P7, F3, C3, P3, O1, Fp2, F8, T8, P8, F4, C4, P4, O2, Fz, Cz, and Pz) universally present in both organizations were incorporated into the analysis. EEG data without any stimulus recorded with eyes closed were utilized for this study.

Preprocessing

Based on the results of a previous study, a minimum data length of 2 min was deemed necessary to analyze significant epileptic seizure signals effectively. ²⁰ Adhering to this guideline, several data windows were created from the individual patient data, each spanning 120 s with a 50% overlap. The increased dataset size helps mitigate overfitting that originates from small datasets, as the model is less likely to learn from the idiosyncrasies of a small dataset and more from generalizable patterns. Subsequently, the data were referenced from the average of the following EEG channels: F3, Fz, F4, C3, Cz, C4, P3, Pz, P4, O1, and O2.

To facilitate a nuanced analysis accounting for the initial site of a patient's epileptic seizures, a methodical strategy was employed to position the electrodes. For individuals with an epileptic focus on the left side, the existing EEG electrode placements were retained. Conversely, for those with an epileptic focus on the right side, the electrode positions were symmetrically adjusted. With this adjustment, the epileptic focus was positioned in the left hemisphere for each individual.

Prior to analysis, the signals underwent bandpass filtering across various frequency bands: delta (0.5−3 Hz), theta (3−8 Hz), alpha (8−12 Hz), low-beta (12−20 Hz), high-beta (20−30 Hz), and gamma (30−50 Hz), to segregate and highlight the relevant signal components for a more robust analysis.

Feature extraction

In the feature extraction phase, four time-domain features (Hjorth parameters, statistical measures, energy metrics, and zero-crossing rate) and two connectivity-based features (Pearson’s correlation and coherence) were used for the analysis, owing to their proven significance in EEG analyses. In addition, connectivity analysis was conducted using Pearson’s correlation and coherence analysis. To avoid duplication and to preserve analytical precision, connectivity values related to duplicated and symmetrically redundant information were omitted from the dataset.

Hjorth parameters: Hjorth parameters have been used to detect and diagnose seizures, as well as predict seizure recurrence after ASM withdrawal. This set encompasses three components: activity, which indicates the signal power; mobility, representing the mean frequency; and complexity, reflecting changes in frequency. ²¹²²
Statistical measures: Statistical parameters have been employed as features to differentiate patients with epilepsy from healthy controls and predict the response to levetiracetam. ¹⁷²³ Six prevalent statistical indicators were used as features: skewness, kurtosis, mean, median, minimum, and maximum values.
Energy metrics: Energy metrics serve as markers for assessing brain activity. ²⁴ Therefore, the linear and nonlinear energies of the EEG signals were included to offer insights into the energy patterns present within the signal. ²⁵
Zero-crossing rate: This parameter indicates the rate at which a signal transitions from positive to zero to negative or vice versa. It has been a prominent tool in numerous studies for distinguishing seizures from normal EEG signals. For this study, both the zero-crossing rate and its first derivative were incorporated into the analysis. ²⁶²⁷
Interchannel Pearson’s correlation coefficient: Pearson’s correlation is a pivotal feature in brain analysis. It computes the linear relationship between two EEG channels and provides a measurement of both the strength and direction of the association between signal sets. This facilitates the identification of intricate patterns and potential anomalies within EEG signals. ²⁸²⁹
Interchannel coherence: Coherence is a frequency-domain measure that offers insights into the synchrony between EEG channels in specific frequency bands. By evaluating the cross-spectral and auto-spectral densities, coherence facilitates the understanding of connectivity patterns and potential neural network alliances within EEG data. ³⁰³¹

Feature selection

Robust feature selection techniques were utilized to improve the performance of the ML model and reduce the risk of overfitting. Two principal methods were employed: filter-based and wrapper-based feature selection. It is critical to highlight that the feature selection process was confined exclusively to the training set. During our 5-fold cross-validation procedure, we meticulously maintained a clear separation between the training and validation datasets. Feature selection was conducted exclusively using the training data. Subsequently, the performance metrics were evaluated solely based on the validation data for each fold.

Filter-based feature selection is a technique that selects relevant features based on statistical properties. Three commonly used filter-based strategies (chi-square, ANOVA F-value, and mutual information) were employed. ³² Each of these methods was applied to assess the significance and contribution of individual features within our dataset.
Wrapper-based method uses a search algorithm to evaluate different subsets of features and selects the optimal subset that achieves the best performance for a given ML model. Recursive feature elimination (RFE) was utilized as our wrapper method, systematically reducing the feature set to identify the most predictive features. ³³

Evaluation

Three robust classifiers, random forest (RF) ³⁴, extreme gradient boosting (XGB) ³⁵, and light gradient boosting (LGB), were employed in this study. ³⁶ The optimal feature selection method was determined based on the average area under the receiver operating characteristic curve (AUROC) ascertained during a five-fold cross-validation process. A comprehensive assessment of the model's performance was facilitated through the analysis of various metrics, including the AUROC, accuracy, F1 score, sensitivity, and specificity. Moreover, both positive and negative predictive values were meticulously scrutinized to gauge the proficiency of the model in accurately delineating the respective classes.

A 5-fold cross-validation was implemented at the patient level, rather than at the individual window level. By implementing cross-validation at the patient level, all data pertaining to a single patient, including their respective windows, are grouped together. This ensures that the model is tested on completely unseen patients, providing a more reliable and accurate assessment of its ability to generalize and its true predictive power. After identifying the superior model and feature selection method at the window level, an evaluation at the patient level was conducted using a soft voting mechanism (Supplementary Fig. S1), which is a critical method for aggregating probabilistic predictions across each individual patient's window, thereby ensuring more nuanced, reliable, and comprehensive insights into the model's predictive capabilities.

Feature interpretation and graph measurement

The selected channel pairs may vary during the five-fold cross-validation process, highlighting the importance of focusing on channel pairs that are consistently chosen in at least three of the five folds. The average feature importance and Shapley additive explanation (SHAP) values ³⁷ for the chosen edges were systematically analyzed to understand their respective contributions to model predictions. Furthermore, a statistical comparative analysis was conducted between the responsive and refractory groups. A two-tailed paired t-test was employed to analyze each feature, both at the individual window levels and at the patient level (average window basis), with a significance threshold set at 0.05.

Given the prominence of coherence as a principal feature, the visualization results are depicted graphically. Each channel is represented as a node, and the coherence value is illustrated as an edge between the nodes. Graph visualization and analysis were performed using the NetworkX ³⁸ and nilearn ³⁹ Python libraries. To compare graph measurements, edges were connected in each window only if the coherence values were higher than 0.5. At the patient level, a single graph per patient was generated by averaging the values across all windows and subsequently connecting or disconnecting the edges based on a threshold of 0.5. Given the sensitivity of averaging to outliers, especially in cases with a limited number of windows, the analyses were restricted to patients with resting-state lengths exceeding 10 min, ensuring a minimum of 10 windows.

Ethics declarations

This study was approved by the Institutional Review Board of Seoul National University Hospital (reference number H-2308-010-1455). The study was performed in accordance with the Declaration of Helsinki. Informed consent was waived because of the study’s retrospective design.

Contributions

S.H., Y.S., S.B.L., S.K.L., Y.G.K., and K.I.P. conceived and designed the study; S.H., J.S.S., and H.S. collected the data; S.H. and Y.S. conducted the analyses; S.H., Y.S., Y.G.K., and K.I.P. interpreted the data; S.H. and Y.S. wrote the manuscript; S.H., Y.S., J.S.S., H.S., S.B.L., K.C., K.Y.J., S.K.L., Y.G.K. and K.I.P. edited and approved the manuscript.

Competing interest

The authors declare no competing interests.

Author Contribution

Acknowledgement

This study was supported by a grant from the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (Grant Number: RS-2023-00265638 and HI22C0776).

Data Availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Chen Z, Brodie MJ, Liew D, Kwan P. Treatment Outcomes in Patients With Newly Diagnosed Epilepsy Treated With Established and New Antiepileptic Drugs: A 30-Year Longitudinal Cohort Study. JAMA Neurol 2018;75:279–286.
Kwan P, Arzimanoglou A, Berg AT, et al. Definition of drug resistant epilepsy: consensus proposal by the ad hoc Task Force of the ILAE Commission on Therapeutic Strategies. Epilepsia 2010;51:1069–1077.
Brodie MJ, Sills GJ. Combining antiepileptic drugs—Rational polytherapy? Seizure 2011;20:369–375.
Simasathien T, Vadera S, Najm I, Gupta A, Bingaman W, Jehi L. Improved outcomes with earlier surgery for intractable frontal lobe epilepsy. Ann Neurol 2013;73:646–654.
Kwan P, Brodie MJ. Epilepsy after the first drug fails: substitution or add-on? Seizure 2000;9:464–468.
Yao L, Cai M, Chen Y, Shen C, Shi L, Guo Y. Prediction of antiepileptic drug treatment outcomes of patients with newly diagnosed epilepsy by machine learning. Epilepsy Behav 2019;96:92–97.
Elger CE, Hoppe C. Diagnostic challenges in epilepsy: seizure under-reporting and seizure detection. Lancet Neurol 2018;17:279–288.
Kwan P, Brodie MJ. Early identification of refractory epilepsy. N Engl J Med 2000;342:314–319.
Xue-Ping W, Hai-Jiao W, Li-Na Z, Xu D, Ling L. Risk factors for drug-resistant epilepsy: A systematic review and meta-analysis. Medicine (Baltimore) 2019;98:e16402.
Aaberg KM, Bakken IJ, Lossius MI, et al. Short-term Seizure Outcomes in Childhood Epilepsy. Pediatrics 2018;141.
Berg AT, Shinnar S, Levy SR, Testa FM, Smith-Rapaport S, Beckerman B. Early development of intractable epilepsy in children: a prospective study. Neurology 2001;56:1445–1452.
Ko TS, Holmes GL. EEG and clinical predictors of medically intractable childhood epilepsy. Clin Neurophysiol 1999;110:1245–1251.
Hakeem H, Feng W, Chen Z, et al. Development and Validation of a Deep Learning Model for Predicting Treatment Response in Patients With Newly Diagnosed Epilepsy. JAMA Neurol 2022.
Grigsby J, Kramer RE, Schneiders JL, Gates JR, Brewster Smith W. Predicting outcome of anterior temporal lobectomy using simulated neural networks. Epilepsia 1998;39:61–66.
Armañanzas R, Alonso-Nanclares L, Defelipe-Oroquieta J, et al. Machine learning approach for the outcome prediction of temporal lobe epilepsy surgery. PLoS One 2013;8:e62819.
Zhang JH, Han X, Zhao HW, et al. Personalized prediction model for seizure-free epilepsy with levetiracetam therapy: a retrospective data analysis using support vector machine. Br J Clin Pharmacol 2018;84:2615–2624.
Croce P, Ricci L, Pulitano P, et al. Machine learning for predicting levetiracetam treatment response in temporal lobe epilepsy. Clin Neurophysiol 2021;132:3035–3042.
Lin LC, Ouyang CS, Chiang CT, Yang RC, Wu RC, Wu HC. Early prediction of medication refractoriness in children with idiopathic epilepsy based on scalp EEG analysis. Int J Neural Syst 2014;24:1450023.
Wang B, Han X, Yang S, et al. An integrative prediction algorithm of drug-refractory epilepsy based on combined clinical-EEG functional connectivity features. J Neurol 2022;269:1501–1514.
Shin Y, Hwang S, Lee SB, et al. Using spectral and temporal filters with EEG signal to predict the temporal lobe epilepsy outcome after antiseizure medication via machine learning. Sci Rep 2023;13:22532.
Päivinen N, Lammi S, Pitkänen A, Nissinen J, Penttonen M, Grönfors T. Epileptic seizure detection: a nonlinear viewpoint. Comput Methods Programs Biomed 2005;79:151–159.
Tanveer M, Pachori RB, Angami NV. Classification of seizure and seizure-free EEG signals using Hjorth parameters. 2018 IEEE Symposium Series on Computational Intelligence (SSCI) 2018:2180–2185.
Gemein LAW, Schirrmeister RT, Chrabąszcz P, et al. Machine-learning-based diagnostics of EEG pathology. Neuroimage 2020;220:117021.
Lanzone J, Ricci L, Tombini M, et al. The effect of Perampanel on EEG spectral power and connectivity in patients with focal epilepsy. Clin Neurophysiol 2021;132:2176–2183.
Ricci L, Assenza G, Pulitano P, et al. Measuring the effects of first antiepileptic medication in Temporal Lobe Epilepsy: Predictive value of quantitative-EEG analysis. Clin Neurophysiol 2021;132:25–35.
Pyrzowski J, Le Douget JE, Fouad A, Siemiński M, Jędrzejczak J, Le Van Quyen M. Zero-crossing patterns reveal subtle epileptiform discharges in the scalp EEG. Sci Rep 2021;11:4128.
Shahidi Zandi A, Tafreshi R, Javidan M, Dumont GA. Predicting temporal lobe epileptic seizures based on zero-crossing interval analysis in scalp EEG. Annu Int Conf IEEE Eng Med Biol Soc 2010;2010:5537–5540.
Morgan VL, Englot DJ, Rogers BP, et al. Magnetic resonance imaging connectivity for the prediction of seizure outcome in temporal lobe epilepsy. Epilepsia 2017;58:1251–1260.
Antony AR, Alexopoulos AV, González-Martínez JA, et al. Functional connectivity estimated from intracranial EEG predicts surgical outcome in intractable temporal lobe epilepsy. PLoS One 2013;8:e77916.
van Mierlo P, Papadopoulou M, Carrette E, et al. Functional brain connectivity from EEG in epilepsy: seizure prediction and epileptogenic focus localization. Prog Neurobiol 2014;121:19–35.
Zaveri HP, Pincus SM, Goncharova, II, Duckrow RB, Spencer DD, Spencer SS. Localization-related epilepsy exhibits significant connectivity away from the seizure-onset area. Neuroreport 2009;20:891–895.
Chandrashekar G, Sahin F. A survey on feature selection methods. Computers & Electrical Engineering 2014;40:16–28.
Guyon I, Weston J, Barnhill S, Vapnik V. Gene Selection for Cancer Classification using Support Vector Machines. Machine Learning 2002;46:389–422.
Breiman L. Random Forests. Machine Learning 2001;45:5–32.
Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2016.
Ke G, Meng Q, Finley T, et al. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. Neural Information Processing Systems; 2017.
Lundberg SM, Lee S-I. A Unified Approach to Interpreting Model Predictions. Neural Information Processing Systems; 2017.
Hagberg A, Swart P, S Chult D. Exploring network structure, dynamics, and function using NetworkX: Los Alamos National Lab.(LANL), Los Alamos, NM (United States), 2008.
Abraham A, Pedregosa F, Eickenberg M, et al. Machine learning for neuroimaging with scikit-learn. Front Neuroinform 2014;8:14.
Matos J, Peralta G, Heyse J, Menetre E, Seeck M, van Mierlo P. Diagnosis of Epilepsy with Functional Connectivity in EEG after a Suspected First Seizure. Bioengineering (Basel) 2022;9.
Jiruska P, Csicsvari J, Powell AD, et al. High-frequency network activity, global increase in neuronal activity, and synchrony expansion precede epileptic seizures in vitro. J Neurosci 2010;30:5690–5701.
Engel J, Jr., Bragin A, Staba R, Mody I. High-frequency oscillations: what is normal and what is not? Epilepsia 2009;50:598–604.
Fisher RS, Webber WR, Lesser RP, Arroyo S, Uematsu S. High-frequency EEG activity at the start of seizures. J Clin Neurophysiol 1992;9:441–448.
Pereda E, Quiroga RQ, Bhattacharya J. Nonlinear multivariate analysis of neurophysiological signals. Prog Neurobiol 2005;77:1–37.
An S, Malhotra K, Dilley C, et al. Predicting drug-resistant epilepsy - A machine learning approach based on administrative claims data. Epilepsy Behav 2018;89:118–125.
Horstmann MT, Bialonski S, Noennig N, et al. State dependent properties of epileptic brain networks: comparative graph-theoretical analyses of simultaneously recorded EEG and MEG. Clin Neurophysiol 2010;121:172–185.
van Diessen E, Otte WM, Braun KP, Stam CJ, Jansen FE. Improved diagnosis in children with partial epilepsy using a multivariable prediction model based on EEG network characteristics. PLoS One 2013;8:e59764.
Lee HJ, Park KM. Intrinsic hippocampal and thalamic networks in temporal lobe epilepsy with hippocampal sclerosis according to drug response. Seizure 2020;76:32–38.

No competing interests reported.

Supplementaryinformation.pdf

Download PDF

Journal Publication

published 04 Sep, 2024

Read the published version in Scientific Reports →

Reviews received at journal
19 Jul, 2024
Reviews received at journal
18 Jul, 2024
Reviews received at journal
10 Jul, 2024
Reviewers agreed at journal
09 Jul, 2024
Reviewers agreed at journal
08 Jul, 2024
Reviewers agreed at journal
08 Jul, 2024
Reviewers invited by journal
08 Jul, 2024
Editor assigned by journal
07 Jul, 2024
Editor invited by journal
07 Jul, 2024
Submission checks completed at journal
04 Jul, 2024
First submitted to journal
03 Jul, 2024

You are reading this latest preprint version

Predicting medical refractoriness of patients with temporal lobe epilepsy: EEG-based parameter optimization and network analysis

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

Patients and data collection

Statistical analysis

EEG recording

Preprocessing

Feature extraction

Feature selection

Evaluation

Feature interpretation and graph measurement

Declarations

Ethics declarations

Contributions

Competing interest

Author Contribution

Acknowledgement

Data Availability

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1