The first study recruited 80 patients from the inpatient rehabilitation department at Dorot Geriatric Medical Center, with a mean age of 73.51 (10.45) years, evenly distributed between males and females. Both groups exhibited a diverse age range. An age difference was observed between the healthy male group and the MCI-R male group. This difference is primarily attributed to the presence of a male participant in the MCI-R group, whose age (101 years) is more than two standard deviations above the group average. When this outlier is excluded from the analysis, the age difference between the male subjects is no longer significant (t = -1.86, p = 0.07). Other than that, no significant age differences were found between the groups.
The second study included 77 patients from the same department, with a mean age of 74.17 (8.90) years, comprising 52% females and 48% males. Each group displayed a wide age range. Differences in age were observed between the MD group and the other two groups (MCI-R and Healthy), particularly among female participants. This can be explained by the established understanding that the prevalence of dementia increases with age. Previous research indicated a modest rise in MCI rates with age [33], unlike dementia, where prevalence nearly doubles every 5-year increase in age [34]. Consistent with the current study, a large-scale study of older women found that females with dementia were significantly older than cognitively healthy participants [35]. Full demographic details are provided in Tables 1 and 2.
In both studies, clinical staff identified potential participants during hospital admissions. Participants were selected based on study inclusion criteria and had MMSE scores of 24–30 (first study) or 10–30 (second study). All patients provided informed consent in line with the Declaration of Helsinki. Individuals who objected or had neurological comorbidities, scalp or skull damage, facial skin irritation, significant hearing impairments, or a history of significant drug abuse were excluded. Ethical approval for both studies was granted by the Ethics Committee (EC) of Dorot Geriatric Medical Center. The approval of the first study was granted on September 07, 2020, NIH Clinical Trials Registry number: NCT04683835. The approval of the second study was granted on March 01, 2022, NIH Clinical Trials Registry number: NCT05528445.
For the meta-analysis, we included additional 146 healthy participants (aged 18–80) who completed auditory cognitive tasks. Ethical approval was obtained from Tel Aviv University.
2.1.1 Study groups
Figure 1 illustrates the group allocation and analysis details for each part of the study.
In the first study, participants were divided into two groups based on their MMSE scores: Healthy group (MMSE scores of 28–30, n = 40); and MCI-R group (MMSE scores of 24–27, n = 40);
In the second study, participants were divided into three groups based on their MMSE scores: Healthy group (MMSE scores of 28–30, n = 30); and MCI-R group (MMSE scores of 24–27, n = 30); MD group (MMSE scores of 10–23, n = 17).
We used MMSE score cutoffs of 24 and 27 for group allocation, focusing on timely detection of cognitive decline. Previous evidence suggests that a higher cutoff score enhances diagnostic accuracy [36]. Additionally, research indicates that educated individuals scoring below 27 on the MMSE are at increased risk of developing dementia [37].
Finally, the meta-analysis included data from both studies and additional healthy participants, totaling 237 elderly individuals (allocated as in the second study) and 112 healthy young participants.
Clinical and demographic data
To enhance the validation of clinical assessments and cognitive states of participants, additional evaluations were conducted alongside the MMSE in both studies. In the first study, participants underwent Instrumental Activities of Daily Living (IADL) assessments, which measures daily living tasks across eight domains, with scores ranging from 0 (low functioning) to 23 (high functioning) [38]. The IADL is self-reported and assessed through interviews and has seldom been linked to objective measures like brain activity. However, a study using single-channel EEG effectively classified elderly subjects based on IADL scores [39].
In the second study, several clinical assessment methods were collected including the Montreal Cognitive Assessment (MoCA), the Geriatric Depression Scale (GDS) for depression diagnosis, and the Executive Clock Drawing Task (CLOX) for assessing cognitive impairment. Additionally, demographic and sleep-related data were collected in the second study.
The MoCA, scoring from 0 to 30, identifies MCI and early dementia, with a score of 26 or higher indicating normal cognitive function. Designed for the detection of MCI or early Dementia by healthcare professionals, the MoCA evaluates various cognitive domains including visuospatial abilities, memory, attention, and delayed recall [40]. The GDS, designed for elderly individuals, consists of "yes" or "no" questions about the past week's emotional experiences, scores from 0 to 15, with higher scores indicating more severe depression [41]. The CLOX task, involving drawing and replicating a clock, scores from 0 to 15, with lower scores indicating greater cognitive impairment [42].
EEG device
EEG recordings were conducted using the Neurosteer® single-channel high dynamic range EEG (hdrEEG) Recorder. A three-electrode medical-grade patch was placed on each subject’s forehead, using dry gel for optimal signal transduction. The non-invasive monopolar electrodes were positioned at the prefrontal regions, with the single-EEG-channel derived from the difference between Fp1 and Fp2 in the International 10/20 electrode system and a reference electrode in Fpz. The data were digitized continuously at a 500-Hz sampling frequency.
2.1.2 Signal processing and high-level features
In recent years, a time-frequency approach has been adopted for analyzing EEG data to characterize brain states in AD [43], [44], [45]. In line with this approach, our study employs an advanced time-frequency method to process the EEG signal, as previously described [23], [25], [28]. The EEG features are produced by a secondary layer of machine learning applied to labeled datasets previously gathered by Neurosteer, to derive several linear combinations. Specifically, the EEG features VC9 and A0 were calculated employing the linear discriminant analysis (LDA) technique [46]. LDA is designed to identify an optimal linear transformation that maximizes class separability. Previous studies employing LDA models on imaging data have demonstrated success in predicting the development of cognitive decline. Simple LDA models using MRI and PET data were shown to predict cognitive decline or stability up to four years prior to the manifestation of decline symptoms [47]. The calculation of EEG feature ST4 utilized principal component analysis (PCA) [48], a technique employed for reducing feature dimensionality before classification. Research indicates that features extracted through PCA exhibit a significant correlation with MMSE scores and effectively distinguish individuals with AD from healthy subjects [49], [50], [51]. Notably, all three EEG features were derived from datasets different from those analyzed in the current study, to avoid overfitting the data. Consequently, the weight matrices previously determined were applied to transform the data acquired in the present study.
In studies conducted on young healthy participants, VC9 feature showed increased activity with escalating levels of cognitive load manipulated by a numeric n-back task [24]. Furthermore, during an arithmetic task, VC9 activity decreased in response to external visual interruptions [26]. Additionally, in a surgery simulator task performed by medical interns, VC9 activity declined with task repetition, correlating with individual performance [25]. VC9 demonstrated greater sensitivity than Theta particularly for tasks with lower cognitive load, making it more suitable for clinical and elderly populations. Notably, in the preceding pilot study [23], higher cognitive load levels resulted in increased VC9 activity exclusively in the healthy young group compared to the healthy senior group, highlighting different activity patterns between young and senior participants across various cognitive states. In clinical settings, VC9 activity correlated with the auditory mismatch negativity (MMN) component in minimally responsive patients [52].
EEG feature A0, previously identified as a classifier for distinguishing cognitive load from rest in healthy subjects, has proven to be a robust predictor of cognitive decline in individuals with mild-to-moderate impairment [23]. Furthermore, A0 effectively differentiates between healthy controls and Parkinson’s disease (PD) patients, with higher activity observed in healthy individuals [28].
EEG feature ST4 was found to correlate with individual performance in the numeric n-back task, specifically correlating the disparity in RTs between high and low cognitive load levels to differences in ST4 activity per participant [24]. In the preceding pilot study [23], ST4 demonstrated the ability to differentiate between individuals with low MMSE scores, those with scores between 24 and 27, and those with scores above 28, as well as healthy young participants. This suggests that ST4 can detect subtle changes in cognitive states, indicating its potential as a sensitive marker of cognitive functioning.
2.1.3 Power spectrum and frequency bands
The EEG power spectrum was obtained through the fast Fourier transform (FFT) of the EEG signals within a 4-second window, using a Hamming window to minimize spectral leakage. Power spectral density was calculated from the frontal channel (Fp1-Fp2) and transformed to dB (logarithm base 10), for Delta (0.5-4 Hz), Theta (4–7 Hz), Alpha (8–15 Hz), Beta (16–31 Hz), and lower Gamma (32–45 Hz) frequency bands.
Previous research has extensively explored the impact of cognitive load on various frequency bands, particularly within the frontal lobe. Results from EEG studies reveal enhanced frontal Theta activity in high cognitive load conditions, which is increasing with the growing demands on memory retention across various cognitive tasks like the n-back [53], [54], [55]. Additionally, studies have highlighted the significance of frontal Delta power in inhibiting potential interferences that might affect performance in high-load cognitive tasks [56]. Gamma activity exhibited positive correlations with fMRI-BOLD signal in various prefrontal cortex regions, indicating modulation during cognitive processing [57]. Middle-aged adults showed heightened frontal Gamma activity than young adults during the high cognitive load level of verbal n-back task[58]. Alongside this, reduced Gamma oscillations was observed in elderly subjects (mean age 75) compared to younger subjects [59], suggesting that Gamma activity increases with age until midlife, and starts to decline in older age. Similar to Gamma, Beta EEG activity has shown positive correlations with fMRI-BOLD signal in various frontal regions and exhibited a positive load effect specifically during cognitive working memory tasks [57]. In the prefrontal cortex, heightened Beta activity aids in information erasure from working memory, cessation of long-term memory retrieval, and preserves contents during delay periods [60]. Furthermore, while behavioral performance was similar between young and healthy elderly participants in an auditory memory task study, notable differences in Beta band desynchronization during retrieval suggest age-related influences on Beta responses during working memory task [61]. Understanding how cognitive load influences frequency bands in the frontal lobe contributes valuable insights into the neural mechanisms underlying cognitive processes and can shed light on cognitive decline.
2.1.4 EEG recording and auditory battery
EEG recording followed the previously described protocols [23], [28], lasting 20–30 minutes, including a 15-minute cognitive assessment battery. This battery consisted of pre-recorded tasks: musical detection, musical n-back, and resting state tasks as outlined in prior studies [23], [28]. In the first study, each patient was re-examined under the same conditions over the next seven days, with sessions at least one day apart. In the second study, patients participated in an additional EEG session involving auditory instructions and two C-IADL sub-tasks from PASS: telephone use and medication management. Each task is rated on a 4-point scale (0–3), and patients receive three types of scores: independence, safety, and adequacy (quality) [31].
Statistical Analysis
2.1.5 Overview
The statistical analysis was conducted separately for the first and second studies, followed by a meta-analysis incorporating data from a total of 349 participants from both studies and previously collected data. In the first study, the analysis began with dimensionality reduction using Lasso, Elastic Net, Ridge, and SVM with RBF kernel models to identify key features correlated with MMSE scores. This was followed by Linear Mixed Model (LMM) analyses to assess the relationships between EEG variables, MMSE groups, and cognitive load levels.
For the first study, the LMM model included the following variables: MMSE group (numeric, between), visit (categorical, within), and cognitive load (numeric, within). Separate LMMs were then conducted for each visit, considering MMSE group and cognitive load.
In the second study, LMM analyses incorporated the MMSE group (numeric, between) and cognitive load (numeric, within) variables. Additionally, correlation models were employed to examine the associations between EEG variables and clinical test scores. Logistic regression models were applied to predict both MMSE and MoCA results based on brain activity features and collected clinical data (e.g., CLOX, GDS, and PASS scales).
The significance level for all analyses was set at p < 0.05. Post-hoc effects with Benjamini-Hochberg correction [62] were applied following significant main effects and interactions. All analyses were carried out using Python Statsmodel [63].
2.1.6 Variables
These studies included EEG variables, performance data, and clinical scales. EEG variables comprised frequency bands: Delta, Theta, Alpha, Beta and lower Gamma, as well as three EEG features: VC9, ST4, and A0 (normalized to a scale of 0-100). All EEG variables were calculated every second using a moving window of four seconds, and mean activity per condition was analyzed. Behavioral variables included mean response accuracy and mean RTs per participant. The independent variable representing cognitive load was constructed as follows: tasks performed during rest were categorized as cog_load 0; Detection task level 1 and 0-back were categorized as cog_load 1; Detection task level 2 and 1-back were categorized as cog_load 2. Finally, 2-back was categorized as cog_load3.