Autistic recognition from EEG signals by extracted features from several time series models

doi:10.21203/rs.3.rs-3931787/v1

Download PDF

Research Article

Autistic recognition from EEG signals by extracted features from several time series models

https://doi.org/10.21203/rs.3.rs-3931787/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Autism is a neurological and psychological disorder that typically manifests in childhood and persists into adulthood. It is characterized by atypical social, behavioral, and communication skills, as well as diminished attention to the surrounding environment. The detection and recognition of autism can contribute to the prevention of its development and the enhancement of social and communicational abilities. Various methods are employed for autism recognition, including questionnaire tests and neurological techniques. One such neuroimaging method is electroencephalography (EEG), which records the brain's electrical activities through sensors placed on the scalp. This paper proposes a method for identifying individuals with autism using EEG signals and features extracted from a multivariate autoregressive moving average (MVARMA) and multivariate integrated autoregressive(ARIMA) models. The approach begins by estimating active sources through source localization methods, followed by the application of a dual Kalman filter to estimate source activity. Subsequently, the MVARMA and ARIMA models are applied to the EEG sensor and active source data, enabling the calculation of model parameters. Principal component analysis is then utilized to select important parameters, and a K nearest neighbor classifier is employed to classify participants as either autistic or neurotypical. The results demonstrate superior classification performance, achieving higher accuracy compared to alternative methods. The proposed method yields superior classification outcomes when compared to other approaches, as it exhibits improved classification measures.

Dual Kalman Filter

Autoregressive moving average(ARMA) model

Autoregressive integrated moving average(ARIMA) model

Autism

Electroencephalography(EEG)

Autism is a complex brain condition with many different parts and aspects. It can show up in various ways and affects people differently. People with autism might struggle with communicating emotions and socializing. They might also find it hard to adapt to new situations or change their behavior. Some noticeable signs in people with autism include difficulty speaking, not paying much attention to what's happening around them, showing fewer emotions, and finding it tough to express feelings on their faces. These things are especially noticeable in children with autism[1–3]. Autism mostly shows up when someone is very young or during their teenage years. It's important to mention that many adults with autism also go through epilepsy or seizures at some time in their lives. This happening together highlights how important it is to identify, provide complete care, and offer support to people who are on the autism spectrum[4–7]. There's strong proof showing that more and more people are being diagnosed with autism. A study from the Center for Disease Control and Prevention (CDC) in 2009 found that the number of autism cases has been going up consistently. In the United States, it went from being less than three cases for every 10,000 kids in 1970 to over 30 cases for every 10,000 kids in 1990. Then, in a report from 2012, the CDC said it had risen even more to one in 88 children, and especially high for boys at one in 54. These increasing numbers show how urgent it is to identify and support children with autism. By understanding more about how complex autism is, we can aim for better ways to spot it early, provide effective help, and improve the lives of those dealing with this condition[3, 7].

Various methods have been created to identify autism, all with the goal of spotting autism spectrum disorder (ASD) early on. This helps in getting help and support when needed. Each method has its good points and bad points. One method, behavioral observation, involves carefully watching how a child behaves, interacts with others, talks, and plays. But because it relies on someone's judgment and experience, it might lead to different opinions about behaviors and can cause differences in diagnoses[8–10]. Different types of surveys and scales have been created to check for autism signs and measure how severe the symptoms are. Some examples are the Autism Diagnostic Interview-Revised (ADI-R) and the Autism Diagnostic Observation Schedule (ADOS)[11, 12]. Relying on someone's personal reporting might bring in some unfairness and might not show all the different behaviors a person has. To look deeper, scientists use brain-scanning methods like functional magnetic resonance imaging (fMRI) and electroencephalography (EEG) to study how the brain works and connects in people with autism. These ways help without being invasive to see how the brain is structured and works. They give us clues about the brain reasons behind autism and might find special signs that could help diagnose it[12–15]. Electroencephalography (EEG) records the brain's electrical activity using electrodes on the scalp. This method has many benefits when identifying autism. It's non-invasive, meaning it doesn't involve surgery or going inside the body, and it's quite affordable. EEG helps study the brain's electrical patterns and connections. By looking at how the brain's rhythms and syncing work, we can learn a lot about the unusual functions linked to autism[15, 16]. Because EEG records brain activity directly, it's really good at catching quick changes in how the brain works. These benefits show how valuable EEG signals can be in spotting and understanding autism. They help find it early, create specific ways to help, and make treatments personalized for each person.

Detecting autism through EEG signals is a crucial aspect of neuroscience research, with various studies exploring different approaches in this area. Some research papers have used machine learning techniques to analyze EEG data, achieving high accuracy in distinguishing between individuals with autism and those without it [17–23]. Others have suggested deep learning methods that extract key features from EEG signals, showing promising results in autism detection [24–27]. Schwartz et al focused on investigating specific EEG frequency bands and their correlation with autism, revealing distinct patterns that could aid in identification [28, 29]. Additionally, innovative approaches combining EEG analysis with graph theory, introduced by Jurriaan and Precenzano et al., offer fresh perspectives on recognizing autism[30]. Tawhid et al. explored time-frequency analysis techniques to extract dynamic information from EEG signals, allowing the detection of temporal patterns associated with autism[31]. Others, like Landowska, developed multi-modal approaches that integrate EEG data with other physiological signals, like heart rate and skin conductance, to improve the accuracy of identifying autism[32]. Qaysar investigated advanced signal processing techniques, such as wavelet transforms and independent component analysis, to reveal unique EEG signatures linked to autism[33]. These varied methods show global researchers' collaborative efforts, pushing the boundaries of autism detection using EEG signals and paving the way for more comprehensive diagnostic tools.Several studies have also pointed out abnormal connectivity patterns in specific brain regions of individuals with autism. Findings have indicated increased local connectivity but decreased long-distance connections in autism[34–39]. Wass et al. reported increased connectivity in frontal and short neural pathways, while Coben et al. found increased coherence in frontal regions and decreased coherence in bilateral posterior temporal regions when comparing autistic subjects with neurotypical children[36]. Studies utilizing Granger causality analysis found underconnectivity between distant brain regions, especially in the prefrontal cortex, anterior cingulate, and bilateral inferior parietal regions [37]. Another study by Coben et al. used Granger causality analysis to estimate effective connectivity from EEG signals and identified underconnectivity between regions that are distant from each other, specifically in the prefrontal cortex (PFC), anterior cingulate, and bilateral inferior parietal regions. Examining the relation between autism and effective connectivity during emotional processing tasks[40], Minshew and Williams observed increased frontal coherence but reduced anterior-to-posterior temporal coherence in autism[41]. Wataru Sato et al. applied dynamic causal modeling to fMRI data, revealing reduced activity in visual brain areas among autistic children[42].

This paper introduces novel approaches using multivariate autoregressive moving average (MVARMA) and multivariate autoregressive integrated moving average (MVARIMA) models. These models are applied to recorded signal and source activity time series. Features are derived from the parameters of these models, focusing on statistical values of the most influential ARMA and ARIMA model parameter time series. To compute source activity signals, an initial step involves applying a source localization method to EEG signals, isolating active sources. Then, a dual Kalman filter is employed on these active sources, estimating source activities alongside the source relationship matrix. Subsequently, the transformation occurs from sensor space (EEG signals) to source space (active brain regions). Finally, a multivariate ARMA and ARIMA model is fitted to each time sample of EEG signal based on past signal recordings and source activities.

After the source localization process, the focus shifts to estimating source dynamics, which represents changes in source activity across time. This estimation is a complex and vital aspect, and several research papers propose solutions to tackle this challenge. One traditional method is the dynamic causal model (DCM), which employs linear or nonlinear models to estimate neural connections based on existing physiological knowledge. It assumes nonlinear relationships between sources or neural dynamics, optimizing model parameters accordingly[43–45]. Additionally, methods utilizing dual Kalman filters are utilized for estimating dynamic source activity[46, 47]. For instance, A.H. Omidvarnia applies dual Kalman filter analysis to newborn EEG signals[48], while Eduardo Giraldo proposes a method using dual Kalman filters for source activity estimation. These approaches estimate source activity and their relationships concurrently across time, reducing estimation errors. In a study by Rajabioun et al., a dual Kalman filter-based approach is employed to estimate effective connectivity among sources, specifically applied to EEG signals from individuals with autism. This research revealed differences in effective connectivity between subjects with autism and those without it[47].

This paper introduces a method aimed at distinguishing individuals with autism from control candidates. This classification is crucial for halting the advancement of disorders and enhancing the lives of those affected. The structure and flowchart detailing this proposed method are depicted in Fig. 1.

To begin, the EEG signal needs to be obtained or prepared. In this study, the signal was acquired by downloading it from [49].The dataset used in this research comprises EEG recordings gathered through the Biosemi ActiveTwo EEG system. It includes data from 28 individuals diagnosed with autism spectrum conditions and 28 neurotypical controls, spanning ages from 18 to 68 years. These recordings were conducted during a 2.5-minute (150 seconds) eyes-closed resting period. Permission for data collection and sharing was obtained from the Health Research Authority, specifically under the IRAS ID 212171[49].

Once the preprocessing stage is completed, the filtered signals are subjected to a source localization technique. This process aims to estimate the active brain sources or regions. Its primary goal is to simplify complexity by identifying a limited number of active sources. This leads to determining the active brain regions and their corresponding spatial coordinates as the final output. In the context of source localization or pinpointing active brain regions, the goal is to minimize the following function:

The raw EEG signals underwent several preprocessing stages to eliminate unwanted noise and artifacts. Initially, a bandpass Butterworth filter was applied, setting the cutoff frequency between 0.5 and 100 Hz to minimize noise. Subsequently, the independent component analysis (ICA) method was utilized to separate independent components associated with brain activity. Any components identified as unrelated to brain function, such as those related to blinking, electromyography, 50 Hz powerline interference, auditory artifacts, etc., were removed from the component list, resulting in a refined signal. For more detailed insights into the techniques used for identifying artifacts and brain-related components, please refer to[50–52].

$F=‖{V}_{K}-G{J}_{K}‖+\alpha ‖{J}_{K}‖$ (Eq. 1)

where ${V}_{K}\left(m\times 1\right)$ is recorded EEG signal in K^th sample, ${J}_{K}\left(3n\times 1\right)$ is the source activity of the brain in K^th sample and G$\left(m\times 3n\right)$ is leadfield matrix which is calculated by forward problem solution like finite element method(FEM) or boundary element method(BEM)[53, 54]. This function consists of two distinct parts: the first part relates to estimating errors, while the second part focuses on controlling noise reduction and managing sharp variations in source activities. A parameter called α determines how much these components affect the final function. Calculating this parameter involves methods like Tikhonov regularization or using techniques such as the L-curve[55].

Various approaches exist for minimizing Eq. 1, one of which is sLORETA, known for its ability to achieve zero localization error[55, 56]. In the case of sLORETA, the explicit solution can be obtained by utilizing the known values of G and ${V}_{K}$:

$\widehat{{J}_{K}}=T{.V}_{K}$ (Eq. 2)

where $\widehat{{J}_{K}}\left(3n\times 1\right)$ is estimated source activity of the brain and T$\left(3n\times m\right)$ is a matrix which can be calculated to source activity estimation from EEG signals(${V}_{K}$) and in sLORETA it can calculated from the following equation:

$T={G}^{T}H{\left[HG{G}^{T}H+\alpha H\right]}^{+}$ (Eq. 3)

where ${\left[\right]}^{+}$is pseudoinverse of matrix and H is regularization matrix for smoothness control and is defined as follows:

$H=I-1{1}^{T}/{1}^{T}1$ (Eq. 4)

In this study, we consider the use of the identity matrix, denoted as I, and the vector of ones, denoted as 1, with dimensions of (m×1). The purpose of employing these mathematical constructs is to facilitate the active region selection process.

To initiate the active region selection, the source activity estimation(Eq. 2) technique is employed on time-varying EEG signals. For each sample, the estimation identifies certain sources as active. The final selection of active sources is based on the identification of regions or sources that exhibit a higher probability of being active in each sample. In essence, this entails selecting the brain source that exhibits a greater frequency of activity compared to other sources throughout the duration of the analysis.

Subsequently, a linear dynamic model is fitted to the selected active sources. This process involves aligning the parameters of the model with the extracted active sources to establish a meaningful representation of their dynamics.

${J}_{K}={{F}_{K}J}_{K-1}+{\eta }_{K}$ (Eq. 5)

where ${\eta }_{K}$ is state noise and ${F}_{K}$ is relation matrix at K^th sample. The ${F}_{K}$ characterizes the interdependencies among different brain active regions, as well as the relationship between several active regions and its own activity at sample K in relation to sample K-1. On the contrary, the relationship between active sources and EEG signals is influenced by the leadfield matrix, which is derived through the implementation of the forward problem method. This matrix encapsulates the spatial sensitivity patterns that connect the active sources to the measured EEG signals. In a simplified manner, this relationship can be expressed as follows:

$\left\{\begin{array}{c}{J}_{K}={{F}_{K}J}_{K-1}+{\eta }_{K} \\ \\ {V}_{K}=G{J}_{K}+{\epsilon }_{K}\end{array}\right.$ (Eq. 6)

In this problem, the source activity during the time(${J}_{K}$) and their relation matrix in time and space(${F}_{K}$) are the unknown parameters and they are estimated by dual Kalman filter. the estimation equations and formulas are described in [57, 58]. After performing the calculation for dynamic source activity (${J}_{K}$) and by knowing ${V}_{K}$ as EEG recorded signal, the next step involves fitting a multivariate autoregressive moving average model on the EEG channels, with ${V}_{K}$ being modeled in relation to previous samples of V and J. This multivariate ARMA(p,q) model can be expressed as follows in its general form:

${V}_{K}=\sum _{i=1}^{p}{a}_{i}{V}_{K-i}+ \sum _{i=0}^{q}{b}_{i}{J}_{K-i}$ (Eq. 7)

where ${a}_{i},{b}_{i}$ are the model parameter matrixes at i^th sample. Given that V is a vector with a size of m and J is a vector with a size of n, the size of the coefficient matrix ${a}_{i}$ is m×p, and the size of the coefficient matrix ${b}_{i}$ is n×(q + 1).

In addition to the aforementioned multivariate ARMA model, our method also incorporates a second time series model that accounts for nonstationary EEG signals. This model is known as the autoregressive integrated moving average (ARIMA) model. The ARIMA(p,d,q) model is defined as follows:

${V}_{K}=\sum _{i=1}^{p}{a}_{i}{V}_{K-i}+ \sum _{i=0}^{q}{b}_{i}{J}_{K-i}+\sum _{i=1}^{d}{c}_{i`}{(1-Z)}^{d}{Y}_{i}$ (Eq. 8)

where Z is the delay factor which are defined as follows

$$\left(1-Z\right){Y}_{i}={Y}_{i}-{Y}_{i-1}$$

${\left(1-Z\right)}^{2}{Y}_{i}={Y}_{i}-2{Y}_{i-1}+{Y}_{i-2}$ (Eq. 9)

$$\dots$$

In this section, the source activities are considered as the input component of the model, while EEG signals are considered as the output component. The aim is to model each sample of the EEG signal using its delayed samples and source activities. To achieve this, ARMA matrices (denoted as 'a' and 'b') and ARIMA matrices (denoted as 'a', 'b', and 'c') are calculated for each sample. The size of these matrices depends on the model orders.

Then, various features are extracted from these model parameters for the purpose of classification. Firstly, to reduce complexity, these parameters are divided into two classes, and several features are extracted from each class:

Class 1: Parameters with small variation and a standard deviation lower than 20% of the mean value of the array. The mean values of these parameters are selected to contribute to the feature vectors.
Class 2: Parameters with values greater than those in Class 1. For this class, the following statistical features are selected for feature vector:
Mean value of the array signal.
Standard deviation of the array signal.
Kurtosis of the array signal.
Skewness of the array signal.
Entropy of the array signal.

Next, the horizontal[59] and natural visibility graphs[60, 61] of each array time series are constructed, and specific features from these graphs are selected to be part of the feature vectors. For a detailed understanding of visibility graphs, additional information can be found in [60, 62, 63]. Readers are encouraged to refer to these sources for further exploration.

The mean of the graph nodes
The standard deviation of graph nodes
The mean value of the shortest path from each node to other nodes

Following feature extraction and the creation of the feature vector, it becomes important to reduce the dimensionality of this vector to simplify complexity and enhance classification accuracy. Principal Component Analysis (PCA) is a highly recognized and effective method for feature vector dimension reduction. PCA aims to transform a set of potentially correlated high-dimensional features into a new set of uncorrelated variables known as principal components. This transformation retains the most important information from the original features while decreasing their dimensionality. PCA begins by normalizing or centering each feature by subtracting its mean value. Then, a correlation matrix is formed from these normalized features, and the eigenvalues of this matrix are calculated. The eigenvectors corresponding to high-magnitude eigenvalues are selected and arranged to construct a transformation matrix. This matrix helps project the original feature vector onto a lower-dimensional space defined by these selected eigenvectors. By discarding eigenvectors with low eigenvalues, PCA effectively reduces dimensionality while maintaining critical information.

In the final phase of the algorithm, a Support Vector Machine (SVM) is used as a classifier to differentiate and identify depressive subjects from normal individuals. SVM is selected for its strong capability in classifying high-dimensional features. It aims to find a hyperplane that maximizes the margin between different training classes, focusing on the closest points known as support vectors. While briefly described here, more comprehensive information about SVM's mechanisms and optimization techniques can be found in relevant literature. SVM exhibits robust classification performance and is widely applied across various fields due to its adeptness in handling complex feature spaces and efficiently managing classification tasks.

In this section, we applied the proposed method to EEG signals obtained from individuals diagnosed with autism as well as control participants. The implementation involved a series of steps. Initially, the signals were acquired and retrieved from. Following this, the signals underwent preprocessing stages utilizing a Butterworth filter, specifically a bandpass filter with cutoff frequencies set between 0.5 and 30 Hz. Subsequently, independent component analysis (ICA) was performed on the signals. ICA facilitated the extraction of independent components from the signals, aiming to eliminate components not related to brain activity, such as blinking, EMG artifacts, ECG artifacts, 50Hz noise, auditory brain response, etc. Details regarding the specific methodologies used for this purpose are elaborated in [50–52]. Following the preprocessing stage, the EEG signals were subjected to sLORETA (standardized low-resolution electromagnetic tomography) with zero localization error[55, 56], which enabled the extraction of the underlying sources. The regularization parameter for sLORETA was determined using the Tikhonov regularization method [54]. Following this, active sources during the ECG recording period were identified, and those exhibiting higher activity across all samples were chosen. The number of selected sources equaled the number of EEG channels. Subsequently, a multivariate autoregressive (MVAR) model was fitted to the active sources, incorporating the relationship between the sources and EEG channels to construct a state space model. To estimate the dynamic source activity signal and simultaneously calculate the source relationship matrix, a dual Kalman filter was employed. Moreover, an autoregressive moving average (ARMA) or autoregressive integrated moving average (ARIMA) model with various orders was fitted between the source activity (as input) and the EEG channel recordings (as output). From the estimated parameters of the ARMA or ARIMA model, several statistical and graph features were extracted. To reduce their dimensionality, principal component analysis (PCA) was applied, and the feature vector size was reduced to 15 components using PCA. Finally, a support vector machine (SVM) was trained using the reduced feature set, and the classification process was carried out using the SVM. To evaluate the classification performance, various simulations were conducted to investigate the impact of different parameter variations. Additionally, several measures were defined for the validation of the classification.

$Accurency=\frac{TP+TN}{TP+TN+FP+FN}$ (Eq. 10)

$sensitivity=\frac{TP}{TP+FN}$ (Eq. 11)

$specificity=\frac{TN}{TN+FP}$ (Eq. 12)

For the initial simulation, the classification was performed by varying the order of the ARMA model. To achieve this, several model orders were assumed for the ARMA and ARIMA model (refer to Eq. 7), and the corresponding results were presented in Table 1.

Table 1

The accuracy, sensitivity, specificity of EEG classification to autistic and control classes by proposed method with different ARMA and ARIMA model degrees
	ARMA(2,2)	ARMA(4,2)	ARMA(4,3)	ARMA(5,4)	ARMA(6,5)
Accuracy	0.9107	0.9286	0.9464	0.9821	0.9464
Sensitivity	0.9286	0.9643	0.9643	1	0.9643
Specificity	0.8929	0.8929	0.9286	0.9643	0.9286
	ARIMA(2,1,2)	ARIMA(4,2,2)	ARIMA(4,2,3)	ARIMA(4,3,4)	ARIMA(6,4,5)
Accuracy	0.9286	0.9821	0.9821	1	0.9464
Sensitivity	0.9643	0.9643	1	1	0.9643
Specificity	0.9286	0.9286	0.9643	1	0.9286

According to the findings presented in Table 1, it can be concluded that the ARMA(5,4) model outperforms the other models. Various methods for model order estimation have been proposed in previous studies, among which Akaike model order estimation is one approach[64, 65]. By employing the Akaike method, the ARMA model order is determined to be ARMA(5,5), which closely resembles the ARMA(5,4) model that yields superior results in this simulation. In terms of the ARIMA model, both the ARIMA(4,2,3) and ARIMA(4,3,4) exhibit better performance compared to other model orders. Consequently, it can be concluded that the ARIMA model yields more accurate results than the ARMA model when using the same model order. This observation highlights the advantage of ARIMA in handling nonstationary signals by incorporating differencing operations.

In the second simulation, the effects of two types of features are analyzed. Initially, only statistical features are utilized for classification. Subsequently, classification is performed using visibility graph-based features. Finally, classification is conducted by considering both statistical and visibility graph features. The accuracies of these simulations for the following models: ARMA(4,3), ARMA(5,4), ARIMA(4,2,3), and ARIMA(4,3,4), are plotted in Fig. 2.

In the second simulation, the accuracy of classifiers trained using visibility graph features is found to be higher compared to classifiers trained solely on statistical features. This improvement can be attributed to the inherent characteristics of visibility graphs, which capture additional structural information from the data. Furthermore, by examining Fig. 2, it is evident that the ARIMA(4,3,4) model exhibits superior performance in classification tasks. This finding suggests that the ARIMA(4,3,4) model is more effective in capturing the underlying patterns and dynamics of the data, leading to improved classification accuracy compared to other models such as ARMA(4,3), ARMA(5,4), and ARIMA(4,2,3).

In third simulation, the number of reduced feature by PCA is discussed. In previous simulations the number of features are reduced to 15 by PCA. But in this simulation the number of reduced features are set 5, 10, 15, 20,30, 50 and the classification accuracy with simulation time is discussed. The results of this simulation are noted in Table 2. In this simulation the selected models are ARMA(5,4) and ARIMA(4,3,4)

Table 2

The accuracy and simulation time of proposed method with different number of features which are reduced by PCA with ARMA(5,4) and ARIMA(4,3,4)
	5	10	15	20	30	50
Accuracy (ARMA(5,4))	0.8929	0.9464	0.9821	0.9821	1	1
Simulation time (ARMA(5,4)) In sec	762	983	1463	2873	3182	8982
Accuracy (ARMA(5,4))	0.9286	0.9821	1	1	1	1
Simulation time (ARIMA(4,3,4)) In sec	1282	1676	3593	4282	7083	10012

According to the results presented in Table 2, increasing the number of features used in SVM training leads to improved accuracy. However, it is important to note that this increase in accuracy is accompanied by a longer simulation time. The extended simulation time indicates higher complexity and computational load during the simulations. Interestingly, beyond the selection of 15 features, the increase in the number of features does not result in any noticeable improvement in accuracy, and the simulation time continues to increase. Consequently, it is advisable to carefully select the number of reduced features using PCA. In this case, the optimal number of reduced features appears to be 15, as it strikes a balance between reasonable computation time and satisfactory accuracy.

In the latest simulation, the proposed method is compared to other methods used for the recognition of autistic candidates. To achieve this, the proposed method and other methods mentioned in previous studies were applied to the same dataset, and their classification accuracies were recorded in Table 3.

Table 3

the classification accuracy of autistic data with methods which are proposed by several papers
	Method No.1(48)	Method No.2 (20)	Method No.3(24)	Method No.4 (32)	Proposed method ARMA(5,4)	Proposed method ARIMA(4,3,4)
Accuracy	0.8929	0.9464	0.9464	0.9643	0.9821	1

The results from Table 3 demonstrate that the proposed method achieves higher accuracy compared to other methods. Therefore, applying the proposed method with ARIMA(4,3,4) leads to improved classification performance. However, it should be noted that this method requires more time compared to the proposed method with ARMA(5,4). ARMA models are typically employed when shorter simulation times are desired.

This study introduces a method for identifying individuals with autism by leveraging EEG signals and features derived from multivariate autoregressive moving average (MVARMA) and multivariate integrated autoregressive (ARIMA) models. The method encompasses several steps, starting with source localization, estimation of source activity via a dual Kalman filter, and employing MVARMA and ARIMA models for parameter calculation. Subsequently, principal component analysis (PCA) is used to select pertinent parameters, followed by participant classification using a K-nearest neighbor classifier. The results showcase superior classification performance compared to alternative methods, highlighting the efficacy of this approach. The primary focus lies in using EEG signals to distinguish autistic participants from neurotypical ones by estimating source activities and capturing the dynamics and connectivity between brain sources, known to be altered in autism.

To evaluate the method's performance, various simulations explore different parameter variations. Results from Table 1 show that the ARMA(5,4) and ARIMA(4,3,4) models outperformed other orders, with the Akaike method suggesting an ARMA(5,5) model similar to the superior ARMA(5,4) model observed. Regarding ARIMA, both ARIMA(4,2,3) and ARIMA(4,3,4) showed enhanced performance, highlighting the advantage of ARIMA in handling nonstationary signals through differencing operations.

Another simulation investigates the impact of different features on classification performance, revealing that features extracted from high visibility graph (HVG) and non-visible graph (NVG) play significant roles. Combining these features with statistical ones yielded better results. Considering the computational load associated with numerous features, PCA for feature dimensionality reduction was explored, indicating that increased reduced features improve accuracy but lead to longer computation times, necessitating a balance between accuracy and computational efficiency.

Lastly, comparison with alternative methods demonstrates the superiority of ARIMA(4,3,4). In summary, this study presents a novel method for autism identification using EEG signals and MVARMA/ARIMA-derived features, demonstrating superior classification performance and potential applications in understanding autism dynamics and connectivity among brain sources. The exploration of parameter variations and feature selections provides valuable insights into the method's effectiveness in autism recognition.

The study focuses on autism, a complex condition impacting individuals throughout life, marked by unique social, behavioral, and communication traits alongside reduced environmental focus. Detecting it early is pivotal for prevention and enhancing social and communication skills. Various methods exist for this purpose, including EEG, which records brain electrical activities via scalp sensors. Utilizing EEG signals allows analysis of brain activity, offering insights into autism's neurological processes. Our method estimates source activities, scrutinizing brain region connectivity to reveal autism-specific patterns and dynamics.

We introduce a method employing EEG signals and features from MVARMA and ARIMA models to classify autism. These models capture data dependencies, statistical traits, and nonstationarity common in autism-related brain activity. Our method outperforms alternatives, effectively identifying autism. It estimates source activities, analyzes brain region relationships, aiding accurate autism classification against neurotypical participants. Our approach includes signal preprocessing, source localization, modeling, feature extraction, and classification. Through simulations and parameter variations, optimal model orders and features yielding high accuracy are identified.

The study emphasizes understanding brain source dynamics and connectivity in autism. EEG signals, MVARMA, and ARIMA models highlight altered brain activity in autism. Feature reduction using PCA enhances computation efficiency while maintaining accuracy. The choice between ARIMA and ARMA models balances accuracy and time; ARMA suits shorter simulations. This study's conclusion presents an effective method using EEG signals for precise autism classification, benefiting early intervention. Future research on larger and diverse datasets will fortify the method's robustness and applicability. Generalizability across populations and age groups warrants exploration. Advancements in EEG technology offer promising avenues for enhancing autism recognition, leading to personalized interventions and improved outcomes for affected individuals.

Author Contribution

all the paper are proposed, simulate and written by Mehdi Rajabioun and nobody helps him in it

Ethical requirements

**Conflict of Interest/Financial Disclosure:** The authors declare no conflict of interest or financial support from any company related to the research conducted in this study.

**Informed Consent in Studies with Human Subjects:** The data utilized in this study were obtained from publicly available sources on the web. As such, no direct interaction with human subjects or their personal information occurred, and therefore, informed consent was not required.

**Animal Studies:** This research does not involve any experimentation or data collection related to animals. No animals were used or harmed in the course of this study.

The research adheres to the ethical standards of data acquisition and handling as per the guidelines outlined above.

This segment provides a clear and concise statement addressing the ethical considerations of the research, including declarations of no conflict of interest, the ethical use of data obtained from web sources without the need for human consent, and the absence of animal studies in the research process.

A. Jack and J. P. Morris, "Neocerebellar contributions to social perception in adolescents with autism spectrum disorder," (in eng), Dev Cogn Neurosci, vol. 10, pp. 77–92, Oct 2014, doi: 10.1016/j.dcn.2014.08.001.
J. Grèzes, B. Wicker, S. Berthoz, and B. de Gelder, "A failure to grasp the affective meaning of actions in autism spectrum disorder subjects," Neuropsychologia, vol. 47, no. 8, pp. 1816–1825, 2009/07/01/ 2009, doi: https://doi.org/10.1016/j.neuropsychologia.2009.02.021.
W. Sato, M. Toichi, S. Uono, and T. Kochiyama, "Impaired social brain network for processing dynamic facial expressions in autism spectrum disorders," (in eng), BMC Neurosci, vol. 13, p. 99, Aug 13 2012, doi: 10.1186/1471-2202-13-99.
P. Shih, M. Shen, B. Ottl, B. Keehn, M. S. Gaffrey, and R. A. Müller, "Atypical network connectivity for imitation in autism spectrum disorder," (in eng), Neuropsychologia, vol. 48, no. 10, pp. 2931-9, Aug 2010, doi: 10.1016/j.neuropsychologia.2010.05.035.
I. Mohammad-Rezazadeh, J. Frohlich, S. K. Loo, and S. S. Jeste, "Brain connectivity in autism spectrum disorder," (in eng), Curr Opin Neurol, vol. 29, no. 2, pp. 137 – 47, Apr 2016, doi: 10.1097/wco.0000000000000301.
X. Yang, N. Zhang, and P. Schrader, "A study of brain networks for autism spectrum disorder classification using resting-state functional connectivity," Machine Learning with Applications, vol. 8, p. 100290, 2022/06/15/ 2022, doi: https://doi.org/10.1016/j.mlwa.2022.100290.
M. K. Belmonte, G. Allen, A. Beckel-Mitchener, L. M. Boulanger, R. A. Carper, and S. J. Webb, "Autism and abnormal development of brain connectivity," (in eng), J Neurosci, vol. 24, no. 42, pp. 9228–31, Oct 20 2004, doi: 10.1523/jneurosci.3340-04.2004.
N. Bauminger-Zviely and A. Shefer, "Naturalistic evaluation of preschoolers' spontaneous interactions: The Autism Peer Interaction Observation Scale," (in eng), Autism, vol. 25, no. 6, pp. 1520–1535, Aug 2021, doi: 10.1177/1362361321989919.
J. Richer, "The social-avoidance behaviour of autistic children," Animal Behaviour, vol. 24, no. 4, pp. 898–906, 1976/11/01/ 1976, doi: https://doi.org/10.1016/S0003-3472(76)80020-6.
P. Wei, D. Ahmedt-Aristizabal, H. Gammulle, S. Denman, and M. A. Armin, "Vision-based activity recognition in children with autism-related behaviors," Heliyon, vol. 9, no. 6, p. e16763, 2023/06/01/ 2023, doi: https://doi.org/10.1016/j.heliyon.2023.e16763.
T. Sappok et al., "Diagnosing autism in a clinical sample of adults with intellectual disabilities: How useful are the ADOS and the ADI-R?," Research in Developmental Disabilities, vol. 34, no. 5, pp. 1642–1655, 2013/05/01/ 2013, doi: https://doi.org/10.1016/j.ridd.2013.01.028.
C. Lord and R. Luyster, "Early diagnosis of children with autism spectrum disorders," Clinical Neuroscience Research, vol. 6, no. 3, pp. 189–194, 2006/10/01/ 2006, doi: https://doi.org/10.1016/j.cnr.2006.06.005.
M. Baygin et al., "Automated ASD detection using hybrid deep lightweight features extracted from EEG signals," Computers in Biology and Medicine, vol. 134, p. 104548, 2021/07/01/ 2021, doi: https://doi.org/10.1016/j.compbiomed.2021.104548.
S. N. Seyed Fakhari, F. Ghaderi, M. Tehrani-Doost, and N. Moghadam Charkari, "EEG-based brain connectivity analysis in autism spectrum disorder: Unraveling the effects of bumetanide treatment," Biomedical Signal Processing and Control, vol. 86, p. 105054, 2023/09/01/ 2023, doi: https://doi.org/10.1016/j.bspc.2023.105054.
E. L. Juarez-Martinez et al., "Prediction of Behavioral Improvement Through Resting-State Electroencephalography and Clinical Severity in a Randomized Controlled Trial Testing Bumetanide in Autism Spectrum Disorder," Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, vol. 8, no. 3, pp. 251–261, 2023/03/01/ 2023, doi: https://doi.org/10.1016/j.bpsc.2021.08.009.
W. J. Bosl, H. Tager-Flusberg, and C. A. Nelson, "EEG Analytics for Early Detection of Autism Spectrum Disorder: A data-driven approach," Scientific Reports, vol. 8, no. 1, p. 6828, 2018/05/01 2018, doi: 10.1038/s41598-018-24318-x.
S. Peketi and S. B. Dhok, "Machine Learning Enabled P300 Classifier for Autism Spectrum Disorder Using Adaptive Signal Decomposition," (in eng), Brain Sci, vol. 13, no. 2, Feb 13 2023, doi: 10.3390/brainsci13020315.
S. Raj and S. Masood, "Analysis and Detection of Autism Spectrum Disorder Using Machine Learning Techniques," Procedia Computer Science, vol. 167, pp. 994–1004, 2020/01/01/ 2020, doi: https://doi.org/10.1016/j.procs.2020.03.399.
D. D. Khudhur and S. D. Khudhur, "The classification of autism spectrum disorder by machine learning methods on multiple datasets for four age groups," Measurement: Sensors, vol. 27, p. 100774, 2023/06/01/ 2023, doi: https://doi.org/10.1016/j.measen.2023.100774.
B. Ari, N. Sobahi, Ö. F. Alçin, A. Sengur, and U. R. Acharya, "Accurate detection of autism using Douglas-Peucker algorithm, sparse coding based feature mapping and convolutional neural network techniques with EEG signals," Computers in Biology and Medicine, vol. 143, p. 105311, 2022/04/01/ 2022, doi: https://doi.org/10.1016/j.compbiomed.2022.105311.
A. R. Aslam and M. A. B. Altaf, "Chapter 14 - Machine learning–based patient-specific processor for the early intervention in autistic children through emotion detection," in Neural Engineering Techniques for Autism Spectrum Disorder, A. S. El-Baz and J. S. Suri Eds.: Academic Press, 2021, pp. 287–313.
S. Parui, D. Samanta, N. Chakravorty, U. Ghosh, and J. J. P. C. Rodrigues, "Artificial intelligence and sensor-based autism spectrum disorder diagnosis using brain connectivity analysis," Computers and Electrical Engineering, vol. 108, p. 108720, 2023/05/01/ 2023, doi: https://doi.org/10.1016/j.compeleceng.2023.108720.
J. Strzelecka, "Electroencephalographic studies in children with autism spectrum disorders," Research in Autism Spectrum Disorders, vol. 8, no. 3, pp. 317–323, 2014/03/01/ 2014, doi: https://doi.org/10.1016/j.rasd.2013.11.010.
A. S. Mohanty, P. Parida, and K. C. Patra, "ASD classification for children using deep neural network," Global Transitions Proceedings, vol. 2, no. 2, pp. 461–466, 2021/11/01/ 2021, doi: https://doi.org/10.1016/j.gltp.2021.08.042.
L. Xu et al., "Characterizing autism spectrum disorder by deep learning spontaneous brain activity from functional near-infrared spectroscopy," Journal of Neuroscience Methods, vol. 331, p. 108538, 2020/02/01/ 2020, doi: https://doi.org/10.1016/j.jneumeth.2019.108538.
C. Li, T. Zhang, and J. Li, "Identifying autism spectrum disorder in resting-state fNIRS signals based on multiscale entropy and a two-branch deep learning network," Journal of Neuroscience Methods, vol. 383, p. 109732, 2023/01/01/ 2023, doi: https://doi.org/10.1016/j.jneumeth.2022.109732.
T. M. Epalle, Y. Song, Z. Liu, and H. Lu, "Multi-atlas classification of autism spectrum disorder with hinge loss trained deep architectures: ABIDE I results," Applied Soft Computing, vol. 107, p. 107375, 2021/08/01/ 2021, doi: https://doi.org/10.1016/j.asoc.2021.107375.
S. Schwartz, R. Kessler, T. Gaughan, and A. W. Buckley, "Electroencephalogram Coherence Patterns in Autism: An Updated Review," (in eng), Pediatr Neurol, vol. 67, pp. 7–22, Feb 2017, doi: 10.1016/j.pediatrneurol.2016.10.018.
F. Precenzano et al., "Electroencephalographic Abnormalities in Autism Spectrum Disorder: Characteristics and Therapeutic Implications," (in eng), Medicina (Kaunas), vol. 56, no. 9, Aug 19 2020, doi: 10.3390/medicina56090419.
J. M. Peters et al., "Brain functional networks in syndromic and non-syndromic autism: a graph theoretical study of EEG connectivity," BMC Medicine, vol. 11, no. 1, p. 54, 2013/02/27 2013, doi: 10.1186/1741-7015-11-54.
M. N. A. Tawhid, S. Siuly, and H. Wang, "Diagnosis of autism spectrum disorder from EEG using a time–frequency spectrogram image-based approach," Electronics Letters, vol. 56, no. 25, pp. 1372–1375, 2020, doi: https://doi.org/10.1049/el.2020.2646.
A. Landowska et al., "Automatic Emotion Recognition in Children with Autism: A Systematic Literature Review," Sensors, vol. 22, no. 4, p. 1649, 2022. [Online]. Available: https://www.mdpi.com/1424-8220/22/4/1649.
Q. Mohi-Ud-Din and A. K. Jayanthy, "WITHDRAWN: EEG feature extraction using wavelet transform for classifying autism spectrum disorder," Materials Today: Proceedings, 2021/03/03/ 2021, doi: https://doi.org/10.1016/j.matpr.2021.01.803.
L. Cornew, T. P. Roberts, L. Blaskey, and J. C. Edgar, "Resting-state oscillatory activity in autism spectrum disorders," (in eng), J Autism Dev Disord, vol. 42, no. 9, pp. 1884–94, Sep 2012, doi: 10.1007/s10803-011-1431-6.
J. R. Isler, K. M. Martien, P. G. Grieve, R. I. Stark, and M. R. Herbert, "Reduced functional connectivity in visual evoked potentials in children with autism spectrum disorder," (in eng), Clin Neurophysiol, vol. 121, no. 12, pp. 2035-43, Dec 2010, doi: 10.1016/j.clinph.2010.05.004.
S. Wass, "Distortions and disconnections: disrupted brain connectivity in autism," (in eng), Brain Cogn, vol. 75, no. 1, pp. 18–28, Feb 2011, doi: 10.1016/j.bandc.2010.10.005.
M. E. Vissers, M. X. Cohen, and H. M. Geurts, "Brain connectivity and high functioning autism: a promising path of research that needs refined models, methodological convergence, and stronger behavioral links," (in eng), Neurosci Biobehav Rev, vol. 36, no. 1, pp. 604 – 25, Jan 2012, doi: 10.1016/j.neubiorev.2011.09.003.
P. Barttfeld et al., "State-dependent changes of connectivity patterns and functional brain network topology in autism spectrum disorder," (in eng), Neuropsychologia, vol. 50, no. 14, pp. 3653-62, Dec 2012, doi: 10.1016/j.neuropsychologia.2012.09.047.
L. Q. Uddin, K. Supekar, and V. Menon, "Reconceptualizing functional brain connectivity in autism from a developmental perspective," (in eng), Front Hum Neurosci, vol. 7, p. 458, 2013, doi: 10.3389/fnhum.2013.00458.
R. Coben, I. Mohammad-Rezazadeh, and R. L. Cannon, "Using quantitative and analytic EEG methods in the understanding of connectivity in autism spectrum disorders: a theory of mixed over- and under-connectivity," (in eng), Front Hum Neurosci, vol. 8, p. 45, 2014, doi: 10.3389/fnhum.2014.00045.
N. J. Minshew and D. L. Williams, "The new neurobiology of autism: cortex, connectivity, and neuronal organization," (in eng), Arch Neurol, vol. 64, no. 7, pp. 945 – 50, Jul 2007, doi: 10.1001/archneur.64.7.945.
W. Sato, M. Toichi, S. Uono, and T. Kochiyama, "Impaired social brain network for processing dynamic facial expressions in autism spectrum disorders," BMC Neuroscience, vol. 13, no. 1, p. 99, 2012/08/13 2012, doi: 10.1186/1471-2202-13-99.
E. A. Aponte, S. Raman, B. Sengupta, W. D. Penny, K. E. Stephan, and J. Heinzle, "mpdcm: A toolbox for massively parallel dynamic causal modeling," (in eng), J Neurosci Methods, vol. 257, pp. 7–16, Jan 15 2016, doi: 10.1016/j.jneumeth.2015.09.009.
J. Nováková, M. Hromčík, and R. Jech, "Dynamic Causal Modeling and subspace identification methods," Biomedical Signal Processing and Control, vol. 7, no. 4, pp. 365–370, 2012/07/01/ 2012, doi: https://doi.org/10.1016/j.bspc.2011.07.002.
M. Pyka, D. Heider, S. Hauke, T. Kircher, and A. Jansen, "Dynamic causal modeling with genetic algorithms," Journal of Neuroscience Methods, vol. 194, no. 2, pp. 402–406, 2011/01/15/ 2011, doi: https://doi.org/10.1016/j.jneumeth.2010.11.007.
M. Rajabioun, A. M. Nasrabadi, and M. B. Shamsollahi, "Estimation of effective brain connectivity with dual Kalman filter and EEG source localization methods," (in eng), Australas Phys Eng Sci Med, vol. 40, no. 3, pp. 675–686, Sep 2017, doi: 10.1007/s13246-017-0578-7.
M. Rajabioun, A. Motie Nasrabadi, M. B. Shamsollahi, and R. Coben, "Effective brain connectivity estimation between active brain regions in autism using the dual Kalman-based method," (in eng), Biomed Tech (Berl), vol. 65, no. 1, pp. 23–32, Jan 28 2020, doi: 10.1515/bmt-2019-0062.
A. H. Omidvarnia, M. Mesbah, M. S. Khlif, J. M. O'Toole, P. B. Colditz, and B. Boashash, "Kalman filter-based time-varying cortical connectivity analysis of newborn EEG," (in eng), Annu Int Conf IEEE Eng Med Biol Soc, vol. 2011, pp. 1423-6, 2011, doi: 10.1109/iembs.2011.6090335.
E. Milne, "EEG Data for "Electrophysiological signatures of brain aging in autism spectrum disorder"," ed: The University of Sheffield, 2021.
S. Makeig and J. Onton, "ERP features and EEG dynamics: An ICA perspective," The Oxford Handbook of Event-Related Potential Components, 01/01 2012, doi: 10.1093/oxfordhb/9780195374148.013.0035.
J. Onton and S. Makeig, "Information-based modeling of event-related brain dynamics," (in eng), Prog Brain Res, vol. 159, pp. 99–120, 2006, doi: 10.1016/s0079-6123(06)59007-7.
A. Delorme and S. Makeig, "EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis," (in eng), J Neurosci Methods, vol. 134, no. 1, pp. 9–21, Mar 15 2004, doi: 10.1016/j.jneumeth.2003.10.009.
K. A. Awada, D. R. Jackson, J. T. Williams, D. R. Wilton, S. B. Baumann, and A. C. Papanicolaou, "Computational aspects of finite element modeling in EEG source localization," IEEE Transactions on Biomedical Engineering, vol. 44, no. 8, pp. 736–752, 1997, doi: 10.1109/10.605431.
H. Hallez et al., "Review on solving the forward problem in EEG source analysis," Journal of NeuroEngineering and Rehabilitation, vol. 4, no. 1, p. 46, 2007/11/30 2007, doi: 10.1186/1743-0003-4-46.
R. Grech et al., "Review on solving the inverse problem in EEG source analysis," (in eng), J Neuroeng Rehabil, vol. 5, p. 25, Nov 7 2008, doi: 10.1186/1743-0003-5-25.
M. A. Jatoi, N. Kamel, A. S. Malik, and I. Faye, "EEG based brain source localization comparison of sLORETA and eLORETA," (in eng), Australas Phys Eng Sci Med, vol. 37, no. 4, pp. 713–21, Dec 2014, doi: 10.1007/s13246-014-0308-3.
E. Wan and A. Nelson, "Neural Dual Extended Kalman Filtering: Applications In Speech Enhancement And Monaural Blind Signal Separation," Neural Networks for Signal Processing - Proceedings of the IEEE Workshop, 08/12 2000, doi: 10.1109/NNSP.1997.622428.
E. A. Wan and A. T. Nelson, "Dual Extended Kalman Filter Methods," in Kalman Filtering and Neural Networks, 2001, pp. 123–173.
B. Luque, L. Lacasa, F. Ballesteros, and J. Luque, "Horizontal visibility graphs: Exact results for random time series," Physical review. E, Statistical, nonlinear, and soft matter physics, vol. 80, p. 046103, 10/01 2009, doi: 10.1103/PhysRevE.80.046103.
M. Zheng, S. Domanskyi, C. Piermarocchi, and G. I. Mias, "Visibility graph based temporal community detection with applications in biological time series," Scientific Reports, vol. 11, no. 1, p. 5623, 2021/03/11 2021, doi: 10.1038/s41598-021-84838-x.
A. Mira-Iglesias, J. Alberto Conejero, and E. Navarro-Pardo, "Natural visibility graphs for diagnosing attention deficit hyperactivity disorder (ADHD)," Electronic Notes in Discrete Mathematics, vol. 54, pp. 337–342, 2016/10/01/ 2016, doi: https://doi.org/10.1016/j.endm.2016.09.058.
L. Wang, X. Long, J. B. A. M. Arends, and R. M. Aarts, "EEG analysis of seizure patterns using visibility graphs for detection of generalized seizures," Journal of Neuroscience Methods, vol. 290, pp. 85–94, 2017/10/01/ 2017, doi: https://doi.org/10.1016/j.jneumeth.2017.07.013.
J. Wang, C. Yang, R. Wang, H. Yu, Y. Cao, and J. Liu, "Functional brain networks in Alzheimer’s disease: EEG analysis based on limited penetrable visibility graph and phase space method," Physica A: Statistical Mechanics and its Applications, vol. 460, pp. 174–187, 2016/10/15/ 2016, doi: https://doi.org/10.1016/j.physa.2016.05.012.
F. Vaz, P. G. de Oliveira, and J. Principe, "A study on the best order for autoregressive EEG modelling," International Journal of Bio-Medical Computing, vol. 20, no. 1, pp. 41–50, 1987/01/01/ 1987, doi: https://doi.org/10.1016/0020-7101(87)90013-4.
S.-Y. Tseng, R.-C. Chen, F.-C. Chong, and T.-S. Kuo, "Evaluation of parametric methods in EEG signal analysis," Medical Engineering & Physics, vol. 17, no. 1, pp. 71–78, 1995/01/01/ 1995, doi: https://doi.org/10.1016/1350-4533(95)90380-T.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Autistic recognition from EEG signals by extracted features from several time series models

Status:

Version 1

Abstract

Figures

Introduction

Materials and Methods

Simulation and results

Discussion

Conclusion

Declarations

Author Contribution

References

Additional Declarations

Status:

Version 1