A Novel Wrapper and Filter-based Feature Dimensionality Reduction Methods for Anomaly Intrusion Detection in Wireless Sensor Networks

doi:10.21203/rs.3.rs-2110149/v1

Download PDF

Research Article

A Novel Wrapper and Filter-based Feature Dimensionality Reduction Methods for Anomaly Intrusion Detection in Wireless Sensor Networks

https://doi.org/10.21203/rs.3.rs-2110149/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Wireless Sensor Networks (WSNs) are the most important technology currently available. WSNs are widely utilized in applications such as business, military, healthcare, smart cities, smart grids, and smart homes. All WSNs implementations demand that sensor nodes and the base station communicate securely. The adversary compromises sensor nodes to deploy diverse attacks into the WSNs. Therefore, an appropriate Intrusion Detection System (IDS) is required to guard against security attacks in WSNs. IDS are crucial for preventing and detecting security breaches. WSNs should have an IDS to assure the reliability, availability, and security of the service. Network intrusion detection is the practice of detecting malicious activity within a network by examining its traffic flow. However, feature dimensionality reduction is critical in IDS, as finding anomalies in high-dimensional network traffic is a lengthy procedure. The selection of features affects the speed of the analysis. In this paper, we proposed an anomaly IDS to detect eight different forms of attacks in WSNs using a wrapper and filter-based feature dimensionality reduction methodologies. The suggested work employed a wrapper-based method with the firefly algorithm (FFA) embedded in the wrapper for feature selection (FS), as well as a filter method with Principal component analysis (PCA) for feature dimensionality reduction. The classifiers random forest (RF) and naïve Bayes (NB) were used to classify the obtained features from both wrapper-based FFA and filter-based PCA. The empirical analysis was carried out on the high-dimensional UNSW-NB15 data. The findings revealed that the wrapper-based FFA-RF achieved an accuracy of 99.98%, f1 score of 100%, precision of 100%, AUC of 100%, and recall of 100%. While, the FFA-NB yielded an accuracy of 99.74%, an F1 score of 99.65%, a precision of 99.38%, an AUC of 99.92%, and a recall of 99.93%. On the other hand, the filter-based PCA-RF achieves an accuracy of 99.99%, an f1-score of 99.97%, a precision of 99.98%, an AUC of 100%, and a recall of 99.97%. While, the PCA-NB gave an accuracy of 97.16%, precision of 97.12%, F1 score of 98.85%, AUC of 99.75%, and recall of 99.50%. This showed that the wrapper-based FFA feature dimensionality reduction methods outperformed the filter-based PCA feature dimensionality approaches in detecting generic, exploits, DoS, fuzzers, backdoors, reconnaissance, and worms’ attacks in WSNs layers. However, in terms of time-critical applications, the filter-based methods required low training time to build the models when compared with the wrapper-based approaches.

Wireless Sensor Networks

Intrusion Detection System

Firefly algorithm

Principal component analysis

Dimensionality Reduction

Random Forest

Intelligent wireless sensor networks (WSNs) have emerged with the expansion and development of technologies such as wireless communication, microcomputer electrical systems, microelectronics, signal processing, and computer networks [1]. WSNs are a type of diverse system that consists of small sensing devices equipped with general-purpose computing units[2]. WSNs are self-possessed of hundreds or even thousands of reduced nodes that are wireless, are even self-organizing, low-power, and are deployed to control and monitor the environment[3],[4].

WSNs are currently widely employed in defense, aerospace, military, medical and health, environmental monitoring, and industrial facilities, among other applications [5],[6],[7]. Additionally, in future applications, for example, observing pollution, building security, traffic on the highway monitoring, wildfire monitoring, and the quality of water monitoring is possible to include WSNs values into their systems. WSNs offer numerous recompenses, including the ability to transform raw data into valuable combined and categorized information[8].

Concerns about security have grown particularly severe in systems that use WSN [9],[10], [11], [12], [13]. Indeed, security in WSNs presents a unique set of issues not encountered in other forms of wireless networks. Using security mechanisms such as cryptography, key management, and authentication, WSNs can be made more secure. Nonetheless, these methods will not be enough to thwart all foreseeable threats. As malicious nodes in WSNs, that is, nodes that seem to be genuine members of the network but are operating on behalf of a third party, are capable of launching a wide range of assaults, WSNs are susceptible to a variety of threats [14], [15]. An additional layer of security, like an Intrusion Detection System (IDS), is necessary [16],[17].

Correct implementation of an IDS on wired connections can notice participating nodes' misconduct and notify other network nodes to take necessary countermeasures. Though, an IDS strategy developed for wired connections cannot be easily used for WSNs due to their unique network features, which include restricted processing power, battery, and memory. An IDS is a critical security device against both insider and outsider assaults in WSNs[18]. It involves the detection of malicious nodes or misbehavior. When an IDS discovers an improperly behaving sensor node, it seeks to separate it from the remainder of the network.

The IDS system is classified into two groups based on the detection. The anomaly detection compares all behavior to the average activity, whereas the signature detection identifies harmful traffic patterns, necessitating a database update to store any novel attack patterns [19]. Anomaly-based IDS have been a hot topic in IDS research due to their success in detecting unintentional attacks[20].

WSNs [21], have been developed as a result of advancements in microelectronic systems technology, digital electronics, and wireless communications in recent years. WSN is a self-organizing network composed of dozens to thousands of sensor nodes linked by wireless links [23]. These wireless sensors are small, low-powered, cost-effective, versatile, and communicate across short distances [16], [24]. The sensing, gathering, processing, and connectivity of sensor nodes are autonomous [25]. WSNs are among the most key technologies, and they are being implemented at an unprecedented rate[26]. They have been employed and deployed in a variety of situations and for a variety of goals, resulting in a variety of applications—military, disaster management, habitat monitoring, and environmental[22]. Due to their dispersed nature, cost, size, and power limits, WSNs impose severe constraints on node resources like energy, computation speed, memory, and communication bandwidth.

WSNs, on the other hand, imposes severe constraints on node resources such as energy, memory, computing speed, and communication bandwidth due to their distributed nature, cost, size, and power constraints. These constraints impose many constraints on sensor battery life, distributed signal processing efficiency, network security, and data processing [22]. Nevertheless, the two most critical issues are the sensors' lifetime (i.e., their functioning period) and the network's security [10], [28].

Due to network restrictions, it is difficult to secure applications launched using WSNs from a security perspective. To fulfill their functions, these types of networks are typically located in distant and perilous areas [23], [29]. Nevertheless, hazardous environments are frequently left ignored. As a result, WSNs lack physical protection, such as gateways or switches to monitor data flow, resulting in the possibility of node compromise as well as insufficient protection and network security[13],[23]. Consequently, it is crucial to safeguard these networks from breaches and assaults, especially in programs that rely on security services. Effective security methods are required to safeguard and secure against threats. WSNs frequently have one (or even more) centralized control units referred to as base stations (s). A base station often serves as an entry to another network, acts as a data storage midpoint, and serves as an input method entry point. Moreover, the base station is referred to as the sink. Each sensor node contributes to the creation of a route, with each tree's root serving as a starting point. The base position has a greater capacity for power and storage than another sensor network. Typically, the base location has sufficient battery life to last the life of the sensor nodes, sufficient memory to store cryptographic keys, superior CPUs, and the ability to communicate with other WSNs [9].

Guaranteeing a high standard of confidence for serious applications that utilize WSNs is critical for protecting their infrastructures and data from intrusions. As a result, abnormal actions and intrusions should be detected using an IDS. Sensors collect data from the environment in which they are deployed and then communicate it to the base station node in a WSN. External attackers should be protected from information, as cryptographic security is not completely effective at securing this information. As a result, a secondary layer of defense, such as IDS, is necessary[24]. Network traffic is monitored using IDS and delivers notifications to the base position if any sensor detects malicious attacks, as seen in Fig. 1. The black cycle denotes a sensor node, the red star denotes a cluster head, the white star denotes an intruder, D denotes the distance between the invader and the cluster head, A denotes the sensors area and R denotes the cluster area.

IDS technique can be employed as the first line of defense to minimize potential attacks. Numerous attack types are possible over WSNs, including Sinkhole attacks, Packet dropping attacks, and Sybil attacks[25]. Packet-dropping attacks sometimes referred to as packet loss, are among the most disruptive and devastating threats to WSNs [32]. Packet-dropping attacks interfere with the normal operation of a network by discarding received data packets or control messages rather than forwarding them to other nodes [33]. Feature dimensionality reduction selection is a strategy for identifying the most relevant features and utilizing them to develop robust and accurate IDS models[26]. The purpose of this work is to develop and test a new intrusion detection approach based on firefly and PCA algorithms using RF and NB for classification, that models many forms of security invasions and is suitable for execution on restricted devices. The resultant IDS can be thought of as a robust decentralized decision support tool capable of providing critical information about potential security issues in WSNs. This paper's significant contributions are summarized below.

To propose a wrapper-based FFA feature dimensionality reduction in WSNs attacks detection.
To propose a filter-based PCA feature dimensionality reduction in the detection of attacks in WSNs.
To develop and implement IDS that meet WSNs protocol criteria using the UNSW-NB15 data instead of a dataset that does not match real-world WSNs scenarios.
To perform classification with RF and NB models on the reduced dimension of data.

The remaining sections are organized as follows. In Section 2, related work is described. Section 3 then describes the recommended approach, followed by Section 4, which describes the experimental and comparative evaluation outcomes. In Section 5, the conclusion and future work are presented.

Related works

In WSNs, IDS is performed using ML techniques. Anomaly IDS powered by machine learning builds an explicitly or implicitly model of the investigated patterns that is updated periodically to optimize system performance based on prior results[7]. Using the hypergrid KNN method, the authors [34] proposed a web-based methodology for detecting random flaws and cyberattacks. This solution reduces computational and communication complexity by reforming anomaly from the hypersphere detection zone to the hypercube detection region. Garofalo et al. [27] used Decision trees to construct a distributed model for detecting sinkhole attacks in WSNs. The author generates both regular and attacks traffic using network simulator 3 (NS3). IDS is formed of both local and central agents. Ma et al. [28] performed ID on the NSL-KDD data utilizing SC and a DNN.

The authors [29] suggested a dispersed model for intrusion detection in WSNs based on fuzzy Q-learning and game theory. To identify and defend against intrusions at the sink node and BS, a game theory technique was utilized, while a fuzzy Q-learning method was utilized to change the game theory to predict upcoming attacks. The authors [30] proposed a methodology for detecting localization assaults in WSNs. The author employs a layered de-noising autoencoder to detect attacks on the localization program.

The authors [31] suggested a model for identifying flooding and blackhole assaults that incorporates fuzzy C-Means (FCM), one-class support vector machines (SVM), and sliding windows. The authors first standardized the test data using Z-score normalization, then utilized FCM to detect noisy data, one class SVM to identify attack traffic that was comparable to normal traffic, and the sliding window technique to assess if the data is being attacked or not. The authors [32] suggested a cluster-based WSN IDS paradigm. This idea employed two subsystems to identify intrusion in CH: RF, a spectral grouping of applications with noise based on enhanced density, and noise-based spectral grouping of applications (E-DBSCAN). RF is utilized to identify known attacks, whereas E-DBSCAN is utilized to identify unknown attacks. Almomani et al. [41] develop a novel IDS dataset (WSN-DS) for WSNs by simulating five distinct attack scenarios: blackhole assault, normal, flooding assault, gray hole, and scheduling assault.

Otoum et al. [33] compare IDS based on ML versus IDS based on deep learning for WSN. The authors concluded that while deep learning-based IDS is more accurate than machine learning-based IDS, it takes longer to identify threats. Tan et al. [34] performed class imbalance using the SMOTE technique and then utilized the RF algorithm to perform intrusion detection on the KDDCup'99 data. The bulk of researchers validates IDS models offline for WSN using the KDD dataset. However, the KDD dataset has an imbalance of classes. Inaccurate results were obtained because of the imbalanced dataset. The authors [35] suggested an RF algorithm-based model for detecting black holes, floods, scheduling, and gray holes.

Mansouri et al. [36] presented a centralized solution for detecting command injection attacks, response attacks, DoS attacks, and reconnaissance attacks using ANN. The authors employ gray wolf optimization (GWO) and Evolutionary System (ES) techniques to obtain optimal ANN weights. The authors [37] suggested a dispersed approach for spotting cyber threats in WSNs based on swarm intelligence and A.I. Through theoretical analysis, the authors demonstrate that AI with fluid intelligence has a high degree of accuracy and a low proportion of false alarms.

Nithiyanandam et al. [47] simulate the WSN using the NS2 network simulator and gather network traffic under normal and attack settings. The author proposed a strategy based on ACO and PSO for very accurate sinkhole attack detection. Sun et al. [48] proposed a distributed IDS for WSN based on Adaboost, the artificial fish swarm (AFS) algorithm, and a cultural algorithm (CA). Hierarchical AdaBoost is utilized to identify anomalies in sensor nodes, CHs, and sink nodes. CA and AFS with backpropagation are utilized to determine BS misuse. The model is trained using the NSL-KDD dataset. By utilizing GA, Singh et al. [38] developed an energy-efficient IDS for clustered WSNs. While A.I.-based intrusion detection achieves highly accurate in WSN, it is not easy to scale and is susceptible to overfitting during training.

Sedjelmaci et al. [39] presented a hybrid approach for detecting network layer assaults by combining a specification-based approach (selective forwarding, hello flood, wormhole, and blackhole) and an anomaly-based. Yan et al. [40] suggested a hybrid IDS that incorporates anomaly-based backpropagation networks (BPNs) with misuse-based intrusion detection algorithms. The authors applied anomaly and abuse detection algorithms to the KDDCup'99 dataset. Using signature and anomaly detection, the authors [52] proposed a distributed and lightweight technique for detecting energy depletion threats. The authors detect anomalies using an artificial immune system inspired by human white blood cells.

Subba et al. [41] presented a hybrid approach for detecting intrusions into networks with several layers. While a hybrid approach to intrusion detection improves accuracy, it also adds complexity [42]. Analysis of intrusion detection methodologies in WSNs reveals that the great majority of researchers employ machine learning algorithms to detect intrusions in WSNs. However, machine learning approaches require additional time for training and testing, as well as additional memory for deploying the model [42].

The network intrusion detection dataset was used by certain researchers to test the accuracy of their method. A majority of researchers utilized the KDDCup'99 dataset. In contrast, the KDDCup'99 data gathering was designed for wired networks and not wireless networks. As a result, KDDCup'99 is unsuitable for WSNs. Also, as noticed in the literature, some researchers use network simulators to construct their dataset and then use the simulated dataset to perform intrusion detection. Additionally, as seen in the literature, the majority of previous research focused on potential solutions from the perspective of certain WSN attack types. In WSN, the majority of the researchers used ML methods to detect intrusions. An ML technique necessitates more time and memory space for training and testing, as well as additional memory space for sensor node deployment. As a result, there is potential to construct a compact ML model for conducting intrusion detection in WSNs to reduce the amount of memory required to install a model. The majority of available strategies focus on a single form of attack on a single layer of the WSN, with no attention paid to assaults on other layers. Consequently, it is crucial to develop a cross-layer IDS capable of detecting a variety of threats that may vary at different WSN levels. Another key issue in the literature is that the majority of datasets used for the experimental study are KDDCUP 99 and NSLKDD, which lack real-life features and are incapable of adjusting to network changes. This is why the majority of the IDS-WSNs aren't suitable for use in a production environment.

Unlike, the existing studies, we present an IDS in WSNs based on wrapper FFA and filter PCA feature dimensionality reduction. To satisfy, the time and memory requirements issues, we used FFA and PCA approaches to eliminate the redundant features and to create a faster learning and training time for the RF and NB models. In addition, as against the existing studies that focused on a single attack on a layer in WSNs and the use of datasets that do not represent real-life WSN scenarios. In the context of this, we used a recent dataset that can adapt to network changes and comprises real-life properties. The attacks in this dataset include generic, exploits, DoS, fuzzers, backdoors, reconnaissance, and worms’ attacks in WSNs layers.

3.1 Proposed IDS in Wireless Sensor Networks

We present an IDS for sensor nodes in this part that is capable of detecting ongoing surveillance, backdoor, exploits, denial-of-service, fuzzers, analysis, worms, generic attacks, and shellcode. To design our system, we anticipated that a routing layer based on connection quality measurements would be used to create a route tree that leads to the base position. The IDS is decentralized, it consists of identical IDS clients running on each network node. Then, the IDS clients interact with one another to establish an agreement over an intrusion occurrence. Each IDS client's capability can be abridged as follows:

Network Surveillance: Each IDS client monitors the net in real time, capturing and examining individual packets traveling through its immediate neighborhood. The fact that all message in a WSN is a natural audit source for the IDS client because it happens over the air so each node can eavesdrop on data in its vicinity.

Intrusion/Attacks Detection: Each IDS user detects attacks using a reference implementation method, that is, it looks for nonconformities from typical user-defined rules-based behavior. The network manager must develop and incorporate the appropriate measure to each assault that the IDS should identify in the nodes has its own set of rules.

Making of Decision: Because of its myopia view of its immediate area, a node might be unable to determine definitively whether a node is an intruder. However, even if it is, the network cannot trust it, as it may be malicious. As a result, when an IDS client detects an abnormality, a cooperative mechanism with surrounding nodes is begun, bringing all of them to a shared deduction.

Alertness: Each node is equipped with a reaction device that enables it to react to an incursion state. Based on these purposes, we construct the suggested IDS in WSNs in accordance with the system architecture depicted in Fig. 2.

3.2 Data Normalization

Data normalization is a pre-processing method that scales data or transformed it to safeguard that each attribute contributes similarly. The success of ML algorithms is contingent upon the availability of high-quality data for developing a generalized prediction model of the classification problem[43]. In this paper, normalization was performed using the min-max technique.

This normalization technique is capable of exactly retaining all relationships within the data, ensuring that no bias is introduced. Each feature is contained within the classifier's acceptable range of values when the min-max method is used, but the fundamental distribution of the associated features inside the new value range remains unaffected. The UNSW-NB-15 dataset contains both discrete and continuous-valued variables. When the feature's discrete and continuous values are mixed, the range of the feature's values becomes distinct.

3.3 Wrapper Dimensionality Reduction with Firefly algorithm

Yang [44] created the firefly (FF) algorithm in 2009 as a bio-inspired program that simulates the social behavior of fireflies. These insects emit light, and each species emits light in a unique pattern. The attractiveness of a firefly is proportional to the intensity or brilliance of its light. The social behavior of fireflies can be expressed as an optimization algorithm if the objective function is the light intensity of each bug. Due to the characteristics of the firefly, the firefly algorithm is primarily used to solve complex problems[45]. According to the genetic makeup of FF, any FF can be fascinated by another FF, and they make no gender distinctions. Two critical aspects of the firefly algorithm are[44] the modulation of brightness and the derivation of attractiveness. Thus, the attraction of two fireflies i and j changes with distance, as does their brightness, which diminishes with distance from their source. Additionally, the absorption coefficient of the media influences attractiveness. Thus, the brightness of a firefly located within a radius (r) of another firefly with a primary brightness R is

R (k) = R₀ e⁻ (1)

Where R₀ denotes the initial brightness; k denotes the distance between any two fireflies, and ∂ is a light absorption coefficient that regulates the light intensity decline. Because the attractiveness of a firefly is relative to the brightness perceived by another FF, the appeal Z of a FF is described as;

Z (k) = Z₀ e⁻ (2)

Where Z₀ denotes the attractiveness when k equals 0. The jth firefly is then drawn to the ith firefly, and the movement is defined by

U_iⁱ⁺¹ = Z₀ e⁻ ² (u_i^t – u^t_j ) + β (D-0.5) (3)

where ꞵ is the randomization limit, and D is a consistently distributed random number generator within 0 and 1. The letter 't' denotes the number of iterations. L (l = 1...L) is the number of dimensions, and kji is the distance between both the jth and ith fireflies, as specified by the Eq. (4).

k = || u_j – u_i || = $\sqrt{{\sum }_{l=1}^{L}(ujl-uil)}$² (4)

3.4 Filter Dimensionality Reduction with Principal Component Analysis

PCA is a method for dimension reduction that is utilized to select and extract data features[46]. Feature selection is a technique for lowering the bulk of data in order to translate it into meaningful features. PCA minimizes the number of variables by employing orthogonal linear mixtures of the unique parameters with the greatest alteration[47]. The following section summarizes the fundamentals of PCA[48]. Assume that b₁, b₂, b₃,...b_m are m-dimensional stochastic input data accounts denoted by the matrix B_nm as illustrated in Eq. (5).

B_n*m = = [ b₁,b_{2, …,} b_m ] (5)

Mean: Assume that b₁, b₂, b₃,...bn represent the random variables with the sample size n. As demonstrated in Eq. (6), the average of the dataset is an arbitrary parameter

B = $\frac{1}{n}{\sum }_{j}^{n}Bj$ (6)

Standard Deviation: Using the standard distance from the data set, the standard deviation is computed. Bj at a particular point B must be established. T is calculated by calculating the square of the distance among all data points and the average set. As illustrated in Eq. (7), to produce a positive square root, the data points are counted and partitioned.

T = $\surd \frac{1}{n}{\sum }_{j=1}^{m}(Bj-B)$² (7)

Covariance: The covariance arrangement is nearly identical to the variance arrangement as seen in Eq. (8).

Cov(B, C) = $\frac{{\sum }_{j=1}^{m}\left(Bj-B\right)(Cj-C)}{n}$ (8)

Eigenvectors and eigenvalues of a matrix: If Y is a nxm matrix, then B₀ is its eigenvector, where is a scalar representing the eigenvalue X and B₀.

Cumulative proportion: As stated in Eq. (9), the cumulative fraction of sample variance described by the first w principal components is determined

$$\frac{\lambda 1+\lambda 2+\lambda 3+\dots +\lambda w}{\lambda 1+\lambda 2+\lambda 3+\dots +\lambda q}$$

Where q connotes the number of variables and λw is wth eigenvalue

Mahalanobis Distance: The Mahalanobis distance is utilized to calculate the distance amid each point in multivariate spaces and the general centroid or mean, based on the data in Eq. (10) has a covariance structure.

Xj = $\surd (Xj-XT)(Xj-X)$ (10)

where Xj data value vector at row j, X mean vector, T − 1 inverse of the covariance matrix.

3.5 Naïve Bayes

Naïve bayes (NB) is a probabilistic algorithm that is simple to use[49]. This method assigns Bayes' theorem and believes that the class parameter value determines all independent and non-dependent properties[50]. The supervised ML model NB is used to solve categorization problems. The benefit of employing NB is that it just needs a modest quantity of training data to figure out the estimated parameter needed in the classification stage[51]. In many cases, NB works far better in real-world circumstances than it has in the past.

3.6 Random Forest

Random Forests (RF) are methods for supervised learning[52]. RF is a type of ML that combines numerous decision trees into an algorithm for finding an accurate and dependable prediction equation. These countless trees are generated at random and programmed for a specific activity, which becomes the model's conclusion. RF is commonly used in the prediction of DDoS attacks [53] and anomaly detection.

3.7 Performance Measures for IDS-WSN

The accuracy, F1, AUC, and Precision-Recall curve can all be used to evaluate the performance of an intrusion detection algorithm.

3.7.1 Accuracy

The percentage of instances successfully categorized is known as DA. Eq. (11) was used to calculate the accuracy of the IDS-WSNs.

Accuracy =$\frac{TP+TN}{TP+FP+TN+FN}$ (11)

3.7.2 Recall

The percentage of real aberrant flows successfully diagnosed is referred to as recall. The recall formula is represented by Eq. (12).

Recall =$\frac{TP}{TP+FN }$ (12)

3.7.3 Precision

Positive predictive value is another name for it. It denotes the proportion of TP to the sum of true positives. The formula used to calculate precision is shown in Eq. (13).

Precision = $\frac{TP}{TP+FP }$ (13)

3.7.4 F1 Score

The harmonic average of recall and precision, as shown in Eq. (14), is the F1 score. When the IDS model uses an unbalanced input dataset, the f1 score is the ideal statistic to verify the performance of the IDS.

F-measure = 2 x $\frac{Precision X Recall}{Precision+Recall }$ (14)

Any intrusion detection method must have a high accuracy, and f1 score.

This section contains the findings of the experiments. Tables 1,2 and 3 lists the evaluation criteria for the anomaly detection methods used on the UNSW-NB15 sensor networks dataset. The accuracy, precision, F1 score, AUC, and training time are all listed in these tables.

4.1 Experimental Analysis of Wrapper Dimensionality Reduction and Classification with NB and RF

The evaluation criteria of the RF and NB classification for the reduced features from FFA based on the f1-score, AUC, precision, recall, and accuracy are shown in Table 1.

Table 1

Performance measures of the proposed FFA-RF and FFA-NB models
Techniques	Accuracy	Precision	F1-score	AUC	Recall
FFA-RF	99.98	100	100	100	100
FFA-NB	99.74	99.38	99.65	99.92	99.93

The proposed FFA-RF achieves an accuracy of 99.98%, precision of 100%, F1 score of 100%, AUC of 100%, and recall of 100% as shown in Fig. 3. While, the FFA-NB gave an accuracy of 99.74%, precision of 99.38%, F1 score of 99.65%, AUC of 99.92%, and recall of 99.93%. This finding showed that the FFA-RF model outperformed the FFA-NB model based on the features received from the wrapper FFA.

4.2 Computational Time of FFA-RF and FFA-NB algorithms

The real computing time spent training and executing the RF and NB classifiers for data training is recorded in Table 2, which is expressed in the total seconds spent on the training process.

Table 2

The training time of the RF and NB algorithms
Algorithms	Training Time (Seconds)
RF	44.12
NB	68.53

The time taken by each of the classifiers is considered in the case of time-critical applications. The RF took 44.12 seconds for training while the NB time taken was 68.53 seconds. This indicates that the NB took more time for training.

4.3 Experimental Analysis of Filter Dimensionality Reduction and Classification with NB and RF

Table 3 shows the evaluation criteria for the RF and NB classification of reduced features from PCA using the f1-score, AUC, precision, recall, and accuracy.

Table 3

Performance of the proposed PCA-RF and PCA-NB models
Techniques	Accuracy	Precision	F1-score	AUC	Recall
PCA-RF	99.99	99.98	99.97	100	99.97
PCA-NB	97.16	97.12	98.85	99.75	99.50

The proposed PCA-RF achieves an accuracy of 99.99%, precision of 99.98%, F1 score of 99.97%, AUC of 100%, and recall of 99.97% as illustrated in Fig. 4. While, the PCA-NB gave an accuracy of 97.16%, precision of 97.12%, F1 score of 98.85%, AUC of 99.75%, and recall of 99.50%.

This finding showed that the PCA-RF model outperformed the PCA-NB model based on the features received from the filter PCA.

As can be seen from the results findings of wrapper-based FS (FFA-RF and FFA-NB) and filter-based FS (PCA-RF and PCA-NB) shown in Fig. 5, in terms of the classification performance. The proposed wrapper methods (FFA-RF and FFA-NB) gave outstanding results when compared to the filter approaches (PCA-RF and PCA-NB). This finding corroborates the fact that in terms of performance, the wrapper FS methods can exhibit more competitive results than the filter methods.

4.4 Computational Time of PCA-RF and PCA-NB algorithms

Table 4 shows the actual computation time spent training and executing the RF and NB classifiers for data training, represented in total seconds spent on the training process.

Table 4

Time is taken to train the PCA-RF and PCA-NB algorithms
Algorithms	Training Time (Seconds)
PCA-RF	0.11
PCA-NB	0.02

As observed in Table 4, the time taken by the PCA-RF is 0.11 seconds and the PCA-NB gave 0.02 seconds. This showed that the time taken by the PCA-RF model is higher than the PCA-NB model. From the two experimental analysis findings of both wrapper FFA-based dimensionality reduction and filter-based PCA dimension reduction, in terms of time taken to train the model.

The wrapper-based FS (FFA-RF and FFA-NB) strategies gave more time for training than the filter methods (PCA-RF, and PCA-NB). This is as a result of the number of calculations needed to get the subset of features. The predictor produces a new model for each subgroup evaluation. Because of its brute force character, the method's temporal complexity is an obvious critique.

Intrusion detection for WSNs is a crucial topic in the realm of WSNs security. The goal of this research is to provide an ML intrusion detection technique that can effectively reduce attacks while using little computation and resources. A UNSW-NB15 dataset for WSN was used to classify the assaults to achieve this goal. In the WSNs layers, generic, exploits, DoS, fuzzes, backdoors, reconnaissance, and worm attacks are all considered. Attacks were identified using 75% of the data for training and 25% for testing based on the FFA-NB, FFA-RF, PCA-NB, and PCA-RF models. The experimental findings of the FFA-RF revealed an accuracy of 99.74%, FFA-NB gave an accuracy of 99.98%, PCA-RF yielded an accuracy of 99.99%, and PCA-NB gave an accuracy of 97.16%. These findings suggest that FFA-NB trained on the UNSW-NB15 dataset is highly useful in categorizing multiple attacks, as it was able to obtain good classification accuracy even when there were many attacks. This study, which examines several different attacking models, adds to the knowledge base. It would derive findings in terms of choosing the appropriate protocols to use in a WSNs application that was accurately described in real-time. The need of addressing security earlier in the network development process is emphasized in this study. Without such, a malicious attack will progressively target inherited weaknesses in this communication protocol and other software. Furthermore, this strategy is simple to use and has a high degree of generalization. It may be widely used to increase the impact of ID for WSNs in the field of wireless sensor network security. This approach can be expanded in the future to incorporate other forms of data link layer assaults, such as sinkholes and Hello flood. Threats on protocols other than LEACH, as well as attacks on other layers of the WSNs, can be considered. It's also feasible to experiment with different classifiers and unsupervised machine learning techniques.

Declarations Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Availability of data and material

Not Applicable

Competing interests

The authors declare that they have no competing interests.

Funding

Not Applicable

Author contributions

All authors contributed to all the steps of conducting this work and writing this manuscript. All the authors read and approved the final manuscript.

Acknowledgments

Not Applicable

R. Zhang and X. Xiao, “Intrusion detection in wireless sensor networks with an improved NSA based on space division,” J. Sensors, vol. 2019, no. 1, 2019, doi: 10.1155/2019/5451263.
M. Safaldin, M. Otair, and L. Abualigah, “Improved binary gray wolf optimizer and SVM for intrusion detection system in wireless sensor networks,” J. Ambient Intell. Humaniz. Comput., vol. 12, no. 2, pp. 1559–1576, 2021, doi: 10.1007/s12652-020-02228-z.
S. Abdollahzadeh and N. J. Navimipour, “Deployment strategies in the wireless sensor network: A comprehensive review,” Comput. Commun., vol. 91–92, pp. 1–16, 2016, doi: 10.1016/j.comcom.2016.06.003.
Y. K. Saheed, “Performance Improvement of Intrusion Detection System for Detecting Attacks on Internet of Things and Edge of Things,” in Artificial Intelligence for Cloud and Edge Computing. Internet of Things (Technology, Communications and Computing), S. Misra, T. K. A., V. Piuri, and L. Garg, Eds. Springer, Cham, 2022, pp. 321–339.
A. Abduvaliyev, A. S. K. Pathan, J. Zhou, R. Roman, and W. C. Wong, “On the vital areas of intrusion detection systems in wireless sensor networks,” IEEE Commun. Surv. Tutorials, vol. 15, no. 3, pp. 1223–1237, 2013, doi: 10.1109/SURV.2012.121912.00006.
B. B. Zarpelão, R. S. Miani, C. T. Kawakani, and S. C. de Alvarenga, “A survey of intrusion detection in Internet of Things,” J. Netw. Comput. Appl., vol. 84, pp. 25–37, 2017, doi: 10.1016/j.jnca.2017.02.009.
A. Ghosal and S. Halder, “A survey on energy efficient intrusion detection in wireless sensor networks,” J. Ambient Intell. Smart Environ., vol. 9, no. 2, pp. 239–261, 2017, doi: 10.3233/AIS-170426.
Y. Maleh and A. Ezzati, “Lightweight intrusion detection scheme for wireless sensor networks,” IAENG Int. J. Comput. Sci., vol. 42, no. 4, pp. 347–354, 2015.
S. H. Jokhio, I. A. Jokhio, and A. H. Kemp, “Light‐weight framework for security‐sensitive wireless sensor networks applications,” IET Wirel. Sens. Syst., vol. 3, no. 4, pp. 298–306, 2013, doi: 10.1049/iet-wss.2012.0127.
N. Aley and S. Kolte, “A Review on Intrusion Detection Schemes in Wireless Sensor Network,” vol. 3, no. 10, pp. 810–813, 2014.
E. Benkhelifa, T. Welsh, and W. Hamouda, “A critical review of practices and challenges in intrusion detection systems for IoT: Toward universal and resilient systems,” IEEE Commun. Surv. Tutorials, vol. 20, no. 4, pp. 3496–3509, 2018, doi: 10.1109/COMST.2018.2844742.
W. Site, “A Survey on Security Challenges in Wireless Sensor Networks Rana Hameed Hussain 1 1 Dep . of Computer Science , Faculty Science Computers and Abstract University of Thi-Qar Journal Vol . 12 No . 3 SEP 2017 2 . The Need to the Security,” vol. 12, no. 3, 2017.
X. Liu, M. Abdelhakim, P. Krishnamurthy, and D. Tipper, “Identifying Malicious Nodes in Multihop IoT Networks using Dual Link Technologies and Unsupervised Learning,” Open J. Internet ofThings, vol. 4, no. 1, pp. 109–125, 2018.
A. Agah, S. K. Das, K. Basu, and M. Asadi, “Intrusion detection in sensor networks: A non-cooperative game approach,” Proc. - Third IEEE Int. Symp. Netw. Comput. Appl. NCA 2004, pp. 343–346, 2004, doi: 10.1109/NCA.2004.1347798.
I. Krontiris, T. Dimitriou, and T. Giannetsos, “Intrusion Detection of Sinkhole Attacks in WSN,” Int. Symp. Algorithms Exp. Sens. Syst. Wirel. Networks Distrib. Robot., pp. 150–161, 2008, [Online]. Available: https://link.springer.com/content/pdf/10.1007%2F978-3-540-77871-4_14.pdf.
H. Y. Lin and T. C. Chiang, “Intrusion detection mechanisms based on queuing theory in remote distribution sensor networks,” Adv. Mater. Res., vol. 121–122, pp. 58–63, 2010, doi: 10.4028/www.scientific.net/AMR.121-122.58.
I. Onat and A. Miri, “An intrusion detection system for wireless sensor networks,” 2005 IEEE Int. Conf. Wirel. Mob. Comput. Netw. Commun. WiMob’2005, vol. 3, pp. 253–259, 2005, doi: 10.1109/WIMOB.2005.1512911.
T. Giannetsos, I. Krontiris, T. Dimitriou, and F. C. Freiling, “Intrusion detection in wireless sensor networks,” Secur. RFID Sens. Networks, pp. 321–340, 2016.
S. Agrawal and J. Agrawal, “Survey on Anomaly Detection using Data Mining Techniques,” Procedia - Procedia Comput. Sci., vol. 60, pp. 708–713, 2015, doi: 10.1016/j.procs.2015.08.220.
A. A. Aburomman, M. Bin, and I. Reaz, “A novel SVM-kNN-PSO ensemble method for intrusion detection system,” vol. 38, pp. 360–372, 2016.
F. Karray, M. W. Jmal, A. Garcia-Ortiz, M. Abid, and A. M. Obeid, “A comprehensive survey on wireless sensor node hardware platforms,” Comput. Networks, vol. 144, pp. 89–110, 2018, doi: 10.1016/j.comnet.2018.05.010.
L. B. Oliveira et al., “SecLEACH-On the security of clustered sensor networks,” Signal Processing, vol. 87, no. 12, pp. 2882–2895, 2007, doi: 10.1016/j.sigpro.2007.05.016.
A. C. Ferreira, M. A. Vilaça, L. B. Oliveira, E. Habib, H. C. Wong, and A. A. Loureiro, “On the security of cluster-based communication protocols for wireless sensor networks,” Lect. Notes Comput. Sci., vol. 3420, no. I, pp. 449–458, 2005, doi: 10.1007/978-3-540-31956-6_53.
C. Guo, Y. Zhou, Y. Ping, Z. Zhang, G. Liu, and Y. Yang, “A distance sum-based hybrid method for intrusion detection,” 2013, doi: 10.1007/s10489-013-0452-6.
P. Dewal, G. S. Narula, V. Jain, and A. Baliyan, Security attacks in wireless sensor networks: A survey, vol. 729. Springer Singapore, 2018.
Y. K. Saheed, M. O. Arowolo, and A. U. Tosho, “An Efficient Hybridization of K-Means and Genetic Algorithm Based on Support Vector Machine for Cyber Intrusion Detection System,” Int. J. Electr. Eng. Informatics, vol. 14, no. 2, pp. 426–442, 2022, doi: 10.15676/ijeei.2022.14.2.11.
A. Garofalo, C. Di Sarno, and V. Formicola, “Enhancing intrusion detection in wireless sensor networks through decision trees,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 7869 LNCS, pp. 1–15, 2013, doi: 10.1007/978-3-642-38789-0_1.
T. Ma, F. Wang, J. Cheng, Y. Yu, and X. Chen, “A hybrid spectral clustering and deep neural network ensemble algorithm for intrusion detection in sensor networks,” Sensors (Switzerland), vol. 16, no. 10, 2016, doi: 10.3390/s16101701.
S. Shamshirband, A. Patel, N. B. Anuar, M. L. M. Kiah, and A. Abraham, “Cooperative game theoretic approach using fuzzy Q-learning for detecting and preventing intrusions in wireless sensor networks,” Eng. Appl. Artif. Intell., vol. 32, no. 2008, pp. 228–241, 2014, doi: 10.1016/j.engappai.2014.02.001.
H. Wang, Y. Wen, and D. Zhao, “Identifying localization attacks in wireless sensor networks using deep learning,” J. Intell. Fuzzy Syst., vol. 35, no. 2, pp. 1339–1351, 2018, doi: 10.3233/JIFS-169677.
H. Qu, L. Lei, X. Tang, and P. Wang, “A Lightweight Intrusion Detection Method Based on Fuzzy Clustering Algorithm for Wireless Sensor Networks,” Adv. Fuzzy Syst., vol. 2018, 2018, doi: 10.1155/2018/4071851.
S. Otoum, B. Kantarci, and H. T. Mouftah, “Detection of Known and Unknown Intrusive Sensor Behavior in Critical Applications,” IEEE Sensors Lett., vol. 1, no. 5, pp. 1–4, 2017, doi: 10.1109/lsens.2017.2752719.
S. Otoum, B. Kantarci, and H. T. Mouftah, “On the Feasibility of Deep Learning in Sensor Network Intrusion Detection,” IEEE Netw. Lett., vol. 1, no. 2, pp. 68–71, 2019, doi: 10.1109/lnet.2019.2901792.
X. Tan et al., “Wireless sensor networks intrusion detection based on SMOTE and the random forest algorithm,” Sensors (Switzerland), vol. 19, no. 1, 2019, doi: 10.3390/s19010203.
T. T. H. Le, T. Park, D. Cho, and H. Kim, “An Effective Classification for DoS Attacks in Wireless Sensor Networks,” Int. Conf. Ubiquitous Futur. Networks, ICUFN, vol. 2018-July, pp. 689–692, 2018, doi: 10.1109/ICUFN.2018.8436999.
A. Mansouri, B. Majidi, and A. Shamisa, “Metaheuristic neural networks for anomaly recognition in industrial sensor networks with packet latency and jitter for smart infrastructures,” Int. J. Comput. Appl., vol. 43, no. 3, pp. 257–266, 2021, doi: 10.1080/1206212X.2018.1533613.
S. Bitam, S. Zeadally, and A. Mellouk, “Bio-inspired cybersecurity for wireless sensor networks,” IEEE Commun. Mag., vol. 54, no. 6, pp. 68–74, 2016, doi: 10.1109/MCOM.2016.7497769.
S. Singh and R. S. Kushwah, “Energy efficient approach for intrusion detection system for WSN by applying optimal clustering and genetic algorithm,” ACM Int. Conf. Proceeding Ser., vol. 12-13-Augu, 2016, doi: 10.1145/2979779.2979840.
S. M. S. and M. F. Hichem Sedjelmaci, “An efficient intrusion detection framework in cluster-based wireless sensor networks,” Secur. Commun. Networks, vol. 5, no. June, pp. 422–437, 2012, doi: 10.1002/sec.
K. Q. Yan, S. C. Wang, S. S. Wang, and C. W. Liu, “Hybrid Intrusion Detection System for enhancing the security of a cluster-based Wireless Sensor Network,” Proc. - 2010 3rd IEEE Int. Conf. Comput. Sci. Inf. Technol. ICCSIT 2010, vol. 1, pp. 114–118, 2010, doi: 10.1109/ICCSIT.2010.5563886.
B. Subba, S. Biswas, and S. Karmakar, “A game theory based multi layered intrusion detection framework for VANET,” Futur. Gener. Comput. Syst., vol. 82, pp. 12–28, 2018, doi: 10.1016/j.future.2017.12.008.
O. A. Osanaiye, A. S. Alfa, and G. P. Hancke, “Denial of Service Defence for Resource Availability in Wireless Sensor Networks,” IEEE Access, vol. 6, no. c, pp. 6975–7004, 2018, doi: 10.1109/ACCESS.2018.2793841.
Y. K. Saheed and F. E. Hamza-Usman, “Feature Selection with IG-R for Improving Performance of Intrusion Detection System,” Int. J. Commun. Networks Inf. Secur, vol. 12, no. 3, pp. 338–344, 2020.
X.-S. Yang, “Furefly Algorithms for Multimodal Optimization,” in SAGA 2009, LNCS, 2009, pp. 169–178.
R. Moazenzadeh, B. Mohammadi, S. Shamshirband, and K. W. Chau, “Coupling a firefly algorithm with support vector regression to predict evaporation in northern iran,” Eng. Appl. Comput. Fluid Mech., vol. 12, no. 1, pp. 584–597, 2018, doi: 10.1080/19942060.2018.1482476.
D. Granato, J. S. Santos, G. B. Escher, B. L. Ferreira, and R. M. Maggio, “Use of principal component analysis (PCA) and hierarchical cluster analysis (HCA) for multivariate association between bioactive compounds and functional properties in foods: A critical perspective,” Trends Food Sci. Technol., vol. 72, no. 2018, pp. 83–90, 2018, doi: 10.1016/j.tifs.2017.12.006.
Y. K. Saheed, U. A. Baba, and M. A. Raji, “Big Data Analytics for Credit Card Fraud Detection Using Supervised Machine Learning Models,” in Big Data Analytics in the Insurance Market (Emerald Studies in Finance, Insurance, and Risk Management), K. Sood, B. Balusamy, S. Grima, and P. Marano, Eds. Emerald Publishing Limited, 2022, pp. 31–56.
B. Sweta et al., “A Novel PCA-Firefly Based XGBoost Classification Model for Intrusion Detection in Networks,” Electron., vol. 9, no. 2, p. 219, 2020.
Y. Kayode Saheed, A. Idris Abiodun, S. Misra, M. Kristiansen Holone, and R. Colomo-Palacios, “A machine learning-based intrusion detection for detecting internet of things network attacks,” Alexandria Eng. J., vol. 61, no. 12, pp. 9395–9409, 2022, doi: 10.1016/j.aej.2022.02.063.
D. A. Effendy, K. Kusrini, and S. Sudarmawan, “Classification of intrusion detection system (IDS) based on computer network,” Proc. - 2017 2nd Int. Conf. Inf. Technol. Inf. Syst. Electr. Eng. ICITISEE 2017, vol. 2018-January, pp. 90–94, 2018, doi: 10.1109/ICITISEE.2017.8285566.
Y. K. Saheed, A. O. Akanni, and M. O. Alimi, “INFLUENCE OF DISCRETIZATION IN CLASSIFICATION OF BREAST CANCER DISEASE,” Univ. PITESTI Sci. Bull. Electron. Comput. Sci., vol. 18, no. 2, pp. 13–20, 2018.
Y. K. Saheed, “A Binary Firefly Algorithm Based Feature Selection Method on High Dimensional Intrusion Detection Data,” in Illumination of Artificial Intelligence in Cybersecurity and Forensics. Lecture Notes on Data Engineering and Communications Technologies, S. Misra and C. Arumugam, Eds. Springer Cham, 2022.
R. Doshi, N. Apthorpe, and N. Feamster, “Machine learning DDoS detection for consumer internet of things devices,” Proc. - 2018 IEEE Symp. Secur. Priv. Work. SPW 2018, no. Ml, pp. 29–35, 2018, doi: 10.1109/SPW.2018.00013.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

A Novel Wrapper and Filter-based Feature Dimensionality Reduction Methods for Anomaly Intrusion Detection in Wireless Sensor Networks

Status:

Version 1

Abstract

Figures

Introduction

Materials And Methodology

3.1 Proposed IDS in Wireless Sensor Networks

3.2 Data Normalization

3.3 Wrapper Dimensionality Reduction with Firefly algorithm

3.4 Filter Dimensionality Reduction with Principal Component Analysis

3.5 Naïve Bayes

3.6 Random Forest

3.7 Performance Measures for IDS-WSN

3.7.1 Accuracy

3.7.2 Recall

3.7.3 Precision

3.7.4 F1 Score

Results And Discussion

4.1 Experimental Analysis of Wrapper Dimensionality Reduction and Classification with NB and RF

4.2 Computational Time of FFA-RF and FFA-NB algorithms

4.3 Experimental Analysis of Filter Dimensionality Reduction and Classification with NB and RF

4.4 Computational Time of PCA-RF and PCA-NB algorithms

Conclusion and future work

Declarations

References

Additional Declarations

Status:

Version 1