Experimental Evaluation of Spectrum Handoff Management with Machine Learning Algorithms using Software Defined Radio

doi:10.21203/rs.3.rs-2357421/v1

Download PDF

Research Article

Experimental Evaluation of Spectrum Handoff Management with Machine Learning Algorithms using Software Defined Radio

https://doi.org/10.21203/rs.3.rs-2357421/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Although the design of spectrum switching has been studied, little is known about how random user movement affects the handoff. This issue can occur when a user moves to a new location. In this paper, the authors present a framework that verifies the necessity of spectrum handoff to improve the performance of the system by implementing machine learning (ML) techniques. Some of these include the Logistic Regression, KNN Algorithm, SVM Algorithm, Naïve Bayes Classifier, Decision Tree Classification and Random Forest Algorithm. The system is implemented on a real-time dataset where all the users are separated in power domain using the concept of non orthogonal multiple access (NOMA) technique. The dataset values are prepared using a software-defined radio experimental setup, which is used to analyze the performance of various ML techniques in terms of confusion matrix, specificity, precision, F1_score, sensitivity and accuracy.

Spectrum Handoff

Spectrum Sensing

Cognitive Radio

Machine learning algorithms

software defined radio

NOMA

Due to the increasing importance of the radio waves, the use of dynamic techniques has been identified as a way to improve the efficiency of the spectrum management process [1]. This paper aims to introduce the various techniques that can be used to improve the efficiency of the spectrum. When the licensed spectrum isn't being used by the primary users, cognitive radio refers to this situation. The unused spectrum is then allocated to the secondary users or unlicensed users. If the users in these areas resume their activities, the new users will have to vacate the spectrum.

The process of transferring a spectrum from one user to another involves a spectrum handover. This occurs when the latter has started to transmit and the licensed user has to access the channel. The secondary user then moves to an idle channel until the unlicensed user has finished his transmission. The spectrum handoff process can help design an efficient network architecture [2]. It involves carrying out various steps, such as the evaluation and maintenance phases. The evaluation phase of the process is when a device, which is a cognitive type, studies the environment and its specifications in order to determine if it should use the spectrum. After the device has decided that it needs to use the spectrum, it stops the transmission of the data and sends the frequency to a different channel.

In Fig. 1 we show the concept of spectrum handoff, which is related to the presence or absence of a primary channel and two secondary users [3], [4]. There are four possible scenarios involving the presence of a primary user in a channel. The primary users must immediately stop all communication and leave the spectrum. The secondary users must then look for the appropriate spectrum before they can resume their operations. The PU must also leave the channel completely. The SUs can then switch to the new channel to continue their communication. The allocation of the spectrum to the secondary users based on their priority is determined once the primary user moves to a different channel.

The spectrum handoff is composed of two types: reactive handoff [5] and proactive handoff [6]. The concept of reactive handoff is to plan and implement the operation according to the network's requirements, which can be affected by the prolonged handoff latency and interference. On the other hand, proactive handoff is carried out according to historical data usage. The authors of this paper discuss the advantages of reactive and proactive handoff over spectrum handoff. In one study [7], Guipponi et al. proposed a fuzzy-based method to handle spectrum handoff. In another [8], Wang and colleagues looked into the link maintainability of networks when the SU vacates a place. In another study [9], the authors analyzed the effects of spectrum handoff on the maintainability of a network after the SU moves out of a position. They discovered that the performance of the handoff had not been thoroughly studied. In order to minimize the disruption caused by the handoff, the authors [10] suggested that a voluntary spectrum handoff scheme be introduced.

The authors of [11] looked into the various factors that influence the performance of a spectrum handoff. They then developed a set of metrics that can help analyze the operation's progress. Some of these include the number of handoffs, the switching delay, the non-completion probability, and the link maintainability [12]. The spectrum handoff is mainly used for mobile platforms, such as smart phones [13, 14]. It can be performed efficiently by implementing a hybrid spectrum handoff strategy, which automatically identifies the channel that needs to be handled and provides a quick response. However, it can also cause poor handoff due to the delays in the traffic [15, 16]. The process of PUTPOSH is to determine if a handoff is required based on the arrival of the PU and the service time [17]. The queue model used in this process is called the M/G/1 queue. It shows the status of each task as it progresses. The slowest user will be notified when the idle channel is spotted [18].

Unfortunately, current multiple access techniques are not capable of handling the massive amount of traffic that will inevitably occur in the future. One of the most promising ways to improve the efficiency of future communications is by implementing non-orthogonal multiple access (NOMA) [19]. This technique can be used to provide a vastly increased spectral efficiency. In NOMA, a multi-user signal is multiplexed using a superposition coding technique in the transmitter. The users are then sent using different power levels depending on their channel conditions. This method ensures that the users with the most problematic channel conditions are given the highest power allotment. Having the correct channel state information in the transmitter is very important in order to improve the efficiency of the system.

The users are then sent using different power levels depending on their channel conditions. The stronger the channel, the faster the user can retrieve its signal. On the other hand, the weaker one, which is usually recognized as interference, performs a series of interference cancellation techniques to regain its original signal. In most conventional multiple access systems, the goal is to maintain the symmetry between the channels by using guard periods. However, this method can be very inefficient due to the interference caused by the guard period. In NOMA, the signals are sent using different power levels and the guard period is completely removed. Despite the advantages of NOMA, it is still very challenging to maintain the level of user fairness due to the complexity of the receiver and the need for a perfect channel state information. In order to solve these issues, we have used machine learning techniques to improve the efficiency of the system.

The data set is created using the help of software defined radios [20], which is used to improve the interoperability of commercial radio systems. The experimental setup shown in Figure 2 is composed of two universal software radio peripheral (USRPs) and a couple of computers which are installed with GNU Radio software. The goal of this study was to analyze the performance of different ML algorithms using the USRP N210 hardware and GNU Radio software. These innovations help to reduce the cost of developing and deploying commercial radio systems. One of the most popular platforms that supports software defined radios is the USRP from Ettus Research. This is used in education and wireless networks.

The Ettus N210 and N200 series of SDR kits are designed for various applications, such as those that can operate in the DC to 6 gigahertz range. The daughter-board's RF capabilities are determined by the installed platform. The other components used during the implementation were the mixed-signal daughter-boards from the XCVR2450 and SBX. The USRP features a wide variety of features, such as an integrated digital-to-digital converter. This can be used for various signal processing functions, such as upsampling and downsampling. It can also communicate with a host computer. The host computer can access the USRP through the driver provided by Ettus. The software used to operate it is known as GNU Radio, and it has its own oscillator and timing. Through the help of the host computer and the software, the various parameters of the USRP are calculated and noted.

The proposed system's workflow is shown in Fig. 3. The operation of each block is described in this section. For our system, it is assumed that there are two base stations, namely, BTS1 and BTS2, which have coverage areas of CVG1 and CVG2 as shown in Fig. 4, respectively. Also, there are various users who are maintaining varying distances from the two base stations.

In order to maximize the efficiency of NOMA's spectrum usage, we have incorporated it into an existing system that separates the users according to their power domain. This method is performed by taking into account the users' distance from the base station and power requirement as independent variables and requirement of handoff (‘0’ or ‘1’) as dependant variable. The resulting dataset is then generated using software defined radio.

In Fig. 4, the moving SU is crossing the boundary of the CVG1. Due to the presence of a PU, the current being used by the SU is being sent to another channel. Through a cooperative spectrum sensing system, the two SUs maintain the same frequency. The user's red line crosses the system's frequency when it senses the change. This model considers the two conditions of the spectrum. This model can be easily extended to other spectrum channels. In order to solve this issue, we have developed a novel method that uses machine learning techniques to manage the handoff between the two SUs. ML is a widely used computational intelligence tool that can be used in various fields. The ML model can predict the handoff and idle point of the two SUs based on the data collected by their neighbors. It can also find a new spectrum band whenever the environment changes. To improve its performance, a cooperative sensing mechanism has been implemented.

The Fig. 5 shows a basic machine learning work flow. It starts with the training data and then the labels [21]. These are used by the algorithm to distinguish between the different types of data. The machine learning algorithm block contains various labels and features that are required for training. After completing the training phase, the block generates a predictive modeling model. This model is then used to predict the future state of the data. The data collected during the prediction phase is then analyzed and transformed into features that are used in the model block's final output.

Classification and regression algorithms are two types of supervised machine learning techniques that are commonly used in the prediction phase [22]. In the former, we can predict the continuous values, while in the latter, we need to predict the categorical values. The classification algorithm is a method that identifies the new observations that are coming from the training data. A program can classify new observations by learning from the given data set and then grouping them into various classes or groups. For instance, if there are no cats or dogs in the dataset, then there are no classes or labels for "yes" or "no" Spam.

4.1 Logic Regression

Although the terms linear and logistic regression are similar, they are not used in the same way [23]. For instance, in linear regression, the goal is to find the optimal solution to a given problem. In logistic regression, the goal is to find the classification challenges that are related to the given function. Since p is an unbounded function, we need to first compute its logit transformation in order to make it a linear one. This is done by taking into account the log p(x)/(1- p(x).

After solving for p(x):

To convert a linear classification into a statistical model, we can select a threshold, such as 0.5. The likelihood of the predicted class is computed by taking into account the training data points x and y.

4.2 KNN Algorithm

The concept of finding the nearest neighbor is a process that involves identifying the points within a given data set that are closest to the source. The algorithm uses a combination of tests and majority votes to find the most appropriate cases. Before implementing KNN, the first step is to transform the data into its vector values [24]. This process takes into consideration the distance between the points and the test data, and it predicts if these are similar. The classification of points is based on the probability that they share the same points. The distance function can be utilized to determine the Minkowski, Hamming, or the Euclidean distance [25]. The distance between two points is computed using the formula known as the "Euclidean Distance."

D((x_1,y₁)(x_2,y₂)) = \(\sqrt{({{\text{x}}_{2}-{\text{x}}_{1})}^{2}+{({\text{y}}_{2}-{\text{y}}_{1})}^{2}}\)........................ (3)

For each value of K, the algorithm finds the nearest neighbors of the data point. It then passes the class to the data point with the highest number of points out of all the other classes of the same K neighbors. After calculating the distance, the algorithm returns the class with the highest probability.

P(Y = i|X = x) = \(\frac{1}{k}\) \(\sum _{\text{j}\in \text{A}}\text{I} ({\text{y}}^{\left(\text{j}\right)}=\text{i})\) ......................... (4)

4.3 SVM Algorithm

One of the most widely used machine learning algorithms is SVM [26], which is used for both regression and classification. In this thread, we will talk about the classification task. It is typically used for medium and small data sets. The main objective of this algorithm is to find the optimal hyperplane that can efficiently separate the data points in two components. We have a set of training examples that are linearly separable. Each example has its own labels that denote either y = + 1 or y = 0. We then create a form that describes the training data,

{x_i, y_i}; where i = 1,......,N, y_i ∈{0,1}, x ∈ R^D ………….(5)

As the data points can be separated linearly, we assume D = 2 to keep the explanation straightforward.

The objective of the SVM is to orient this hyperplane as far away from the nearest member of both classes as possible. Support vector examples are closest to the ideal hyperplane. From Fig. 6, we can see that two hyperplanes, H₁ and H₂, respectively, travel via support vectors of the + 1 and 0 classes. So

mx + c = 0 : H₁

mx + c = 1 : H_{2 ………………………………………}. (5)

Additionally, the distances between the H1 hyperplane and the origin are (0-c)/|m| and (1-c)/|m|, respectively. Margin can so be provided as

M=(1-c)/|m|-(0-c)/|m|

M = 2/|m| .......................... (6)

where M is the margin twice. Margin can thus be expressed as 1/|m|. The SVM objective is reduced to the fact of maximising the term 1/|m| because the ideal hyperplane maximizes the margin.

4.4 Naïve Bayes Classifier Algorithm

The Nave Bayes classifier is a perfect tool for developing fast machine learning models [27]. It can predict an object's probability based on its condition. This is referred to as a probabilistic classification, and it is also known as Bayes' theorem, which is a type of rule that states that a hypothesis has a probability of being true. The Bayes' theorem can be written as

P(\(A/B\)) = \(\frac{\text{P}\left(\text{B}/\text{A}\right) \text{P}\left(\text{A}\right)}{\text{P}\left(\text{B}\right)}\) .......................... (7)

Where,

The concept of the posterior probability P(A|B), is that a given hypothesis will most likely come true. It is computed by taking into account the likelihood that the correct hypothesis will be presented. On the other hand, the likelihood probability is based on the available evidence. The priority probability P(A), is the likelihood of a theory being presented before the evidence P(B) is examined.

4.5 Decision Tree Classification

The goal of the decision tree algorithm is to predict the class of a given dataset [28]. It starts by comparing the values of the record's value and the root attribute's value. The process continues until the tree's leaf node. When implementing a decision tree, there is one major issue that needs to be resolved: which attribute should be selected for the root node and sub-nodes. There are two methods that are commonly used to select the appropriate attribute: the Gini Index and the Information Gain.

4.5.1 Information gain

The information gain metric is used to measure the changes in the entropy after a feature segmentation has been carried out. It takes into account the amount of information that a feature provides us about a given class. A decision tree algorithm tries to maximize the information gain (in Eq. (8)) by implementing a method that splits the nodes and attributes into different categories.

Where H_v is a subset of set H and has the same value as attribute v, and where the range of attribute A is (A).

Entropy always has a value between 0 and 1. When it equals 0, it is superior to when it equals 1, and when it equals 1, it is inferior. This is how the entropy is calculated:

𝑃 _x is the ratio between the n-th attribute value and the sample size of the subset.

4.5.2 GINI index

The GINI index is a representation of the purity of a class when it has split into a specific attribute. The better the split, the higher the sets' purity. If there are multiple class labels in a dataset D, the index is calculated as follows.

4.6. Random Forest Algorithm

The Random Forest technique can perform both regression and classification tasks. It can be used with various decision trees, such as Aggregation and the Bootstrap framework [29]. Instead of relying on individual trees, the goal of the Random Forest technique is to combine several decision trees into a final output. This is done through the use of multiple learning models. One of these is called the Random Forest Base Model. In this part, we perform feature sampling and row sampling from the collected data.

In this paper, a dataset is created to decide the necessity of spectrum handoff. Various machine learning classification algorithm are employed to provide the optimal boundary based on the trained data. Figures 7 to 10, show the trained set result and test set result for 100 and 500 number of users when various ML algorithms such as Logistic Regression, KNN Algorithm, SVM Algorithm, Naïve Bayes Classifier, Decision Tree Classification and Random Forest Algorithm. These plots are made by using two independent variables i.e., Distance from the base station on the x-axis and Power of each user on the y-axis. The graph shows two regions: the blue and the yellow. The former is represented by the observations in the blue region, while the latter is represented by the observations in the yellow region. Data points in the graph correspond to the users of the dataset, and the two regions represent the prediction and blue observations.

The blue point observations are for which requirement of spectrum handoff (dependent variable) is probably 0, i.e., users who are under the coverage of base-station 1. The yellow point observations are for which requirement of spectrum handoff is probably 1 means user is not under the coverage of base-station 1. Therefore, it is observed that Blue users don’t require spectrum handoff, when this user crosses the boundary line then it requires spectrum handoff. It is a good model and prediction. However, there are some data points that are in different regions which can be ignored. To minimize this error, we will use the confusion matrix to analyze the data. The classification shown in Figs. 7(a), (b) and 9(a), (b) is a linear model which is used for logistic regression. In the future, we will learn about non-linear classification techniques. In the first example, the boundary shown in the Figs. 7(c), (d) and 9(c), (d) is irregular because it is a K-NN algorithm that finds the nearest neighbor. It has also classified the users according to their categories. For instance, the blue region is for those who don't require the handoff, while the yellow region is for those who do. Although the model is showing good results, there are still some yellow and blue points in the different regions. This is not a big issue since doing this model prevents overfitting. The output of the model shown in Figs. 7(e), (f) and 9(e), (f) is similar to the one shown in the previous example. In the output, the hyperplane has been used to classify the users according to their categories. It has also divided the two classes into the blue and yellow regions.

The Nave Bayes classifier (see Figs. 8(a), (b) and 10(a), (b)) shows that it has a fine boundary and segregated the data points. It is a Gaussian curve, and we have used it in our code. However, there are some errors in the predictions that we have made in Confusion matrix. Despite these, it is still a good classifier. The decision tree classification output shown in Figs. 8(c), (d) and 10(c), (d) is different from the other models. It has both horizontal and vertical lines that are splitting the data according to the Distance and Power variable. This is because the tree is trying to capture all the data. Figures 8(e), (f) and 10(e), (f) (Random Forest Algorithm output) is very much similar to the Decision tree classifier. So, in the Random Forest classifier, we have taken 10 trees that have predicted Yes or NO for the handoff. The classifier took the majority of the predictions and provided the result. We can check that there is a minimum number of incorrect predictions without the overfitting issue. We will get different results by changing the number of trees in the classifier.

In the Tables 1 and 2, the predicted output and real test output are given for 100 and 500 number of users respectively. We can clearly see that there are some values in the prediction vector, which are different from the real vector values. These are called prediction errors and are highlighted in the tables for better understanding. So if we want to know the number of correct and incorrect predictions, we need to use the confusion matrix. The concept of the confusion matrix is a table that shows the rows that represent the actual classes that the model should have been able to achieve as shown in Figs. 11 and 12. The columns in the matrix represent the predictions that the algorithm has made. However, it is also easy to see which ones are wrong. True or False means that the model was correct, while the other means that there was an error or a wrong prediction. With the creation of the Confusion Matrix, we can now measure the quality of the model. In the Fig. 11(a), we can see the confusion matrix, which has 0 + 3 = 3 incorrect predictions and 19 + 3 = 22 correct predictions. The number of correct and incorrect predictions are generated and shown in Figs. 11 and 12 for 100 and 500 number of users.

Table 1

Predicted output and real test output of various ML algorithms for 100 number of users
Logistic Regression	Test set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 0]
Logistic Regression	Predict set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0]
KNN Algorithm	Test set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 0]
KNN Algorithm	Predict set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 0 0 0 0 0 0 0]
SVM Algorithm	Test set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 0]
SVM Algorithm	Predict set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0]
Naïve Bayes Classifier	Test set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 0]
Naïve Bayes Classifier	Predict set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0]
Decision Tree Classification	Test set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 0]
Decision Tree Classification	Predict set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 0]
Random Forest Algorithm	Test set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 0]
Random Forest Algorithm	Predict set	[1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 1 0 0 0 1 0 0 0]

Table 2

Predicted output and real test output of various ML algorithms for 500 number of users
Logistic Regression	Test set	[0 1 0 0 1 1 0 1 1 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 1 0 1 0 0 0 1 0 0 1 0 1 1 0 1 1 0 0 0 0 1 1 0 0 1 0 0 0 1 0 0 1 0 1 0 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 0 0 0 0 1 0 0 0 1 0 1 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0]
Logistic Regression	Predict set	[0 1 1 0 0 0 0 0 0 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 0 1 0 0 0 0 1 0 0 1 0 1 0 0 0 1 0 0 1 0 0 1 1 0 1 0 0 0 1 0 0 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 0 1 0 0 1 0 1 0 0 0 0 0 1 1 1 0 1 0 1 0 0 0 1 0 0 1 0 1 0 1 1 1 0 0 0 1 1 0 0 1 0 0 0 0 1 0]
KNN Algorithm	Test set	[0 1 0 0 1 1 0 1 1 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 1 0 1 0 0 0 1 0 0 1 0 1 1 0 1 1 0 0 0 0 1 1 0 0 1 0 0 0 1 0 0 1 0 1 0 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 0 0 0 0 1 0 0 0 1 0 1 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0]
KNN Algorithm	Predict set	[1 1 1 0 1 1 0 1 1 0 1 0 0 1 1 1 0 1 0 0 1 1 0 0 0 0 1 1 1 1 1 0 0 0 1 0 0 1 0 1 1 0 0 1 0 0 1 0 1 1 0 1 1 0 0 0 1 0 0 1 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 1 1 0 1 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 1 0 1 0 1 0 1 0 0 1 0 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 0 0 1 0]
SVM Algorithm	Test set	[0 1 0 0 1 1 0 1 1 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 1 0 1 0 0 0 1 0 0 1 0 1 1 0 1 1 0 0 0 0 1 1 0 0 1 0 0 0 1 0 0 1 0 1 0 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 0 0 0 0 1 0 0 0 1 0 1 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0]
SVM Algorithm	Predict set	[0 1 1 0 0 0 0 0 0 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 0 1 0 0 0 0 1 0 0 1 0 1 0 0 0 1 0 0 0 0 0 1 1 0 1 0 0 0 1 0 0 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 0 1 0 0 0 0 1 0 0 0 0 0 1 1 1 0 1 0 1 0 0 0 1 0 0 1 0 1 0 1 1 1 0 0 0 1 1 0 0 1 0 0 0 0 1 0]
Naïve Bayes Classifier	Test set	[0 1 0 0 1 1 0 1 1 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 1 0 1 0 0 0 1 0 0 1 0 1 1 0 1 1 0 0 0 0 1 1 0 0 1 0 0 0 1 0 0 1 0 1 0 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 0 0 0 0 1 0 0 0 1 0 1 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0]
Naïve Bayes Classifier	Predict set	[0 1 1 0 1 1 0 1 0 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 0 1 1 0 0 0 1 0 0 1 0 1 1 0 0 1 0 0 1 0 0 1 0 1 1 0 0 0 1 0 0 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 1 1 0 1 0 1 0 0 0 0 0 0 0 0 0 0 1 1 1 0 1 0 1 0 1 0 1 0 0 1 0 0 0 1 0 1 0 0 0 1 0 0 0 1 0 0 0 0 1 0]
Decision Tree Classification	Test set	[0 1 0 0 1 1 0 1 1 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 1 0 1 0 0 0 1 0 0 1 0 1 1 0 1 1 0 0 0 0 1 1 0 0 1 0 0 0 1 0 0 1 0 1 0 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 0 0 0 0 1 0 0 0 1 0 1 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0]
Decision Tree Classification	Predict set	[0 1 1 0 1 1 1 1 1 0 1 0 0 1 1 1 0 1 0 0 1 1 0 0 0 0 0 1 0 1 1 0 0 0 1 0 0 1 0 1 1 0 1 1 0 0 0 0 1 1 0 0 1 0 0 0 1 0 0 1 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 1 1 0 1 0 1 0 1 1 0 1 0 0 0 0 0 1 1 1 0 1 0 1 0 1 0 1 0 0 1 0 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 0 0 1 0]
Random Forest Algorithm	Test set	[0 1 0 0 1 1 0 1 1 0 1 0 0 1 0 1 0 1 0 0 1 1 0 0 0 0 1 1 1 0 1 0 0 0 1 0 0 1 0 1 1 0 1 1 0 0 0 0 1 1 0 0 1 0 0 0 1 0 0 1 0 1 0 1 1 0 0 0 1 0 0 0 1 0 0 1 1 0 0 1 1 0 0 0 0 0 1 0 0 0 1 0 1 0 0 1 0 1 0 0 0 0 0 0 1 0 0 0 1 0 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0]
Random Forest Algorithm	Predict set	[1 1 1 0 1 1 0 1 1 0 1 0 0 1 1 1 0 1 0 0 1 1 0 0 0 0 1 1 1 1 1 0 0 0 1 0 0 1 0 1 1 0 1 1 0 0 1 0 1 1 1 0 1 0 0 0 1 0 0 1 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 1 1 0 1 1 1 0 1 0 0 1 0 0 0 0 0 1 1 1 0 1 0 1 0 1 0 1 0 0 1 0 1 0 0 1 1 0 0 0 1 1 0 1 1 0 0 0 0 1 0]

The authors of this study analyzed the performance of the ML algorithms on various parameters such as Accuracy, Precision, Sensitivity, Specificity, F1_score, Confusion Matrix [30] by varying the number of users and presented in Tables 3 and 4. The accuracy of a model is measured by how often it is correct. The precision measure is used to evaluate the amount of positive percentage. On the other hand, sensitivity is a measure of how good a model is at predicting false negatives. This is because, if a model is correct about predicting a positive outcome, then it should also consider the true positives. The sensitivity measure is useful in assessing the accuracy of a model when it comes to predicting a positive outcome. Specificity, on the other hand, is a measure of how well a model can predict a negative outcome. It takes into account both the false and positive cases. The F-score is a harmonic measure that takes into account both the sensitivity and precision of a model. However, it does not take into account the True Negative values. It has been observed from the Tables 3 and 4 that the number of test users increases with the number of values, which leads to more errors. It is also believed that the system will become more efficient by having fewer test users and more trained ones.

Table 3

Performance analysis of various ML algorithms for 100 users
	Accuracy	Precision	Sensitivity	Specificity	F1_score
Logistic Regression	0.88	1.0	0.5	1.0	0.666
KNN Algorithm	0.96	1.0	0.833	1.0	0.909
SVM Algorithm	0.92	1.0	0.666	1.0	0.8
Naïve Bayes Classifier	0.92	1.0	0.666	1.0	0.8
Decision Tree Classification	1.0	1.0	1.0	1.0	1.0
Random Forest Algorithm	0.96	1.0	0.833	1.0	0.909

Table 4

Performance analysis of various ML algorithms for 500 users
	Accuracy	Precision	Sensitivity	Specificity	F1_score
Logistic Regression	0.776	0.704	0.673	0.835	0.688
KNN Algorithm	0.856	0.769	0.869	0.848	0.816
SVM Algorithm	0.792	0.738	0.673	0.860	0.704
Naïve Bayes Classifier	0.84	0.782	0.782	0.873	0.782
Decision Tree Classification	0.832	0.735	0.847	0.822	0.787
Random Forest Algorithm	0.832	0.711	0.913	0.784	0.799

The authors of this paper presented a framework that aims to improve the performance of a system by implementing various machine learning techniques for spectrum handoff. These include the Logistic Regression, KNN Algorithm, SVM Algorithm, Naïve Bayes Classifier, Decision Tree Classification and Random Forest Algorithm. The system is implemented on a live dataset that contains all the users in the power domain using the non orthogonal multiple access (NOMA) technique. The data collected from this system is then analyzed using a software-defined radio experimental setup. The performance of different ML techniques is compared with the accuracy, sensitivity, specificity, and F1_score and confusion matrix of the users. The number of test users has been observed to increase with the number of values. The increasing number of test users can lead to errors and lead to the system becoming more inefficient. This is because the number of trained users and fewer test users will help the system become more efficient.

Funding:

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Competing Interests:

The authors have no relevant financial or non-financial interests to disclose.

Author Contributions:

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Mr. Patan Babjan and Dr V. Rajendran. The first draft of the manuscript was written by Mr. Patan Babjan and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Data Availability:

No datasets are used in this paper.

Arjoune, Youness, and Naima Kaabouch. "A comprehensive survey on spectrum sensing in cognitive radio networks: Recent advances, new challenges, and future research directions." Sensors 19.1 (2019): 126.
Aggarwal, Manav, T. Velmurugan, and S. Nandakumar. "Comparative Analysis of Algorithms Governing Spectrum Handoff in Cognitive Radio Networks." Wireless Personal Communications 121.3 (2021): 1423-1435.
Yawada, Prince Semba, and Mai Trung Dong. "Intelligent process of spectrum handoff/mobility in cognitive radio networks." Journal of Electrical and Computer Engineering 2019 (2019).
Guo, Jinghua, et al. "A novel spectrum handoff management scheme based on SVM in cognitive radio networks." 2011 6th International ICST Conference on Communications and Networking in China (CHINACOM). IEEE, 2011.
Gouda, Ahmed El, et al. "Reactive spectrum handoff combined with random target channel selection in cognitive radio networks with prioritized secondary users." Alexandria engineering journal 57.4 (2018): 3219-3225.
Charan, Gouranga, Muhammad Alrabeiah, and Ahmed Alkhateeb. "Vision-aided 6G wireless communications: Blockage prediction and proactive handoff." IEEE Transactions on Vehicular Technology 70.10 (2021): 10193-10208.
L. Giupponi and A. Perez-Neira, ”Fuzzy-based Spectrum Handoff in Cognitive Radio Networks”, the 3rd International Conference on Cognitive Radio Oriented Wireless Networks and Communications (CROWNCOM 2008), May, 2008, Singapore.
L.-C. Wang and A. Chen, ”On the performance of spectrum handoff for link maintenance in cognitive radio”, the 3rd International Symposium on Wireless Pervasive Computing (ISWPC 2008), May 2008
O. Jo and D.-H. Cho, ”Efficient Spectrum Matching Based on Spectrum Characteristics in Cognitive Radio System”, IEEE Wireless Telecomunications Symposium (WTS 2008), April 2008
Suk-Un Y. and E. Ekici. Voluntary Spectrum Handoff: A Novel Approach to Spectrum Management in CRNs. IEEE International Conference on Communications (ICC), 2010.
L.-C. Wang and C.-W. Wang, “Spectrum handoff for cognitive radio networks: Reactive-sensing or proactive-sensing?” in Proc. IEEE International in Performance, Computing and Communications Conference (IPCCC), December 2008, pp. 343–348.
Yan, Z. Spectrum Handoff in Cognitive Radio Networks: Opportunistic and Negotiated Situations. IEEE International Conference on Communications (ICC), 2009.
Shekhar, S., Hoque, S., & Arif, W. (2020). Analysis of spectrum handoff delay using finite queuing model in cognitive radio networks. International Journal of Communication Networks and Distributed Systems, 25(3), 249–264
Haldorai, A., & Kandaswamy, U. (2019). Intelligent spectrum handovers in cognitive radio networks. EAI/Springer Innovations in Communication and Computing. NewYork: Springer.
Preetha, K. S., & Kalaivani, S. (2020). Analysis of spectrum handoff schemes for cognitive radio networks considering secondary user mobility. International Journal of Grid and Utility Computing., 11(4), 443–456.
Yawada PS, and Dong MT (2019). Intelligent process of spectrum handoff/mobility in cognitive radio networks. Journal of Electrical and Computer Engineering, Hindawi.1–12.
Arshid, K., Hussain, I., Bashir, M. K., Naseem, S., Ditta, A., Mian, N. A., et al. (2020). Primary user traffic pattern based opportunistic spectrum handoff in cognitive radio networks. Applied Science, 10(5), 3–19.
Zahed, S., Awan, I., & Cullen, A. (2013). Analytical modeling for spectrum handoff decision in cognitive radio networks. Simulation modeling practice and Theory, Elsevier, 38, 98–114.
Reddy, Bathula Siva Kumar. "Experimental validation of non-orthogonal multiple access (NOMA) technique using software defined radio." Wireless Personal Communications 116.4 (2021): 3599-3612.
Reddy, Bathula Siva Kumar. "Experimental Validation of Timing, Frequency and Phase Correction of Received Signals Using Software Defined Radio Testbed." Wireless Personal Communications 101.4 (2018): 2085-2103.
Ahmad, Hassaan Bin. "Ensemble classifier based spectrum sensing in cognitive radio networks." Wireless Communications and Mobile Computing 2019 (2019).
Sen, Pratap Chandra et al.,"Supervised classification algorithms in machine learning: A survey and review." Emerging technology in modelling and graphics. Springer, Singapore, 2020. 99-111.
Wood SN. Generalized additive models: an introduction with R. CRC press; 2017 May 18.
Wang, Boyuan, et al. "A novel weighted KNN algorithm based on RSS similarity and position distance for Wi-Fi fingerprint positioning." IEEE Access 8 (2020): 30591-30602.
Du, Wen Sheng. "Minkowski‐type distance measures for generalized orthopair fuzzy sets." International Journal of Intelligent Systems 33.4 (2018): 802-817.
Tharwat, Alaa. "Parameter investigation of support vector machine classifier with kernel functions." Knowledge and Information Systems 61.3 (2019): 1269-1302.
Chen, Shenglei, et al. "A novel selective naïve Bayes algorithm." Knowledge-Based Systems 192 (2020): 105361.
Fletcher, Sam, and Md Zahidul Islam. "Decision tree classification with differential privacy: A survey." ACM Computing Surveys (CSUR) 52.4 (2019): 1-33.
Schonlau, Matthias, and Rosie Yuyan Zou. "The random forest algorithm for statistical learning." The Stata Journal 20.1 (2020): 3-29.

Authorbiography.docx

Download PDF

Editorial decision: Minor revisions
21 Jul, 2023
Reviewers agreed at journal
15 Mar, 2023
Editor assigned by journal
08 Dec, 2022
First submitted to journal
08 Dec, 2022

You are reading this latest preprint version

Experimental Evaluation of Spectrum Handoff Management with Machine Learning Algorithms using Software Defined Radio

Status:

Version 1

Abstract

Figures

1. Introduction

2. Experimental Setup- Software Defined Radio (Sdr)

3. Proposed System Model

4. Machine Learning Algorithms

4.1 Logic Regression

4.2 KNN Algorithm

4.3 SVM Algorithm

4.4 Naïve Bayes Classifier Algorithm

4.5 Decision Tree Classification

4.5.1 Information gain

4.5.2 GINI index

4.6. Random Forest Algorithm

5 Results And Discussion

6 Conclusions

Declarations

References

Supplementary Files

Status:

Version 1