A Novel Deep Encrypted Network Traffic Discriminator in Software Defined Network (SDN)

doi:10.21203/rs.3.rs-2054446/v1

Download PDF

Research Article

A Novel Deep Encrypted Network Traffic Discriminator in Software Defined Network (SDN)

https://doi.org/10.21203/rs.3.rs-2054446/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Nowadays, Internet users are rising and need to be supplied with an adoptable quality of service (QoS). Network traffic classification is one of the essential functions that can lead the internet service provider (ISP) to provide required network resources rationally. In facing new flows, the network traffic classification accuracy improvement can play a critical role in network performance, QoS, and security improvement. In this paper, we propose a novel classification model, including (1) a deep autoencoder and (2) a classifier to improve the network traffic classification accuracy in facing new network flows. The deep autoencoder is designed and evaluated in this article with the mean square error (MSE) metric. The proposed deep autoencoder has advanced the model to extract the effective features from the training set more accurately than other methods like the manual method or shallow neural network model. Three distinct classifiers are considered to be added to the deep autoencoder and make it more accurate. The transfer learning is used to add the distinct classifiers, namely logistic regression, random forest, decision tree, and Support Vector Machine (SVM), as a layer to the proposed model. The proposed deep classification model is evaluated with accuracy and f-score measures. The simulation results show that the proposed model has more accuracy and f-score than Convolutional Neural Network (CNN). The UNB ISCX VPN-nonVPN dataset is used for training and testing the model. Software Defined Network (SDN) architecture is used for the proposed model to be deployed because this architecture has made the network more programmable and flexible than the traditional closed networks.

Network traffic classification

Application identification

Machine learning

Deep learning

Auto-encoder

Software Defined Networking (SDN)

Network traffic classification plays a significant role in network performance, QoS and security [1][2]. Network traffic classification is a network component implemented in the network equipment to identify the traffic based on the different applications [3]. There are various approaches for network traffic classification, including port-based, deep packet inspection (DPI) and machine learning-based methods. The port-based method uses port numbers to discriminate the network traffic and detect the application based on the port number used for the destination. The Internet Assignment Number Authority (IANA) has registered the port number [4]. For example, E-mail uses port number twenty-five for sending protocol called SMTP and 110 for receiving called POP3, and port eighty has been registered for the web application. The port number cannot provide the network requirements in peer-to-peer (P2P) applications in which port numbers are changeable [3]. The DPI has been proposed to overcome the port number limitation; therefore, this approach analyzes the packet's payload to find a specific pattern (i.e. Signatures). The applications are classified based on their characteristics. The pattern in the first step should be extracted and kept up-to-date because this method can detect the evolution of the applications [5]. The drawback of DPI appears when the encrypted traffic has been used increasingly; hence, this method has a high computational overhead and is not adaptable to users' privacy [6]. The machine learning-based method is another approach to classify the network traffic dynamically while it can detect and support the encrypted traffic in traditional and SDN architecture [3], [4], [7]–[8]. The researchers have proposed a machine learning-based algorithm using statistical flow features such as packet size, packet length, the time interval between packets, etc., to classify the network traffic. Machine learning techniques generally are categorized into three ones:

Supervised learning

It is a method that extracts the patterns from the dataset with labelled samples. Classification and regression are the common usages based on the trained model extracted from the train set. The classification or prediction is the output of this technique [3].

Unsupervised learning

It is a method that extracts the patterns from the dataset with unlabeled samples. Clustering is one of the unsupervised techniques. There is no need to extract the model from a labelled dataset. The clustering is done based on the similarity among the samples. This similarity is calculated based on the features. The number of classes can be predefined like K-Means. The output of this technique is the clusters based on which their members are selected based on their most similarity [3], [9].

Semi-supervised learning

It is done based on the dataset in which a part of it is labelled, and the other part is unlabeled. This method benefits from both supervised and unsupervised approaches.

The novel model implemented with machine learning needs to be deployed in the network, but it is complex with traditional network architecture; hence, SDN, a cutting-edge paradigm, has been supposed to make the network more programmable, agile, flexible, and innovative SDN [10]. The main idea of SDN is the separating the control and data traffic [11][12]–[13]. SDN consists of three layers and three APIs, as shown in Fig.1. \

The Data plane is responsible for passing the flows through the network equipment. Unlike the traditional network, all equipment in this layer requires forwarding only, so the equipment in this layer is called Forwarding Element (FE). This layer is connected to the control layer, which is the brain of the network in SDN, with the southbound API such as OpenFlow, which is open source. The controller is centeralized, which can be physical or conceptual centralized, and it has caused the controller to have a global view of the network. This global view has led the SDN controller to make the optimized decision. The network application is a layer which hosts network applications as a module and is connected with the controller using northbound API like SNMP. Finally, the east-west API connects the controller and scales up the control plane, which is the SDN key challenge [14][15].

The main problem of this paper is network traffic classification which should be implemented in the python module so that it can be deployed in the SDN controller. In this paper, a novel deep autoencoder architecture will be designed to decrease Mean Square Error (MSE). In the continue, four distinct classifiers, consisting of logistic regression, Support Vector Machine (SVM), Decision Tree, and random forest, are considered to be added to the proposed deep autoencoder. The transfer learning is used to select the proper classifier to improve the classification accuracy and f-score. The UNB ISCX VPN-nonVPN dataset is used for training and testing the model. The proposed model, including the deep autoencoder and the proper classifier, is evaluated and compared with the proposed CNN model.

The rest of the paper is organized into five sections. Section 2 is about the related work, and the proposed model will be expressed in section 3. The simulation results are evaluated and reported in section 4, and finally, the paper will be concluded in section 5.

Various studies have been done on network traffic classification to improve the network traffic classification accuracy. This section reviews the articles addressing this problem and proposes the machine learning-based method for network traffic classification.

A. Shrivastav et al. have proposed the classification component using K-means and SVM. They have used KDD Cup 1999 dataset as the training and testing set. Their experiments have shown that the k-means clustering algorithm has reached an accuracy of 90.65 percent and 76 percent with SVM [16].

Canti et al. have worked on the network classification problem and used the random forest algorithm to classify encrypted network traffic. It has provided the network resources based on the applications installed on mobile phones. They used a dataset with 62747 samples. The samples have been distributed among the classes, including Gmail, Twitter, Facebook, Tumblr, Dropbox, Google Plus and Evernote and the number of samples belonging to each class is 11600, 7700, 10120, 1670, 15104, 7813, and 8740, respectively. The proposed model has achieved an accuracy of 95 percent [17].

Yao et al. have suggested a Recurrent neural network (RNN) model called the Short-Term Memory Algorithm (LSTM) to classify the encrypted network traffic. Their proposed model consists of (I) feature extraction and (II) a model extraction component. The ISCX VPN-non-VPN dataset has been used for the training and testing set, which includes 15,545 encrypted samples and 22706 non-encrypted samples. The authors have used a part of the dataset for testing consisting of 1943 encrypted samples and 2838 non-encrypted samples. The proposed model achieved 91.2 percent accuracy compared with the other methods, which had reached 89.8 percent [18].

B. Yamanasawasler et al. have worked to identify the applications and used machine learning to discriminate them. They have used the UNB ISCX for training and testing phases. They have proposed a module consisting of three key components: (I) Data collection, (II) Feature extraction, and (III) Classification. They have made use of four classification algorithms KNN, random forest, J48 and Naive Bayes classifier. Their feature selection is manual in the proposed module. The classification accuracy reached 94% and 90.87% with KNN and random forest algorithms, respectively [19].

G. Draper-Gil et al. have evaluated two classification algorithms to discriminate the encrypted network traffic. They have used Weka to implement the model, and the simulation results show that C4.5 has a recall equal to 92 percent and accuracy equals 80 percent. This algorithm performs better than the KNN in classification. The tarting and testing set in this research have been collected privately in the author's lab [20].

Z. Zou et al. have worked on a deep neural network to classify the encrypted network traffic. They have compared the classification performance between convolutional neural networks (CNN) and Recurrent neural network (RNN) algorithms. They have evaluated the classification performance with precision and recall. The experiment showed that LSTM has better behaviour in traffic classification than CNN, especially in encrypted traffic. They have used ISCX VPN-nonVPN traffic for training and testing [21].

H. Zhou et al. have proposed the classification algorithm based on Min-Max Normalization Convolutional Neural (MMN-CNN) to classify the network classification performance. The authors have used the Moore dataset for training and testing. They achieved an accuracy of 99.30 percent [22].

Z. Fan et al. have used a supervised and unsupervised algorithm to classify the network traffic. The unsupervised algorithm is K-means, and the supervised algorithm is SVM which has been used in this research. The Moore dataset is the training and testing set used in the proposed model. The accuracy of classification reached 98 percent [6].

T. Auld et al. have proposed the Bayesian-based algorithm to classify the network traffic into distinct applications. They have used the Moore dataset. The authors showed that the proposed model reached 95 percent of accuracy. Their experiment also showed that the increase in the number of samples has caused the accuracy to rise from 88.3 percent to 95 percent [23].

M. Lopez-Martin et al. have worked on the Internet of Things (IoT) traffic and proposed a CNN and RNN model to classify IoT network traffic with device detection or attack detection in wireless sensor networks. They have used the RedIRIS dataset and evaluated their model with accuracy, precision, recall, and f-score. They could achieve accuracy and f-score of about 96 percent and 95 percent, respectively [24].

M. Lotfollahi et al. have designed a convolutional neural network (CNN) and a stack autoencoder network (SAE) for traffic classification and compared their classification performance. They have used the ISCX VPN-nonVPN data set for training and testing. The proposed CNN model has better accuracy in identifying the applications. The accuracy which they experienced was 98 percent, and the f-score was 93 percent [25].

O. Aouedi et al. have proposed a model based on a deep autoencoder and a stack sparse autoencoder (SSAE) classifier along with denoising and dropout techniques to classify the network traffic. The authors used a private dataset collected by [26]. The authors reached an accuracy of 89.09% and 89.05% of f-score [27]. The following brief of the related work is shown in Table 1.

Table 1

A brief of related work

Ref. No	Proposed Approach	Classification Performance Metrics	Dataset	Accuracy (%)
[16]	K-Means & SVM	Accuracy	KDD Cup 1999	K-Means=90.65 SVM=76
[17]	Random Forest	Accuracy	Private	95
[18]	LSTM	Accuracy	UNB ISCX VPN-Non-VPN Dataset	91.2
[19]	KNN	Accuracy	UNB ISCX VPN-Non-VPN Dataset	94
[20]	C4.5	Accuracy & Recall	Private	80
[21]	LSTM	Accuracy, Recall	UNB ISCX VPN-Non-VPN Dataset	99
[22]	MMN-CNN	Accuracy	The Moore Dataset	99.30
[6]	K-Means & SVM	Accuracy	The Moore Dataset	98
[23]	Bayesian	Accuracy	The Moore Dataset	95
[24]	CNN & RNN	Accuracy, Precision, Recall & f-score	The RedIRIS	96
[25]	CNN	Accuracy & f-score	UNB ISCX VPN-Non-VPN Dataset	98
[27]	DeepAutoencoder+denoising and dropout techniques	Accuracy, F-score, precision, recall	Private	89.09

According to recently viewed papers in this article, the solutions, measures and their accuracy in network traffic classification are studied, and their datasets are examined.

In this paper, we design two discrimination models to classify network traffic. The first is a model to classify the network traffic into encrypted and not-encrypted traffic, and the second is used to identify the network traffic in the designated applications. Both models have two distinct significant components, including feature extraction and classifier. The deep autoencoder is designed for classification with high accuracy in facing new network flows. The proposed deep autoencoder model will be proven with a minimum MSE measure. The proposed autoencoder will be frozen, and the learned model will be transferred to examine four distinct classifiers. The classifiers containing Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and Support Vector Machine (SVM) are added to the deep autoencoder to classify the network traffic for both applications (encrypted and non-encrypted network traffic and application identifications). The training and testing dataset will be introduced in the following subsections, and then preprocessing will be discussed. The proposed deep autoencoder with different classifiers will be introduced in the following subsections. The top view of the proposed model is shown in Fig.2.

3-1. Dataset

This research uses the "ISCX VPN-nonVPN" network traffic dataset for training and testing. The Canadian Cyber Security Institute has collected this dataset and published it in the university of New Brunswick (UNB) [20]. The total amount of this dataset is 28 GB. This dataset has captured and collected the regular session and a session over VPN. This dataset has 14 total traffic categories: VOIP, VPN-VOIP, P2P, VPN-P2P, etc. The applications are divided into seven classes for each VPN and regular network traffic. 70 percent of the samples in this dataset are used for training, and 30 percent are for the testing phase.

The application distribution is presented in Table 2. It shows the dataset has 30092 regular traffic samples and 29613 VPN traffic samples.

Table 2

Applications distribution in ISCX-VPN-NONVPN

VPN		Non-VPN
Applications	#No of Samples	Applications	#No of samples
Browsing	9999	Browsing	10000
Chat	2839	Chat	2505
ftp	4704	ftp	3975
E-mail	2444	E-mail	1364
P2P	3415	P2P	4000
Streaming	1115	Streaming	1284
VoIP	5576	VoIP	6485
Total VPN Samples	30092	Total Non-VPN Samples	29613

The application distribution with VPN and Non-VPN clustering is shown in Fig.3. This distribution states that the dataset is imbalanced. More samples are more likely to be predicted; therefore, it should be solved to make the likelihood the same to be classified for each class.

The model tries to use this labelled (70 percent of the dataset) to extract patterns to classify the network traffic into VPN and Non-VPN, in addition to application identification. The rest of the dataset is used for testing because the extracted model has not seen these samples before; therefore, it can give us the performance of the proposed model at facing new network traffic.

3-2. Preprocessing

The preprocessing phase is significant in designing a classification model. To summarize the steps, the taken actions are listed here:

Data quality assessment ensures the features' quality and sample data.
Data transformation is done here because we used the CICflowmeter tool to convert Pcap files to CSV format [28]. The main data collected is based on PCAP format, which should be converted to CSV format.

The labels which are not numerical should be converted to the numerical class. The labels are converted to one-hot-encoding to prepare the dataset for training and testing.
Several unnecessary features should be omitted. For example, Timestamps, source and destination IPs, and flow ID are removed from the training and testing part of the dataset.
The weighted class method is used to make the imbalanced dataset balanced. The weight of samples belonging to its class is calculated as below:

This weight causes the classes to be balanced. The class with more samples is more likely to be predicted in an imbalanced dataset, but the classification of classes will be equal in the balanced dataset.

The normalization is done to make the value of each feature between 0 and 1.

These actions have been taken to preprocess the dataset to prepare the training and test sets.

3-3. The proposed Deep Autoencoder

The autoencoder is a model to extract the features for classification [27]. The multilayer neural network can be used to extract the features and includes the input, hidden, and output layers. The number of hidden layers can increase the deep autoencoder, but this increase is not always necessary and can sometimes decrease the classification accuracy [29]. The hidden layers can improve the feature selection and extraction to improve the classification accuracy. The deep neural network needs to be designed as an autoencoder. To evaluate the autoencoder, the encoder and decoder are connected to cause the encoder model to be evaluated. The encoder and decoder are composed as shown in Fig.2. The better autoencoder should minimize the difference between the value of input and output layers. The number of features reduces the number of nodes called code in Fig.4.

The training phase instruction has been followed to train the AE network. First, the TensorFlow modules must be called to build the model. Neural networks generally consist of three layers: input layer, hidden layer(s), and output layer. Here, the model is built on the same basis. To build a model, the input layer must first be created. The input layer consists of neurons that receive only the input and transmit them to other layers. The number of neurons in the input layer should be equal to the features in the data set. According to the number of features, the input nodes are set at 80. The Dense class is used to implement the layers of the model. The activation function which is used in hidden layers is Relu. The activation function is used for forwarding and backward, but the output layer activation function is different, which is sigmoid in the designed model. The training phase is done with compile() function in python. The proposed model uses the Adam optimizer [30] to update weights. The strategy pseudocode used to design the deep autoencoder is demonstrated in Algorithm.1. The algorithm indicates that the input and output nodes are set based on the dataset features and classes. The number of input nodes is 79, and the output nodes are specified based on the number of classes used for classification. In Algorithm.1, it is stated how the hidden layer with how many nodes are added to the encoder model.

Algorithm 1: The deep autoencoder model evaluation Pseudocode

Input: Input layer, Output layer

Step1à Create the model with input layer and output layer

Step2à Set i = 0

Step3à Add a hidden layer and set i = i +1

Step4à Set the added hidden layer nodes equal to (Input layer. nodes- (i × 5))

Step5à Encoder = Connect all nodes to make a mesh neural network

Step6à Decoder = Invert the Encoder and create the Decoder

Step7à Combined model = Combine the Encoder and Decoder

Step8à Evaluate the combined model and save the MSE between the input and output in a list

Step9à Continue steps 3 to 8 if the number of nodes in the hidden layer is more than the nodes in the output layer; otherwise, stop the loop and exit

Output: The list of MSE evaluated values for each combined Encoder and Decoder

The model is an encoder which should decrease the number of features. The hidden layer is added between the input and output layers, and the encoder needs to be evaluated. The reverse encoder as the decoder is added to the encoder, making the model in which the input and output should be the same. It paves the way to evaluate the model with the difference between input and output values. The hidden layer with the neurons, which has five neurons less than the previous layer, is added. This procedure continues until the number of hidden layer neurons is less than the output layer. The model is a full mesh model; hence, all neurons should be connected to the next and previous layers' neurons.

The model evaluation is done with the mean square error (MSE) as a loss function which should be minimized. Fig.4. and Fig.5. are the output of the training phase with the proposed model. The layers and the number of nodes in each layer, encoder, and decoder, are shown in Fig.5. and Fig.6., respectively.

The input layer has 79 nodes equal to the number of features, and the minimum MSE belongs to the output layer with ten nodes; therefore, the features are reduced to ten. It shows that the classifier must have ten nodes input layers.

The minimum MSE between output and input belongs to the hidden layer with ten neurons. Fig.7 shows the combination of encoder and decoder used to evaluate and achieve the minimum MSE in this research.

3-4. The Classifier Layer

The classifier is a layer which should be added to the autoencoder, so it should be able to classify the input network traffic with only ten features which is the output of the autoencoder. According to the proposed deep autoencoder, an accurate classifier can improve the classification accuracy in facing the new network flows. To find the best classifier, the deep autoencoder has been trained with 100 iterations, and all weights and biases have been trained, so the transfer-learning approach is used to freeze the model. The best classifier is selected among four well-known classification algorithms, including Decision Tree (DT), Random Forest (RF), Support Vector Machine (SVM) and Logistic Regression (LR). These classifiers should receive the ten features and classify the network traffic. This model should be able to do the classification between VPN and Non-VPN and identification among the applications.

Application Identification: the proposed model is shown in Fig.8. This model is a layer that classifies the network traffic with ten features given by the proposed autoencoder into seven classes shown in Table 2. This model is trained for the VPN and Non-VPN separately.
Encrypted Traffic Classification: the proposed model is indicated in Fig.9. This model classifies the network traffic with ten features from the proposed deep autoencoder into VPN or Non-VPN. It predicts that the traffic is VPN or Non-VPN.

After adding the classifier, there are two different models for two distinct responsibilities: VPN and Non-VPN classification and application identification.

3-5. Hyperparameter Tuning

The hyperparameters for the model are developed in python using sklearn.model_selection. The consequences of the GridSearchCV function for each classifier to increase the accuracy are shown in Table 3.

Table 3

Hyperparameter values

Classifier	Tuning parameters & values
Logistic Regression (LR)	C=0.01, penalty=12, solver= 'liblinear'
SVM	C=0.01, gamma=scale, kernel=’rbf’
Decision Tree (DT)	Max depgth=6
Random Forest (RF)	max_features=’log2’, n_estimators=1000

The training phase is done with a cross-validation technique to prevent overfitting. The k-fold method with k=10 has been used to train the model.

3-6. Model Deployment

The proposed model has been deployed in the controller using SDN architecture. The Ryu controller has been used as the controller, and the model has been developed in the Ryu with Python 3. The trained model has been deployed in the controller to classify the encrypted network traffic and identify the applications.

This section evaluates the proposed model with the classification metrics, including accuracy, precision, recall and f-score. The proposed model is compared with similar work using CNN to show the accuracy improvement in the proposed model.

4 − 1. Experiment Situation

The situation that this experiment has been conducted is shown in Table 4.

Table 4

Training, Testing and Simulation Conditions
Hardware/Software	Framework	Description
Hardware	CPU	Intel(R) Core(TM)-i7-2.4GH
	GPU/TPU	Google Colab
	Ram	12 GB
	Hard	2 TB
Software	OS	Windows 10-64bit
	Programming language	Python 3.7.3
	Programming IDE	Spyder 3.3.3 + Jupyter 5.7
	Deep learning software	Tensorflow/python
	Deep learning library	Keras
	Machine Learning library	scikit-learn v0.21.3
	Training and Testing Dataset	ISCX VPN-nonVPN collected in UNB
	Controller	Ryu

4 − 2. Evaluation Metrics

The metrics by which the model is evaluated and compared with other models are accuracy, precision, recall and f-score. In this subsection, the equals these metrics are calculated are defined. Accuracy, Precision, Recall and F-score metrics are presented in Equals (2), (3), (4) and (5), respectively.

$$Accuracy= \frac{(TP+TN)}{(TP+TN+FP+FN)}$$

$$Precision= \frac{TP}{(TP+FP)}$$

$$Recall= \frac{TP}{(TP+FN)}$$

$$F-Score=2 \times \frac{(Precision-Recall) }{(Precision+Recall)}$$

TP, TN, FP and FN are defined as below:

TP (True Positive) refers to samples belonging to the positive class and classified as positive by the classifier correctly.
TN (True Negative) refers to samples belonging to the negative class and classified as negative by the classifier correctly.
FP (False Positive) refers to samples belonging to the negative class but wrongly classified as positive by the classifier.
FN (False Negative) refers to samples belonging to the positive class but wrongly classified as negative by the classifier.

4 − 3. Evaluation of Encrypted Traffic Discrimination

This subsection compares four distinct classifiers with Recall, Precision and F-score metrics in Table 5.

Table 5

The Evaluation of VPN and Non-VPN Discrimination
Network Traffic Type	VPN			Non-VPN
Metrics Classifier	Recall	Precision	F-score	Recall	Precision	F-score
Logistic regression	0.68	0.59	0.63	0.51	0.60	0.55
Random forest	0.97	0.93	0.95	0.92	0.97	0.94
Decision tree	0.94	0.91	0.93	0.90	0.94	0.92
SVM	0.83	0.58	0.68	0.37	0.69	0.48

The model's accuracy in classifying the network traffic facing the new flows is shown in Fig. 10. This bar chart compares four distinct classifiers' accuracy.

According to Fig. 10, random forest discriminated VPN and Non-VPN samples with an accuracy of 99 percent, which is the highest among DT, SVM and LR. The second highest accuracy belongs CNN model proposed in [25]. DT, SVM and LR with 91%, 67 and 56 are the accuracy of other classifiers, respectively.

4–4. Evaluation of Applications Identification

The four different classifiers have been evaluated separately to indicate the model classification performance; thus, precision, recall and f-score of random forest, decision tree, SVM and logistic regression will be indicated in Tables 6, 7, 8 and 9, respectively. Each table shows three metrics, including recall, precision and f-score, for the classification of each application. Each table shows the classification performance metrics of a specific classifier.

Table 6

Applications identification performance of random forest
labels	Recall	Precision	F-score
Chat	0.87	0.90	0.92
Bittorent	0.87	0.96	0.93
E-mail	0.98	0.99	0.99
Facebook audio	0.98	0.98	0.98
Facebook chat	0.87	0.90	0.93
FTPS	0.99	0.99	0.99
Hangout audio	0.95	0.98	0.98
Hangout chat	0.93	0.97	0.99

Table 7

Applications identification performance of decision tree
Labels	Recall	Precision	F-score
Chat	0.80	0.96	0.90
Bittorent	0.77	0.97	0.89
E-mail	0.99	0.97	0.86
Facebook audio	0.98	0.99	0.99
Facebook chat	0.82	0.99	0.98
FTPS	0.98	0.97	0.89
Hangout audio	0.93	0.98	0.98
Hangout chat	0.87	0.95	0.94

Table 8

Applications identification performance of SVM
labels	Recall	Precision	F-score
Chat	0.84	0.84	0.82
Bittorent	0.73	0.73	0.74
E-mail	0.63	0.63	0.65
Facebook audio	0.63	0.63	0.50
Facebook chat	0.61	0.61	0.70
FTPS	0.76	0.76	0.68
Hangout audio	0.68	0.68	0.67
Hangout chat	0.56	0.56	0.70

Table 9

Applications identification performance of logisitic regression
labels	Recall	Precision	F-score
Chat	0.53	0.50	0.52
Bittorent	0.30	0.62	0.41
E-mail	0.44	0.33	0.38
Facebook audio	0.73	0.46	0.56
Facebook chat	0.83	0.50	0.63
FTPS	0.37	0.44	0.40
Hangout audio	0.63	0.60	0.62
Hangout chat	0.89	0.57	0.70

4–5. Evaluation of Encrypted Network Traffic Discriminator Model

In this subsection, the VPN and Non-VPN model is combined with the applications identification model, so these are consecutive models, and their accuracy should be multiplied together. This formula will iterate for the f-score metric. The average accuracy of the proposed model with distinct four classifiers will be compared with the CNN model proposed in [25].

Each sample type has no priority over others, so recall or precision cannot represent the classification performance. Thus, accuracy and f-score metrics can give us more information about the network encrypted traffic classification performance. These metrics, containing accuracy and f-score, will be shown in Fig. 11 and Fig. 12, respectively.

According to Fig. 11, the proposed model has achieved an accuracy of 96%, which shows a 3% improvement compared to similar work in [25]. It illustrates that the proposed model has better accuracy for network traffic discrimination while the several sessions are encrypted. The proposed model has had better behaviour in facing the new network traffic.

As shown in Fig. 12, the proposed model has had a better f-score, which is another metric for comparison, and it is 93% which shows a 2% improvement compared with the CNN model in [25].

4–6. Discussion

In this paper, a global model has been proposed to discriminate the encrypted network traffic. The proposed model is based on a deep autoencoder and a classifier. The model has used the UNB ISCX VPN-nonVPN dataset for training the model. 70 percent of the dataset has been used for the training phase, and 30 percent of the dataset has been used for testing the model. The proposed autoencoder has been frozen till a classifier layer can be added to the output layer. The combined with four distinct classifiers has been evaluated in accuracy and f-score. The proposed model has two sections (1) this module classifies the network flows whether they are encrypted or not, and (2) the other module identifies the applications which have different requirements. These two modules have been evaluated and reported in this paper. Finally, the whole model has been compared with the proposed CNN-based model in [25]. The classification and our experiment show that the proposed model has an improvement equal to 3% in accuracy and an improvement equal to 2% in f-score compared with the CNN model. The proposed model has been defined to be deployed in SDN architecture, so the proposed trained model has been deployed in the Ryu controller to detect the Applications and their requirements. This controller can improve the network traffic classification and resource allocation with QoS improvement.

The network traffic classification accuracy improvement is the main issue of this paper. The proposed model should be deployed in the SDN controller to classify the network traffic. A model composed of the deep autoencoder with a classifier has been proposed, which at the first step, hs discriminates the encrypted and non-encrypted network traffic. Then according to the regular or encrypted traffic, the applications are identified to characterize the network traffic requirements. The proposed model has been deployed in an SDN controller called Ryu and used in the Mininet emulator. The experiment results show that the proposed autoencoder has more accuracy with random forest classifier compared with logistic regression, decision tree and SVM. The model has been trained with 70% of the ISCX VPN-nonVPN collected in UNB. The rest of the dataset has been used for testing. The proposed model has been compared with the CNN model, and simulation results show that the proposed model has 3% more accuracy than CNN and 2% more f-score compared to the CNN model. The model has been deployed in the SDN Ryu controller. However, the resource allocation based on the application requirements has not been completed, so the resource allocation following the application requirement can be the future work. This model can be examined next to the resource allocation module, and the QoS can be evaluated for future research.

Ethics approval and consent to participate

“Not applicable”

Consent for publication

“Not applicable”

Availability of data and materials

“The datasets analyzed during the current study are available in the [https://www.unb.ca/cic/datasets/vpn.html] repository”

Competing interests

“The authors of this paper hey have no competing interests”

Funding

“No funds, grants, or other support was received.”

Authors' contributions

“Alireza Shirmarz designed the general model, and Negin Mohammadi implemented the model under supervision of Alireza Shirmarz. Alireza Shirmarz and Negin Mohammadi made the simulation results and comparison.”

Acknowledgements

“Not applicable”

Y. X. J. Zhang, Y. Xiang, Y. Wang, W. Zhou and Y. Guan, “Network traffic classification using correlation information,” IEEE Trans. Parallel Distrib. Syst., vol. 24, pp. 104–117, 2013, doi: 10.1109/TPDS.2012.98.
A. Shirmarz and A. Ghaffari, “Performance issues and solutions in SDN-based data center: a survey,” Journal of Supercomputing, vol. 76, no. 10, pp. 7545–7593, Oct. 2020, doi: 10.1007/S11227-020-03180-7.
M. Shafiq, X. Yu, A. A. Laghari, and N. Karn, “Network Traffic Classification techniques and comparative analysis using Machine Learning algorithms,” in 2016 2nd IEEE Int. Conf. Comput. Commun., 2016, no. May 2019. doi: 10.1109/CompComm.2016.7925139.
N. Sharma and B. Arora, “Review of Machine Learning Techniques for Network Traffic Classification,” pp. 1–7.
T. Bujlow, V. Carela-Español, and P. Barlet-Ros, “Independent comparison of popular DPI tools for traffic classification,” Computer Networks, vol. 76, pp. 75–89, 2015, doi: 10.1016/j.comnet.2014.11.001.
Z. Fan and R. Liu, “Investigation of Machine Learning Based Network Traffic Classification,” in 2017 Int. Symp. Wirel. Commun. Syst., 2017, pp. 1–6. doi: 10.1109/ISWCS.2017.8108090.
A. Vulpe, I. Girla, R. Craciunescu, and M. G. Berceanu, “Machine Learning based Software-Defined Networking Traffic Classification System,” 2021 IEEE International Black Sea Conference on Communications and Networking, BlackSeaCom 2021, May 2021, doi: 10.1109/BLACKSEACOM52164.2021.9527861.
C. Yu, J. Lan, J. C. Xie, and Y. Hu, “QoS-aware traffic classification architecture using machine learning and deep packet inspection in SDNs,” Procedia Computer Science, vol. 131, pp. 1209–1216, 2018, doi: 10.1016/J.PROCS.2018.04.331.
“6 concepts of Andrew NG’s book: ‘Machine Learning Yearning’ | by Niklas Donges | Towards Data Science.” https://towardsdatascience.com/6-concepts-of-andrew-ngs-book-machine-learning-yearning-abaf510579d4 (accessed Aug. 17, 2022).
H. Kim and N. Feamster, “Improving network management with software defined networking,” IEEE Communications Magazine, vol. 51, no. 2, pp. 114–119, 2013, doi: 10.1109/MCOM.2013.6461195.
P. Amaral, “Machine Learning in Software Defined Networks : Data Collection and Traffic Classification,” in 2016 IEEE 24th Int. Conf. Netw. Protoc., 2016, no. November. doi: 10.1109/ICNP.2016.7785327.
A. Shirmarz and A. Ghaffari, “Taxonomy of controller placement problem (CPP) optimization in Software Defined Network (SDN): a survey,” Journal of Ambient Intelligence and Humanized Computing, vol. 12, no. 12, pp. 10473–10498, Dec. 2021, doi: 10.1007/S12652-020-02754-W.
A. Shirmarz and A. Ghaffari, “Automatic Software Defined Network (SDN) Performance Management Using TOPSIS Decision-Making Algorithm,” Journal of Grid Computing, vol. 19, no. 2, Jun. 2021, doi: 10.1007/S10723-021-09557-Z.
A. Shirmarz and A. Ghaffari, “An adaptive greedy flow routing algorithm for performance improvement in a software‐defined network,” International numerical modeling: Electronic networks, Devices, and Fields-Wiley online library, vol. 33, no. 1, pp. 1–21, 2019, doi: 10.1002/jnm.2676.
A. Shirmarz and A. Ghaffari, “A novel flow routing algorithm based on non-dominated ranking and crowd distance sorting to improve the performance in SDN,” Photonic Network Communications, no. 0123456789, 2021, doi: 10.1007/s11107-021-00951-x.
A. Shrivastav, “Network Traffic Classification using Semi-Supervised Approach,” in IEEE Second Int. Conf. Mach. Learn. Comput. Netw., 2010, pp. 345–349. doi: 10.1109/ICMLC.2010.79.
M. Conti, S. Member, L. V. Mancini, R. Spolaor, and N. V. Verde, “Analyzing Android Encrypted Network Traffic to Identify User Actions,” in IEEE Trans. Inf. Forensics Secur., 2015, no. 1, pp. 114–125. doi: 10.1109/TIFS.2015.2478741.
H. Yao, C. Liu, P. Zhang, and S. Wu, “Identification of Encrypted Traffic Through Attention Mechanism Based Long Short Term Memory,” IEEE Trans. Big Data, vol. PP, no. XX, p. 1, 2019, doi: 10.1109/TBDATA.2019.2940675.
M. E. K. Yamansavascilar, Baris; M. Amac Guvensan, A. Gokhan Yavuz, “Application Identification via Network Traffic Classification.” doi: 10.1109/ICCNC.2017.7876241.
G. Draper-Gil, A. H. Lashkari, M. Saiful, I. Mamun, and A. A. Ghorbani, “Characterization of Encrypted and VPN Traffic Using Time-Related Features,” in 2nd Int. Conf. Inf. Syst. Secur. Priv. (ICISSP 2016), 2016, no. February, pp. 407–4414. doi: 10.5220/0005740704070414.
Z. Zou, J. Ge, H. Zheng, Y. Wu, C. Han, and Z. Yao, “Encrypted Traffic Classification with a Convolutional Long Short-Term Memory Neural Network,” in 2018 IEEE 20th Int. Conf. High Perform. Comput. Commun. IEEE 16th Int. Conf. Smart City; IEEE 4th Int. Conf. Data Sci. Syst., 2018, pp. 329–334. doi: 10.1109/HPCC/SmartCity/DSS.2018.00074.
H. Zhou, Y. Wang, X. Lei, and Y. Liu, “A method of improved CNN traffic classification,” in 2017 13th Int. Conf. Comput. Intell. Secur., 2017, no. 2. doi: 10.1109/CIS.2017.00046.
T. Auld, A. W. Moore, and S. F. Gull, “Bayesian Neural Networks for Internet Traffic Classification,” IEEE Trans. NEURAL NETWORKS, vol. 18, no. 1, pp. 223–239, 2017, doi: 10.1109/TNN.2006.883010.
M. Lopez-Martin, B. Carro, A. Sanchez-Esguevillas, and J. Lloret, “Network Traffic Classifier with Convolutional and Recurrent Neural Networks for Internet of Things,” IEEE Access, vol. 5, pp. 18042–18050, Sep. 2017, doi: 10.1109/ACCESS.2017.2747560.
M. Lotfollahi, M. J. Siavoshani, R. Shirali, H. Zade, and M. Saberian, “Deep packet : a novel approach for encrypted traffic classification using,” Soft Comput., vol. 24, no. 3, pp. 1999–2012, 2020, doi: 10.1007/s00500-019-04030-2.
O. Gervasi et al., “Personalized Service Degradation Policies on OTT Applications Based on the Consumption Behavior of Users,” in Comput. Sci. Its Appl. – ICCSA 2018, 2018, pp. 543–557. doi: 10.1007/978-3-319-95168-3_37.
O. Aouedi, “A Semi-supervised Stacked Autoencoder Approach for Network Traffic Classification”, doi: 10.1109/ICNP49622.2020.9259390.
M. S. I. M. Arash Habibi Lashkari, Gerard Draper-Gil and A. A. Ghorbani, “CICFlowMeter,” https://www.unb.ca/cic/research/applications.html.
S. Zou and F. Zhong, “Network Traffic Classification Based on Deep Learning,” 2018. doi: 10.1088/1742-6596/1087/6/062021.
D. P. Kingma and J. Lei Ba, “ADAM: A METHOD FOR STOCHASTIC OPTIMIZATION”.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

A Novel Deep Encrypted Network Traffic Discriminator in Software Defined Network (SDN)

Status:

Version 1

Abstract

Figures

1. Introduction

2. Related Work

3. The Proposed Classification Model

4. Experiment Evaluation

5. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1