A Deep Learning Approach to Find Optimal Path in Underwater Networks Using ns3-ai

doi:10.21203/rs.3.rs-4235108/v1

Download PDF

Research Article

A Deep Learning Approach to Find Optimal Path in Underwater Networks Using ns3-ai

https://doi.org/10.21203/rs.3.rs-4235108/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Undersea communication has become increasingly common due to its varied applications including a collection of oceanographic information, environment monitoring, seismic and pollution monitoring, and many more [1]. The environment undersea is highly unstable due to its intermittent and noisy characteristics [1][2]. Therefore, the routing approach that comprehends the environment is the need of the hour. The reinforcement learning method is one such approach that performs action based on environmental conditions [3]. One of the machine learning methods called Reinforcement learning allows an agent to learn from the environment and behave accordingly. In this paper, the authors have used a deep learning approach, a class of reinforcement learning which uses neural networks to train agents. Ns3-ai framework provides the abstraction between the ns3 simulator and the ai framework [4]. Here, an underwater sensor network is simulated in ns3 and a deep-learning approach is used to train the agents. The connection between ns3 and the deep learning framework is established through ns3-ai. The deep learning framework trains the agents based on the data received from the ns3 simulator. The actions performed by the agents are transferred to ns3 simulator where the actual routing of the packets happens. The results are compared with Q learning algorithm. The deep learning approach outperforms Q learning in terms of delay and delivery time.

Reinforcement Learning

Deep Learning

Q learning

ns3-ai

Underwater Networks

Underwater networks are the networks that monitor pollution, and seismic and collect oceanographic information undersea [1]. Communication undersea is needed to gather this information. Acoustic communication is best suited for underwater networks [2]. These acoustic networks are classified as delay-tolerant networks [5]. Because of its intermittent and noisy nature, routing becomes an essential component in these networks. A routing approach that understands the environment is needed. One such routing approach is reinforcement learning, a class of machine learning algorithms.

Reinforcement learning is the learning paradigm that trains the agent to learn from its experience. Reinforcement learning can handle complex environments and is applied to various networking applications [6]. Here, the authors have chosen the deep Q learning method, a class reinforcement learning to design a routing approach for the underwater sensor networks. The applications of Deep Q learning are many including Cloud computing [13], routing [8], and many more. Here, Deep learning is used to find the optimal path between source and destination. Q learning algorithm trains the agent by using the Q table and maximizes its rewards appropriately. This method works fine for smaller environments. Deep Q learning can be used for larger networks and to maximize efficiency. Deep Q learning is a reinforcement learning method that trains the agent using a neural network approach. The fundamental step of Deep Q learning is that the initial state is fed into the neural network, Deep Q learning fetches the Q-value of all possible actions as an output as shown in Fig. 1.

In deep Q learning, agent stores experiences in the form of a tuple (s, a, r, s’). Once the agent reaches the threshold, agent starts learning from the batch of experiences. The agent selects randomly a uniformly distributed sample from the batch. Now the agent chooses an appropriate action either by using epsilon greedy, epsilon soft, or softmax function. Exploitation and exploration tradeoff strategy is managed by choosing an appropriate action based on these methods. Exploitation fetches more rewards to the agent as it selects the best path based on available information. Exploration allows the agent to explore new paths to choose an optimal path. In this paper, the authors have used Deep Q learning with appropriate action selection methods to find an optimal path between source and destination.

The ns3-ai framework allows high-speed data exchange between the ns3 simulator and the AI engine. There are two components in the ns3 – ai framework. The first is a C++-developed ns3 interface, and the second is a Python-developed AI interface. To enable data communication between these two interfaces or different processes, data has to pass through the buffer in the kernel. A simple example of the same is process X can put the data in the kernel buffer and process Y can read the data from the same [4]. There are different ways of establishing communication between two different processes. Communication can be established either through sockets or pipes [7]. But in ns3-ai, a shared memory pool is used to establish communication between processes. Shared memory pool is the faster way to exchange data between the processes.

In our paper, the authors have simulated an underwater environment using aquasim-ng as a ns3-interface. Information regarding sensor nodes (including source and destination), packets, simulation time, and other related information is stored in a shared buffer. AI interface reads the data present in the memory pool and applies the DL algorithm. The algorithm trains the agent and chooses the appropriate action. The optimal action is again placed into a shared memory pool. Now, the ns3 interface reads the buffer and uses the optimal action specified by the algorithm. Now, the sensor node can use this action to route the packets. Along these lines, a deep learning strategy is used to find an optimal path between the source and destination in underwater sensor networks.

The rest of the paper is organized as follows: Section II describes a survey of related work. Section III outlines the implementation details of the proposed method. Section IV illustrates the results of the proposed method. The paper draws the conclusion and future work in Section V.

Deep Q learning, a class of reinforcement learning is an appropriate strategy for an uncertain environment. This helps to monitor the environment continuously and take appropriate action. This section describes the use of DL algorithms in various network scenarios. This section also describes several applications using ns3-ai.

Deep neural networks are used for network routing in [8]. The authors of this article have presented a DL model that trains on the best decisions for flows using inputs of known traffic demands. Here, the authors have used two objective functions to minimize congestion in the network. These include maximum link utilization and Fortz and Thorup’s congestion measure technique. This model resulted in a quasi-optimal performance with two objectives.

Opportunistic IoT networks belong to a class of IoT where humans and machines work together in a network to share data. A Deep Q Learning model to address security attacks including hello flood, sinkhole, and distributed denial of service has been designed by authors in [9]. Deep Q learning-based secure routing protocol is used to learn the behavior of the nodes and later predict the said attacks that might occur in the network. Here, the algorithm is divided into two phases: training and prediction phase. In the training phase, preprocessing, feature extraction, feature normalization, and feature selection are completed. The normalized data is divided into training and testing datasets. The training dataset is given to the DQN model. Here, a multiple-agent approach is employed instead of an experience relay to reduce memory requirements. The training is done several times until the error is minimal. Finally, the nodes with higher Q value are preferred to route the packets. Thus, authors in [9] conclude that Deep Q learning based secure routing secures the network by predicting attacks.

Authors in [10] have designed powerful deep Q extreme learning for the underwater communication. An adaptive reward mechanism is used in this paper to improve packet delivery ratio and throughputs. A powerful fire fly routing mechanism is also used in this paper to gain energy efficiency.

In this paper, authors have used Yang’s [11] fire fly equation to define energy efficient routing mechanism.

An adaptive deep learning mechanism has been designed by authors in [12]. Here, a deep Q algorithm with on and off policy strategy is used to achieve optimal routing decisions. The energy and depth information of the nodes is considered for finding the Q value of the nodes. Nodes with maximum Q value are chosen as forwarders. In order to reduce network overhead, a combination of unicast and broadcast communication is used by the authors in [12].

An LSTM model based on an attention mechanism has been designed by authors in [14]. The attention mechanism is used to enhance the input features. This is given as an input to the LSTM model which helps to predict the channel with better accuracy. The use of attention mechanism prior to LSTM helps in learning the model better.

Spectral Social Spider Optimization (SSSO) feature selection method along with multi-path routing is proposed in [15]. Here, data from the standard repository is used to reduce delay and for better feature selection. Initially, redundant data and missing values are dealt with in the cluster head as part of preprocessing. Then, feature selection is done using the SSSO algorithm. The same algorithm is used in data delivery. I- DES algorithm is used in encrypting and decrypting the data. After decrypting the data, feature weights are validated using Softmax Neuron Classifier. Finally, the Recursive Spectral Neural Network is used to enable the paths for transmission. At the sink or at the receiver, the hop count and decryption key are used to decrypt the data. So, the methods discussed enhance the security in UWASN.

Energy efficient routing protocol for under water sensor networks using meta heuristic algorithm [21] finds an optimal path destination with minimal time. This algorithm is based on the genetic algorithm solution. Here, authors have used a combination of global search and local search algorithms to find the best route to destination. The routing framework here [21] is divided into 4 sections namely energy management, neighboring sensor management, information management and routing management section. Neighboring management helps to collect information about neighbors. Data management section performs data retrieval and data storage. Routing management helps is routing process using the algorithm described. Energy management helps to find the energy consumed by each sensor node as soon as it sends the information to other nodes.

TABLE I

USE OF DEEP LEARNING ALGORITHMS IN UWSN

Authors	Protocol Used	Problems Handled
8	Deep neural networks for network routing	Traffic congestion
9	Deep Q learning for secured routing	Securing the network against attacks
10	A Deep extreme Q learning firefly energy efficient protocol	Packet delivery ratio and throughput
12	An Adaptive Deep Q- Network Based Energy and Latency Aware Routing Protocol	Better Latency and Energy efficiency
14	LSTM Based Attention based model	Channel prediction accuracy
15	Deep learning based multipath routing protocol	Encryption in underwater wireless sensor networks
21	Energy efficient routing protocol using hybrid metaheuristic algorithm	Optimal path in a minimal time.
22	Energy efficient Clustering Protocol	Energy efficiency and network lifetime

As discussed in Section I, ns3-ai framework is used for the communication between the ns3 simulator and the RL algorithm. In this section, let us discuss the reason for choosing ns3-ai for the research.

Ns3-gym framework was proposed by authors in [7]. OpenAI gym toolkit is used for many of the AI applications. Numerous network simulations are done in the ns3 simulator. However, there was a need to integrate RL and ns3. The ns3-gym framework has been designed to integrate ns3 and RL. Zero MQ Sockets mechanism was used as an inter-process communication in ns3gym. This resulted in a slower communication rate between the ns3 simulator and the RL algorithm.

Ns3-ai framework as proposed by authors in [4] used a shared memory pool for the communication between ns3 and AI algorithm. This increased the communication speed to a larger extent.

In this paper, the authors have proposed a deep Q learning-based approach using the ns3-ai framework for underwater sensor networks.

Deep Q learning is one of the efficient RL algorithms for learning the environment faster. The main advantage of Deep Q learning is it learns from a batch of experience. As a result, Deep Q learning is able to predict Q values more quickly and may be applied to bigger networks. Here, authors have designed Deep Q learning for underwater networks using ns3-ai framework. Let us discuss the details of implementation in this section.

A. Deep Q Learning

Deep Q learning uses a neural network to approximate Q values. The sensor's current state is provided as an input, and the Q value for every action that could be taken is created as an output. The Fig. 2 portrays the difference between Q values generated by Q learning and Deep Q learning.

Deep Q learning uses the SARSA equation [16] to predict the Q value action of every state. Eq. 1 is used in the design of Deep Q learning for underwater networks.

$$Q\left({s}_{t} , {a}_{t}\right)=Q\left({s}_{t} , {a}_{t}\right)+\propto \left[{R}_{t+1}+\gamma *{max}_{a}Q\left({s}_{t+1} , {a}_{t}\right)-Q\left({s}_{t} , {a}_{t}\right) \right]$$

Q(s_t,, a_t) – current state and action

α – learning rate

γ – discount factor

R – reward earned

Q(s_t+1,, a_t) – next state and action

Steps in Deep Q network

Preprocess and feed the state of the sensor node image to the deep neural network. It will return the Q-values of all possible actions of the state.
Select the action using epsilon greedy policy or geo location as described in [3]
Perform the action selected in step 2 in a state s and move to the state s’. This indeed becomes the preprocessed image of the next sensor node image. This transition is stored in the buffer as <s, a, r, s’>
This replay or experienced buffer is used for learning the model. A uniform sample distribution of buffer is taken for learning purpose.
Calculate the loss after learning the model as in equation 2.

$$loss=(\propto +\gamma {max}_{{a}^{{\prime }}} Q\left({s}^{{\prime }},{a}^{{\prime }}; {\theta }^{{\prime }}\right)- Q\left(s,a; \theta \right)){ }^{2}$$

This represents the squared difference between target and predicted Q

6. After n no. of iterations update network weights as calculated in loss function (2)

7. Repeat above steps for M no. of episodes

Using the above procedure, Deep Q learning helps to predict Q values for every state.

B. Deep Q based Routing

Deep Q based is implemented using ns3-ai. The structure of the environment, here underwater environment is simulated using ns3 simulator. The parameters, sensor states, environment of the underwater environment created by ns3 simulator is given as input to Deep Q learning algorithm. Deep Q learning analyzes parameters to approximate Q values for the appropriate state. Algorithm 1 describes Deep Q based routing for finding an optimal between source and destination.

Algorithm 1: Deep Q Based Routing

Step 1: Simulate 1500 * 1500 m underwater environment in ns3

Step 2: Initialize environment parameters such as bandwidth, delay, duration and no. of source nodes.

Step 3: Using message interface, the environment and its parameters are passed to RL algorithm (Python side)

Step 4: Select DQN agent algorithm

Initialize s, a, r, s’

Store the above initialized values as a transition

If stored transitions reaches memory capacity:

Learn the neural network

Else:

Initialize s as source state

Use observation parameters as command window, segments acknowledged, bytesinflight

Calculate reward

Reward = segments acknowledged – bytesinflight

Find the action using equation 1 in neural network

Find the next state s’

Store the transition as (s, a, r, s’) in the memory buffer

Repeat the above steps for m no. of episodes

Ns3-ai framework is used to simulate above described algorithm. It contains two interfaces- ns3, python (i.e RL algorithm). A 1500 * 1500m underwater environment is created using ns3. Acoustic sensor nodes are created using aquasim ng. Node id’s and socket id’s are created as a part of initialization. Environment parameters are set according to the table1. The data and events are generated at source end. In order to route the data to the destination or sink node, deep Q learning is used to find next optimal node. As a part of ns3-ai framework, state of the environment and its parameters are passed to python interface using message passing mechanism. In the RL part, deep Q learning agent is used to analyze the parameters received from ns3 and find optimal path.

In the deep Q learning, every transition is stored in the buffer as a quadruple (s, a, r, a’). s represents the state, here, it represents source node that sends the data. a represents action, here, it represents to which node the data has to be transmitted to. r represents reward, here it calculates the reward earned by the node for choosing appropriate action. s’ represents next target, here it represents next node to which the data is forwarded to. Let us discuss, the working deep Q learning in finding the appropriate action. Initially the parameters of RL environment state, action, reward and next state are initialized. Every transition is stored in the buffer. If number of the transitions exceeds the defined capacity, then neural network takes uniform distributed sample from the stored transitions. Neural network then uses this sample to learn and train the model using Eq. 2. Otherwise, using observation parameters like segments acknowledged, segment size and bytes in flight, q values and action are calculated using Eq. 1. Then, appropriate reward is calculated as discussed in the algorithm. It then identifies next state based on action selected. This transition is stored in the buffer for further use.

Q value along with action is stored in the shared memory pool that is shared by ns3 and ai interfaces. Ns3 simulator now reads the value stored in shared memory and uses the same to perform action on the source node. Source node now forwards the data to the node as suggested by deep q learning algorithm (ai interface). The updated parameters are again sent to ai interface using shared memory pool. The process is repeated for ‘m’ no. of episodes.

The figure 3 illustrates the underwater environment is simulated in ns3 using aquasim-ng with acoustic sensor nodes. The topology can have n no. of source nodes and 1 sink node. Here, the data collected by source nodes has to be routed to sink. Deep Q based routing finds optimal neighbor to forward the packet to the destination. The Deep Q Agent as described in Algorithm 1 finds the next nearest neighbor. The source node then forwards the packet to the selected neighbor node. The same process continues until the packet reaches the destination.

The topology illustrated in figure 3 is simulated using the simulation parameters represented in Table II.

TABLE II

SIMULATION PARAMETERS

Parameters	Value
Discount Factor, γ	0.2
Learning Rate, α	0.0001
Simulation	10000
Environment Width * Height	1500m*1500m
Bandwidth	10Mbps
Delay	20ms
Sensor Buffer size	4Mb

Let us now see the simulation results of Deep Q based routing for the simulation parameters (Table II) and topology (Figure 3) described above. Efficiency of the algorithm mainly depends on the number of the packets reached the destination. Figure 4 illustrates the number of packets acknowledged.

In this simulation, the packets are randomly generated by the sensor nodes. These packets are eventually transmitted to the destination. Figure 5 represents the number of packets in the transition.

Figure 6 represents the size of segments through out the simulation.

Figure 7 represents throughput and goodput at various nodes for the simulation parameters depicted in TABLE II. Throughput represents the total amount of data transmitted through the network which includes even overhead data whereas Goodput represents the useful data transmitted through the network.

Deep Q learning is successful in transmitting the packets faster than Q learning and TCP based algorithm. TABLE III depicts the comparison parameters of TCP based, Q learning and Deep Q based learning algorithms. As observed from the TABLE III, Deep Q learning algorithm is able to transmit a greater number of packets to destination. Comparison of all these algorithms were computed using message interface using ns3ai as described in Introduction section.

TABLE III

COMPARISION OF ALGORITHMS

Parameter	TCP Based	Q Learning	Deep Q based
Throughput	2.22654Mbps	2.26527Mbps	2.8Mbps
Delay	22ms	25ms	20ms

The rest of the comparisons in the preceding sections were done using gym interface [23]. Gym interface is an open-source library used for comparing RL algorithms. Further comparisons were analyzed using gym interface. Learning rate and discount factors play a crucial role in calculating Q value in deep learning. It controls how much to change the model in response to estimated error each time the model weights are updated. TABLE IV portrays the throughput and delay results for various learning rates.

TABLE IV

THROUHPUT, DELAY AND PACKET LOSS RATIO RESULTS FOR VARYING LEARNING RATE

Sl no.	Learning Rate	Throughput (KB/s)	Delay (ms)	Packet loss Ratio
1	0.0001	160337.3581	178.07	0.94%
2	0.0002	159443.2882	120.11	0.23%
3	0.0003	159040.5614	114.13	0.23%
4	0.0004	167492.8557	218.80	1.17%
5	0.0005	162389.4431	107.15	0.12%

It can be observed from the Table IV and Figure 8 that as the learning rate increases, the model is able to find the shorter route in a minimal amount of time and with nominal throughput. It also observed that packet loss ratio is minimized to a greater extent.

TABLE V illustrates the results of throughput, delay and packet loss ratio for various discount factor values. In essence, the discount factor controls the reinforcement learning agent's concern for rewards in the distant future in comparison to those in the near future.

TABLE V

THROUHPUT, DELAY AND PACKET LOSS RATIO RESULTS FOR VARYING DISCOUNT FACTOR

Sl no.	Discount Factor	Throughput (KB/s)	Delay (ms)	Packet loss Ratio
1	0.2	1829019.1526	148.33	0.54%
2	0.4	152265.6639	178.61	0.77%
3	0.6	156251.2964	105.00	0.12%
4	0.8	162389.4431	107.15	0.12%

It can be observed from the Table V and Figure 9 that increasing the discount factor to 0.8 has produced minimal packet loss and was able to transmit the packets with minimal delay and nominal throughput. Here, the authors have observed that model has produced better results with learning rate 0.0001 and discount factor with 0.8.

The Table VI illustrates values of throughput for different number of nodes. The sustainability of the network can be analyzed by increasing number of nodes. The network needs to be able to send a sufficient number of packets even when there is a lot of traffic on it. The following table demonstrates the throughput of the network as the number of the nodes increases.

TABLE VI

NO. OF NODES V/S THROUGHPUT

Sl no.	Nodes	Throughput
1	4	162389.4431
2	6	192636.6108
3	8	160090.1622
4	10	135662.5794

It can be observed from the table that the topology simulated in ns3 is able to send good number of packets even when the number of nodes has increased.

Huge applications in underwater have caused tremendous research interests in underwater wireless sensor networks [2]. So, routing of packets in these networks is very essential [18]. In this paper, authors have designed Deep Q based routing that helps in finding an optimal path between source and destination. Deep Q learning observes the environment and suggests the optimal next node for the transmission. Deep Q learning uses SARSA and loss function to update Q values and find best action. Section IV illustrates results of Deep Q based routing for the underwater sensor networks. The algorithm has outperformed other base line algorithms including Q learning and TCP routing in terms of packet loss ratio and delay. The authors have carried simulations on varying learning rate and discount factors which majorly effect deep learning model. Using these results, the nominal values of learning rate and discount factor can be used for further research. The simulations were also carried for an increasing number of sensors. The results conveyed that the network was sustainable enough for transporting data even if there is traffic on it.

Here, we conclude that Deep Q based routing understands the environment before forwarding any data and eventually results in the optimal path towards destination. So, Deep Q learning is best suited for routing in underwater sensor networks.

Bio inspired algorithms have huge applications in various fields [19]. The research can be extended to apply bio inspired algorithms in underwater sensor networks. As these algorithms have shown better results in routing of UAV’s, the same can applied to underwater sensor networks.

Ethical Approval – Not Applicable
Availability of supporting data - The data can be made available in person on request
Competing interests – The authors have no financial or proprietary interests in any material discussed in this article.
Funding – There is no funding available for this research paper
Authors' contributions -K.R wrote abstract and introduction. C wrote the literature survey. K.R continued with implementation details and later results were drafted by C. The final conclusion was given by K.R.
Acknowledgments – This paper and the research behind it would not have been possible without the exceptional support of my supervisor Dr. Kavitha C. I would also extend my sincere gratitude to BMS College of Engineering for supporting my research.

Shruthi K R, Dr. Kavitha C, “Reinforcement learning-based approach for establishing energy-efficient routes in underwater sensor networks”, 8th International Conference on Electronics, Computing and Communication Technologies, IEEE CONECCT-2022.
Shruthi K R, “An Artificial Intelligence Based Routing for Underwater Wireless Sensor Networks,” 4th International Conference on Electrical, Electronics, Communication, Computer Technologies, and Optimization Techniques.
Shruthi K R, Dr. Kavitha C, “Reinforcement Learning based approach for Underwater Environment to evaluate Agent Algorithm”, 31 August 2023, PREPRINT (Version 1) available at Research Square [https://doi.org/10.21203/rs.3.rs-3291459/v1]
Hao Yin, Pengyu Liu, Keshu Liu, Liu Cao, Lytianyang Zhang, Yayu Gao, and Xiaojun Hei. 2020. Ns3-ai: Fostering Artificial Intelligence Algorithms for Networking Research. In Proceedings of the 2020 Workshop on ns-3 (WNS3 '20). Association for Computing Machinery, New York, NY, USA, 57–64. https://doi.org/10.1145/3389400.3389404
Z. Zhang, S.-L. Lin, and K.-T. Sung, “A prediction-based delay tolerant protocol for underwater wireless sensor networks,” 2010 International Conference on Wireless Communications and Signal Processing (WCSP ’10), pp. 1–6, IEEE, Suzhou, China, October 2010.
N. C. Luong et al., "Applications of Deep Reinforcement Learning in Communications and Networking: A Survey," in IEEE Communications Surveys & Tutorials, vol. 21, no. 4, pp. 3133-3174, Fourthquarter 2019, doi: 10.1109/COMST.2019.2916583.
Gawłowicz, Piotr, and Anatolij Zubow. "ns3-gym: Extending openai gym for networking research." arXiv preprint arXiv:1810.03943 (2018).
J. Reis, M. Rocha, T. K. Phan, D. Griffin, F. Le and M. Rio, "Deep Neural Networks for Network Routing," 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019, pp. 1-8, doi: 10.1109/IJCNN.2019.8851733.
Nisha Kandhoul, Sanjay K. Dhurandher, “Deep Q learning based secure routing approach for OppIoT networks,” Internet of Things, Volume 20, 2022, 100597, ISSN 2542-6605, https://doi.org/10.1016/j.iot.2022.100597.
D. Anitha, R.A. Karthika, “DEQLFER — A Deep Extreme Q-Learning Firefly Energy Efficient and high performance routing protocol for underwater communication,” Computer Communications, Volume 174, 2021, Pages 143-153, ISSN 0140-3664, https://doi.org/10.1016/j.comcom.2021.04.030.
Yang X.S, “Firefly algorithms for multimodal optimization”, Watanabe O., Zeugmann T. (Eds.), Proc. 5th Symposium on Stochastic Algorithms, Foundations and Applications, Lecture Notes in Computer Science, vol. 5792 (2009), pp. 169-178
Y. Su, R. Fan, X. Fu and Z. Jin, "DQELR: An Adaptive Deep Q-Network-Based Energy- and Latency-Aware Routing Protocol Design for Underwater Acoustic Sensor Networks," in IEEE Access, vol. 7, pp. 9091-9104, 2019, doi: 10.1109/ACCESS.2019.2891590.
Shin, D.-J.; Kim, J.-J., “Deep Reinforcement Learning-Based Network Routing Technology for Data Recovery in Exa-Scale Cloud Distributed Clustering Systems”, Appl. Sci. 2021, 11, 8727. https://doi.org/10.3390/app11188727
Zhu, Z., Tong, F., Zhou, Y. et al., “Deep Learning Prediction of Time-Varying Underwater Acoustic Channel Based on LSTM with Attention Mechanism”, J. Marine. Sci. Appl. 22, 650–658 (2023). https://doi.org/10.1007/s11804-023-00347-5
K. Prakash and S. Sathya, "A Deep Learning-based Multi-Path Routing Protocol for Improving Security using Encryption in Underwater Wireless Sensor Networks," 2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India, 2023, pp. 581-588, doi: 10.1109/ICESC57686.2023.10193733.
G Rummery and M Niranjan, "On-line q-learning using connectionist systems", Technical Report Cambridge University Engineering Dept CUED/F-INFENG/TR 166, 1994.
Brendan O’Donoghue, Ian Osband, Remi Munos, Volodymyr Mnih, “ The Uncertainty Bellman equation and exploration”, arXiv:1709.05380v4 [cs.AI] 22 Oct 2018
R. Coutinho and A. Boukerche, "Opportunistic Routing in Underwater Sensor Networks: Potentials, Challenges and Guidelines," in 2017 13th International Conference on Distributed Computing in Sensor Systems (DCOSS), Ottawa, ON, Canada, 2017 pp. 1-2.
doi: 10.1109/DCOSS.2017.42
Ashraf Darwish, “Bio-inspired computing: Algorithms review, deep analysis, and the scope of applications”, Future Computing and Informatics Journal, Volume 3, Issue 2, 2018, Pages 231-246, ISSN 2314-7288, https://doi.org/10.1016/j.fcij.2018.06.001.
T. R. Beegum, M. Y. I. Idris, M. N. B. Ayub and H. A. Shehadeh, "Optimized Routing of UAVs Using Bio-Inspired Algorithm in FANET: A Systematic Review," in IEEE Access, vol. 11, pp. 15588-15622, 2023, doi: 10.1109/ACCESS.2023.3244067.
Behzad Saemi, Fariba Goodarzian, “Energy-efficient routing protocol for underwater wireless sensor networks using a hybrid metaheuristic algorithm,”Engineering Applications of Artificial Intelligence, Volume 133, Part C, 2024,108132, ISSN 0952-1976, https://doi.org/10.1016/j.engappai.2024.108132.
A. V. Jha, B. Appasani, M. S. Khan and H. H. Song, "A Novel Clustering Protocol for Network Lifetime Maximization in Underwater Wireless Sensor Networks," in IEEE Transactions on Green Communications and Networking, doi: 10.1109/TGCN.2024.3375011.
Greg Brockman and Vicki Cheung and Ludwig Pettersson and Jonas Schneider and John Schulman and Jie Tang and Wojciech Zaremba, OpenAI Gym, 2016, arXiv:1606.01540

No competing interests reported.

Download PDF

Editor assigned by journal
16 Apr, 2024
Submission checks completed at journal
16 Apr, 2024
First submitted to journal
08 Apr, 2024

You are reading this latest preprint version

A Deep Learning Approach to Find Optimal Path in Underwater Networks Using ns3-ai

Status:

Version 1

Abstract

Figures

I. INTRODUCTION

II. RELATED SURVEY

III. IMPLEMENTATION

IV. RESULTS

V. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1