Research Article

Learning-Based Path Planning Algorithm in Ocean Currents for Multi-Glider

Algorithm 1

Learning-based path planning algorithm.
(i)Training phase (offline)
(1)for epoch 1 to T do
(2)for step 1 to M do
(3)Randomly select one batch of training data. To restrict the movement based on equation (9)–(11).
(4)Calculate the loss and its gradient based on equation (8).
(5)Update Doc-CNN .
(6)end for
(7)end for
 Planning phase (online)
(8)Receive the initial state ( constant)
change over time and location,
(9)while do
(10)Input into Doc-CNN and output .
(11)Using action based on , equation (7), (9)–(11).
(12)Update state to .
(13)end while
(14)Send path planning results to glider formation.