Complexity

Research Article

Learning-Based Path Planning Algorithm in Ocean Currents for Multi-Glider

Learning-based path planning algorithm.

(i)	Training phase (offline)
(1)	for epoch 1 to T do
(2)	for step 1 to M do
(3)	Randomly select one batch of training data. To restrict the movement based on equation (9)–(11).
(4)	Calculate the loss and its gradient based on equation (8).
(5)	Update Doc-CNN .
(6)	end for
(7)	end for
Planning phase (online)
(8)	Receive the initial state ( constant)
change over time and location,

(9)	while do
(10)	Input into Doc-CNN and output .
(11)	Using action based on , equation (7), (9)–(11).
(12)	Update state to .
(13)	end while
(14)	Send path planning results to glider formation.