Research Article

GMOM: An Offloading Method of Dependent Tasks Based on Deep Reinforcement Learning

Table 2

Training process of the GMOM model.

Algorithm GMOM

(1) Initialize the initial network parameter that the actor network and the critic network share randomly
(2) Initialize the parameter of the old actor network with
(3) For iteration = 1, 2, … do
(4)  for t = 1, 2, …, N do
(5)   for i = 1, 2, …, D do
(6)    the whole episode is collected with the old actor network, and the obtained data is stored in the experience pool D
(7)    calculate the GAE function value for each time step according to formula (14), get and cache it
(8)    calculate the value in each state according to formula (17) and get
(9)   end
(10)   for j = 1, 2, …, H do
(11)   sample batch size sample data to optimize the objective function, update the actor network
(12)   end
(13)   Synchronize the parameters of two actor networks, i.e.,
(14)  end
(15) end