[Retracted] Intelligent Offloading Decision and Resource Allocations Schemes Based on RNN/DQN for Reliability Assurance in Software-Defined Massive Machine-Type Communications
Algorithm 1
SDN-based DQN in MTG resource adjustment and MTG selections.
[DQN-based MTG Resource Adjustment]
(1)
Initial the main, target parameters, and replay buffer, and respectively.
Define number of episodes
(2)
for each step in the episodes, then
(3)
State observation
(4)
DQNagent selects action based on the optimal policy
(5)
Action selection and explore next state and obtain the reward
At each time slot , SDN controller executes the action
(6)
if the size of the size of replay buffer
cache into the replay buffer
(7)
else
Replace queue tail element with the current as the FIFO process.
(8)
End if
Transition to next network state
(9)
Random mini-batch of samples from replay buffer
(10)
Compute the target network value:
(11)
(12)
Compute and minimize the loss:
(13)
(14)
Update the target network , based on the updated :