Wireless Communications and Mobile Computing

Research Article

Resource Allocation in UAV-Assisted Wireless Powered Communication Networks for Urban Monitoring

MJDDPG-based resource allocation algorithm.

Initialize weights for main network and target networks;
Initialize experience replay buffer, exploration variance and action exploration probability;
for to
for to
Update the environment state and observe the current state and record and ;
Based on selection action;
Execute action to update state and calculate reward value ;
Store the experience tuple into the experience buffer;
if buffer is full then
Randomly sampled mini-batch samples from the experience buffer;
Computing the target network values;
Updating the critic network by minimizing the loss of the critic network;
Updating the actor network by maximizing actor network losses;
Updating target network parameters;
Updating the action random parameters;
end if
end for
end for