Research Article

Federated Reinforcement Learning-Based UAV Swarm System for Aerial Remote Sensing

Table 3

Hyperparameters and values used for learning.

HyperparameterValue

Actor network dimension162562562565
Critic network dimension162562562565
Minibatch size5
Number of epochs4
Learning rate0.0003
Horizon value20
Generalized advantage estimator0.95
Discount factor gamma0.99
Clipping parameter0.2
Value function coefficient0.5
Optimizer algorithmAdam