Research Article

Diversity Evolutionary Policy Deep Reinforcement Learning

Table 2

Values of hyperparameter.

HyperparameterValues

Critic/actor learning rate0.0003
Critic/actor hidden layer2
Number of neurons400/300
Critic activationRelu
Actor activationTanh
Discount factor0.99
OptimizerAdam
Soft update coefficient0.005
Experience pool capacity106
Experience pool sample size100
Gauss noiseClip ((0, 0.2), −0.5, 0.5)