Research Article
Diversity Evolutionary Policy Deep Reinforcement Learning
Table 2
Values of hyperparameter.
| Hyperparameter | Values |
| Critic/actor learning rate | 0.0003 | Critic/actor hidden layer | 2 | Number of neurons | 400/300 | Critic activation | Relu | Actor activation | Tanh | Discount factor | 0.99 | Optimizer | Adam | Soft update coefficient | 0.005 | Experience pool capacity | 106 | Experience pool sample size | 100 | Gauss noise | Clip ((0, 0.2), −0.5, 0.5) |
|
|