Research Article

Intercept Guidance of Maneuvering Targets with Deep Reinforcement Learning

Table 2

Hyperparameters for TD3 algorithm.

ParameterValue

Number of hidden layers2
BATCH_SIZE32
Replay buffer size50000
Actor learning rate10-5
Critic learning rate
Policy noise0.2
Noise bound0.5
Soft update factor 0.01
Discounting factor γ0.95
Delay steps5
Gradient optimizerAdam