Research Article

Intercept Guidance of Maneuvering Targets with Deep Reinforcement Learning

Table 3

Policy and value function network architecture.

LayerPolicy networkValue network

Input layer12
Hidden13010
Hidden24015
Output11