Intercept Guidance of Maneuvering Targets with Deep Reinforcement Learning

<div>Comparison of convergence performance between the DDPG and TD3 model: (a) cumulative reward comparison of several different learning rates of DDPG, (b) miss-distance comparison of several different learning rates of DDPG, (c) cumulative reward comparison of several different learning rates of our guidance algorithm, (d) miss-distance comparison of several different learning rates of our guidance algorithm.</div>

International Journal of Aerospace Engineering

fig7

Figure 7

Figure 7: Intercept Guidance of Maneuvering Targets with Deep Reinforcement Learning