Complexity

Research Article

Deep Reinforcement Learning for UAV Intelligent Mission Planning

Learning curves for the fighter-jammer scenario. The shaded region represents the standard deviation of average evaluation over three trails.