Research Article

Representation Enhancement-Based Proximal Policy Optimization for UAV Path Planning and Obstacle Avoidance

Figure 13

Instability of per-episode cumulative reward.
(a) Instability in Scenario A
(b) Instability in Scenario B