Research Article

Supervised Reinforcement Learning for ULV Path Planning in Complex Warehouse Environment

Figure 7

The collision times per episode in a dynamic environment.