Research Article

Supervised Reinforcement Learning for ULV Path Planning in Complex Warehouse Environment

Figure 9

Average reward in a simple fixed-point environment.