Research Article

Supervised Reinforcement Learning for ULV Path Planning in Complex Warehouse Environment

Figure 12

Average reward in a complex fixed-point environments.