Research Article
Unbiased Model-Agnostic Metalearning Algorithm for Learning Target-Driven Visual Navigation Policy
Figure 3
The steps-dependent learning curves of our MAML model without inequality minimization in metatesting phase. The X-axis indicates the number of moving steps taken; the Y-axis indicates the mean trajectory length of current episode as agent explores.