Unbiased Model-Agnostic Metalearning Algorithm for Learning Target-Driven Visual Navigation Policy

<div>The learning curves of our initial MAML model Ours(loc)/Ours(glo) and UMAML model Ours(UM-loc)/Ours(UM-glo) applied into untrained bedroom scenes. The results demonstrate that Ours(UM-loc) and Ours(UM-glo) all achieve better performances than Ours(loc) and Ours(glo), as the UMAML initial models explicitly minimize the inequality of losses over sampled tasks.</div>

Computational Intelligence and Neuroscience

fig4

Figure 4

Figure 4: Unbiased Model-Agnostic Metalearning Algorithm for Learning Target-Driven Visual Navigation Policy