Deep Q-Network with Predictive State Models in Partially Observable Domains

<div>Comparison of RPSR-DQN to model-free methods. Plots show the performance for three methods on all tasks: (a) CartPole-v1, (b) Swimmer-v1, and (c) Reacher-v1.</div>

Mathematical Problems in Engineering

fig3

Figure 3

Figure 3: Deep Q-Network with Predictive State Models in Partially Observable Domains