Research Article
Deep Q-Network with Predictive State Models in Partially Observable Domains
Figure 3
Comparison of RPSR-DQN to model-free methods. Plots show the performance for three methods on all tasks: (a) CartPole-v1, (b) Swimmer-v1, and (c) Reacher-v1.
(a) |
(b) |
(c) |