Research Article

Limiting Dynamics for Q-Learning with Memory One in Symmetric Two-Player, Two-Action Games

Figure 10

Phase diagram for the possible BRNs in the prisoner’s dilemma for different values of : 0.1 (a), 0.3 (b), 0.75 (c), and 0.99 (d).
(a)
(b)
(c)
(d)