Research Article

Limiting Dynamics for Q-Learning with Memory One in Symmetric Two-Player, Two-Action Games

Table 1

All strategy pairs solve the Bellman equations self-consistently for the prisoner’s dilemma, their structural and behavioural types, the conditions for their existence, and the regions of Figure 2 in which they exist.

Policy-pairTypeConditionsRegions

Sym, NCCAlways
Sym, NCC
Sym, CC