Research Article
Limiting Dynamics for Q-Learning with Memory One in Symmetric Two-Player, Two-Action Games
Table 1
All strategy pairs solve the Bellman equations self-consistently for the prisoner’s dilemma, their structural and behavioural types, the conditions for their existence, and the regions of Figure 2 in which they exist.
|