Research Article
Limiting Dynamics for Q-Learning with Memory One in Symmetric Two-Player, Two-Action Games
Figure 9
Diagram showing the structure of an interaction between the two players during a batch. The batch size is so that and remains fixed during the batch, while is updated.