Research Article

Application of Reinforcement Learning in Multiagent Intelligent Decision-Making

Table 2

Regret matrix of the first round.

Player 2

Player 1 2 (2)4 (1)6 (1)
1 (4)0,02,04,0
3 (2)1,00,12,1
5 (1)1,21,00,1