Research Article

Application of Reinforcement Learning in Multiagent Intelligent Decision-Making

Table 1

Definition of Q value under different algorithms.

SingleNashRegret

Q
Updated QLargest value under the next stateProduct of agent’s united Nash strategy and Q valueQ value of minimum action under next state’s regret value