Research Article
Application of Reinforcement Learning in Multiagent Intelligent Decision-Making
Table 1
Definition of Q value under different algorithms.
| | Single | Nash | Regret |
| Q | | | | Updated Q | Largest value under the next state | Product of agent’s united Nash strategy and Q value | Q value of minimum action under next state’s regret value |
|
|