Research Article

Flipit Game Deception Strategy Selection Method Based on Deep Reinforcement Learning

Figure 9

The reward of defender at each stages.