Research Article

Flipit Game Deception Strategy Selection Method Based on Deep Reinforcement Learning

Figure 8

Convergence performance of MFD-PPO under different hyper-parameters .