Research Article

Flipit Game Deception Strategy Selection Method Based on Deep Reinforcement Learning

Figure 12

The delay comparison of MFD-PPO, MFD-A2C and MFD-PSO.