Research Article

Flipit Game Deception Strategy Selection Method Based on Deep Reinforcement Learning

Figure 4

The MFD-PPO algorithm architecture.