Research Article

Flipit Game Deception Strategy Selection Method Based on Deep Reinforcement Learning

Table 2

The simulation parameters setting.

Simulation parametersValue

Attacker’s strategy [0.01–0.15]
Defender’s strategy [1–100]
Defender’s cost 5
Hyperparameter 0.2
Learning rate
Batch size128