Research Article
Flipit Game Deception Strategy Selection Method Based on Deep Reinforcement Learning
Table 2
The simulation parameters setting.
| Simulation parameters | Value |
| Attacker’s strategy | [0.01–0.15] | Defender’s strategy | [1–100] | Defender’s cost | 5 | Hyperparameter | 0.2 | Learning rate | | Batch size | 128 |
|
|