International Journal of Intelligent Systems

Research Article

Flipit Game Deception Strategy Selection Method Based on Deep Reinforcement Learning

Convergence performance of MFD-A2C and MFD-PPO algorithms.