Research Article

A Multiphase Semistatic Training Method for Swarm Confrontation Using Multiagent Deep Reinforcement Learning

Table 5

The main parameters of training.

Hyperparameters

Batch size2048Buffer size20480Learning rate3.0e − 05Beta0.01
Epsilon0.2Lambda0.95Num epoch3Time horizon128

Network settingReward signals
Hidden units512Num layers3Gamma0.99Strength1