Research Article

Learning Attentional Communication with a Common Network for Multiagent Reinforcement Learning

Table 1

Training parameters of the MAACCN algorithm.

ParametersValueDescriptions

Lr0.0005The learning rate
Epsilon1Probability of exploration
Min_epsilon0.05Minimum probability of exploration
Anneal_steps50000The annealing steps of exploration
T_max2000000The total step size of training
N_episodes1The number of episodes sampled at an epoch
Evaluate_cycle100The interval of the evaluation cycle
Evaluate_epoch32Frequency of evaluation
Batch_size32The batch data size for training
Buffer_size5000The size of the buffer
Target_update_cycle200The update interval of the target network
hidden_dim64The dimension of a hidden layer
Head8The number of the multihead