UAVs Maneuver Decision-Making Method Based on Transfer Reinforcement Learning
Table 1
The training method of 1vs1 confrontation maneuver decision-making based on transfer learning.
Step
Different scenario requirements from simple to difficult
1
Set that there is only one attack UAV in the battlefield environment and train the UAV to avoid collision with obstacles and boundaries until it can reach the target area
2
Use the strategy of the attack UAV in step 1 and add a defense UAV in the environment. The maneuverability of the defense UAV is not as good as that of the attack UAV. The defense UAV is trained to avoid collision with obstacles and boundaries, and we perform the task of intercepting and attacking the attack UAV
3
Use the strategy of the defense UAV trained in step 2. It is set that the attack UAV can detect the defense UAV in advance. Use the transfer strategy and the nontransfer strategy for training, respectively