Wireless Communications and Mobile Computing

Research Article

UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient

Control of agent under reinforcement learning model.