Research Article

Suppressing Uncommanded Roll-Yaw Motion by Jet Flow Control Based on Reinforcement Learning

Table 1

TD3 hyperparameters.

ParameterValue

OptimizerAdam [19]
Number of hidden layers (all networks)2
Number of hidden units per layer256
Critic learning rate
Actor learning rate
Discount factor ()0.99
Exploration noise0.1
Policy noise0.2
Range to clip policy noise0.5
Target smoothing coefficient ()0.005
Number of samples per minibatch256
Policy update frequency2
Activation functionReLU (rectified linear unit) [20]