Research Article

Deep Reinforcement Learning-Based Joint Satellite Scheduling and Resource Allocation in Satellite-Terrestrial Integrated Networks

Table 2

Parameters of PSDDQN algorithm.

ParametersValue

Batch size32
Learning rate α0.005
Number of leaves of SumTree structure2000
Number of neurons in input layer156
Number of neurons in hidden layers64, 32
Number of neurons in output layer24
Discount rate γ0.9
Activation functionReLU
OptimizerGradient descent optimizer