Research Article
Joint Optimization for MEC Computation Offloading and Resource Allocation in IoV Based on Deep Reinforcement Learning
Table 2
Main hyperparameters of the De-DDPG.
| | Parameters | Value |
| | Size of the first hidden layer for actor and critic | 300 | | Size of the second hidden layer for actor and critic | 300 | | Learning rate of actor and critic / | 0.0001/0.001 | | Size of experienced memory | 20000 | | Parameters for OU noise | 0.15, 0.15, 0.10 | | Discount factor | 0.95 | | Penalty for failed task execution | 8 | | Total number of all episodes | 1000 | | Total time periods of one episode | 110 |
|
|