Research Article
Research on Fresh Product Logistics Transportation Scheduling Based on Deep Reinforcement Learning
| Parameter | Parameter description | Value |
| Enc-net | Encoder parameters | | Dec-net | Decoder parameters | | | Discount rate | 1.0 | | Learning rate | 0.001 | | Learning rate | 0.002 | | | [0.1, 1.0] | C | Parameter update interval | 0 | | Capacity of experience pool D | 5000 |
|
|