Research Article
Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning
| Method | lifetime | Lead time | | |
| PAQ-DQN | 2 | 0 | 5394.735 | 21.519 | | 1 | 2761.297 | 371.397 | PAQ-A2C | 2 | 0 | 5367.027 | 16.139 | | 1 | 2739.912 | 330.639 | Q-learning | 2 | 0 | 4437.220 | 87.911 | | 1 | | 0 |
|
|