Research Article
Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning
Table 5
for proposed algorithm and optimal policy.
| Method | Lifetime | | | | |
| PAQ-DQN | 2 | 5377.021 | 5691.355 | 10.477 | 94.477 | 3 | 5514.710 | 5691.253 | 5.884 | 96.898 | 4 | 5409.587 | 5664.803 | 8.507 | 95.495 |
| PAQ-A2C | 2 | 5385.653 | 5662.379 | 9.224 | 95.113 | 3 | 5590.559 | 5674.625 | 2.802 | 98.519 | 4 | 5428.072 | 5677.201 | 8.304 | 95.612 |
|
|