Research Article
Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning
Table 6
with fixed ordering cost.
| Method | Lifetime | Lead time | | (%) | | | |
| PAQ-DQN | 4 | 0 | 50 | 98 | 15041.55 | 16172.39 | 93.01 | | 1 | 50 | 98 | 14484.18 | 15576.99 | 92.98 | | 0 | 25 | 98 | 15753.94 | 16507.79 | 95.43 | | 1 | 25 | 98 | 14544.00 | 15436.53 | 94.22 |
| PAQ-A2C | 4 | 0 | 50 | 98 | 15116.77 | 16178.22 | 93.44 | | 1 | 50 | 98 | 14223.13 | 15423.03 | 92.22 | | 0 | 25 | 98 | 15813.49 | 16502.16 | 95.83 | | 1 | 25 | 98 | 14896.70 | 15858.32 | 93.94 |
|
|