Research Article
Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning
Table 13
for proposed algorithm and optimal policy.
| Method | Lifetime | | | | |
| PAQ-DQN | 2 | 5349.735 | 5670.955 | 10.707 | 94.336 | 3 | 5574.282 | 5693.097 | 3.960 | 97.913 | 4 | 5435.068 | 5685.511 | 8.348 | 95.595 | PAQ-A2C | 2 | 5367.027 | 5695.111 | 10.936 | 94.239 | 3 | 5627.395 | 5686.337 | 1.964 | 98.963 | 4 | 5504.487 | 5694.402 | 6.330 | 96.665 |
|
|