Research Article
Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning
Table 11
Results for dynamic pricing with positive lead time.
| Method | Lifetime | | | | |
| PAQ-DQN | 2 | 2761.297 | 2648.036 | 371.397 | 371.135 | 3 | 4832.157 | 4624.937 | 51.167 | 73.137 | 4 | 4917.960 | 4787.649 | 28.737 | 42.665 |
| PAQ-A2C | 2 | 2739.912 | 2492.352 | 330.639 | 329.539 | 3 | 4647.317 | 4490.300 | 94.006 | 102.618 | 4 | 4807.079 | 4638.502 | 84.877 | 84.269 |
|
|