Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 4

Results.

MethodlifetimeLead time

PAQ-DQN205377.02118.747
13242.104254.905

PAQ-A2C205385.65321.222
13081.546232.000

Q-learning204800.62231.376
1592.721151.462