Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 8

Comparison for PAQ-DQN.

LifetimeLead time

402515753.9416013.562.450
12514544.0014911.0636.870
405015041.5515241.451.920
15014484.1814771.2197.6756.97