Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 13

for proposed algorithm and optimal policy.

MethodLifetime

PAQ-DQN25349.7355670.95510.70794.336
35574.2825693.0973.96097.913
45435.0685685.5118.34895.595
PAQ-A2C25367.0275695.11110.93694.239
35627.3955686.3371.96498.963
45504.4875694.4026.33096.665