Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 6

with fixed ordering cost.

MethodLifetimeLead time (%)

PAQ-DQN40509815041.5516172.3993.01
1509814484.1815576.9992.98
0259815753.9416507.7995.43
1259814544.0015436.5394.22

PAQ-A2C40509815116.7716178.2293.44
1509814223.1315423.0392.22
0259815813.4916502.1695.83
1259814896.7015858.3293.94