Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 7

for the learned policies.

LifetimeLead time

402516013.5616525.7596.90
12514911.0615540.6295.95
405015421.4516197.4594.10
15014771.2115583.8994.79