Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 12

Results.

MethodlifetimeLead time

PAQ-DQN205394.73521.519
12761.297371.397
PAQ-A2C205367.02716.139
12739.912330.639
Q-learning204437.22087.911
10