Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 5

for proposed algorithm and optimal policy.

MethodLifetime

PAQ-DQN25377.0215691.35510.47794.477
35514.7105691.2535.88496.898
45409.5875664.8038.50795.495

PAQ-A2C25385.6535662.3799.22495.113
35590.5595674.6252.80298.519
45428.0725677.2018.30495.612