Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 11

Results for dynamic pricing with positive lead time.

MethodLifetime

PAQ-DQN22761.2972648.036371.397371.135
34832.1574624.93751.16773.137
44917.9604787.64928.73742.665

PAQ-A2C22739.9122492.352330.639329.539
34647.3174490.30094.006102.618
44807.0794638.50284.87784.269