Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 3

Results for dynamic pricing with positive lead time.

MethodLifetime

PAQ-DQN23242.1042911.139254.905291.997
34734.7664455.81440.439110.146
44840.4434714.71234.06628.642

PAQ-A2C23081.5462305.394232.000573.749
34513.7864474.915207.470130.278
44919.3322278.32449.802748.916