Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

Table 2

The parameter values.

ParameterValuesDescription

30Periods of the planning horizon
{2,3,4}Lifetime of perishables
0Initial inventory position of all ages
0.9Discount factor
{0, 1, 2}Lead time
{25, 50}Fixed ordering cost
22.5Unit variable cost
0.22Unit holding cost
10.78Unit penalty cost
10unit disposal cost
Function of selling price
1Initial exploration rate
{0.01, 0.001, 0.0001}Learning rate
Exploration rate decay parameter