Complexity

Research Article

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning

The parameter values.


Parameter	Values	Description

	30	Periods of the planning horizon
	{2,3,4}	Lifetime of perishables
	0	Initial inventory position of all ages
	0.9	Discount factor
	{0, 1, 2}	Lead time
	{25, 50}	Fixed ordering cost
	22.5	Unit variable cost
	0.22	Unit holding cost
	10.78	Unit penalty cost
	10	unit disposal cost
		Function of selling price
	1	Initial exploration rate
	{0.01, 0.001, 0.0001}	Learning rate
		Exploration rate decay parameter