Research Article
Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning
| Parameter | Values | Description |
| | 30 | Periods of the planning horizon | | {2,3,4} | Lifetime of perishables | | 0 | Initial inventory position of all ages | | 0.9 | Discount factor | | {0, 1, 2} | Lead time | | {25, 50} | Fixed ordering cost | | 22.5 | Unit variable cost | | 0.22 | Unit holding cost | | 10.78 | Unit penalty cost | | 10 | unit disposal cost | | | Function of selling price | | 1 | Initial exploration rate | | {0.01, 0.001, 0.0001} | Learning rate | | | Exploration rate decay parameter |
|
|