Research Article
Intelligent Inventory Control via Ruminative Reinforcement Learning
Algorithm 1
RSarsa algorithm.
| (L00) Initialize . | | (L01) Observe . | | (L02) Determine by policy . | | (L03) For each period, | | (L04) observe , and ; | | (L05) determine by policy ; | | (L06) calculate ; | | (L07) update ; | | (L08) for each , | | (L09) calculate with , | | (L10) determine , | | (L11) calculate , | | (L12) update | | (L13) until ruminated all ; | | (L14) set and | | (L15) until termination. |
|