Research Article
Research on Vibration Reduction Control Based on Reinforcement Learning
Algorithm 1
The updating method of value function for the different reinforcement learning algorithms.
| Input: environment E; action space A; initial state ; discount coefficient ; learning rate ; | (1) | | (2) | | (3) | For | (4) | reward and transfer status generated by performing actions in environment E; | (5) | | (6) | | (7) | | (8) | | (9) | End | | Output: strategy |
|