Research Article
Research on Vibration Reduction Control Based on Reinforcement Learning
Algorithm 1
The updating method of value function for the different reinforcement learning algorithms.
| | Input: environment E; action space A; initial state ; discount coefficient ; learning rate ; | | (1) | | | (2) | | | (3) | For | | (4) | reward and transfer status generated by performing actions in environment E; | | (5) | | | (6) | | | (7) | | | (8) | | | (9) | End | | | Output: strategy |
|