Research Article
Novel Model-free Optimal Active Vibration Control Strategy Based on Deep Reinforcement Learning
| | Initialization step: initial DDPG agent, initial state vector | | | repeat | | | Observe state and get action | | | Execute in the environment | | | Observe the next state | | | Get the reward | | | If is terminal, then reset the initial states and start a new episode | | | If it is time to update, then | | | DDPG | | | end if | | | until convergence |
|