Research Article
Novel Model-free Optimal Active Vibration Control Strategy Based on Deep Reinforcement Learning
| Initialization step: initial DDPG agent, initial state vector | | repeat | | Observe state and get action | | Execute in the environment | | Observe the next state | | Get the reward | | If is terminal, then reset the initial states and start a new episode | | If it is time to update, then | | DDPG | | end if | | until convergence |
|