Research Article
Path-Integral-Based Reinforcement Learning Algorithm for Goal-Directed Locomotion of Snake-Shaped Robot
Algorithm 1
The algorithm steps of goal-directed locomotion.
| Initialize: | | repeat | | (for each episode) Initialize: | | While is not | | If (Probability less than) | | choose | | Else | | Random choose | | End | | If (There is no collision) | | Take action, returnand | | Update | | repeat | | | | until | | until The task of goal-directed locomotion is finished | | End |
|