Research Article

Diversity Evolutionary Policy Deep Reinforcement Learning

Table 1

The test environment in the MuJoCo benchmark.

EnvironmentAction dimension/state dimensionTask goals

Hopper-v23/11Make a two-dimensional one-legged robot hop forward as fast as possible
HalfCheetah-v26/17Make the 2D cheetah robot run fast
Ant-v28/111Make a four-legged creature walk forward as fast as possible
Walker2d-v26/17Make a two-dimensional bipedal robot walk forward as fast as possible