Research Article

Multilayer Deep Deterministic Policy Gradient for Static Safety and Stability Analysis of Novel Power Systems

Table 1

The classification of deep reinforcement learning methods.

TypeAction spaceMethods

Value-basedDiscreteQ learning, DQN, state-action-reward-state-action
Policy-basedDiscrete or continuousPolicy gradient
Actor-criticDiscrete or continuousActor-critic, PPO, trust region policy optimization
Actor-criticContinuousDDPG, twin-delayed DDPG, soft actor-critic