Research Article
Action-Based Load Balancing Technique in Cloud Network Using Actor-Critic-Swarm Optimization
  |  | Term | Meaning | Term | Meaning |  
  |   | Agent |  | State-value function |   | Initial state |  | Policy function |   | Next state |  | Actor learning rate |  |  &  | Episode & iteration |  | Critic learning rate |   | Actor parameter |  | TD error |   | Critic parameter |  | Advantage function |   | Eligibility trace |  | Policy gradient |   | Trace decay rate | γ | Discount factor |   | Reward |  | Fitness value |   | Personal best |  | Global best |  
  |  
  |