Input: An arbitrary initial policy at given . |
Output: The optimal policy for all user queues . |
Procedure: |
() Choose as the stopping criterion for all user queues. |
() for queue to do |
() repeat |
() Set iteration index . |
() Calculate and by solving (2.6) and (3.2), respectively. |
() Determine the gradient such that: |
|
() Do line search along the gradient, choose the right step size . |
() Update service rates . |
() Set . |
() Until or . |
() end for |