Research Article
Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Continuous-Time Markov Decision Processes
Algorithm 1
(An iteration algorithm).
| Step 1. (Initialization). Choose any , let in Assumption 1, and for each , let | | | | Step 2. (Iteration). For each , let | | | | where . | | Step 3. (Approximation value). If | | | | where , go on step 4, otherwise increment by 1 and return to step 2. | | Step 4. (-optimal policy). For each , choose | | | | and is -optimal policy. |
|