Research Article
Deep Q-Learning Network Model for Optimizing Transit Bus Priority at Multiphase Traffic Signal Controlled Intersection
Figure 10
Training reward value iteration results. (a) Training iteration results (when = 0.1, = 0.25, = 0.5, = 0.75). (b) Training iteration results (when = 0.9).
(a) |
(b) |