Research Article

Deep Q-Learning Network Model for Optimizing Transit Bus Priority at Multiphase Traffic Signal Controlled Intersection

Figure 10

Training reward value iteration results. (a) Training iteration results (when  = 0.1,  = 0.25,  = 0.5,  = 0.75). (b) Training iteration results (when  = 0.9).
(a)
(b)