Research Article

Optimizing the Junction-Tree-Based Reinforcement Learning Algorithm for Network-Wide Signal Coordination

Table 1

The given Q-value matrix of the virtual road network.

Phase combinationIntersection and phaseIntersection and phaseIntersection and phaseIntersection and phase
12133424

Combination 1AA4AA8AA7AA8
Combination 2AB5AB7AB6AB9
Combination 3BA3BA5BA4BA5
Combination 4BB7BB9BB6BB7