Research Article
Hybrid Online and Offline Reinforcement Learning for Tibetan Jiu Chess
Figure 7
Comparison of learning efficiency with or without 2D normal distribution. (a) 2D normalization off. (b) 2D normalization on. (c) 2D normalization on and making a special board type.
(a) |
(b) |
(c) |