Research Article

Hybrid Online and Offline Reinforcement Learning for Tibetan Jiu Chess

Figure 7

Comparison of learning efficiency with or without 2D normal distribution. (a) 2D normalization off. (b) 2D normalization on. (c) 2D normalization on and making a special board type.
(a)
(b)
(c)