Complexity

Research Article

Reinforcement Learning with Probabilistic Boolean Network Models of Smart Grid Devices

Maximum reward obtained for the normal operation mode of the IPR in one year of operation.