Review Article

The Applicability of Reinforcement Learning Methods in the Development of Industry 4.0 Applications

Table 2

-test table of principle captured classes by RL method types.

Principle capturedMarkov decision processMultiarmed banditDynamicTemporal differenceValue function approximationPolicy gradientMultiagentEdge computing

Prediction, forecasting, estimation, planning1.96−0.580.12−0.323.080.31−0.39−4.17

Detection, recognition, prevention, avoidance, protection−3.091.51−1.11−0.170.9988, 0.9782, 0.97850.390.9935, 0.8769, 0.87852.2−1.381.65

Evaluation, assessment−0.820.63−0.08−0.62−1.21−0.62.96−0.27

Classification, clustering−3.610.561.281.411.540.65−2.881.05

Decision making3.12.47−0.841.730.950.02−10.823.39

Allocation, assignment, resource management−2.69−1.950.21−11.18−4.742.0810.547.74

Scheduling, queuing, planning2.17−1.33−2.171.241.61−0.04−2.631.16

Control2.67−1.342.357.52−1.72−4.934.27−10.82