Research Article

[Retracted] Marketing System Construction and Risk Analysis Based on Random Forest of Machine Learning

Table 4

Important parameters of the random forest model.

ParameterExplainImpact on model performance

N_estimatorsNumber of base evaluators. The default is 100It does not affect the complexity of a single tree, but too small is easy to underfit, and too large is easy to overfit
Max_depthMaximum tree depth, default maximum depthMaximum depth, i.e., maximum complexity, reduces max_depth to reduce model complexity
Min_samples_leafThe minimum number of leaf nodes is 1 by defaultThe default value is the highest complexity, increasing min_samples_leaf to reduce model complexity
Min_samples_splitThe minimum number of samples required for internal node division is 2 by defaultThe default value is the highest complexity, increasing min_samples_split to reduce model complexity
Max_featureThe maximum number of features is auto by default, and the total number of features is squaredThe default value is the intermediate complexity, reducing max_feature to reduce the complexity of the model; on the contrary, it increases the complexity of the model