Research Article

Research on the Prediction of Nonbreakeven Financial Products’ Yield of Commercial Banks Based on Machine Learning

Table 2

The parameters of the random forest model.

ParametersDescriptions and settings

n_estimatorsThe number of classifiers, also called the number of iterations, is the number of decision trees in the forest
CriterionThe standard used for splitting nodes, the default is gini
Max_depthMaximum depth of trees
Min_samples_splitMinimum number of samples required to split a node inside the tree, defaults to 2
Min_samples_leafMinimum number of samples required at leaf nodes, defaults to 1
Min_weight_fraction_leafMinimum weighted score in the sum of weights at all leaf nodes, defaults to 0
Max_featuresThe number of features to consider when finding the best segmentation, the default is none; that is, all features are considered
Max_leaf_nodesThe maximum number of leaf nodes, which must be an integer, the default is none
Min_impurity_decreaseIf the decrement of the split index is greater than this value, then split, default is 0
BootstrapWhether there is a randomly selected sample to put back, the default is true
Oob_scoreWhether to use out-of-bag samples to evaluate the quality of the model, set to true
N_jobsThe number of parallel calculations, the default is none
Random_stateControls the randomness of bootstrap and randomness of the selected samples. In order to facilitate the adjustment of other parameters, this parameter adopts the default value
VerboseControls verbosity when fitting and predicting, default is 0
Warm_startWhether to use the trained model and add more base learners to it, set to false