Research Article

Research on the Prediction of Nonbreakeven Financial Products’ Yield of Commercial Banks Based on Machine Learning

Table 3

The parameter of the random forest model.

ParametersDescriptions and settings

TaskTrain
ObjectiveModel training target, which can be selected as a regression model or binary classification model
Boosing_typeBase learner, for gbdt
MetricMetrics as a function of evaluating the optimal model
Leaning_rateLearning rate, the default is 0.1, the smaller the value, the more accurate the learning
n_estimatorsThe number of iterations in the classifier
Num_leavesThe number of leaves in the tree, the indicator should be less than 2 to the power of max_depth
Max_depthTree depth
Feature_fractionThe feature sampling ratio for building the tree, which ranges from 0 to 1
Bagging_fractionThe sample sampling ratio for building the tree, the data range is 0 to 1
Max_binThe maximum bin value, generally equal to the number of features
Min_data_in_leafThe minimum number of samples for each leaf node, when the leaf node is smaller than this value, the tree will no longer be split
Min_gain_to_splitThe tree stops splitting when the leaf node is smaller than this value