Research Article
Research on Credit Risk Prediction under Unbalanced Dataset Based on Ensemble Learning
| Input: dataset D = {(, )}in, xi ∈ RT, random sampling times N, feature weight threshold β; | | Output: the filtered variable feature set S | (1) | Set the feature weight to 0, F = True, and the set S is empty | (2) | for i = 1 to N do | (3) | if F == True | | Randomly draw a sample xi from the minority sample, F = False | (4) | else | | A sample is randomly selected from the majority of samples, F = True | (5) | Use formulas (4) and (5) to calculate the probability that xi is used to update feature weights | (6) | Generate a random number from 0 to 1 μ | (7) | if μ < | | Find the nearest neighbor sample H from the similar samples of , and find the nearest neighbor sample M from different classes | | for i = 1 to T do | | Update the feature weight according to formula (1) | (8) | for i = 1 to T do | | if ≥ β | | Add the i-th feature to the set S | (9) | return S |
|