Research Article
Optimization of Tourism Information Analysis System Based on Big Data Algorithm
Algorithm 1
D3 NEW AND ID3 IMPROVED ALGORITHM.
| Input: R:a set of noncategorical attributes | | D: the categorical | | T: training set | | Output: a decision tree | | Begin | | If T is null | | Return empty flags or single data point flags | | If the records in T all have the same classification mark | | Return the classification value with a single node flag | | Assign threshold and confidence | | For all attribute X in D | | Calculate the obtained value of (x, T) | | Let be the maximum value obtained by (x, T) | | Let x = Xi | | Let W be the attribute with the greatest gain | | Calculate the percentage of each class in the data set and the subset of different values of the decision attribute W; | | If the percent value < x then | | Not counted in the queue | | End if | | End |
|