Research Article
Optimization of Tourism Information Analysis System Based on Big Data Algorithm
Algorithm 1
D3 NEW AND ID3 IMPROVED ALGORITHM.
| | Input: R:a set of noncategorical attributes | | | D: the categorical | | | T: training set | | | Output: a decision tree | | | Begin | | | If T is null | | | Return empty flags or single data point flags | | | If the records in T all have the same classification mark | | | Return the classification value with a single node flag | | | Assign threshold and confidence | | | For all attribute X in D | | | Calculate the obtained value of (x, T) | | | Let be the maximum value obtained by (x, T) | | | Let x = Xi | | | Let W be the attribute with the greatest gain | | | Calculate the percentage of each class in the data set and the subset of different values of the decision attribute W; | | | If the percent value < x then | | | Not counted in the queue | | | End if | | | End |
|