Research Article

Optimization of Tourism Information Analysis System Based on Big Data Algorithm

Algorithm 1

D3 NEW AND ID3 IMPROVED ALGORITHM.
Input: R:a set of noncategorical attributes
 D: the categorical
 T: training set
Output: a decision tree
Begin
 If T is null
  Return empty flags or single data point flags
If the records in T all have the same classification mark
  Return the classification value with a single node flag
Assign threshold and confidence
For all attribute X in D
  Calculate the obtained value of (x, T)
  Let be the maximum value obtained by (x, T)
   Let x = Xi
 Let W be the attribute with the greatest gain
   Calculate the percentage of each class in the data set and the subset of different values of the decision attribute W;
 If the percent value < x then
  Not counted in the queue
 End if
End