Research Article
Support Vector Machine with Ensemble Tree Kernel for Relation Extraction
Input: | () Initial labeled text set I with little syntactic parsing processing | () Unlabeled text set II with large syntactic parsing processing | Output: Label information of text set II: M-II | // The labeled text set I is equally divided into two data set I1 and I2. | , | , | Begin | Do | // Training two SVMs based on tree kernel function, namely, svm1 and svm2, on I1 and I2, respectively | svm1.train and I1 → M1.model | svm2.train and I2 → M2.model | // classify the relations on text set II by using two trained svm models (M1.model and M2.model), and then generate different | classification result on the same text set II, candidate-I1 and candidate-I2 | M1.model and II → candidate-I1 | M2.model and II → candidate-I2 | // choose common samples from candidate-I1 and candidate-I2, and generate the new seed set, candidate-I | candidate-I ≔ candidate-I1 ∩ candidate-I2 | Until complete the task of classification in text set II | End | Printf label information of text set II: M-II |
|