Research Article
Support Vector Machine with Ensemble Tree Kernel for Relation Extraction
| Input: | | () Initial labeled text set I with little syntactic parsing processing | | () Unlabeled text set II with large syntactic parsing processing | | Output: Label information of text set II: M-II | | // The labeled text set I is equally divided into two data set I1 and I2. | | , | | , | | Begin | | Do | | // Training two SVMs based on tree kernel function, namely, svm1 and svm2, on I1 and I2, respectively | | svm1.train and I1 → M1.model | | svm2.train and I2 → M2.model | | // classify the relations on text set II by using two trained svm models (M1.model and M2.model), and then generate different | | classification result on the same text set II, candidate-I1 and candidate-I2 | | M1.model and II → candidate-I1 | | M2.model and II → candidate-I2 | | // choose common samples from candidate-I1 and candidate-I2, and generate the new seed set, candidate-I | | candidate-I ≔ candidate-I1 ∩ candidate-I2 | | Until complete the task of classification in text set II | | End | | Printf label information of text set II: M-II |
|