Research Article
WASTK: A Weighted Abstract Syntax Tree Kernel Method for Source Code Plagiarism Detection
Table 2
The sizes of datasets.
| Fold # | Type of set | Number of code pairs |
| 1 | Training set | 14810 | 1 | Testing set | 7404 |
| 2 | Training set | 14809 | 2 | Testing set | 7405 |
| 3 | Training set | 14809 | 3 | Testing set | 7405 |
|
|