Research Article
WASTK: A Weighted Abstract Syntax Tree Kernel Method for Source Code Plagiarism Detection
Table 2
The sizes of datasets.
| | Fold # | Type of set | Number of code pairs |
| | 1 | Training set | 14810 | | 1 | Testing set | 7404 |
| | 2 | Training set | 14809 | | 2 | Testing set | 7405 |
| | 3 | Training set | 14809 | | 3 | Testing set | 7405 |
|
|