Research Article

WASTK: A Weighted Abstract Syntax Tree Kernel Method for Source Code Plagiarism Detection

Table 2

The sizes of datasets.

Fold # Type of setNumber of code pairs

1Training set14810
1Testing set7404

2Training set14809
2Testing set7405

3Training set14809
3Testing set7405