Research Article
BERT-PPII: The Polyproline Type II Helix Structure Prediction Model Based on BERT and Multichannel CNN
Table 2
The dataset under less strict definition (NonStrict_data).
| Dataset | Number of sequence | Number of PPII | Number of non-PPII | Total |
| Training set | 7121 | 64490 | 1554142 | 1618432 | Test set | 1781 | 15880 | 379276 | 395156 | Independent test set | 1001 | 8639 | 208785 | 217424 |
|
|