Research Article
Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees
Table 3
Base set of attributes.
| | Attribute | Description of the attribute in relation to the control or candidate sequence |
| | chromLen/position | The ratio of the length of the chromosome over the position on that chromosome | | ShannonEntropyNorm | Shannon entropy normalized to the sequence length | | G% | Percentage of G base composition | | C% | Percentage of C base composition | | T% | Percentage of T base composition | | A% | Percentage of A base composition | | DuplexEnergy | The duplex energy between the miRNAs:miRNAs* | | DuplexEnergyNorm | The duplex energy normalized to the length of the duplex structure | | MaxMismatch | Maximum number of mismatches in the duplex structure based on both sides of the structure | | minMatchPercent | Minimum % match based on length of the duplex structure both sides of the structure | | DeltaG | Minimum free energy for the stem loop | | DeltaGnorm | Minimum free energy normalized to the length of the stem loop | | longestDotSet | Longest run of mismatches in the stem loop | | longestBracketSet | Longest run of matches in the stem loop | | loopCountNorm | Number of loop heads normalized to the length of the stem loop |
|
|