Research Article
Textual Backdoor Attack for the Text Classification System
Figure 13
Accuracy of target model on original sentences from the MR dataset and attack success rate of the backdoor samples according to the proportion of backdoor samples in the input dataset. Each pair of bars represents the performance of the target model trained using a different proportion of backdoor samples. The trigger text was “movie.” (a) Trigger at the beginning of a sentence. (b) Trigger at the end of a sentence.
(a) |
(b) |