Research Article

Vietnamese Sentiment Analysis under Limited Training Data Based on Deep Neural Networks

Table 5

The mean accuracy results (with standard deviation) of various data augmentation techniques for Vietnamese sentiment analysis in 10 runs based on the deep learning model.

Datasets(1)(2)(3)(4)(5)(6)(7)

Dataset 10.764 0.050.838 0.010.818 0.030.843 0.020.844 0.010.840 0.010.8470.0003
Dataset 20.415 0.060.518 0.030.438 0.100.433 0.070.5200.010.449 0.050.367 0.0003
Dataset 30.742 0.050.797 0.010.683 0.100.742 0.070.780 0.010.782 0.010.8010.0001
Dataset 40.757 0.020.8020.010.710 0.070.738 0.090.782 0.010.786 0.010.746 0.0001

(1) With preprocessing techniques; (2) EDA; (3) sentence shuffling; (4) back translation; (5) contextual substitution (); (6) contextual substitution ( +); (7) sentence embedding mixup (the standard deviations are very minimum).