Research Article

Analysis and Recognition of Food Safety Problems in Online Ordering Based on Reviews Text Mining

Table 1

List of prior studies on online reviews text sentiment analysis.

StudyBuilding feature vectorsTechniques usedKey findings

Huang et al. [11]Bag of words, Word2vec, and TF-IDFSVM, RF, GBDT, XGBoostGBDT has the best classification performance
Barrientos et al. [26]Bag of words, Word2vec, and TF-IDFSVM, LR, KNN, RFThe best performance result was achieved by the combination of the text encoder TF-IDF and the SVM classifier with linear kernel
Zahoor et al. [27]TF-IDFNB, SVM, LR, RFRF algorithm achieves the maximum accuracy in sentiment analysis
Yang et al. [28]Word2vecXGBoostThe accuracy of XGBoost model in predicting emotional polarity is 0.896
Anisha et al. [29]TF-IDF; bag of wordsSVM, RF, NB, LR, RNN, LSTM, BiLSTM, CNNThe verification accuracy of LSTM method is the highest
Liu et al. [30]Word2vec, BertSVM, CNN, LSTM, BiLSTMBiLSTM model has a higher improvement in F1 value compared with other models
Duan et al. [31]BertDict-BertDict-Bert model is better than the BERT-only model, especially when the training set is relatively small
Zeng et al. [32]Word2vecBiLSTM, SVM, RF, XGBOOST, LSTMBiLSTM model achieved good results on F1 and accuracy
Wu et al. [33]BertCNN, RNN, FastText, RCNNOn the basis of Bert, RCNN combined with attention mechanism is used to extract the context features of reviews text, which can improve the accuracy of model classification
Li et al. [34]BertCNN, BiLSTMBLSTM can solve the connection between the words and the semantic
Maslej-Krešňáková et al. [35]TF-IDF, pretrained word embeddingsFFNN, CNN, LSTM, GRU, BiLSTMThe combined structure classification accuracy of BiLSTM + CNN network is high

SVM: support vector machine; RF: random forest; GBDT: Gradient Boosting Decision Tree; XGBoost: Extreme Gradient Boosting; LR: logistic regression; KNN: K-nearest neighbors; NB: Naive Bayes; RNN: recurrent neural network; LSTM: long short-term memory; BiLSTM: bidirectional long short-term memory; CNN: convolutional neural network; RCNN: recurrent convolutional neural networks; FFNN: feedforward neural network; GRU: gated recurrent unit.