Research Article

A Computational Linguistic Approach for Gender Prediction Based on Vietnamese Names

Table 3

Gender prediction of N-gram by using full name and without family name on the GenderVN1.0 dataset.

N-gramUsing full nameWithout family name
Logistics regressionNäive BayesRandom forestLogistics regressionNäive BayesRandom forest

1-gram90.490.088.290.990.390.5
2-gram70.370.070.468.468.468.4
3-gram54.854.854.853.053.053.0

Bold shows best accuracy obtained.