Research Article

A Topic Recognition Method of News Text Based on Word Embedding Enhancement

Table 2

The information of the 20NewsGroup dataset.

Topic categoryLenSizeTopic categoryLenSize

comp.graphics163.0973rec.autos126.3990
comp.os.ms.windows.misc160.6985rec.motorcycles118.4996
comp.sys.ibm.pc.hardware116.7982rec.sport.baseball131.2994
comp.sys.mac.hardware109.0963rec.sport.hockey155.3999
comp.windows.x174.3988misc.forsale95.1975
talk.politics.misc232.7775sci.crypt189.6991
talk.politics.guns189.9910sci.electronics121.8984
talk.politics.mideast269.4940sci.med173.8990
alt.atheism183.0799sci.space174.9987
soc.religion.Christian194.2997talk.religion.misc192.1628