Research Article

SynoExtractor: A Novel Pipeline for Arabic Synonym Extraction Using Word2Vec Word Embeddings

Table 5

Statistics of Arabic Gigaword Third Edition corpus.

SourceFilesDOCsWords

Agence France Presse152147612798436
Assabah28658715410
Al Hayat142171502378353
An Nahar134193732449340
Ummah Press2412014645
Xinhua News Agency6756165348551
Total5475767991994735