Research Article

SynoExtractor: A Novel Pipeline for Arabic Synonym Extraction Using Word2Vec Word Embeddings

Table 4

Statistics of the KSUCCA corpus [23].

GenreNumber of textsNumber of wordsPercentage (%)

Religion1502364508746.73
Linguistics56709396614.02
Literature104722450414.28
Science42642913312.71
Sociology3227097745.36
Biography2634999486.92
Total41050602412100