Research Article
Rapid Text Retrieval and Analysis Supporting Latent Dirichlet Allocation Based on Probabilistic Models
Table 3
Analysis of data according to English grammar based on LDA.
| Word statistics | Word count | Cumulative | Percentage of cumulative |
| Syllables | 2045 | 2045 | 51.58 | Sentences | 126 | 2171 | 3.18 | Unique words | 371 | 2542 | 9.36 | Average word length (char) | 4.6 | 2546.6 | 0.12 | Average sentence length (word) | 10.2 | 2556.8 | 0.26 | Monosyllabic words (1 syllable) | 749 | 3305.8 | 18.83 | Polysyllabic words (≥3 syllables) | 179 | 3484.8 | 4.52 | Syllables per word | 1.6 | 3486.4 | 0.04 | Paragraph | 79 | 3565.4 | 1.99 | Difficult Words | 399 | 3964.4 | 10.07 |
|
|