Research Article
A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation
Table 1
Proportions of domains of general corpus.
| | Domain | Sent. number | % |
| | News | 279,962 | 24.60 | | Novel | 304,932 | 26.79 | | Law | 48,754 | 4.28 | | Miscellaneous | 504,396 | 44.33 |
| | Total | 1,138,044 | 100.00 |
|
|