Research Article

iSentenizer- : Multilingual Sentence Boundary Detection Model

Table 10

Results on the Brown, WSJ, and Tycho Brahe corpus.

CorpusCandidatesRecallPrecision -Score

WSJ corpusiSentenizer98.48%99.18%98.83%
Punkt93.08%57.84%71.34%
MxTerminator93.08%60.24%73.14%

Brown corpusiSentenizer99.41%99.98%99.70%
Punkt96.30%99.95%98.09%
MxTerminator96.30%99.98%98.11%

Tycho Brahe corpusiSentenizer99.40%99.86%99.63%
Punkt79.83%99.90%88.74%
MxTerminator79.83%99.98%88.77%