string - Sentences division -
i have divide text separated sentences.
ok. seem simple.
just search "." or "?" or "!" , add next sentence array.
but unfortunately not great , simple.
how can avoid situation when:
washington, d.c.
will splitted for: "washington, d" , "c".
or
“one time set off explosive under chair of our teacher, mrs. thurman."
is splitted on:
"one time set off explosive under chair of our teacher, mrs"
and
"thurman"
maybe database acronyms contains "." ?
thanks tips in advance!
check out nltk. has out-of-the-box solutions problems described
Comments
Post a Comment