string - Sentences division -


i have divide text separated sentences.

ok. seem simple.

just search "." or "?" or "!" , add next sentence array.

but unfortunately not great , simple.

how can avoid situation when:

washington, d.c.

will splitted for: "washington, d" , "c".

or

“one time set off explosive under chair of our teacher, mrs. thurman."

is splitted on:

"one time set off explosive under chair of our teacher, mrs"

and

"thurman"

maybe database acronyms contains "." ?

thanks tips in advance!

check out nltk. has out-of-the-box solutions problems described


Comments

Popular posts from this blog

All overlapping substrings matching a java regex -

c++ - Using OpenSSL in a multi-threaded application -

php - Deleting/Renaming a locked file -