string - Sentences division -


i have divide text separated sentences.

ok. seem simple.

just search "." or "?" or "!" , add next sentence array.

but unfortunately not great , simple.

how can avoid situation when:

washington, d.c.

will splitted for: "washington, d" , "c".

or

“one time set off explosive under chair of our teacher, mrs. thurman."

is splitted on:

"one time set off explosive under chair of our teacher, mrs"

and

"thurman"

maybe database acronyms contains "." ?

thanks tips in advance!

check out nltk. has out-of-the-box solutions problems described


Comments

Popular posts from this blog

c# - SVN Error : "svnadmin: E205000: Too many arguments" -

c# - Copy ObservableCollection to another ObservableCollection -

All overlapping substrings matching a java regex -