Given a string:
c = 'A problem. She said: "I don\'t know about it."'
And an attempt to tokenize it:
>>> for sindex,sentence in enumerate(sent_tokenize(c)):
... print str(sindex)+": "+sentence
...
0: A problem.
1: She said: "I don't know about it.
2: "
>>>
Why does NLTK put the end quote of sentence 2 into its own sentence 3? Is there a way to correct this behavior?