This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author r.david.murray
Recipients barry, bryced, r.david.murray
Date 2018-10-03.17:59:47
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1538589587.25.0.545547206417.issue34881@psf.upfronthosting.co.za>
In-reply-to
Content
You could also play with just making a parser that is a simplified version of get_unstructured, producing a....maybe call it ASCIIOnlyUnstructuredTokenList...that would have as_ew_allowed set to False.  That might not produce optimal results, but it would be better than the current situation.
History
Date User Action Args
2018-10-03 17:59:47r.david.murraysetrecipients: + r.david.murray, barry, bryced
2018-10-03 17:59:47r.david.murraysetmessageid: <1538589587.25.0.545547206417.issue34881@psf.upfronthosting.co.za>
2018-10-03 17:59:47r.david.murraylinkissue34881 messages
2018-10-03 17:59:47r.david.murraycreate