This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author jberg
Recipients Neui, SilentGhost, eryksun, jberg, ncoghlan
Date 2020-05-24.17:40:20
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1590342020.19.0.285394291128.issue35883@roundup.psfhosted.org>
In-reply-to
Content
In fact that python one-liner works with just about everything else that you can throw at it, just not something that "looks like utf-8 but isn't".

And of course adding LC_CTYPE=ascii or something like that fixes it, as you'd expect. Then the "surrogateescape" works fine, since mbstowcs() won't try to decode it as utf-8.
History
Date User Action Args
2020-05-24 17:40:20jbergsetrecipients: + jberg, ncoghlan, SilentGhost, eryksun, Neui
2020-05-24 17:40:20jbergsetmessageid: <1590342020.19.0.285394291128.issue35883@roundup.psfhosted.org>
2020-05-24 17:40:20jberglinkissue35883 messages
2020-05-24 17:40:20jbergcreate