This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author jberg
Recipients Neui, SilentGhost, eryksun, jberg, ncoghlan
Date 2020-05-24.17:26:36
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1590341196.17.0.756431238441.issue35883@roundup.psfhosted.org>
In-reply-to
Content
Pretty sure this is an issue still, I see it on current git master.

This seems to work around it?

https://p.sipsolutions.net/603927f1537226b3.txt

Basically, it seems that mbstowcs() and mbrtowc() on glibc with utf-8 just blindly decode even invalid UTF-8 to a too large wchar_t, rather than failing.
History
Date User Action Args
2020-05-24 17:26:36jbergsetrecipients: + jberg, ncoghlan, SilentGhost, eryksun, Neui
2020-05-24 17:26:36jbergsetmessageid: <1590341196.17.0.756431238441.issue35883@roundup.psfhosted.org>
2020-05-24 17:26:36jberglinkissue35883 messages
2020-05-24 17:26:36jbergcreate