This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author pitrou
Recipients Arfrever, ezio.melotti, lemburg, ncoghlan, pitrou, r.david.murray, serhiy.storchaka, vstinner
Date 2014-09-23.11:23:47
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <>
The encoding used impacts the result:

>>> s = 'abc\udcc3\udca9'
>>> s.encode('ascii', 'surrogateescape').decode('ascii', 'replace')
>>> s.encode('utf-8', 'surrogateescape').decode('utf-8', 'replace')

The original string ('abc\udcc3\udca9') was obtained by decoding a valid utf-8 string with the 'ascii' codec and the 'surrogateescape' error handler.

If anything, the default encoding should probably be sys.getfilesystemencoding().
Date User Action Args
2014-09-23 11:23:47pitrousetrecipients: + pitrou, lemburg, ncoghlan, vstinner, ezio.melotti, Arfrever, r.david.murray, serhiy.storchaka
2014-09-23 11:23:47pitrousetmessageid: <>
2014-09-23 11:23:47pitroulinkissue18814 messages
2014-09-23 11:23:47pitroucreate