This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author vstinner
Recipients benjamin.peterson, ezio.melotti, grahamd, lemburg, ncoghlan, pitrou, pje, serhiy.storchaka, vstinner
Date 2014-08-25.01:06:06
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1408928766.47.0.264699325796.issue22264@psf.upfronthosting.co.za>
In-reply-to
Content
I don't think that applications are prepared to handle surrogate characters, so I'm not sure that the default encoding should be "surrogateescape". In my experience, text is later encoded to UTF-8 (or latin1 or ascii) and you then you an error from the encoder.

Just one example: issue #11186.
History
Date User Action Args
2014-08-25 01:06:06vstinnersetrecipients: + vstinner, lemburg, pje, ncoghlan, pitrou, benjamin.peterson, ezio.melotti, grahamd, serhiy.storchaka
2014-08-25 01:06:06vstinnersetmessageid: <1408928766.47.0.264699325796.issue22264@psf.upfronthosting.co.za>
2014-08-25 01:06:06vstinnerlinkissue22264 messages
2014-08-25 01:06:06vstinnercreate