This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author serhiy.storchaka
Recipients Arfrever, ezio.melotti, ncoghlan, pitrou, r.david.murray, serhiy.storchaka, vstinner
Date 2014-08-24.13:49:12
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1408888152.16.0.546650225121.issue18814@psf.upfronthosting.co.za>
In-reply-to
Content
What problem is purposed to solve clean_surrogate_escapes()? Could you please provide user scenario or two?

Possible alternative implementation is:

def clean_surrogate_escapes(s):
    return s.encode('utf-8', 'surrogatepass').decode('utf-8', 'replace')

It can be faster for some data (for mostly ASCII with rare surrogates it is superfast). For other data 'utf-16' can be better choice.
History
Date User Action Args
2014-08-24 13:49:12serhiy.storchakasetrecipients: + serhiy.storchaka, ncoghlan, pitrou, vstinner, ezio.melotti, Arfrever, r.david.murray
2014-08-24 13:49:12serhiy.storchakasetmessageid: <1408888152.16.0.546650225121.issue18814@psf.upfronthosting.co.za>
2014-08-24 13:49:12serhiy.storchakalinkissue18814 messages
2014-08-24 13:49:12serhiy.storchakacreate