Author ezio.melotti
Recipients ezio.melotti, gvanrossum, kennyluck, lemburg, loewis, tchrist, vstinner
Date 2012-01-30.08:51:05
SpamBayes Score 4.41798e-06
Marked as misclassified No
Message-id <1327913466.02.0.834023803478.issue12892@psf.upfronthosting.co.za>
In-reply-to
Content
Thanks for the patch!

>  * fix an error in the error handler for utf-16-le. (In, Python3.2 
> b'\xdc\x80\x00\x41'.decode('utf-16-be', 'ignore') returns "\x00" 
> instead of "A" for some reason)

This should probably be done on a separate patch that will be applied to 3.2/3.3 (assuming that it can go to 3.2).  Rejecting surrogates will go in 3.3 only.  (Note that lot of Unicode-related code changed between 3.2 and 3.3.)

> Should we really reject lone surrogates for UTF-7?

No, I meant only UTF-8/16/32; UTF-7 is fine as is.
History
Date User Action Args
2012-01-30 08:51:06ezio.melottisetrecipients: + ezio.melotti, lemburg, gvanrossum, loewis, vstinner, tchrist, kennyluck
2012-01-30 08:51:06ezio.melottisetmessageid: <1327913466.02.0.834023803478.issue12892@psf.upfronthosting.co.za>
2012-01-30 08:51:05ezio.melottilinkissue12892 messages
2012-01-30 08:51:05ezio.melotticreate