This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ezio.melotti
Recipients ezio.melotti, lemburg, serhiy.storchaka, vstinner, wiml
Date 2013-03-04.18:10:43
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1362420644.05.0.788358961833.issue15866@psf.upfronthosting.co.za>
In-reply-to
Content
> I doubt about '\ud83d\udc9d' on wide build. Is it right to encode it as 
> b'&#128157;' and not as b'&#55357;&#56477;'?

I don't think so.  IIRC surrogates are invalid in UTF-32, and certainly shouldn't be recombined.

> This will be compatible with narrow build but will break compatibility
> with 3.3+. What is less evil?

I think it's better to be compatible with 3.3+.  This is anyway a rather obscure corner case.

Do you want to propose a new patch?
History
Date User Action Args
2013-03-04 18:10:44ezio.melottisetrecipients: + ezio.melotti, lemburg, vstinner, wiml, serhiy.storchaka
2013-03-04 18:10:44ezio.melottisetmessageid: <1362420644.05.0.788358961833.issue15866@psf.upfronthosting.co.za>
2013-03-04 18:10:44ezio.melottilinkissue15866 messages
2013-03-04 18:10:43ezio.melotticreate