Author gvanrossum
Recipients Arfrever, belopolsky, ezio.melotti, gvanrossum, mrabarnett, tchrist
Date 2011-08-26.21:11:23
SpamBayes Score 3.47014e-09
Marked as misclassified No
Message-id <1314393084.09.0.900456558475.issue12736@psf.upfronthosting.co.za>
In-reply-to
Content
I presume this applies to builtin str methods like .lower(), right?  I think it is a good thing to do for Python 3.3.

We'd need to define what should happen in edge cases, e.g. when (against all odds) a string happens to contain a lone surrogate or some other code point or sequence of code points that the Unicode standard considers illegal.  I think it should not fail but just leave those code points alone.

Does this require us to import more data files from the Unicode standard?  By itself that doesn't scare me.

Would this also affect .islower() and friends?
History
Date User Action Args
2011-08-26 21:11:24gvanrossumsetrecipients: + gvanrossum, belopolsky, ezio.melotti, mrabarnett, Arfrever, tchrist
2011-08-26 21:11:24gvanrossumsetmessageid: <1314393084.09.0.900456558475.issue12736@psf.upfronthosting.co.za>
2011-08-26 21:11:23gvanrossumlinkissue12736 messages
2011-08-26 21:11:23gvanrossumcreate