Message 143036 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	gvanrossum
Recipients	Arfrever, belopolsky, ezio.melotti, gvanrossum, mrabarnett, tchrist
Date	2011-08-26.21:11:23
SpamBayes Score	3.4701426e-09
Marked as misclassified	No
Message-id	<1314393084.09.0.900456558475.issue12736@psf.upfronthosting.co.za>
In-reply-to

Content
I presume this applies to builtin str methods like .lower(), right? I think it is a good thing to do for Python 3.3. We'd need to define what should happen in edge cases, e.g. when (against all odds) a string happens to contain a lone surrogate or some other code point or sequence of code points that the Unicode standard considers illegal. I think it should not fail but just leave those code points alone. Does this require us to import more data files from the Unicode standard? By itself that doesn't scare me. Would this also affect .islower() and friends?

I presume this applies to builtin str methods like .lower(), right?  I think it is a good thing to do for Python 3.3.

We'd need to define what should happen in edge cases, e.g. when (against all odds) a string happens to contain a lone surrogate or some other code point or sequence of code points that the Unicode standard considers illegal.  I think it should not fail but just leave those code points alone.

Does this require us to import more data files from the Unicode standard?  By itself that doesn't scare me.

Would this also affect .islower() and friends?

History
Date	User	Action	Args
2011-08-26 21:11:24	gvanrossum	set	recipients: + gvanrossum, belopolsky, ezio.melotti, mrabarnett, Arfrever, tchrist
2011-08-26 21:11:24	gvanrossum	set	messageid: <1314393084.09.0.900456558475.issue12736@psf.upfronthosting.co.za>
2011-08-26 21:11:23	gvanrossum	link	issue12736 messages
2011-08-26 21:11:23	gvanrossum	create