This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author belopolsky
Recipients belopolsky, eric.smith, ezio.melotti, lemburg, pitrou
Date 2010-11-25.05:01:37
SpamBayes Score 2.2867641e-10
Marked as misclassified No
Message-id <1290661300.37.0.975605523775.issue10521@psf.upfronthosting.co.za>
In-reply-to
Content
On Wed, Nov 24, 2010 at 3:37 PM, Marc-Andre Lemburg
<report@bugs.python.org> wrote:
..
> I don't think we should change that for the formatting methods.

That's a reasonable position.  What about

>>> unicodedata.category('\N{OLD ITALIC LETTER A}')
'Lo'
>>> '\N{OLD ITALIC LETTER A}'.isalpha()
False

the str.isalpha() method is underspecified in the reference manual,
but a comment in unicodectype.c describes Py_UNICODE_ISALPHA as
follows:

/* Returns 1 for Unicode characters having the category 'Ll', 'Lu',
'Lt',
  'Lo' or 'Lm',  0 otherwise. */

I don't have a wide build handy, but I am fairly sure  '\N{OLD ITALIC
LETTER A}'.isalpha() would produce True there.  The result above is
simply consequence of surrogates considered to be non-letters:

>>> [c.isalpha() for c in '\N{OLD ITALIC LETTER A}']
[False, False]
History
Date User Action Args
2010-11-25 05:01:40belopolskysetrecipients: + belopolsky, lemburg, pitrou, eric.smith, ezio.melotti
2010-11-25 05:01:40belopolskysetmessageid: <1290661300.37.0.975605523775.issue10521@psf.upfronthosting.co.za>
2010-11-25 05:01:37belopolskylinkissue10521 messages
2010-11-25 05:01:37belopolskycreate