Author ezio.melotti
Recipients ezio.melotti, gvanrossum, lemburg, loewis, mrabarnett, tchrist, terry.reedy
Date 2011-10-01.02:15:35
SpamBayes Score 1.84402e-07
Marked as misclassified No
Message-id <1317435343.47.0.967017479927.issue12753@psf.upfronthosting.co.za>
In-reply-to
Content
Attached a new patch that adds support for named sequences (still needs some test and can probably be improved).

> There are certainly advantages to that strategy: you don't have to
> deal with [\N{sequence}] issues.

I assume with [] you mean a regex character class, right?

> If the argument to unicode.lookup() and be any of name, alias, or 
> sequence, that seems ok. 

With my latest patch, all 3 are supported.

> \N{} should still do aliases, though, since those don't have the 
> complication that sequences have.

\N{} will only support names and aliases (maybe this can go in 2.7/3.2 too).

> You may wish unicode.name() to return the alias in preference,
> however. That's what we do.  And of course, there is no issue of 
> sequences there.

This can be done for 3.3, but I wonder if it might create problems.  People might use unicodedata.name() to get a name and use it elsewhere, and the other side might not be aware of aliases.
History
Date User Action Args
2011-10-01 02:15:43ezio.melottisetrecipients: + ezio.melotti, lemburg, gvanrossum, loewis, terry.reedy, mrabarnett, tchrist
2011-10-01 02:15:43ezio.melottisetmessageid: <1317435343.47.0.967017479927.issue12753@psf.upfronthosting.co.za>
2011-10-01 02:15:42ezio.melottilinkissue12753 messages
2011-10-01 02:15:42ezio.melotticreate