Message 144716 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	ezio.melotti
Recipients	ezio.melotti, gvanrossum, lemburg, loewis, mrabarnett, tchrist, terry.reedy
Date	2011-10-01.02:15:35
SpamBayes Score	1.8440245e-07
Marked as misclassified	No
Message-id	<1317435343.47.0.967017479927.issue12753@psf.upfronthosting.co.za>
In-reply-to

Content
Attached a new patch that adds support for named sequences (still needs some test and can probably be improved). > There are certainly advantages to that strategy: you don't have to > deal with [\N{sequence}] issues. I assume with [] you mean a regex character class, right? > If the argument to unicode.lookup() and be any of name, alias, or > sequence, that seems ok. With my latest patch, all 3 are supported. > \N{} should still do aliases, though, since those don't have the > complication that sequences have. \N{} will only support names and aliases (maybe this can go in 2.7/3.2 too). > You may wish unicode.name() to return the alias in preference, > however. That's what we do. And of course, there is no issue of > sequences there. This can be done for 3.3, but I wonder if it might create problems. People might use unicodedata.name() to get a name and use it elsewhere, and the other side might not be aware of aliases.

Attached a new patch that adds support for named sequences (still needs some test and can probably be improved).

> There are certainly advantages to that strategy: you don't have to
> deal with [\N{sequence}] issues.

I assume with [] you mean a regex character class, right?

> If the argument to unicode.lookup() and be any of name, alias, or 
> sequence, that seems ok. 

With my latest patch, all 3 are supported.

> \N{} should still do aliases, though, since those don't have the 
> complication that sequences have.

\N{} will only support names and aliases (maybe this can go in 2.7/3.2 too).

> You may wish unicode.name() to return the alias in preference,
> however. That's what we do.  And of course, there is no issue of 
> sequences there.

This can be done for 3.3, but I wonder if it might create problems.  People might use unicodedata.name() to get a name and use it elsewhere, and the other side might not be aware of aliases.

History
Date	User	Action	Args
2011-10-01 02:15:43	ezio.melotti	set	recipients: + ezio.melotti, lemburg, gvanrossum, loewis, terry.reedy, mrabarnett, tchrist
2011-10-01 02:15:43	ezio.melotti	set	messageid: <1317435343.47.0.967017479927.issue12753@psf.upfronthosting.co.za>
2011-10-01 02:15:42	ezio.melotti	link	issue12753 messages
2011-10-01 02:15:42	ezio.melotti	create