Message 339203 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	dscorbett
Recipients	docs@python, dscorbett, ezio.melotti, vstinner
Date	2019-03-30.14:41:13
SpamBayes Score	-1.0
Marked as misclassified	Yes
Message-id	<1553956873.19.0.268771747568.issue36486@roundup.psfhosted.org>
In-reply-to

Content
In `unicodedata`, the functions `lookup` and `name` have some bugs and inconsistencies. `lookup` matches case-insensitively, except for the algorithmic names of Hangul syllables and CJK unified ideographs, which must be in all caps. The documentation does not explain how character names are fuzzily matched. `lookup` accepts names like “CJK UNIFIED IDEOGRAPH-04E00”, where the code point has a leading zero. `lookup` and `name` don’t implement rule NR2, defined in chapter 4 of Unicode, for Tangut ideographs’ names.

In `unicodedata`, the functions `lookup` and `name` have some bugs and inconsistencies.

`lookup` matches case-insensitively, except for the algorithmic names of Hangul syllables and CJK unified ideographs, which must be in all caps. The documentation does not explain how character names are fuzzily matched.

`lookup` accepts names like “CJK UNIFIED IDEOGRAPH-04E00”, where the code point has a leading zero.

`lookup` and `name` don’t implement rule NR2, defined in chapter 4 of Unicode, for Tangut ideographs’ names.

History
Date	User	Action	Args
2019-03-30 14:41:13	dscorbett	set	recipients: + dscorbett, vstinner, ezio.melotti, docs@python
2019-03-30 14:41:13	dscorbett	set	messageid: <1553956873.19.0.268771747568.issue36486@roundup.psfhosted.org>
2019-03-30 14:41:13	dscorbett	link	issue36486 messages
2019-03-30 14:41:13	dscorbett	create