Message339203
In `unicodedata`, the functions `lookup` and `name` have some bugs and inconsistencies.
`lookup` matches case-insensitively, except for the algorithmic names of Hangul syllables and CJK unified ideographs, which must be in all caps. The documentation does not explain how character names are fuzzily matched.
`lookup` accepts names like “CJK UNIFIED IDEOGRAPH-04E00”, where the code point has a leading zero.
`lookup` and `name` don’t implement rule NR2, defined in chapter 4 of Unicode, for Tangut ideographs’ names. |
|
Date |
User |
Action |
Args |
2019-03-30 14:41:13 | dscorbett | set | recipients:
+ dscorbett, vstinner, ezio.melotti, docs@python |
2019-03-30 14:41:13 | dscorbett | set | messageid: <1553956873.19.0.268771747568.issue36486@roundup.psfhosted.org> |
2019-03-30 14:41:13 | dscorbett | link | issue36486 messages |
2019-03-30 14:41:13 | dscorbett | create | |
|