Message287077
unicodedata can't normalize(NFC) hangul strings which contain \u1176(HANGUL JUNGSEONG A-O).
>>> from unicodedata import normalize
>>> normalize("NFC", "\u1100\u1176\u11a8")
'깍'
=> should be "\u1100\u1176\u11a8" not '깍' (\uae4d)
I attached a patch for this issue. (Fixing boundary of modern medial vowels) |
|
Date |
User |
Action |
Args |
2017-02-06 04:27:52 | pusnow | set | recipients:
+ pusnow, vstinner, ezio.melotti |
2017-02-06 04:27:52 | pusnow | set | messageid: <1486355272.29.0.0894514518656.issue29456@psf.upfronthosting.co.za> |
2017-02-06 04:27:52 | pusnow | link | issue29456 messages |
2017-02-06 04:27:51 | pusnow | create | |
|