Message264706
I tested http://minaret.info/test/normalize.msp
(1)
꾸ᆧ (afb8 11a7) --NFC or NFKC--> 꾸ᆧ (afb8, 11a7) === same than python
꾸ᆧ (afb8 11a7) --NFD or NFKD--> 꾸ᆧ (1101 116e, 11a7) === same than python
(2)
꾸ᆧ (1101 116e 11a7) --NFC or NFKC--> 꾸 (afb8) === same than python
꾸ᆧ (1101 116e 11a7) --NFC or NFKC--> 꾸ᆧ (1101 116e, 11a7) === same than python
(3)
꾸ᆧ㤺 (afb8 11a7 2f8a1) --NFC or NFKC--> 꾸ᆧ㤺 (afb8, 11a7, 393a) == DIFFERENT than python, python eats the U+11a7 character
꾸ᆧ㤺 (afb8 11a7 2f8a1) --NFD or NFKD--> 꾸ᆧ㤺 (1101 116e, 11a7, 393a) === same than python |
|
Date |
User |
Action |
Args |
2016-05-03 09:35:40 | vstinner | set | recipients:
+ vstinner, arigo, ezio.melotti |
2016-05-03 09:35:40 | vstinner | set | messageid: <1462268140.51.0.974105555681.issue26917@psf.upfronthosting.co.za> |
2016-05-03 09:35:40 | vstinner | link | issue26917 messages |
2016-05-03 09:35:40 | vstinner | create | |
|