Message93513
Amaury, you are absolutely correct, \ud801 is not a valid unicode glyph,
however I am not giving Python \ud801, I am giving Python '𐑑' (==
'\U00010451').
I am attaching a different short example that demonstrates that Python
is mishandling UTF-8 on both the interactive terminal and in scripts, u.py
The output should be the same, but on Python 3.1.1 compiled for wide
unicode it reports two different values. As someone on #python-dev
found '𐑑'.encode('utf-16').decode('utf-16') outputs the correct value. |
|
Date |
User |
Action |
Args |
2009-10-03 23:04:04 | ArcRiley | set | recipients:
+ ArcRiley, loewis, amaury.forgeotdarc, ezio.melotti |
2009-10-03 23:04:04 | ArcRiley | set | messageid: <1254611044.42.0.753938429452.issue7045@psf.upfronthosting.co.za> |
2009-10-03 23:04:02 | ArcRiley | link | issue7045 messages |
2009-10-03 23:04:02 | ArcRiley | create | |
|