This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ArcRiley
Recipients ArcRiley
Date 2009-10-03.03:46:45
SpamBayes Score 1.110223e-15
Marked as misclassified No
Message-id <1254541613.35.0.521074293953.issue7045@psf.upfronthosting.co.za>
In-reply-to
Content
The following is a minimal example which does not work under Python
3.1.1 but functions as expected on Pyhton 2.6 and 3.0.

Python 3.1.1 believes the single UTF-8 glyph is two entirely different
(and illegal) unicode characters:

Traceback (most recent call last):
  File "t.py", line 2, in <module>
    print('𐑛')
UnicodeEncodeError: 'utf-8' codec can't encode character '\ud801' in
position 0: surrogates not allowed

Test system is Ubuntu 9.10-beta 32-bit
History
Date User Action Args
2009-10-03 03:46:54ArcRileysetrecipients: + ArcRiley
2009-10-03 03:46:53ArcRileysetmessageid: <1254541613.35.0.521074293953.issue7045@psf.upfronthosting.co.za>
2009-10-03 03:46:46ArcRileylinkissue7045 messages
2009-10-03 03:46:45ArcRileycreate