This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author opstad
Recipients opstad
Date 2010-06-08.16:15:21
SpamBayes Score 6.2627517e-09
Marked as misclassified No
Message-id <1276013724.21.0.174818104611.issue8941@psf.upfronthosting.co.za>
In-reply-to
Content
The utf-32 little-endian codec works fine, but the big-endian codec is producing incorrect results:

Python 3.1.2 (r312:79360M, Mar 24 2010, 01:33:18) 
[GCC 4.0.1 (Apple Inc. build 5493)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> str(b'\x00\x00\x01\x00', 'utf-32le') # works
'\U00010000'
>>> str(b'\x00\x01\x00\x00', 'utf-32be') # doesn't work
'\ud800\x02'
History
Date User Action Args
2010-06-08 16:15:24opstadsetrecipients: + opstad
2010-06-08 16:15:24opstadsetmessageid: <1276013724.21.0.174818104611.issue8941@psf.upfronthosting.co.za>
2010-06-08 16:15:22opstadlinkissue8941 messages
2010-06-08 16:15:21opstadcreate