Message 107326 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	opstad
Recipients	opstad
Date	2010-06-08.16:15:21
SpamBayes Score	6.2627517e-09
Marked as misclassified	No
Message-id	<1276013724.21.0.174818104611.issue8941@psf.upfronthosting.co.za>
In-reply-to

Content
The utf-32 little-endian codec works fine, but the big-endian codec is producing incorrect results: Python 3.1.2 (r312:79360M, Mar 24 2010, 01:33:18) [GCC 4.0.1 (Apple Inc. build 5493)] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> str(b'\x00\x00\x01\x00', 'utf-32le') # works '\U00010000' >>> str(b'\x00\x01\x00\x00', 'utf-32be') # doesn't work '\ud800\x02'

The utf-32 little-endian codec works fine, but the big-endian codec is producing incorrect results:

Python 3.1.2 (r312:79360M, Mar 24 2010, 01:33:18) 
[GCC 4.0.1 (Apple Inc. build 5493)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> str(b'\x00\x00\x01\x00', 'utf-32le') # works
'\U00010000'
>>> str(b'\x00\x01\x00\x00', 'utf-32be') # doesn't work
'\ud800\x02'

History
Date	User	Action	Args
2010-06-08 16:15:24	opstad	set	recipients: + opstad
2010-06-08 16:15:24	opstad	set	messageid: <1276013724.21.0.174818104611.issue8941@psf.upfronthosting.co.za>
2010-06-08 16:15:22	opstad	link	issue8941 messages
2010-06-08 16:15:21	opstad	create