This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ezio.melotti
Recipients PeterL, ezio.melotti, pitrou
Date 2010-05-30.20:20:52
SpamBayes Score 0.19824326
Marked as misclassified No
Message-id <1275250854.2.0.79229973458.issue8859@psf.upfronthosting.co.za>
In-reply-to
Content
I think the problem is in the default encoding used when you call unicode() without specifying any encoding.
>>> '\xc5\xa0'.decode('iso-8859-1').split()
[u'\xc5']
>>> '\xc5\xa0'.decode('utf-8').split()
[u'\u0160']
History
Date User Action Args
2010-05-30 20:20:54ezio.melottisetrecipients: + ezio.melotti, pitrou, PeterL
2010-05-30 20:20:54ezio.melottisetmessageid: <1275250854.2.0.79229973458.issue8859@psf.upfronthosting.co.za>
2010-05-30 20:20:52ezio.melottilinkissue8859 messages
2010-05-30 20:20:52ezio.melotticreate