Author mrabarnett
Recipients gangesmaster, mark.dickinson, mrabarnett
Date 2012-12-23.18:12:33
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1356286353.8.0.380793055064.issue16741@psf.upfronthosting.co.za>
In-reply-to
Content
It occurred to me that the truncation of the string when building the error message could cause a UnicodeDecodeError:

>>> int("1".ljust(199) + "\u0100")
Traceback (most recent call last):
  File "<pyshell#0>", line 1, in <module>
    int("1".ljust(199) + "\u0100")
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc4 in position 199: unexpected end of data

This is because it's truncating a UTF-8 string, and the truncation is in the middle of a multi-byte sequence.
History
Date User Action Args
2012-12-23 18:12:34mrabarnettsetrecipients: + mrabarnett, mark.dickinson, gangesmaster
2012-12-23 18:12:33mrabarnettsetmessageid: <1356286353.8.0.380793055064.issue16741@psf.upfronthosting.co.za>
2012-12-23 18:12:33mrabarnettlinkissue16741 messages
2012-12-23 18:12:33mrabarnettcreate