Author malin
Recipients ezio.melotti, hyeshik.chang, lemburg, loewis, malin, serhiy.storchaka, vstinner
Date 2015-05-05.00:52:19
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1430787140.94.0.235676634957.issue24117@psf.upfronthosting.co.za>
In-reply-to
Content
I found another bug in hz codec.
hz encoding uses 7-bit ASCII to represent Chinese characters, it was popular in USENET networks in the late 1980s and early 1990s.

I will do more check and fix them together, then I will invite you to review the patch.


u = 'hi~python'
b = u.encode('hz')   # bug in this step, the right sequence should be b"hi~~python"
print(b)    # the output is b"hi~python"

u = b.decode('hz')   # so can't decode, UnicodeDecodeError raised
print(u)
History
Date User Action Args
2015-05-05 00:52:21malinsetrecipients: + malin, lemburg, loewis, hyeshik.chang, vstinner, ezio.melotti, serhiy.storchaka
2015-05-05 00:52:20malinsetmessageid: <1430787140.94.0.235676634957.issue24117@psf.upfronthosting.co.za>
2015-05-05 00:52:20malinlinkissue24117 messages
2015-05-05 00:52:20malincreate