This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author vstinner
Recipients benjamin.peterson, doerwalter, ezio.melotti, larry, lemburg, loewis, pitrou, serhiy.storchaka, vstinner
Date 2014-02-09.10:53:13
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1391943194.27.0.134649517107.issue20571@psf.upfronthosting.co.za>
In-reply-to
Content
The test tries to decode a partial UTF-8 bytes string. The problem is that codecs.code_page_decode() doesn't implement fully partial decoders. The decoder only supports partial decoding for a few code pages: 932, 936, 949, 950, and 1361. The partial decoding is currently based on IsDBCSLeadByteEx():
http://msdn.microsoft.com/en-us/library/windows/desktop/dd318667%28v=vs.85%29.aspx

It may be possible to enhance decoders, but it's not a regression from Python 3.3 and so can be done in Python 3.5.

Please just skip failing tests for CP_UTF8 (cp 65001) and maybe other Windows code pages in test_codecs.

(I don't have time to write a patch to skip, sorry.)
History
Date User Action Args
2014-02-09 10:53:14vstinnersetrecipients: + vstinner, lemburg, loewis, doerwalter, pitrou, larry, benjamin.peterson, ezio.melotti, serhiy.storchaka
2014-02-09 10:53:14vstinnersetmessageid: <1391943194.27.0.134649517107.issue20571@psf.upfronthosting.co.za>
2014-02-09 10:53:14vstinnerlinkissue20571 messages
2014-02-09 10:53:13vstinnercreate