This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author serhiy.storchaka
Recipients ezio.melotti, serhiy.storchaka
Date 2013-01-16.10:46:44
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1358333205.49.0.474914292402.issue16979@psf.upfronthosting.co.za>
In-reply-to
Content
An error handler in unicode_escape_decode() eats at least one byte (or more) after illegal escape sequence.

>>> import codecs
>>> codecs.unicode_escape_decode(br'\u!@#', 'replace')
('�', 5)
>>> codecs.unicode_escape_decode(br'\u!@#$', 'replace')
('�@#$', 6)

raw_unicode_escape_decode() works right:

>>> codecs.raw_unicode_escape_decode(br'\u!@#', 'replace')
('�!@#', 5)
>>> codecs.raw_unicode_escape_decode(br'\u!@#$', 'replace')
('�!@#$', 6)

See also issue16975.
History
Date User Action Args
2013-01-16 10:46:45serhiy.storchakasetrecipients: + serhiy.storchaka, ezio.melotti
2013-01-16 10:46:45serhiy.storchakasetmessageid: <1358333205.49.0.474914292402.issue16979@psf.upfronthosting.co.za>
2013-01-16 10:46:45serhiy.storchakalinkissue16979 messages
2013-01-16 10:46:44serhiy.storchakacreate