Author serhiy.storchaka
Recipients Vyacheslav.Rafalskiy, serhiy.storchaka, vstinner
Date 2017-08-29.18:58:09
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1504033090.01.0.0289178744717.issue31303@psf.upfronthosting.co.za>
In-reply-to
Content
Simpler reproducer:

>>> import xml.etree.ElementTree
>>> xml.etree.ElementTree.XML(b'<key attr="' + b'x'*1023 + b'\xc3\xa0&quot;"/>')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/serhiy/py/cpython/Lib/xml/etree/ElementTree.py", line 1315, in XML
    parser.feed(text)
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc3 in position 1023: invalid continuation byte

Seems this is a regression in the Expat library.
History
Date User Action Args
2017-08-29 18:58:10serhiy.storchakasetrecipients: + serhiy.storchaka, vstinner, Vyacheslav.Rafalskiy
2017-08-29 18:58:10serhiy.storchakasetmessageid: <1504033090.01.0.0289178744717.issue31303@psf.upfronthosting.co.za>
2017-08-29 18:58:09serhiy.storchakalinkissue31303 messages
2017-08-29 18:58:09serhiy.storchakacreate