Author ezio.melotti
Recipients ezio.melotti, iko, r.david.murray
Date 2014-01-17.18:35:24
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1389983724.84.0.451305962633.issue20288@psf.upfronthosting.co.za>
In-reply-to
Content
Thanks for the report, this is indeed a bug.
This behavior was covered by a test (see Lib/test/test_htmlparser.py:164), but _run_check feeds the chars one by one to the parser, and in that case it works correctly.  While feeding the parser a whole chunk I was able to reproduce the bug.  This should be fixed, and the behavior of _run_check should probably be changed too -- maybe it could test both the char-by-char and the regular feeding.
History
Date User Action Args
2014-01-17 18:35:24ezio.melottisetrecipients: + ezio.melotti, iko, r.david.murray
2014-01-17 18:35:24ezio.melottisetmessageid: <1389983724.84.0.451305962633.issue20288@psf.upfronthosting.co.za>
2014-01-17 18:35:24ezio.melottilinkissue20288 messages
2014-01-17 18:35:24ezio.melotticreate