This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ezio.melotti
Recipients eric.araujo, ezio.melotti
Date 2012-02-07.11:56:33
SpamBayes Score 0.009046208
Marked as misclassified No
Message-id <1328615796.36.0.0693404155355.issue13960@psf.upfronthosting.co.za>
In-reply-to
Content
html.parser fails to handle the following invalid comments:
<! foo >
<! bar -->
<! -- baz -->
The attached patch follows the HTML5 specs [0], and parses them as "bogus comments".  Currently the patch fixes the problem only when strict=False, but it might be better to make this the default behavior and apply it to 2.7 too.

[0]: http://www.w3.org/TR/html5/tokenization.html#bogus-comment-state
History
Date User Action Args
2012-02-07 11:56:36ezio.melottisetrecipients: + ezio.melotti, eric.araujo
2012-02-07 11:56:36ezio.melottisetmessageid: <1328615796.36.0.0693404155355.issue13960@psf.upfronthosting.co.za>
2012-02-07 11:56:35ezio.melottilinkissue13960 messages
2012-02-07 11:56:35ezio.melotticreate