This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author Matt.Basta
Recipients Hunanyan, Matt.Basta, cpalmer, eric.araujo, ezio.melotti, fantoozler, fdrake, friday, georg.brandl, gsf, momat, orsenthil, r.david.murray, yotam
Date 2011-07-27.03:24:51
SpamBayes Score 3.614195e-08
Marked as misclassified No
Message-id <1311737091.98.0.678032127976.issue670664@psf.upfronthosting.co.za>
In-reply-to
Content
The number of problems produced by this bug can be greatly reduced by adding a relatively small check to the parser. Currently, <script> and <style> tags call set_cdata_mode(), which sets self.interesting to HTMLParser.interesting_cdata. This is bad because it searches for ANY closing tag, rather than a closing tag which matches the opening tag.

Alexander's fix solved about half the problem, but it didn't handle ending tags as text. I've fixed this and added some tests.

This is my first patch, so if there's a better way that I could be submitting this, input would be appreciated.
History
Date User Action Args
2011-07-27 03:24:52Matt.Bastasetrecipients: + Matt.Basta, fdrake, georg.brandl, yotam, orsenthil, fantoozler, gsf, cpalmer, ezio.melotti, eric.araujo, r.david.murray, momat, Hunanyan, friday
2011-07-27 03:24:51Matt.Bastasetmessageid: <1311737091.98.0.678032127976.issue670664@psf.upfronthosting.co.za>
2011-07-27 03:24:51Matt.Bastalinkissue670664 messages
2011-07-27 03:24:51Matt.Bastacreate