Author Michel.Leunen
Recipients Michel.Leunen
Date 2012-04-09.20:01:55
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1334001716.14.0.206404588397.issue14538@psf.upfronthosting.co.za>
In-reply-to
Content
HTMLParser fails to parse this structure of tags: 

'<a></a><script></script><meta><meta / ><body></body>'

Parsing stops after the first meta tag ignoring the remainers

from HTMLParser import HTMLParser
parser = process_html()
parser.feed('<a></a><script></script><meta><meta / ><body></body>')

Python 2.7.2+ Ubuntu 11.10
History
Date User Action Args
2012-04-09 20:01:56Michel.Leunensetrecipients: + Michel.Leunen
2012-04-09 20:01:56Michel.Leunensetmessageid: <1334001716.14.0.206404588397.issue14538@psf.upfronthosting.co.za>
2012-04-09 20:01:55Michel.Leunenlinkissue14538 messages
2012-04-09 20:01:55Michel.Leunencreate