Message153043
The attached patch fixes a few problems with HTMLParser on 2.7.
Instead of raising error when invalid markup is detected, the parser now consumes the invalid input and proceeds. This patch is a partial backport of #1486713.
After this two more patches will follow.
The first will get rid of errors raised while parsing declarations and should also solve #13576:
def unknown_decl(self, data):
- self.error("unknown declaration: %r" % (data,))
+ pass
The second will take care of "bogus comments" (see #13960).
Once this is done HTMLParser should be able to parse (almost) everything. I'm planning to commit this before the release of 2.7.3. |
|
Date |
User |
Action |
Args |
2012-02-10 13:45:58 | ezio.melotti | set | recipients:
+ ezio.melotti, benjamin.peterson, eric.araujo, r.david.murray |
2012-02-10 13:45:58 | ezio.melotti | set | messageid: <1328881558.78.0.634398241399.issue13987@psf.upfronthosting.co.za> |
2012-02-10 13:45:58 | ezio.melotti | link | issue13987 messages |
2012-02-10 13:45:57 | ezio.melotti | create | |
|