You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
bpo-1486713: HTMLParser : A auto-tolerant parsing mode
Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.
Show more details
GitHub fields:
assignee=Noneclosed_at=<Date2010-12-03.04:14:13.145>created_at=<Date2004-06-18.19:33:18.000>labels= ['type-feature', 'library']
title='HTMLParser lukewarm on bogus bare attribute chars'updated_at=<Date2010-12-03.04:14:13.143>user='https://bugs.python.org/mkc'
I tripped over the same problem mentioned in bug bpo-921657 (HTMLParser.py), except that my bogus attribute
char is '|' instead of '@'.
May I suggest that HTMLParser either require strict
compliance with the HTML spec, or alternatively that it
accept everything reasonable? The latter approach
would be much more useful, and it would also be
valuable to have this decision documented.
In particular, 'attrfind' needs to be changed to accept
(following the '=\s*') something like the subpattern
given for 'locatestarttagend' (see the "bare value" line).
Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.
Show more details
GitHub fields:
bugs.python.org fields:
The text was updated successfully, but these errors were encountered: