Message146774
I think <x><y z=""o"" /></x> should be parser as <x><y z="" /></x>, and the o"" should be ignored.
<x><y z="""" /></x> should be parser as <x><y z="" /></x>, and the last two "" should be ignored. This is what Firefox seems to do.
Currently the parser doesn't seem to handle extraneous data in the start tag too well, because the locatestarttagend_tolerant regex looks for (more or less) well-formed attributes.
Attached a patch for test_htmlparser with the two examples provided by Kevin. |
|
Date |
User |
Action |
Args |
2011-11-01 13:21:42 | ezio.melotti | set | recipients:
+ ezio.melotti, eric.araujo, r.david.murray, teoryn |
2011-11-01 13:21:42 | ezio.melotti | set | messageid: <1320153702.19.0.00632884125491.issue12629@psf.upfronthosting.co.za> |
2011-11-01 13:21:41 | ezio.melotti | link | issue12629 messages |
2011-11-01 13:21:41 | ezio.melotti | create | |
|