Author tungwaiyip
Recipients
Date 2005-02-15.07:04:13
SpamBayes Score
Marked as misclassified
Message-id
In-reply-to
Content
When parsing the document below using sgmllib:

<html>
<!-BAD COMMENT->hello
</html>

The incorrect declaration is returned with hello as one 
single character data:

  "<!-BAD COMMENT->hello"

markupbase should have treated it as an error (to be 
consistent with it strict treatment in _scan_name).

I believe the line 73 of markupbase.py should be

        if rawdata[j:j+2] in ("-", ""):

intead of 

        if rawdata[j:j+1] in ("-", ""):

Also note that the condition in line 79 will not be true

    if rawdata[j:j+1] == '--'
History
Date User Action Args
2008-01-20 09:57:29adminlinkissue1122916 messages
2008-01-20 09:57:29admincreate