This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author xkjq
Recipients xkjq
Date 2015-01-01.18:47:07
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1420138028.63.0.996553076818.issue23144@psf.upfronthosting.co.za>
In-reply-to
Content
If convert_charrefs is set to true the final data section is not return by feed(). It is held until the next tag is encountered.

---
from html.parser import HTMLParser

class MyHTMLParser(HTMLParser):
    def __init__(self):
        HTMLParser.__init__(self, convert_charrefs=True)
        self.fed = []
    def handle_starttag(self, tag, attrs):
        print("Encountered a start tag:", tag)
    def handle_endtag(self, tag):
        print("Encountered an end tag :", tag)
    def handle_data(self, data):
        print("Encountered some data  :", data)

parser = MyHTMLParser()

parser.feed("foo <a>link</a> bar")
print("")
parser.feed("spam <a>link</a> eggs")

---

gives

Encountered some data  : foo 
Encountered a start tag: a
Encountered some data  : link
Encountered an end tag : a

Encountered some data  :  barspam 
Encountered a start tag: a
Encountered some data  : link
Encountered an end tag : a


With 'convert_charrefs = False' it works as expected.
History
Date User Action Args
2015-01-01 18:47:08xkjqsetrecipients: + xkjq
2015-01-01 18:47:08xkjqsetmessageid: <1420138028.63.0.996553076818.issue23144@psf.upfronthosting.co.za>
2015-01-01 18:47:08xkjqlinkissue23144 messages
2015-01-01 18:47:07xkjqcreate