Author ezio.melotti
Recipients eric.araujo, ezio.melotti, r.david.murray
Date 2012-06-24.02:45:42
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1340505943.38.0.45190182563.issue15156@psf.upfronthosting.co.za>
In-reply-to
Content
HTMLParser has an internal method called unescape [0] used to convert named character references to the equivalent characters, and it does so by using html.entities.name2codepoint to recreate the equivalent of html.entities.entityrefs with the addition of &apos;.
Now that the html5 entities have been added to html.entities, the parser should use them instead of name2codepoint.

[0]: see Lib/html/parser.py:500
History
Date User Action Args
2012-06-24 02:45:43ezio.melottisetrecipients: + ezio.melotti, eric.araujo, r.david.murray
2012-06-24 02:45:43ezio.melottisetmessageid: <1340505943.38.0.45190182563.issue15156@psf.upfronthosting.co.za>
2012-06-24 02:45:42ezio.melottilinkissue15156 messages
2012-06-24 02:45:42ezio.melotticreate