Message 163702 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	ezio.melotti
Recipients	eric.araujo, ezio.melotti, r.david.murray
Date	2012-06-24.02:45:42
SpamBayes Score	-1.0
Marked as misclassified	Yes
Message-id	<1340505943.38.0.45190182563.issue15156@psf.upfronthosting.co.za>
In-reply-to

Content
HTMLParser has an internal method called unescape [0] used to convert named character references to the equivalent characters, and it does so by using html.entities.name2codepoint to recreate the equivalent of html.entities.entityrefs with the addition of '. Now that the html5 entities have been added to html.entities, the parser should use them instead of name2codepoint. [0]: see Lib/html/parser.py:500

HTMLParser has an internal method called unescape [0] used to convert named character references to the equivalent characters, and it does so by using html.entities.name2codepoint to recreate the equivalent of html.entities.entityrefs with the addition of &apos;.
Now that the html5 entities have been added to html.entities, the parser should use them instead of name2codepoint.

[0]: see Lib/html/parser.py:500

History
Date	User	Action	Args
2012-06-24 02:45:43	ezio.melotti	set	recipients: + ezio.melotti, eric.araujo, r.david.murray
2012-06-24 02:45:43	ezio.melotti	set	messageid: <1340505943.38.0.45190182563.issue15156@psf.upfronthosting.co.za>
2012-06-24 02:45:42	ezio.melotti	link	issue15156 messages
2012-06-24 02:45:42	ezio.melotti	create