This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author christian.heimes
Recipients christian.heimes, corona10, hongweipeng, Валентин Dreyk
Date 2020-06-19.07:33:54
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1592552035.04.0.632406147313.issue40873@roundup.psfhosted.org>
In-reply-to
Content
According to https://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references#cite_ref-semicolon_1-64 the trailing semicolon can be omitted for the named entity "reg". That means "&reg" and "&reg;" are equivalent.

saxutils.unescape() only handles '<', '>', and '&' by default. You have to pass in a dictionary to unescape other entities.
History
Date User Action Args
2020-06-19 07:33:55christian.heimessetrecipients: + christian.heimes, corona10, hongweipeng, Валентин Dreyk
2020-06-19 07:33:55christian.heimessetmessageid: <1592552035.04.0.632406147313.issue40873@roundup.psfhosted.org>
2020-06-19 07:33:55christian.heimeslinkissue40873 messages
2020-06-19 07:33:54christian.heimescreate