This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author liudongmiao@gmail.com
Recipients liudongmiao@gmail.com
Date 2009-07-31.07:45:51
SpamBayes Score 0.00011989395
Marked as misclassified No
Message-id <1249026354.39.0.530341625498.issue6611@psf.upfronthosting.co.za>
In-reply-to
Content
HTMLParser (Python 2.6.2) Cannot deal with mixture of arbitrary data and
character reference. 

In line 365-373, replaceEntities(s) returns unichr(charref) in unicode,
which cannot be a mixture with arbitrary data in str.

A fix way: replace unichr(c) with unichr(c).encode('utf-8').
History
Date User Action Args
2009-07-31 07:45:54liudongmiao@gmail.comsetrecipients: + liudongmiao@gmail.com
2009-07-31 07:45:54liudongmiao@gmail.comsetmessageid: <1249026354.39.0.530341625498.issue6611@psf.upfronthosting.co.za>
2009-07-31 07:45:52liudongmiao@gmail.comlinkissue6611 messages
2009-07-31 07:45:52liudongmiao@gmail.comcreate