This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ezio.melotti
Recipients eric.araujo, ezio.melotti, rednaks
Date 2012-03-13.00:11:30
SpamBayes Score 2.7137295e-09
Marked as misclassified No
Message-id <1331597491.43.0.934801601049.issue14251@psf.upfronthosting.co.za>
In-reply-to
Content
I test this again and indeed a bare s.decode() is not enough to fix the problem.  The attribute might contain non-ascii characters, and that will result in an error (see for example the "test.py" script attached to #3932).  The correct solution is to decode the page before passing it to the parser.
History
Date User Action Args
2012-03-13 00:11:31ezio.melottisetrecipients: + ezio.melotti, eric.araujo, rednaks
2012-03-13 00:11:31ezio.melottisetmessageid: <1331597491.43.0.934801601049.issue14251@psf.upfronthosting.co.za>
2012-03-13 00:11:30ezio.melottilinkissue14251 messages
2012-03-13 00:11:30ezio.melotticreate