This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author r.david.murray
Recipients r.david.murray, wrohdewald
Date 2014-10-27.19:54:32
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1414439672.9.0.023281877548.issue22746@psf.upfronthosting.co.za>
In-reply-to
Content
If you look at the file, you'll find that the data is in utf-8 (at least if your locale is a utf-8 locale).  However, html is by default interpreted as latin-1, so that's what the webrowser displays when you pass the file on disk to it.  If you add "encoding='latin-1'" to your open call, your script will work.  What you do if you need to display non-latin1 characters, I don't know.  (See https://bugzil.la/760050, for example).

Note: the above is for python3.  I don't remember how you do the equivalent in python2...a naive codecs.open call just got me a UnicodeDecodeError.
History
Date User Action Args
2014-10-27 19:54:32r.david.murraysetrecipients: + r.david.murray, wrohdewald
2014-10-27 19:54:32r.david.murraysetmessageid: <1414439672.9.0.023281877548.issue22746@psf.upfronthosting.co.za>
2014-10-27 19:54:32r.david.murraylinkissue22746 messages
2014-10-27 19:54:32r.david.murraycreate