This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author rurwin
Recipients amaury.forgeotdarc, bugok, effbot, nnorwitz, rurwin
Date 2008-11-14.17:32:12
SpamBayes Score 3.12686e-10
Marked as misclassified No
Message-id <1226683937.45.0.657283928936.issue1767933@psf.upfronthosting.co.za>
In-reply-to
Content
Here is a patch of my quick hack, more for interest than any suggestion
it gets used. Although it does produce good output so long as you avoid
the BOM.

The full solution is beyond my (very weak) Python skills. The character
encoding is tied in with XML character substitution (&amp; etc. and
hexadecimal representation of multibyte characters). I could disentangle
it, but I probably wouldn't produce optimal Python, or indeed anything
that wouldn't inspire mirth and/or incredulity.

NB. The workaround suggested by Fredrik Lundh doesn't solve our
particular problems, since the downsize to UTF-8 causes the multi-byte
characters to be represented in hex. Our software doesn't read those. (I
know that's our problem.)
History
Date User Action Args
2008-11-14 17:32:17rurwinsetrecipients: + rurwin, effbot, nnorwitz, amaury.forgeotdarc, bugok
2008-11-14 17:32:17rurwinsetmessageid: <1226683937.45.0.657283928936.issue1767933@psf.upfronthosting.co.za>
2008-11-14 17:32:16rurwinlinkissue1767933 messages
2008-11-14 17:32:15rurwincreate