This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author loewis
Recipients lars.gustaebel, lemburg, loewis, vstinner
Date 2010-06-10.18:40:55
SpamBayes Score 0.010755782
Marked as misclassified No
Message-id <4C1131B6.6080208@v.loewis.de>
In-reply-to <4C101F12.7090102@egenix.com>
Content
>> 7-zip encodes "à" (U+00e0) as 0x85 (1 byte), and "é" (U+00e9) as 0x82 (1 byte). I don't know this encoding.
>
> That's an old DOS code paged used in Europe: CP850

There is a good chance that they use it because it is the OEM code page 
on the system.

In any case, I think that both cp850 and cp1252 are inherently incorrect 
for tarfiles (despite these tools using them). tar is a POSIX thing, and 
these encodings have nothing to do with POSIX.

So using UTF-8 is a reasonable choice, IMO. The other reasonable choice 
would be ASCII.
History
Date User Action Args
2010-06-10 18:40:58loewissetrecipients: + loewis, lemburg, lars.gustaebel, vstinner
2010-06-10 18:40:56loewislinkissue8784 messages
2010-06-10 18:40:55loewiscreate