This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author vstinner
Recipients alanmcintyre, vstinner
Date 2011-01-21.12:07:08
SpamBayes Score 0.00067192456
Marked as misclassified No
Message-id <1295611629.36.0.303977621119.issue10972@psf.upfronthosting.co.za>
In-reply-to
Content
7zip and WinRAR uses the same algorithm than ZipFile._encodeFilename(): try cp437 or use UTF-8. Eg. if a filename contains ∞ (U+221E), it is encoded to UTF-8.

WinZIP encodes all filenames to cp437: ∞ (U+221E) is replaced by 8 (U+0038), ☺ (U+263A) is replaced by... U+0001!

7zip, WinRAR and WinZIP are able to decode UTF-8 filenames (handle correctly the unicode flag).
History
Date User Action Args
2011-01-21 12:07:09vstinnersetrecipients: + vstinner, alanmcintyre
2011-01-21 12:07:09vstinnersetmessageid: <1295611629.36.0.303977621119.issue10972@psf.upfronthosting.co.za>
2011-01-21 12:07:08vstinnerlinkissue10972 messages
2011-01-21 12:07:08vstinnercreate