This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author dholth
Recipients dholth, docs@python, r.david.murray, serhiy.storchaka, terry.reedy
Date 2016-06-23.15:46:23
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1466696784.31.0.0667812978991.issue27344@psf.upfronthosting.co.za>
In-reply-to
Content
The current documentation says "Note There is no official file name encoding for ZIP files. If you have unicode file names, you must convert them to byte strings in your desired encoding before passing them to write(). WinZip interprets all file names as encoded in CP437, also known as DOS Latin."

This is bad advice because if you convert the filenames to bytes before passing them to zipfile, it won't remember that they should be unicode. Instead it should say

"The ZIP file format supports Unicode filenames. If you have unicode filenames, zipfile will encode them to and from utf-8 internally. If you pass bytes filenames to write() then they will be stored without a specified encoding."

I am not sure what current versions of WinZip or Windows file manager do.
History
Date User Action Args
2016-06-23 15:46:24dholthsetrecipients: + dholth, terry.reedy, r.david.murray, docs@python, serhiy.storchaka
2016-06-23 15:46:24dholthsetmessageid: <1466696784.31.0.0667812978991.issue27344@psf.upfronthosting.co.za>
2016-06-23 15:46:24dholthlinkissue27344 messages
2016-06-23 15:46:23dholthcreate