This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author loewis
Recipients loewis, monson
Date 2012-08-09.08:47:48
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1344502069.88.0.874625002017.issue15602@psf.upfronthosting.co.za>
In-reply-to
Content
You are mistaken: there *is* a character set specification for file names in zip files, see

http://www.pkware.com/documents/casestudies/APPNOTE.TXT

Appendix D says

"The ZIP format has historically supported only the original IBM PC character encoding set, commonly referred to as IBM Code Page 437.  This limits storing file name characters to only those within the original MS-DOS range of values and does not properly support file names in other character encodings, or languages."

Using bytes objects for file names is not acceptable; in Python 3, file names are (unicode) strings.

Adding a new parameter is an option, and already discussed in issue 10614 .

People using non-437 code sets should really start using UTF-8 encoded file names in the zip files, and set the general purpose bit 11.

Closing this report as a duplicate.
History
Date User Action Args
2012-08-09 08:47:49loewissetrecipients: + loewis, monson
2012-08-09 08:47:49loewissetmessageid: <1344502069.88.0.874625002017.issue15602@psf.upfronthosting.co.za>
2012-08-09 08:47:49loewislinkissue15602 messages
2012-08-09 08:47:48loewiscreate