This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author ivan.sorokin.tech
Recipients ivan.sorokin.tech
Date 2020-10-04.11:21:41
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1601810501.41.0.603959888811.issue41928@roundup.psfhosted.org>
In-reply-to
Content
See attached sample. Well-known unzip command line tool lists its contents correctly:

$ unzip -l 23.zip
Archive:  23.zip
  Length      Date    Time    Name
---------  ---------- -----   ----
    81408  2012-10-23 19:03   Β' ΦΑΣΗ ΠΕ06 ΣΧΟΛΕΙΑ ΕΑΕΠ (ΙΝΤ).xls
---------                     -------
    81408                     1 file

But ZipFile lists the same file inside this archive as
ü' öÇæå Åä06 æòÄèäêÇ äÇäÅ (êîÆ).xls

It's because ZipFile completely ignores Unicode Path Extra Field (0x7075) zip header field.

See .ZIP specification for details on this field meaning and usage:
https://pkware.cachefly.net/webdocs/casestudies/APPNOTE.TXT
History
Date User Action Args
2020-10-04 11:21:41ivan.sorokin.techsetrecipients: + ivan.sorokin.tech
2020-10-04 11:21:41ivan.sorokin.techsetmessageid: <1601810501.41.0.603959888811.issue41928@roundup.psfhosted.org>
2020-10-04 11:21:41ivan.sorokin.techlinkissue41928 messages
2020-10-04 11:21:41ivan.sorokin.techcreate