This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author serhiy.storchaka
Recipients Steffen Kampmann, jaraco, paul.moore, serhiy.storchaka, steve.dower, tim.golden, vstinner, zach.ware
Date 2016-07-30.04:55:51
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1469854552.06.0.373693637864.issue24672@psf.upfronthosting.co.za>
In-reply-to
Content
See also issue16700.

On Windows there are two sets of API: Unicode and bytes. File names are stored in Unicode (UTF-16) in modern filesystems and encoded to bytes by system for bytes API. Unfortunately this encoding is lossfull. Windows try to find the closest equivalent if the character is not encodable with current codepage (for example drops diacritics) and silently replaces it with "?" if can't find anything appropriate. We can't do anything with this from Python side except using Unicode API.
History
Date User Action Args
2016-07-30 04:55:52serhiy.storchakasetrecipients: + serhiy.storchaka, paul.moore, jaraco, vstinner, tim.golden, zach.ware, steve.dower, Steffen Kampmann
2016-07-30 04:55:52serhiy.storchakasetmessageid: <1469854552.06.0.373693637864.issue24672@psf.upfronthosting.co.za>
2016-07-30 04:55:51serhiy.storchakalinkissue24672 messages
2016-07-30 04:55:51serhiy.storchakacreate