This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author gvanrossum
Recipients Jukka Aho, amaury.forgeotdarc, asvetlov, benjamin.peterson, gvanrossum, vstinner
Date 2009-03-30.14:21:41
SpamBayes Score 4.204248e-12
Marked as misclassified No
Message-id <>
At the sprint, Andrew Svetlov, Martin von Loewis and I looked into this
a bit, and discovered that Andrew's Vista copy uses a Russian locale for
the filesystem encoding (despite using English as the language).  In
this locale, a-umlaut cannot be represented in the ANSI code page (which
has only 256 values), because the Russian locale uses those byte values
to represent Cyrillic.

As long as the import code (written in C) uses bytes in the filesystem
encoding to represent paths, this problem will remain.

Two possible solutions would be to switch to Brett's importlib, or to
change the import code to use wide characters everywhere (like
posixmodule.c).  Both are extremely risky and a lot of work, and I don't
expect we'll get to this for 3.1.

(In 2.x the same problem exists, but is perhaps less real because module
names are limited to ASCII.)

We also discovered another problem, which I'll report separately: the
*module* name is decoded to UTF8, while the *path* name uses the
filesystem encoding...
Date User Action Args
2009-03-30 14:21:44gvanrossumsetrecipients: + gvanrossum, amaury.forgeotdarc, vstinner, benjamin.peterson, Jukka Aho, asvetlov
2009-03-30 14:21:44gvanrossumsetmessageid: <>
2009-03-30 14:21:42gvanrossumlinkissue4352 messages
2009-03-30 14:21:41gvanrossumcreate