Author vstinner
Recipients eric.araujo, ezio.melotti, flox, mandel, santoso.wijaya, vstinner
Date 2011-10-26.22:50:03
SpamBayes Score 1.45216e-08
Marked as misclassified No
Message-id <1319669404.0.0.604649290929.issue13207@psf.upfronthosting.co.za>
In-reply-to
Content
Python 2 uses byte strings. If characters are not encodable to the ANSI code page, Windows replaces them by question marks. See the issue #13247 for another example (in Python 3 when using explicitly the bytes API). To be able to support characters not encodable to the ANSI code page, you have to use Unicode *everywhere*.

Because Python 2 doesn't have access to the Unicode environment and uses bytes in most cases, I don't think that we can fix this issue in Python 2.

I close this issue because it would require too much work to fix this issue in Python 2, whereas it already works in Python 3.  Move to Python 3 is the best solution of this issue.
History
Date User Action Args
2011-10-26 22:50:04vstinnersetrecipients: + vstinner, ezio.melotti, eric.araujo, flox, santoso.wijaya, mandel
2011-10-26 22:50:04vstinnersetmessageid: <1319669404.0.0.604649290929.issue13207@psf.upfronthosting.co.za>
2011-10-26 22:50:03vstinnerlinkissue13207 messages
2011-10-26 22:50:03vstinnercreate