Author lemburg
Recipients ezio.melotti, lemburg, pitrou
Date 2010-01-15.12:57:20
SpamBayes Score 9.82484e-08
Marked as misclassified No
Message-id <1263560243.28.0.697614270067.issue5905@psf.upfronthosting.co.za>
In-reply-to
Content
The reason for this is that the strftime() C lib API is used to build localized month names. With your setting, you'll get French Latin-1 month names and those cannot be coerced to UTF-8 due to the accented characters in them.

This works in Python 2.x since PyUnicode_FromString() et al. convert Latin-1 strings to Unicode.

Apparently, this was changed in Python 3.x without looking at the header file or looking at the Python 2.x implementation which mandate Latin-1 as input encoding. Even the Python 3.x header still says that PyUnicode_FromString() will convert from Latin-1 to Unicode.

No idea why time.strptime() even bothers with these month names, though, since neither the format string nor the string being parsed contains literal month names.
History
Date User Action Args
2010-01-15 12:57:23lemburgsetrecipients: + lemburg, pitrou, ezio.melotti
2010-01-15 12:57:23lemburgsetmessageid: <1263560243.28.0.697614270067.issue5905@psf.upfronthosting.co.za>
2010-01-15 12:57:21lemburglinkissue5905 messages
2010-01-15 12:57:20lemburgcreate