Author dmalcolm
Recipients dmalcolm
Date 2010-01-20.23:04:07
SpamBayes Score 2.91205e-09
Marked as misclassified No
Message-id <>
Currently sys.std[in|out|err] are set to the locale encoding but only if they are directly connected to a tty (and PYTHONIOENCODING is not set in the environment to override things).

The conditionality on "isatty" leads to lots of unexpected changes when scripts are added to shell pipelines, or run from cronjobs.

For example:
[david@brick ~]$ python -c 'print u"\u03b1\u03b2\u03b3"'

[david@brick ~]$ python -c 'print u"\u03b1\u03b2\u03b3"' | less
Traceback (most recent call last):
  File "<string>", line 1, in <module>
UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-2:
ordinal not in range(128)  

("less" will respect the locale encoding).

The attached patch removes the check to see if these streams are connected to a tty, so that the encoding is always set (either to the locale value, or to the "override" value from PYTHONIOENCODING)

With this patch:
$ ./python -c 'print u"\u03b1\u03b2\u03b3"'| less

has output:
[17298 refs]
Date User Action Args
2010-01-20 23:04:11dmalcolmsetrecipients: + dmalcolm
2010-01-20 23:04:10dmalcolmsetmessageid: <>
2010-01-20 23:04:09dmalcolmlinkissue7745 messages
2010-01-20 23:04:08dmalcolmcreate