Author dmalcolm
Recipients dmalcolm
Date 2010-01-20.23:04:07
SpamBayes Score 2.91205e-09
Marked as misclassified No
Message-id <1264028650.99.0.196256865676.issue7745@psf.upfronthosting.co.za>
In-reply-to
Content
Currently sys.std[in|out|err] are set to the locale encoding but only if they are directly connected to a tty (and PYTHONIOENCODING is not set in the environment to override things).

The conditionality on "isatty" leads to lots of unexpected changes when scripts are added to shell pipelines, or run from cronjobs.

For example:
[david@brick ~]$ python -c 'print u"\u03b1\u03b2\u03b3"'
αβγ

[david@brick ~]$ python -c 'print u"\u03b1\u03b2\u03b3"' | less
Traceback (most recent call last):
  File "<string>", line 1, in <module>
UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-2:
ordinal not in range(128)  

("less" will respect the locale encoding).

The attached patch removes the check to see if these streams are connected to a tty, so that the encoding is always set (either to the locale value, or to the "override" value from PYTHONIOENCODING)

With this patch:
$ ./python -c 'print u"\u03b1\u03b2\u03b3"'| less

has output:
αβγ
[17298 refs]
History
Date User Action Args
2010-01-20 23:04:11dmalcolmsetrecipients: + dmalcolm
2010-01-20 23:04:10dmalcolmsetmessageid: <1264028650.99.0.196256865676.issue7745@psf.upfronthosting.co.za>
2010-01-20 23:04:09dmalcolmlinkissue7745 messages
2010-01-20 23:04:08dmalcolmcreate