Author serhiy.storchaka
Recipients benjamin.peterson, ezio.melotti, hynek, pitrou, serhiy.storchaka, stutzbach, vstinner
Date 2014-01-28.20:26:55
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <>
Unlike to io.StringIO, _pyio.StringIO doesn't work with lone surrogates.

>>> import io, _pyio
>>> io.StringIO('\ud880')
<_io.StringIO object at 0xb71426ec>
>>> _pyio.StringIO('\ud880')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/serhiy/py/cpython/Lib/", line 2065, in __init__
  File "/home/serhiy/py/cpython/Lib/", line 1629, in write
    b = encoder.encode(s)
  File "/home/serhiy/py/cpython/Lib/encodings/", line 20, in encode
    return codecs.utf_8_encode(input, self.errors)[0]
UnicodeEncodeError: 'utf-8' codec can't encode character '\ud880' in position 0: surrogates not allowed

Proposed patch adds support of lone surrogates to _pyio.StringIO.
Date User Action Args
2014-01-28 20:26:55serhiy.storchakasetrecipients: + serhiy.storchaka, pitrou, vstinner, benjamin.peterson, stutzbach, ezio.melotti, hynek
2014-01-28 20:26:55serhiy.storchakasetmessageid: <>
2014-01-28 20:26:55serhiy.storchakalinkissue20424 messages
2014-01-28 20:26:55serhiy.storchakacreate