Author serhiy.storchaka
Recipients BreamoreBoy, ezio.melotti, kunkku, lemburg, loewis, martin.panter, serhiy.storchaka, vstinner
Date 2016-05-23.06:02:43
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1463983367.37.0.631877574355.issue16182@psf.upfronthosting.co.za>
In-reply-to
Content
Yes, the readline module is broken in Python 3. Underlying C library operates C strings and use locale-depended C functions to split it on Unicode characters. The Python wrapper always uses the UTF-8 encoding for converting between Python strings and C strings. It works only on UTF-8 locales. get_begidx() and get_endidx() don't correctly work at all for non-ASCII data. We should use locale encoding for converting.

Proposed patch makes the readline module to use locale depending coding functions instead of default UTF-8. It also corrects indices for get_begidx() and get_endidx().
History
Date User Action Args
2016-05-23 06:02:47serhiy.storchakasetrecipients: + serhiy.storchaka, lemburg, loewis, vstinner, ezio.melotti, BreamoreBoy, martin.panter, kunkku
2016-05-23 06:02:47serhiy.storchakasetmessageid: <1463983367.37.0.631877574355.issue16182@psf.upfronthosting.co.za>
2016-05-23 06:02:47serhiy.storchakalinkissue16182 messages
2016-05-23 06:02:46serhiy.storchakacreate