Message324100
I think you are absoltely right.
> In any case, it seems to be broken for any system that does not have UTF-8 as default.
You likely mean ASCII. Python assumes that LANG=C is ASCII which is not the case for AIX and HP-UX.
Your patch looks reasonable, I will try this on Monday. The problem is that there is no roman8 codec in Python. Maybe ISO-8859-1 will do it for the test, but I am still eager to add one.
> I mention this because it seems neither roman8 nor roman9 have 'official' iso names or alias (correct me if I am wrong).
There are no ISO names because this is not an ISO encoding. This is an HP invention aka hp-roman8 (roman8, ibm-1051, r8, Cp1051).
Edit: there is roman8 support: https://github.com/python/cpython/blob/e42b705188271da108de42b55d9344642170aa2b/Lib/encodings/hp_roman8.py as well as aliases.
There are a few aliases missing: cp1051, ibm1051 and hp-roman8. This needs an additonal PR. |
|
Date |
User |
Action |
Args |
2018-08-25 20:17:09 | michael-o | set | recipients:
+ michael-o, terry.reedy, Michael.Felt |
2018-08-25 20:17:09 | michael-o | set | messageid: <1535228229.69.0.56676864532.issue34403@psf.upfronthosting.co.za> |
2018-08-25 20:17:09 | michael-o | link | issue34403 messages |
2018-08-25 20:17:09 | michael-o | create | |
|