classification
Title: Add encodings of supported in glibc locales
Type: enhancement Stage:
Components: Library (Lib) Versions: Python 3.5
process
Status: open Resolution:
Dependencies: 19459 21081 22681 22682 Superseder:
Assigned To: Nosy List: jwilk, lemburg, loewis, progfou, serhiy.storchaka, vstinner
Priority: normal Keywords:

Created on 2014-10-20 16:49 by serhiy.storchaka, last changed 2014-10-29 05:00 by progfou.

Messages (1)
msg229730 - (view) Author: Serhiy Storchaka (serhiy.storchaka) * (Python committer) Date: 2014-10-20 16:49
There are 6 encodings used in supported by glibc locales:

ARMSCII-8 - Armenian encoding
EUC-TW - Taiwan encoding in EUC family
GEORGIAN-PS - Georgian encoding
KOI8-T - Tajik encoding in KOI family
RK1048 - Kazakh variation of CP1251
TCVN5712-1 - Vietnam encoding

All these encodings are rare nowadays, but the fact that they are supported as encoding of official glibc locales means that they were used in some places in some time and there are some documents in these encodings. May be they are used even nowadays. I think it is worth to add support of all this encodings in Python.

This is a meta-issue. There are requests for support of GEORGIAN-PS (issue19459) and TCVN5712-1 (issue21081).
History
Date User Action Args
2014-10-29 05:00:57progfousetnosy: + progfou
2014-10-28 14:30:07jwilksetnosy: + jwilk
2014-10-20 20:26:42serhiy.storchakasetdependencies: + Add support of KZ1048 (RK1048) encoding
2014-10-20 17:58:59serhiy.storchakasetdependencies: + Add support of KOI8-T encoding
2014-10-20 16:50:51serhiy.storchakasetdependencies: + Python does not support the GEORGIAN-PS charset, missing vietnamese codec TCVN 5712:1993 in Python
2014-10-20 16:49:38serhiy.storchakacreate