classification
Title: Tamil locale is using outdated encoding
Type: enhancement Stage: needs patch
Components: Versions: Python 3.9
process
Status: open Resolution:
Dependencies: Superseder:
Assigned To: Nosy List: Muthu A, ammar2, benjamin.peterson, ezio.melotti, lemburg, vstinner
Priority: normal Keywords:

Created on 2020-05-07 07:41 by Muthu A, last changed 2020-05-13 08:33 by ammar2.

Files
File name Uploaded Description Edit
Screen Shot 2020-05-06 at 11.04.18 PM.png Muthu A, 2020-05-07 07:41
Messages (2)
msg368325 - (view) Author: Muthu A (Muthu A) Date: 2020-05-07 07:41
Tamil locale (TA_IN, TA_SL, TA_SG, TA_MY) is using outdated encoding of TSCII. Tamil community is widely using UTF-8 encoding.
Further, the 'locale' standard library package in Python3 should be updated with these strings.

Should the maintainers desire assistance on this task, as a native speaker of the language, I can propose a patch or recommed other native speakers for this task.
msg368767 - (view) Author: Ammar Askar (ammar2) * (Python triager) Date: 2020-05-13 08:33
Hi Muthu, thanks for reporting this! Looks like this is related to issue20087 whereby the X11 locale data for TA_IN is using TSCII but the glibc supported file has the UTF-8 alias.

Pending a resolution to that bug, I think your best course of action would be to try to get this corrected upstream in x11, the code is here:

https://gitlab.freedesktop.org/xorg/lib/libx11/-/blob/master/nls/locale.alias.pre#L1078

and their issue tracker is here:

https://gitlab.freedesktop.org/xorg/lib/libx11/-/issues


Assuming that it gets accepted upstream, it should be simple enough to regenerate the locale module to use it.
History
Date User Action Args
2020-05-13 08:33:22ammar2setnosy: + ammar2
messages: + msg368767
2020-05-08 20:38:54terry.reedysetnosy: + lemburg, vstinner, benjamin.peterson, ezio.melotti
stage: needs patch
type: enhancement

versions: + Python 3.9
2020-05-07 07:41:48Muthu Acreate