Author malin
Recipients ezio.melotti, hyeshik.chang, lemburg, loewis, malin, serhiy.storchaka, vstinner
Date 2015-05-19.06:28:23
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1432016904.73.0.191780081269.issue24117@psf.upfronthosting.co.za>
In-reply-to
Content
>> If you could provide links to the relevant pages/section we can verify that the codecs are indeed incorrect. 

Here is CP950, 0xC6A1 is not in it.
https://msdn.microsoft.com/zh-cn/goglobal/cc305155

I can provide one link, but there are many variants of BIG5 convert table on the Interenet, so one link doesn't bring persuasion.

In this page: https://moztw.org/docs/big5/
Listed many variants of BIG5 tables, I found 0xC6A1<->U+30FE in this table "Unicode 1.1", the description of it is "it's a terrible table, many errors exist, sadlly many foreigners are using it", but IIRC Python's BIG5 codec is not fully same as that table.

IMO, the most reliable way is reading a lot of stuff, and verify the key points and conflicts with authoritative source, but this way is very hard for foreigners.

Anyway, let's wait Taiwanese and their opinion for whether this should be fixed.
History
Date User Action Args
2015-05-19 06:28:24malinsetrecipients: + malin, lemburg, loewis, hyeshik.chang, vstinner, ezio.melotti, serhiy.storchaka
2015-05-19 06:28:24malinsetmessageid: <1432016904.73.0.191780081269.issue24117@psf.upfronthosting.co.za>
2015-05-19 06:28:24malinlinkissue24117 messages
2015-05-19 06:28:23malincreate