Author malin
Recipients ezio.melotti, inada.naoki, malin, serhiy.storchaka, vstinner
Date 2020-07-18.08:31:13
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <>
> But how many new Python web application use CJK codec instead of UTF-8?

A CJK character usually takes 2-bytes in CJK encodings, but takes 3-bytes in UTF-8.

I tested a Chinese book:
in GBK:     853,025 bytes
in UTF-8: 1,267,523 bytes

For CJK content, UTF-8 is wasteful, maybe CJK encodings will not be eliminated.
Date User Action Args
2020-07-18 08:31:13malinsetrecipients: + malin, vstinner, ezio.melotti, inada.naoki, serhiy.storchaka
2020-07-18 08:31:13malinsetmessageid: <>
2020-07-18 08:31:13malinlinkissue41330 messages
2020-07-18 08:31:13malincreate