Author malin
Recipients ezio.melotti, inada.naoki, malin, serhiy.storchaka, vstinner
Date 2020-07-18.08:31:13
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1595061073.57.0.418492875334.issue41330@roundup.psfhosted.org>
In-reply-to
Content
> But how many new Python web application use CJK codec instead of UTF-8?

A CJK character usually takes 2-bytes in CJK encodings, but takes 3-bytes in UTF-8.

I tested a Chinese book:
in GBK:     853,025 bytes
in UTF-8: 1,267,523 bytes

For CJK content, UTF-8 is wasteful, maybe CJK encodings will not be eliminated.
History
Date User Action Args
2020-07-18 08:31:13malinsetrecipients: + malin, vstinner, ezio.melotti, inada.naoki, serhiy.storchaka
2020-07-18 08:31:13malinsetmessageid: <1595061073.57.0.418492875334.issue41330@roundup.psfhosted.org>
2020-07-18 08:31:13malinlinkissue41330 messages
2020-07-18 08:31:13malincreate