Message373889
> But how many new Python web application use CJK codec instead of UTF-8?
A CJK character usually takes 2-bytes in CJK encodings, but takes 3-bytes in UTF-8.
I tested a Chinese book:
in GBK: 853,025 bytes
in UTF-8: 1,267,523 bytes
For CJK content, UTF-8 is wasteful, maybe CJK encodings will not be eliminated. |
|
Date |
User |
Action |
Args |
2020-07-18 08:31:13 | malin | set | recipients:
+ malin, vstinner, ezio.melotti, methane, serhiy.storchaka |
2020-07-18 08:31:13 | malin | set | messageid: <1595061073.57.0.418492875334.issue41330@roundup.psfhosted.org> |
2020-07-18 08:31:13 | malin | link | issue41330 messages |
2020-07-18 08:31:13 | malin | create | |
|