This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author vstinner
Recipients corona10, ezio.melotti, malin, methane, serhiy.storchaka, vstinner
Date 2020-08-03.23:18:56
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1596496737.07.0.449100582086.issue41330@roundup.psfhosted.org>
In-reply-to
Content
Since CJK codecs have been implemented, unicodeobject.c got multiple optimizations:

* _PyUnicodeWriter for decoder: API designed with efficiency and PEP 393 (compact string) in mind
* _PyBytesWriter for encoders: in short, API to overallocate a buffer
* _Py_error_handler enum and "_Py_error_handler _Py_GetErrorHandler(const char *errors)" function to pass an error handler as an integer rather than a string

But rewriting CJK codecs with these is a lot of effort, I'm not sure that it's worth it.
History
Date User Action Args
2020-08-03 23:18:57vstinnersetrecipients: + vstinner, ezio.melotti, methane, serhiy.storchaka, malin, corona10
2020-08-03 23:18:57vstinnersetmessageid: <1596496737.07.0.449100582086.issue41330@roundup.psfhosted.org>
2020-08-03 23:18:57vstinnerlinkissue41330 messages
2020-08-03 23:18:56vstinnercreate