This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author vstinner
Recipients ezio.melotti, methane, python-dev, r.david.murray, serhiy.storchaka, vstinner
Date 2015-09-21.23:06:40
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <>
Ok, I prepared the code for the UTF-8 optimization.

@Serhiy: would you like to rebase your patch  faster_surrogates_hadling.patch?

Attached utf8.patch is a less optimal implementation which only changes PyUnicode_DecodeUTF8Stateful(). Maybe it's enough?

I would like to see a benchmark here to choose the good compromise between performance and code complexity.
Date User Action Args
2015-09-21 23:06:41vstinnersetrecipients: + vstinner, ezio.melotti, r.david.murray, methane, python-dev, serhiy.storchaka
2015-09-21 23:06:41vstinnersetmessageid: <>
2015-09-21 23:06:41vstinnerlinkissue24870 messages
2015-09-21 23:06:40vstinnercreate