This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author serhiy.storchaka
Recipients loewis, serhiy.storchaka, vstinner
Date 2012-10-31.13:58:45
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1351691925.51.0.209663807894.issue16311@psf.upfronthosting.co.za>
In-reply-to
Content
With the patch UTF-8 decoder 20% slower for some data. UTF-16 decoder 20% faster for some data and 20% slower for other data. UTF-32 decoder slower for many data (even after some optimization, naive code was up to 50% slower). Standard charmap decoder 10% slower. Only UTF-7, unicode-escape and raw-unicode-escape have become much faster (unicode-escape and raw-unicode-escape as with issue16334 patch).

A well optimized decoders do not benefit from the _PyUnicodeWriter, only a slight slowdown. The patch requires some optimization (as for UTF-32 decoder) to reduce the negative effect. Non-optimized decoders will receive the great benefit.
History
Date User Action Args
2012-10-31 13:58:45serhiy.storchakasetrecipients: + serhiy.storchaka, loewis, vstinner
2012-10-31 13:58:45serhiy.storchakasetmessageid: <1351691925.51.0.209663807894.issue16311@psf.upfronthosting.co.za>
2012-10-31 13:58:45serhiy.storchakalinkissue16311 messages
2012-10-31 13:58:45serhiy.storchakacreate