This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author v+python
Recipients barry, mitya57, r.david.murray, v+python
Date 2012-06-24.00:17:43
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1340497068.05.0.903185561314.issue15016@psf.upfronthosting.co.za>
In-reply-to
Content
Patch is interesting, using an encoder to detect validity. However, it suffers from some performance problems for long text that has large ASCII prefixes.

This seems to be an enhancement sort of request rather than a bug... so I wonder why Python 3.2 is listed?

And in Python 3.3 with PEP 393 strings the C API to strings provides a quick way to determine the maximum character in the string... although I see nothing in the PEP about how to access that information from Python. If it is available, it could provide a much quicker precheck rather than multiple attempts to encode strings with large ASCII prefixes only to discover that the next to last character is in (128,255) and the last character is > 255 (which would be about the worst case scenario for the algorithm in the patch).
History
Date User Action Args
2012-06-24 00:17:50v+pythonsetrecipients: + v+python, barry, r.david.murray, mitya57
2012-06-24 00:17:48v+pythonsetmessageid: <1340497068.05.0.903185561314.issue15016@psf.upfronthosting.co.za>
2012-06-24 00:17:46v+pythonlinkissue15016 messages
2012-06-24 00:17:43v+pythoncreate