Author mgiuca
Recipients loewis, mgiuca, orsenthil, thomaspinckney3
Date 2008-07-10.01:54:57
SpamBayes Score 0.00489041
Marked as misclassified No
Message-id <1215654901.53.0.773652038658.issue3300@psf.upfronthosting.co.za>
In-reply-to
Content
OK well here are the necessary changes to the documentation (RST docs
and docstrings in the code).

As I said above, I plan to to extensive testing and add new cases, and I
don't recommend this patch is accepted until that's done.

Patch (parse.py.patch3) is for branch /branches/py3k, revision 64834.

Commit log:

urllib.parse.unquote: Added "encoding" and "errors" optional arguments,
allowing the caller to determine the decoding of percent-encoded octets
(previously implicitly decoded as ISO-8859-1). As per RFC 3986, default
is "utf-8".

urllib.parse.quote: Added "encoding" and "errors" optional arguments,
allowing the caller to determine the encoding of non-ASCII characters
before being percent-encoded (previously characters in range(128, 256)
were encoded as ISO-8859-1, and characters above that as UTF-8). Also
fixed characters greater than 256 not responding to "safe", and also not
being cached.

Doc/library/urllib.parse.rst: Updated docs on quote and unquote to
reflect new interface.

Lib/test/test_urllib.py, Lib/test/test_http_cookiejar.py: Updated test
cases which expected output in ISO-8859-1, now expects UTF-8.

Lib/email/utils.py: Calls urllib.parse.quote and urllib.parse.unquote
with encoding="latin-1", to preserve existing behaviour (which the whole
email module is dependent upon).
History
Date User Action Args
2008-07-10 01:55:01mgiucasetspambayes_score: 0.00489041 -> 0.00489041
recipients: + mgiuca, loewis, orsenthil, thomaspinckney3
2008-07-10 01:55:01mgiucasetspambayes_score: 0.00489041 -> 0.00489041
messageid: <1215654901.53.0.773652038658.issue3300@psf.upfronthosting.co.za>
2008-07-10 01:55:00mgiucalinkissue3300 messages
2008-07-10 01:54:59mgiucacreate