This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author tlesher
Recipients tlesher
Date 2008-04-15.15:09:09
SpamBayes Score 0.051726278
Marked as misclassified No
Message-id <1208272151.95.0.897886992301.issue2637@psf.upfronthosting.co.za>
In-reply-to
Content
The urllib.quote docstring implies that it quotes only characters in RFC
2396's "reserved" set.

However, urllib.quote currently escapes all characters except those in
an "always_safe" list, which consists of alphanumerics and three
punctuation characters, "_.-".

This behavior is contrary to the RFC, which defines "unreserved"
characters as alphanumerics plus "mark" characters, or "-_.!~*'()".  

The RFC also says:

  Unreserved characters can be escaped without changing the semantics
  of the URI, but this should not be done unless the URI is being used
  in a context that does not allow the unescaped character to appear.

This seems to imply that "always_safe" should correspond to the RFC's
"unreserved" set of "alphanum" | "mark".
History
Date User Action Args
2008-04-15 15:09:12tleshersetspambayes_score: 0.0517263 -> 0.051726278
recipients: + tlesher
2008-04-15 15:09:11tleshersetspambayes_score: 0.0517263 -> 0.0517263
messageid: <1208272151.95.0.897886992301.issue2637@psf.upfronthosting.co.za>
2008-04-15 15:09:10tlesherlinkissue2637 messages
2008-04-15 15:09:10tleshercreate