This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author madison.may
Recipients anh.le, madison.may, orsenthil
Date 2013-06-05.16:33:44
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1370450025.29.0.0425584954617.issue18140@psf.upfronthosting.co.za>
In-reply-to
Content
urllib.parse.urlsplit() in Python3.3 behaves the same way.  Since urlsplit takes an optional param "allow_fragments", I don't think it should be a high priority issue.  

The relevant code from Python3.3 is below, however:

    if allow_fragments and '#' in url:
        url, fragment = url.split('#', 1)
    if '?' in url:
        url, query = url.split('?', 1)

Note that passwords containing '?' would produce a similar result, which is perhaps mildly more concerning, as there is no flag to ignore the query portion of the url.    

That being said, I'm against making any changes to urlsplit at this point, since that would also require modifying urlunsplit and may in fact make it much more difficult (or impossible) to rejoin a url.  The strength of the very simple implementation we have currently is that it's always reversible.
History
Date User Action Args
2013-06-05 16:33:45madison.maysetrecipients: + madison.may, orsenthil, anh.le
2013-06-05 16:33:45madison.maysetmessageid: <1370450025.29.0.0425584954617.issue18140@psf.upfronthosting.co.za>
2013-06-05 16:33:45madison.maylinkissue18140 messages
2013-06-05 16:33:45madison.maycreate