Author christian.heimes
Recipients LDTech, Mi.Zou, christian.heimes, ezio.melotti, orsenthil, terry.reedy, vajrasky
Date 2013-07-18.12:32:35
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1374150756.05.0.371499732615.issue17214@psf.upfronthosting.co.za>
In-reply-to
Content
The problem may not be a bug but a deliberate design choice. urllib is rather low level and doesn't implement some browser magic. Browsers handle stuff like 'รค' -> '%C3%A4', ' ' -> '%20' or IDNA but urllib doesn't. I always saw it as may responsibility to quote and encode everything myself. Higher level APIs such as requests are free to implement browser magic.

Contrary to common believes an URL with an umlaut or space is *not* a valid URI. From http://docs.python.org/3/library/urllib.request.html#urllib.request.Request

> url should be a string containing a valid URL.

I suggest that this ticket shall be closed as "won't fix".
History
Date User Action Args
2013-07-18 12:32:36christian.heimessetrecipients: + christian.heimes, terry.reedy, orsenthil, ezio.melotti, Mi.Zou, vajrasky, LDTech
2013-07-18 12:32:36christian.heimessetmessageid: <1374150756.05.0.371499732615.issue17214@psf.upfronthosting.co.za>
2013-07-18 12:32:36christian.heimeslinkissue17214 messages
2013-07-18 12:32:35christian.heimescreate