This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author martin.panter
Recipients Babe Hardy, martin.panter
Date 2017-01-18.11:53:01
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1484740381.61.0.496715318542.issue29305@psf.upfronthosting.co.za>
In-reply-to
Content
In general, HTTP URLs are supposed to be ASCII only. Newer protocols (e.g. RTSP which is based on HTTP) specifically allow UTF-8 encoding. But it would be wrong for Python’s HTTP library to assume UTF-8 is wanted everywhere. Especially in a domain name (e.g. in the full-URL request to a proxy), which should not be UTF-8 encoded. 

I suggest to work on handling IRIs (<https://tools.ietf.org/html/rfc3987>, basically Unicode URLs) in higher-level places like “urllib”. See Issue 3991.
History
Date User Action Args
2017-01-18 11:53:01martin.pantersetrecipients: + martin.panter, Babe Hardy
2017-01-18 11:53:01martin.pantersetmessageid: <1484740381.61.0.496715318542.issue29305@psf.upfronthosting.co.za>
2017-01-18 11:53:01martin.panterlinkissue29305 messages
2017-01-18 11:53:01martin.pantercreate