Message 110759 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	mgiuca
Recipients	BreamoreBoy, adamnelson, ajaksu2, collinwinter, eric.araujo, ezio.melotti, mastrodomenico, mgiuca, nagle, orsenthil, pitrou, vak, varmaa, vstinner
Date	2010-07-19.14:26:31
SpamBayes Score	0.004270955
Marked as misclassified	No
Message-id	<1279549593.49.0.862204399415.issue1712522@psf.upfronthosting.co.za>
In-reply-to

Content
OK sure, there are some other things broken, but they are mostly not dealing with string data, but binary data (for example, zlib expects a sequence of bytes, not characters). Just one quick point: > urllib.urlretrieve("file:///tmp/hé") > UnicodeError: URL u'file:///tmp/h\xc3\xa9' contains non-ASCII characters That's precisely correct behaviour. URLs are not allowed to contain non-ASCII characters (that's the whole point of urllib.quote). urllib.quote should accept non-ASCII characters (for conversion into ASCII strings). Other URL processing functions should not accept non-ASCII characters, since they aren't valid URIs.

OK sure, there are some other things broken, but they are mostly not dealing with string data, but binary data (for example, zlib expects a sequence of bytes, not characters).

Just one quick point:

> urllib.urlretrieve("file:///tmp/hé")
> UnicodeError: URL u'file:///tmp/h\xc3\xa9' contains non-ASCII characters

That's precisely correct behaviour. URLs are not allowed to contain non-ASCII characters (that's the whole point of urllib.quote). urllib.quote should accept non-ASCII characters (for conversion into ASCII strings). Other URL processing functions should not accept non-ASCII characters, since they aren't valid URIs.

History
Date	User	Action	Args
2010-07-19 14:26:33	mgiuca	set	recipients: + mgiuca, collinwinter, varmaa, nagle, orsenthil, pitrou, vstinner, ajaksu2, ezio.melotti, eric.araujo, mastrodomenico, vak, adamnelson, BreamoreBoy
2010-07-19 14:26:33	mgiuca	set	messageid: <1279549593.49.0.862204399415.issue1712522@psf.upfronthosting.co.za>
2010-07-19 14:26:31	mgiuca	link	issue1712522 messages
2010-07-19 14:26:31	mgiuca	create