Author mgiuca
Recipients BreamoreBoy, adamnelson, ajaksu2, collinwinter, eric.araujo, ezio.melotti, mastrodomenico, mgiuca, nagle, orsenthil, pitrou, vak, varmaa, vstinner
Date 2010-07-19.14:26:31
SpamBayes Score 0.00427095
Marked as misclassified No
Message-id <1279549593.49.0.862204399415.issue1712522@psf.upfronthosting.co.za>
In-reply-to
Content
OK sure, there are some other things broken, but they are mostly not dealing with string data, but binary data (for example, zlib expects a sequence of bytes, not characters).

Just one quick point:

> urllib.urlretrieve("file:///tmp/hé")
> UnicodeError: URL u'file:///tmp/h\xc3\xa9' contains non-ASCII characters

That's precisely correct behaviour. URLs are not allowed to contain non-ASCII characters (that's the whole point of urllib.quote). urllib.quote should accept non-ASCII characters (for conversion into ASCII strings). Other URL processing functions should not accept non-ASCII characters, since they aren't valid URIs.
History
Date User Action Args
2010-07-19 14:26:33mgiucasetrecipients: + mgiuca, collinwinter, varmaa, nagle, orsenthil, pitrou, vstinner, ajaksu2, ezio.melotti, eric.araujo, mastrodomenico, vak, adamnelson, BreamoreBoy
2010-07-19 14:26:33mgiucasetmessageid: <1279549593.49.0.862204399415.issue1712522@psf.upfronthosting.co.za>
2010-07-19 14:26:31mgiucalinkissue1712522 messages
2010-07-19 14:26:31mgiucacreate