This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author akuchling
Recipients ajaksu2, akuchling, loewis, pboddie, vdupras
Date 2008-02-23.17:52:57
SpamBayes Score 0.0720954
Marked as misclassified No
Message-id <1203789177.99.0.871499052555.issue2124@psf.upfronthosting.co.za>
In-reply-to
Content
The solution of adding caching, If-Modified-Since, etc. is a good one,
but I quail in fear at the prospect of expanding the saxutils resolver
into a fully caching HTML agent that uses a cache across processes.  We
should really be encouraging people to use more capable libraries such
as httplib2 (http://code.google.com/p/httplib2/), but this is slightly
at war 
with the batteries-included philosophy.

So, I propose we:

* add warnings to the urllib, urllib2, saxutil module docs that parsing
can retrieve arbitrary resources over the network, and encourage the
user to use a smarter library such as httplib2.
* update the urllib2 HOWTO to mention this.

I'm willing to do the necessary writing.
History
Date User Action Args
2008-02-23 17:52:58akuchlingsetspambayes_score: 0.0720954 -> 0.0720954
recipients: + akuchling, loewis, pboddie, ajaksu2, vdupras
2008-02-23 17:52:58akuchlingsetspambayes_score: 0.0720954 -> 0.0720954
messageid: <1203789177.99.0.871499052555.issue2124@psf.upfronthosting.co.za>
2008-02-23 17:52:57akuchlinglinkissue2124 messages
2008-02-23 17:52:57akuchlingcreate