Message 138666 - Python tracker

➜

This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author	eric.araujo
Recipients	alexis, eric.araujo, michael.mulich, tarek
Date	2011-06-19.21:12:19
SpamBayes Score	2.9261584e-05
Marked as misclassified	No
Message-id	<1308517940.22.0.541215819689.issue12368@psf.upfronthosting.co.za>
In-reply-to

Content
Extract from IRC: <pumazi> hmm... I'm thinking Crawler's follow_externals flag isn't working as expected [...] <pumazi> I'm not sure, my assumption of [its] function could be off [...] <merwok> “hosts is a list of hosts allowed to be processed if follow_externals is true (default behavior is to follow all hosts), follow_externals enables or disables following external links (default is false, meaning disabled).” <pumazi> Well, I was assuming it would disable external downloads <merwok> I think “external links” are external links to be scraped, not download links <merwok> But I see your misunderstanding <pumazi> I see, but wouldn't we want the same restrictions on download links? [...] <merwok> IIUC, follow_externals can be disabled because it’s guesswork <merwok> The info obtained from XML-RPC or the simple interface is not guesswork <merwok> So I think you could want to disable guessing from external links, but I don’t see why you should care about the origin of the download <pumazi> trust issues I suppose <merwok> But the same person can upload a malicious file to PyPI as well as on their site <merwok> Without reading the code, I think this is the rationale. OTOH, if easy_install and pip can restrict downloads and your user expectations show that it can be needed to restrict downloads, let’s file a bug

Extract from IRC:
<pumazi> hmm... I'm thinking Crawler's follow_externals flag isn't working as expected
[...]
<pumazi> I'm not sure, my assumption of [its] function could be off
[...]
<merwok> “hosts is a list of hosts allowed to be processed if follow_externals is true (default behavior is to follow all hosts), follow_externals enables or disables following external links (default is false, meaning disabled).”
<pumazi> Well, I was assuming it would disable external downloads
<merwok> I think “external links” are external links to be scraped, not download links
<merwok> But I see your misunderstanding
<pumazi> I see, but wouldn't we want the same restrictions on download links?
[...]
<merwok> IIUC, follow_externals can be disabled because it’s guesswork
<merwok> The info obtained from XML-RPC or the simple interface is not guesswork
<merwok> So I think you could want to disable guessing from external links, but I don’t see why you should care about the origin of the download
<pumazi> trust issues I suppose
<merwok> But the same person can upload a malicious file to PyPI as well as on their site
<merwok> Without reading the code, I think this is the rationale.  OTOH, if easy_install and pip can restrict downloads and your user expectations show that it can be needed to restrict downloads, let’s file a bug

History
Date	User	Action	Args
2011-06-19 21:12:20	eric.araujo	set	recipients: + eric.araujo, tarek, alexis, michael.mulich
2011-06-19 21:12:20	eric.araujo	set	messageid: <1308517940.22.0.541215819689.issue12368@psf.upfronthosting.co.za>
2011-06-19 21:12:19	eric.araujo	link	issue12368 messages
2011-06-19 21:12:19	eric.araujo	create