msg52082 - (view) |
Author: David Cantrell (dcantrell-rh) |
Date: 2007-03-07 05:19 |
A number of base Python modules use gethostbyname() when they should be using getaddrinfo(). The big limitation hit when using gethostbyname() is the lack of IPv6 support.
This first patch is for urllib2.py. It replaces all uses of gethostbyname() with getaddrinfo() instead. getaddrinfo() returns a 5-tuple, so additional code needs to wrap a getaddrinfo() call when replacing gethostbyname() calls. Still should be pretty simple to read.
I'd like to see this patch added to the next stable release of Python, if at all possible. I am working up patches for the other modules I see in the Lib/ subdirectory that could use getaddrinfo() instead of gethostbyname().
|
msg52083 - (view) |
Author: John J Lee (jjlee) |
Date: 2007-07-11 23:57 |
* Where are the tests? A functional test, perhaps in test_urllib2net.py, for IPv6 support in urllib2 would be especially welcome, I think.
* Why does .check_host() not begin with an underscore?
* "check_host" is a poor name. How about "_is_localhost"?
* locals is a built-in function, hence usually considered good style not to use it as a name.
* Is is necessary to call make_host_tuple(searchlist) twice?
* The patch appears to fix several bugs at once (e.g. adding a try: / except: suite around a large part of an existing method to catch socket.error).
|
msg56125 - (view) |
Author: Senthil Kumaran (orsenthil) * |
Date: 2007-09-25 03:37 |
Hi,
The patch attached required a complete rewrite. I am attaching the
modified patch, which will just substitute socket.gethostbyname with a
function gethost_addrinfo which internally uses getaddrinfo and takes
care of the IPv4 or IPv6 addresses translation.
jjlee, skip: let me know your comments on this.
One note we have to keep in mind is, testing on IPv6 address.
For eg. on my system /etc/hosts
10.98.1.6 goofy.goofy.com
#fe80::219:5bff:fefd:6270 localhost
127.0.0.1 localhost
test_urllib2 will PASS for the above.
But if I uncomment the IPv6 address, opening the local file fails. I am
not sure how local file access is done with IPv6 and should urllib2
(local file opening function) itself needs to be modified. Shall check
into that, with next version.
|
msg69209 - (view) |
Author: Facundo Batista (facundobatista) * |
Date: 2008-07-03 17:27 |
What I don't understand here is... if gethostbyname() lacks of IPv6
support, instead of creating a new function why not to add the
functionality to that same function?
Right now gethostbyname() is implemented in C, which would be the
drawback of making it a Python function?
|
msg78716 - (view) |
Author: Derek Morr (dmorr) |
Date: 2009-01-01 18:49 |
Senthil,
I don't think your gethost_addrinfo() function will work. On a v6-
enabled machine, it will only return v6 or v4 names. Shouldn't it
return both (since a machine could have both v4 and v6 addresses)? For
example, on my machine, I have the following addresses for
"localhost": ::1, fe80::1%lo0, 127.0.0.1.
Also, why is the AI_CANONNAME flag set? The canonname field isn't used.
And you only appear to take the last IP address returned (sa[0]).
Shouldn't you return all the addresses?
|
msg78717 - (view) |
Author: Derek Morr (dmorr) |
Date: 2009-01-01 18:52 |
Question: Why does FTPHandler.ftp_open() try to resolve the hostname()?
The hostname will be passed into connect_ftp(), then into
urllib.ftpwrapper(), and eventually into ftplib.FTP.connect(), which is
IPv6-aware.
|
msg78750 - (view) |
Author: Senthil Kumaran (orsenthil) * |
Date: 2009-01-02 03:49 |
Derek,
This patch was along the lines that when IPv6 address is present, return
the first address,which I assumed to be active address and would make
the urllib2 work.
I am not sure, if returning all the addresses would help and how would
we define which address to use?
AI_CANONNAME flag, I don't accurately remember it now. But I had
encountered issues when testing on IPv-4 systems without it.
I am having different opinion on this issue now.
First is, taking from Facundo's comment on having this functionality in
gethostbyname() and implementing it in C.
Second is, the wrapper function and suitable way needs to be defined.
I am sorry, I fail to understand the question on why ftp_open does
hostname resolution. You mean to say without it, if we pass it to
ftplib.FTP.connect() it would work for IPv6 address?
|
msg78754 - (view) |
Author: Derek Morr (dmorr) |
Date: 2009-01-02 04:13 |
My understanding is that the FileHandler checks if the file:// URL
contains the hostname or localhost IP of the local machine (isn't that
what FileHandler.names is for?). So, shouldn't the following URLs all
open the same file:
file:///foo.txt
file://localhost/foo.txt
file://127.0.0.1/foo.txt
file://[::1]/foo.txt
If that is the case, then doesn't FileHandler.names need to have all of
those values in it?
I am a little confused by this though. It looks like
FileHandler.file_open() checks if there is a hostname in the URL, and
if so, uses FTPHandler instead. So why does FileHandler.open_local_file
check the hostname value?
For your other points, gethostbyname() in libc can only handle IPv4
addresses. The IETF defined the getaddrinfo() interface as an IP
version neutral replacement. I would recommend using getaddrinfo().
Yes, FTPHandler creates an urllib.FTPWrapper object. That object calls
into ftplib, which is already IPv6-capable. So, I don't think we need
to do hostname resolution in FTPHandler.
|
msg78759 - (view) |
Author: Senthil Kumaran (orsenthil) * |
Date: 2009-01-02 05:29 |
> I am a little confused by this though. It looks like
> FileHandler.file_open() checks if there is a hostname in the URL, and
> if so, uses FTPHandler instead. So why does FileHandler.open_local_file
> check the hostname value?
You are right. Even I had observed this, but did not dispute it. Let
me try to look into the history to see why it so. Perhaps it needs to
change.
> For your other points, gethostbyname() in libc can only handle IPv4
> addresses. The IETF defined the getaddrinfo() interface as an IP
> version neutral replacement. I would recommend using getaddrinfo().
> Yes, FTPHandler creates an urllib.FTPWrapper object. That object calls
> into ftplib, which is already IPv6-capable. So, I don't think we need
> to do hostname resolution in FTPHandler.
Thanks for the info. I shall look into both in revision of the path.
1) using getaddrinfo() for IP version neutral call.
2) passing the hostname directly to ftplib. ( I am not sure of
consequences, need to investigate).
|
msg84810 - (view) |
Author: Ned Deily (ned.deily) * |
Date: 2009-03-31 15:52 |
Note also Issue5625 - any work for IPv6 should keep in mind that local
hosts may have more than one IP address.
|
msg116619 - (view) |
Author: Mark Lawrence (BreamoreBoy) * |
Date: 2010-09-16 21:49 |
@Senthil should this be assigned to your good self?
|
msg165931 - (view) |
Author: Ramchandra Apte (Ramchandra Apte) * |
Date: 2012-07-20 14:31 |
Bump.
|
|
Date |
User |
Action |
Args |
2022-04-11 14:56:23 | admin | set | github: 44672 |
2012-07-20 14:31:47 | Ramchandra Apte | set | nosy:
+ Ramchandra Apte messages:
+ msg165931
|
2010-11-22 11:13:08 | orsenthil | set | assignee: facundobatista -> orsenthil
nosy:
- BreamoreBoy |
2010-09-16 21:49:37 | BreamoreBoy | set | nosy:
+ BreamoreBoy messages:
+ msg116619
|
2010-07-11 02:18:54 | terry.reedy | set | versions:
+ Python 3.2, - Python 2.6, Python 3.0 |
2010-05-20 20:28:41 | skip.montanaro | set | nosy:
- skip.montanaro
|
2009-03-31 15:52:00 | ned.deily | set | nosy:
+ ned.deily messages:
+ msg84810
|
2009-01-02 05:29:47 | orsenthil | set | messages:
+ msg78759 |
2009-01-02 04:13:47 | dmorr | set | messages:
+ msg78754 |
2009-01-02 03:49:47 | orsenthil | set | messages:
+ msg78750 |
2009-01-01 18:52:52 | dmorr | set | messages:
+ msg78717 |
2009-01-01 18:49:04 | dmorr | set | nosy:
+ dmorr messages:
+ msg78716 |
2008-07-03 17:27:58 | facundobatista | set | assignee: facundobatista messages:
+ msg69209 nosy:
+ facundobatista |
2008-06-30 16:33:37 | orsenthil | set | type: enhancement |
2007-11-23 08:29:27 | christian.heimes | set | versions:
+ Python 2.6, Python 3.0 |
2007-09-25 03:38:12 | orsenthil | set | files:
+ test_urllib2-getaddrinfo.patch |
2007-09-25 03:37:43 | orsenthil | set | files:
+ urllib2-getaddrinfo.patch nosy:
+ orsenthil messages:
+ msg56125 |
2007-08-30 22:26:57 | skip.montanaro | set | nosy:
+ skip.montanaro |
2007-03-07 05:19:06 | dcantrell-rh | create | |