Author eisele
Recipients belopolsky, eisele, loewis, rhettinger
Date 2008-04-11.02:46:13
SpamBayes Score 0.0360803
Marked as misclassified No
Message-id <1207881975.23.0.774163354534.issue2607@psf.upfronthosting.co.za>
In-reply-to
Content
Sorry for not giving a good example in the first place.
The problem seems to appear only in the presence of
sufficiently many distinct tuples. Then I see performance
that looks rather like O(n*n)
Here is an example that shows the problem:

>>> from time import clock
>>> d = {}
>>> t0 = clock()
>>> for i in range(5):
 for j in range(i*1000000,(i+1)*1000000):
  d[str(j),str(j)]=j
 print clock()-t0


13.04
39.51
81.86
134.18
206.66
>>>

The same example with str(j)+str(j) works fine.

Sorry if this should be a non-issue. For me it is a
reason to implement functionality in C or Perl
that I would really love to do in Python.
I would call such a thing a performance bug, but
maybe I'm just too demanding...

Best regards,
Andreas
History
Date User Action Args
2008-04-11 02:46:15eiselesetspambayes_score: 0.0360803 -> 0.0360803
recipients: + eisele, loewis, rhettinger, belopolsky
2008-04-11 02:46:15eiselesetspambayes_score: 0.0360803 -> 0.0360803
messageid: <1207881975.23.0.774163354534.issue2607@psf.upfronthosting.co.za>
2008-04-11 02:46:14eiselelinkissue2607 messages
2008-04-11 02:46:14eiselecreate