This issue tracker has been migrated to GitHub, and is currently read-only.
For more information, see the GitHub FAQs in the Python's Developer Guide.

Author stutzbach
Recipients amaury.forgeotdarc, kristjan.jonsson, loewis, pitrou, rhettinger, stutzbach, tim.peters
Date 2010-07-10.00:33:33
SpamBayes Score 0.0022037185
Marked as misclassified No
Message-id <AANLkTimp30NPP1E9Jvg0mk4glWOo0q6BnhW4sz--uGwL@mail.gmail.com>
In-reply-to <1278715713.41.0.647823933206.issue9141@psf.upfronthosting.co.za>
Content
2010/7/9 Kristján Valur Jónsson <report@bugs.python.org>

> Your message was classified as spam, I have no idea why, but this is why I
> only noticed it now.
>

Yes, I just noticed that tonight as well.  I filed a bug on the meta-tracker
in the hopes that someone can dig into the underlying cause.

> Your suggestion is interesting, I hadn't thought of that.  Yes, it is
> possible to use the track/untrack functions (I think), but that would mean
> that you would have to monitor your object for every state change and
> reliably detect the transition from one state to another.  Being able to
> query the current state and let gc know:  "No, I cannot be collected as I am
> now" is a much more robust solution from the programmers perspective.
>
> A further difference is this:  If an object isn't tracked, it won't be
> collected if it is part of a cycle, but it will not be put in gc.garbage
> either.  In effect, it will just remain in memory, unreachable, with no
> chance of it ever being released.
>

Yes, I see.  I have used the track/untrack approach, but it was in a very
different situation.  I have a long-running C function which keeps alive a
large number of objects.  At the start of the function, I untrack them all
as a performance optimization, so the garbage collector does not have to
spend time traversing them.  Before the function returns, I track them
again.

I see now why that wouldn't work for your use-case.  Thank you.  I like the
idea of your patch to give opportunities to tell the GC that they are
uncollectable on the fly.

I'm concerned about the performance impact of making tp_traverse do
double-duty, though.  Calling tp_traverse for every object in a cycle will
have the effect of making an extra pass on every reference from every object
participating in the cycle.  For example, consider a large list that's part
of a cycle.  If we call the list's tp_traverse to establish if
it's collectible, list's tp_traverse will call visit() on every item in the
list.  Even though you've made visit() a do-nothing function, that's still a
function call per reference.  It seems a shame to do all of that work
unnecessarily.
Files
File name Uploaded
unnamed stutzbach, 2010-07-10.00:33:30
History
Date User Action Args
2010-07-10 00:33:36stutzbachsetrecipients: + stutzbach, tim.peters, loewis, rhettinger, amaury.forgeotdarc, pitrou, kristjan.jonsson
2010-07-10 00:33:33stutzbachlinkissue9141 messages
2010-07-10 00:33:33stutzbachcreate