concurrent.futures.as_completed() fails when given duplicate Futures #64566

glangford · 2014-01-23T15:00:48Z

BPO	20367
Nosy	@gvanrossum, @tim-one, @mdickinson, @vstinner, @1st1
Files	test_dupfuture.py issue20367.patch issue20367.patch issue20367.patch issue20367.patch

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = None
closed_at = <Date 2014-01-27.08:14:07.882>
created_at = <Date 2014-01-23.15:00:48.109>
labels = ['type-bug', 'library']
title = 'concurrent.futures.as_completed() fails when given duplicate Futures'
updated_at = <Date 2014-02-06.00:07:36.755>
user = 'https://bugs.python.org/glangford'

bugs.python.org fields:

activity = <Date 2014-02-06.00:07:36.755>
actor = 'gvanrossum'
assignee = 'none'
closed = True
closed_date = <Date 2014-01-27.08:14:07.882>
closer = 'vstinner'
components = ['Library (Lib)']
creation = <Date 2014-01-23.15:00:48.109>
creator = 'glangford'
dependencies = []
files = ['33658', '33684', '33685', '33686', '33728']
hgrepos = []
issue_num = 20367
keywords = ['patch']
message_count = 19.0
messages = ['208952', '208956', '208957', '208961', '209081', '209084', '209091', '209093', '209102', '209267', '209321', '209338', '209359', '209360', '209361', '209367', '209413', '210352', '210353']
nosy_count = 7.0
nosy_names = ['gvanrossum', 'tim.peters', 'mark.dickinson', 'vstinner', 'python-dev', 'yselivanov', 'glangford']
pr_nums = []
priority = 'normal'
resolution = 'fixed'
stage = 'resolved'
status = 'closed'
superseder = None
type = 'behavior'
url = 'https://bugs.python.org/issue20367'
versions = ['Python 3.3', 'Python 3.4']

glangford · 2014-01-23T15:00:48Z

concurrent.futures.as_completed([f,f]) will yield f twice, then fail with a KeyError for a Future f which is not completed.

If the Future has already completed, as_completed([f,f]) will yield f once and does not trigger an exception.

What is the correct behaviour?
as_completed( [f,f] ) -> yield f twice ?
wait( [f,f], return_when=ALL_COMPLETED ) -> yield f twice ?

glangford · 2014-01-23T15:15:44Z

There is a subtlety in the as_completed() code which explains a lot - note that "finished" starts off as a set in the _AcquireFutures block. So if a Future f has already completed,
as_completed( [f,f] )
will only yield f once, because f appears once in the finished set.

Later on when waiter events are processed, "finished" turns into a list because of the line:

finished = waiter.finished_futures

So any duplicates in that list will cause problems in pending.remove(Future).

vstinner · 2014-01-23T15:22:41Z

Since the new asyncio module has also a Future class and functions like as_completed(), this issue concerns also asyncio. concurrent.futures and asyncio should have the same behaviour.

gvanrossum · 2014-01-23T15:47:38Z

I think you the caller was wrong to pass in [f, f] in the first place.

In asyncio, the argument is converted into a set before using it, but there's still a bug if you pass it a list containing two references to the same coroutine -- it gets wrapped in two separate Futures. I think the better behavior is to convert to a set first and then map coroutines to Futures.

So concurrent.futures should also convert to a set first.

glangford · 2014-01-24T14:25:12Z

Proposed patch for as_completed(). bpo-20369 fixes wait(), and behaviour is consistent between the two.

vstinner · 2014-01-24T14:28:54Z

Proposed patch for as_completed().

Could you please try to write a unit test. The unit test should fail without the patch, and fail with the patch. Then create a new patch including the patch.

If it's tricky to write a reliable test reproducing the race condition, you can use unittest.mock to mock some objects.

glangford · 2014-01-24T15:34:04Z

Could you please try to write a unit test.

Revised patch with unit test for as_completed().

vstinner · 2014-01-24T15:55:05Z

Hum, you should also modify the documentation to explicit the
behaviour. Example: "Duplicates futures are only yielded once".

You may add the same sentence in the asyncio.as_completed()
documentation. It looks like asyncio tests doesn't test as_completed()
with duplicated future. You may write a new patch to modify asyncio
doc and tests. It should be very similar.

+ completed = [f for f in futures.as_completed( [f1,f1] ) ]

You can just use list(futures.as_completed([f1,f1])). Please no space
around parenthesis (see the PEP-8).

+ self.assertEqual( len(completed), 1 )

No space around parenthesis (see the PEP-8):
self.assertEqual(len(completed), 1).

You may check the list value instead: self.assertEqual(completed, [f1])

(Why "f1" name? There is no f2, maybe rename to f?)

glangford · 2014-01-24T17:21:44Z

Thanks for the feedback. The new patch is modified for PEP-8 with naming consistent with other concurrent tests. assertEqual I think is clearer by checking list length, so it is not changed. The docstring is updated.

I suggest asyncio be handled separately.

gvanrossum · 2014-01-26T01:55:28Z

LGTM. But you also need to update Doc/library/concurrent.futures.rst

I see this as a bugfix so it's not necessary to get this in before the beta 3 release tonight.

I will work on an asyncio patch and test.

glangford · 2014-01-26T14:20:26Z

Ah...ok, here is a patch that includes an update to Doc/library/concurrent.futures.rst as well.

python-dev · 2014-01-26T17:59:20Z

New changeset 58b0f3e1ddf8 by Guido van Rossum in branch 'default':
Fix issue bpo-20367: concurrent.futures.as_completed() for duplicate arguments.
http://hg.python.org/cpython/rev/58b0f3e1ddf8

python-dev · 2014-01-26T22:34:04Z

New changeset 1dac8c954488 by Victor Stinner in branch 'default':
Issue bpo-20367: Add Glenn Langford to Misc/ACKS
http://hg.python.org/cpython/rev/1dac8c954488

vstinner · 2014-01-26T22:36:24Z

@guido: Why not fixing the issue in Python 3.3? You forgot to add Gleen Langford to Misc/ACKS!

@gleen: Congrats for your first commit :)

gvanrossum · 2014-01-26T22:42:35Z

Sorry, I suppose it needs to be backported to 3.3.

If someone wants to do that, please do (I'm afraid I'd mess up the merge).

glangford · 2014-01-26T23:43:54Z

@victor: Thank you, and I appreciate all your advice! I am still learning the dev environment but hope to be helpful. :)

python-dev · 2014-01-27T08:13:52Z

New changeset 791b69f9f96d by Victor Stinner in branch '3.3':
Issue bpo-20367: Fix behavior of concurrent.futures.as_completed() for duplicate
http://hg.python.org/cpython/rev/791b69f9f96d

1st1 · 2014-02-05T23:26:49Z

This one is still not merged in Tulip, right?

gvanrossum · 2014-02-06T00:07:37Z

Correct, if you want to work on it, see http://code.google.com/p/tulip/issues/detail?id=114

glangford mannequin added stdlib Python modules in the Lib dir type-bug An unexpected behavior, bug, or error labels Jan 23, 2014

gvanrossum closed this as completed Jan 26, 2014

gvanrossum reopened this Jan 26, 2014

vstinner closed this as completed Jan 27, 2014

ezio-melotti transferred this issue from another repository Apr 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

concurrent.futures.as_completed() fails when given duplicate Futures #64566

concurrent.futures.as_completed() fails when given duplicate Futures #64566

glangford mannequin commented Jan 23, 2014

glangford mannequin commented Jan 23, 2014

glangford mannequin commented Jan 23, 2014

vstinner commented Jan 23, 2014

gvanrossum commented Jan 23, 2014

glangford mannequin commented Jan 24, 2014

vstinner commented Jan 24, 2014

glangford mannequin commented Jan 24, 2014

vstinner commented Jan 24, 2014

glangford mannequin commented Jan 24, 2014

gvanrossum commented Jan 26, 2014

glangford mannequin commented Jan 26, 2014

python-dev mannequin commented Jan 26, 2014

python-dev mannequin commented Jan 26, 2014

vstinner commented Jan 26, 2014

gvanrossum commented Jan 26, 2014

glangford mannequin commented Jan 26, 2014

python-dev mannequin commented Jan 27, 2014

1st1 commented Feb 5, 2014

gvanrossum commented Feb 6, 2014

concurrent.futures.as_completed() fails when given duplicate Futures #64566

concurrent.futures.as_completed() fails when given duplicate Futures #64566

Comments

glangford mannequin commented Jan 23, 2014

glangford mannequin commented Jan 23, 2014

glangford mannequin commented Jan 23, 2014

vstinner commented Jan 23, 2014

gvanrossum commented Jan 23, 2014

glangford mannequin commented Jan 24, 2014

vstinner commented Jan 24, 2014

glangford mannequin commented Jan 24, 2014

vstinner commented Jan 24, 2014

glangford mannequin commented Jan 24, 2014

gvanrossum commented Jan 26, 2014

glangford mannequin commented Jan 26, 2014

python-dev mannequin commented Jan 26, 2014

python-dev mannequin commented Jan 26, 2014

vstinner commented Jan 26, 2014

gvanrossum commented Jan 26, 2014

glangford mannequin commented Jan 26, 2014

python-dev mannequin commented Jan 27, 2014

1st1 commented Feb 5, 2014

gvanrossum commented Feb 6, 2014