msg298008 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-09 23:48 |
http://buildbot.python.org/all/builders/AMD64%20FreeBSD%2010.x%20Shared%203.x/builds/557/steps/test/logs/stdio
test_handle_called_with_mp_queue (test.test_logging.QueueListenerTest) ... Warning -- threading_cleanup() failed to cleanup -1 threads after 3 sec (count: 0, dangling: 1)
ok
|
msg298009 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-09 23:55 |
The load average was 3.15:
0:04:33 load avg: 3.15 [176/406] test_logging failed (env changed)
--
Another fail on AMD64 FreeBSD CURRENT Non-Debug 3.x:
http://buildbot.python.org/all/builders/AMD64%20FreeBSD%20CURRENT%20Non-Debug%203.x/builds/568/steps/test/logs/stdio
0:01:56 load avg: 3.45 [ 44/406] test_logging failed (env changed)
...
test_output (test.test_logging.UnixSocketHandlerTest) ... ok
test_output (test.test_logging.UnixDatagramHandlerTest) ... ok
test_output (test.test_logging.UnixSysLogHandlerTest) ... ok
test__all__ (test.test_logging.MiscTestCase) ... ok
test_handle_called_with_mp_queue (test.test_logging.QueueListenerTest) ... Warning -- threading_cleanup() failed to cleanup -1 threads after 4 sec (count: 0, dangling: 1)
ok
test_handle_called_with_queue_queue (test.test_logging.QueueListenerTest) ... ok
test_no_messages_in_queue_after_stop (test.test_logging.QueueListenerTest) ... ok
|
msg298010 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 00:01 |
Previous issue which fixed QueueListenerTest of test_logging is bpo-30131:
commit 8ca2f2faefa8dba323a2e4c4b86efb633d7a53cf
Author: Victor Stinner <victor.stinner@gmail.com>
Date: Wed Apr 26 15:56:25 2017 +0200
bpo-30131: test_logging now joins queue threads (#1298)
QueueListenerTest of test_logging now closes the multiprocessing
Queue and joins its thread to prevent leaking dangling threads to
following tests.
Add also @support.reap_threads to detect earlier if a test leaks
threads (and try to "cleanup" these threads).
|
msg298011 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 00:16 |
While trying to reproduce the bug, I got:
test_handle_called_with_mp_queue (test.test_logging.QueueListenerTest) ... /usr/home/haypo/cpython/Lib/test/support/__init__.py:1515: ResourceWarning: unclosed <socket.socket fd=6, family=AddressFamily.AF_INET, type=536870913, proto=0, laddr=('127.0.0.1', 8166), raddr=('127.0.0.1', 8167)>
gc.collect()
ok
|
msg298014 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 00:30 |
The problem is that multiprocessing.Queue.join_thread() does nothing since the thread wasn't started by a subprocess.
See also bpo-30171: Emit ResourceWarning in multiprocessing Queue destructor.
|
msg298037 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 09:17 |
The warning is a race condition which can be reproduced easily on Linux using attached test_handle_called_with_mp_queue-bug.patch, run:
haypo@selma$ ./python -m test --fail-env-changed -m test_handle_called_with_mp_queue test_logging
Run tests sequentially
0:00:00 load avg: 0.22 [1/1] test_logging
Warning -- threading_cleanup() failed to cleanup 20 threads after 0 sec (count: 20, dangling: 21)
Warning -- threading._dangling was modified by test_logging
Before: <_weakrefset.WeakSet object at 0x7fe1df5302c8>
After: <_weakrefset.WeakSet object at 0x7fe1df5338e0>
test_logging failed (env changed)
1 test altered the execution environment:
test_logging
Total duration: 718 ms
Tests result: ENV CHANGED
|
msg298038 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 09:27 |
https://github.com/python/cpython/pull/2642 fixes the warning. I tested the change with test_handle_called_with_mp_queue-bug.patch: no more warning.
Sorry, I don't know multiprocessing to understand the purpose of the removed test.
I would like to really make sure that a Queue object doesn't "leak" a thread when I close .close() + .join_thread(). It's surprising that .join_thread() doesn't join anything and leave a thread running in the background. Even if in the common case, when the system load is low, the thread quits quickly thanks to .close().
|
msg298040 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 09:35 |
Hum, interesting, created_by_this_process was already removed from Python 2.7 in bpo-4106:
commit 77657e40fa5f43fe6f7ffb6e32da4613dba657e1
Author: Antoine Pitrou <solipsis@pitrou.net>
Date: Wed Aug 24 22:41:05 2011 +0200
Issue #4106: Fix occasional exceptions printed out by multiprocessing on interpreter shutdown.
This bug doesn't seem to exist on 3.2, where daemon threads are killed
before Py_Finalize() is entered.
|
msg298041 - (view) |
Author: Antoine Pitrou (pitrou) * |
Date: 2017-07-10 09:39 |
> I would like to really make sure that a Queue object doesn't "leak" a thread when I close .close() + .join_thread().
I don't understand how this happens. The Finalize object only acts as an atexit handler. When called as a regular finalize, `self._thread` is dead and therefore `_finalize_join()` doesn't do anything.
|
msg298042 - (view) |
Author: Antoine Pitrou (pitrou) * |
Date: 2017-07-10 09:40 |
Oh, that's because you're calling join_thread() explicitly. I see. I agree that the fix looks desirable then.
|
msg298043 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 09:40 |
> I don't understand how this happens.
If you run "./python -m test --fail-env-changed -m test_handle_called_with_mp_queue test_logging" with attached test_handle_called_with_mp_queue-bug.patch, no finalizer is registered: .join_thread() does nothing, because created_by_this_process is true.
|
msg298051 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 10:21 |
> Oh, that's because you're calling join_thread() explicitly. I see. I agree that the fix looks desirable then.
FYI I added join_thread() in my first attempt to fix "Warning -- threading._dangling was modified by test_logging": bpo-30131, commit 8ca2f2faefa8dba323a2e4c4b86efb633d7a53cf. I expected that join_thread() would... join the thread :-)
|
msg298052 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 10:22 |
I suggest to backport the fix up to Python 3.5.
|
msg298054 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 10:45 |
New changeset 3b69d911c57ef591ac0c0f47a66dbcad8337f33a by Victor Stinner in branch 'master':
bpo-30886: Fix multiprocessing.Queue.join_thread() (#2642)
https://github.com/python/cpython/commit/3b69d911c57ef591ac0c0f47a66dbcad8337f33a
|
msg298055 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 11:43 |
New changeset 69e41807f0851ff1107f949dcdc94dbb0af32acd by Victor Stinner in branch '3.5':
bpo-30886: Fix multiprocessing.Queue.join_thread() (#2642) (#2644)
https://github.com/python/cpython/commit/69e41807f0851ff1107f949dcdc94dbb0af32acd
|
msg298056 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 11:43 |
New changeset 7f3d65d6e4f8bebaaf996efb1c1adb67eb1724cb by Victor Stinner in branch '3.6':
bpo-30886: Fix multiprocessing.Queue.join_thread() (#2642) (#2643)
https://github.com/python/cpython/commit/7f3d65d6e4f8bebaaf996efb1c1adb67eb1724cb
|
msg298057 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 11:49 |
Ok, I applied my fix to 3.5, 3.6 and master branches.
Thanks for the review Antoine.
|
msg298089 - (view) |
Author: STINNER Victor (vstinner) * |
Date: 2017-07-10 21:40 |
I'm not sure that the bug is fully fixed, I still saw a warning on:
http://buildbot.python.org/all/builders/AMD64%20FreeBSD%2010.x%20Shared%203.x/builds/561/
This build tested the commit aa8d0a24694bea05061f1920ec3f944a9e6799d5 which is more recent than commit 3b69d911c57ef591ac0c0f47a66dbcad8337f33a.
test_handle_called_with_mp_queue (test.test_logging.QueueListenerTest) ...
Warning -- threading_cleanup() failed to cleanup -1 threads after 5 sec (count: 0, dangling: 1)
ok
|
|
Date |
User |
Action |
Args |
2022-04-11 14:58:48 | admin | set | github: 75069 |
2017-07-10 21:40:03 | vstinner | set | messages:
+ msg298089 |
2017-07-10 11:49:09 | vstinner | set | status: open -> closed resolution: fixed messages:
+ msg298057
stage: resolved |
2017-07-10 11:43:22 | vstinner | set | messages:
+ msg298056 |
2017-07-10 11:43:19 | vstinner | set | messages:
+ msg298055 |
2017-07-10 10:47:07 | vstinner | set | pull_requests:
+ pull_request2709 |
2017-07-10 10:46:38 | vstinner | set | pull_requests:
+ pull_request2708 |
2017-07-10 10:45:25 | vstinner | set | messages:
+ msg298054 |
2017-07-10 10:22:06 | vstinner | set | messages:
+ msg298052 versions:
+ Python 3.5, Python 3.6 |
2017-07-10 10:21:47 | vstinner | set | messages:
+ msg298051 |
2017-07-10 09:40:30 | vstinner | set | messages:
+ msg298043 |
2017-07-10 09:40:20 | pitrou | set | messages:
+ msg298042 |
2017-07-10 09:39:14 | pitrou | set | messages:
+ msg298041 |
2017-07-10 09:35:17 | vstinner | set | messages:
+ msg298040 |
2017-07-10 09:27:39 | vstinner | set | nosy:
+ pitrou, davin
title: test_handle_called_with_mp_queue() of test_logging leaks a thread on AMD64 FreeBSD 10.x Shared 3.x -> multiprocessing.Queue.join_thread() does nothing if created and use in the same process |
2017-07-10 09:27:06 | vstinner | set | messages:
+ msg298038 |
2017-07-10 09:24:34 | vstinner | set | pull_requests:
+ pull_request2707 |
2017-07-10 09:17:50 | vstinner | set | files:
+ test_handle_called_with_mp_queue-bug.patch keywords:
+ patch messages:
+ msg298037
|
2017-07-10 00:30:35 | vstinner | set | messages:
+ msg298014 |
2017-07-10 00:16:33 | vstinner | set | messages:
+ msg298011 |
2017-07-10 00:01:09 | vstinner | set | messages:
+ msg298010 |
2017-07-09 23:55:16 | vstinner | set | messages:
+ msg298009 |
2017-07-09 23:48:51 | vstinner | create | |